Techdots

March 22, 2025

Deep Dive into Ruby’s Enumerator and Enumerable: Lazy Evaluation and Custom Iterators

Have you ever wondered how Ruby handles large datasets, infinite sequences, or streaming data so efficiently? 

The answer lies in its powerful Enumerable and Enumerator modules, which form the backbone of Ruby’s iteration capabilities. These tools enable developers to work with collections, streams, and even infinite sequences using techniques like lazy evaluation, external iterators, and internal iterators. 

Whether you’re processing massive CSV files, consuming streaming APIs, or implementing custom iteration patterns, Ruby’s Enumerable and Enumerator provide the flexibility and performance you need. 

In this guide, we’ll explore how to leverage these modules, dive into lazy evaluation, and create custom iterators using yield and fibers to write more efficient and expressive Ruby code. Let’s get started!

1. Understanding Enumerable and Enumerator

Let's have a look at Enumerable and Enumerator:

Enumerable

The Enumerable module is a mixin that provides collection classes with a suite of methods for traversal, searching, and sorting. It relies on the each method, which must be implemented by the including class. Common methods like map, select, reduce, and find are all part of Enumerable.

class MyCollection

 include Enumerable

 def initialize(items)

   @items = items

 end

 def each(&block)

   @items.each(&block)

 end

end

collection = MyCollection.new([1, 2, 3, 4])

collection.map { |x| x * 2 } # => [2, 4, 6, 8]

Enumerator

An Enumerator is an object that encapsulates iteration. It allows you to externalize the control of iteration, making it possible to pause, resume, or even rewind the iteration process. Enumerators are often created using the to_enum method or the Enumerator.new constructor.

enum = [1, 2, 3].to_enum

enum.next # => 1

enum.next # => 2

enum.next # => 3

enum.next # raises StopIteration

2. Internal vs. External Iterators

Internal Iterators

Internal iterators are the most common in Ruby. Methods like each, map, and select are internal iterators because they control the iteration process internally. You simply provide a block, and the method handles the iteration for you.

[1, 2, 3].each { |x| puts x }

External Iterators

External iterators, on the other hand, give you control over the iteration process. The Enumerator class is a prime example of this. You can manually call next to advance the iteration, allowing for more complex iteration patterns.

enum = [1, 2, 3].to_enum

puts enum.next # => 1

puts enum.next # => 2

External iterators are particularly useful when you need to coordinate iteration across multiple collections or when you want to implement custom iteration logic.

3. Lazy Evaluation with Enumerator::Lazy

Lazy evaluation is a technique where computation is deferred until absolutely necessary. In Ruby, the Enumerator::Lazy class enables lazy evaluation for enumerables. This is especially useful when working with large or infinite sequences, as it avoids loading the entire collection into memory.

Example: Infinite Sequence

infinite_sequence = (1..Float::INFINITY).lazy

squares = infinite_sequence.map { |x| x * x }

puts squares.first(10).to_a # => [1, 4, 9, 16, 25, 36, 49, 64, 81, 100]

Example: Large CSV Processing

When processing large CSV files, lazy evaluation can significantly reduce memory usage.

require 'csv'

CSV.foreach('large_file.csv').lazy

  .map { |row| process_row(row) }

  .take(100)

  .each { |processed_row| puts processed_row }

4. Creating Custom Iterators with Enumerator.new

The Enumerator.new method allows you to create custom iterators. This is particularly useful when you need to implement complex iteration logic or when working with external data sources.

Example: Custom Iterator for Fibonacci Sequence

fibonacci = Enumerator.new do |yielder|

 a = b = 1

 loop do

   yielder << a

   a, b = b, a + b

 end

end

puts fibonacci.take(10).to_a # => [1, 1, 2, 3, 5, 8, 13, 21, 34, 55]

Example: Custom Iterator for Streaming API

require 'net/http'

def api_stream

 Enumerator.new do |yielder|

   url = URI('https://api.example.com/stream')

   Net::HTTP.start(url.host, url.port, use_ssl: true) do |http|

     request = Net::HTTP::Get.new(url)

     http.request(request) do |response|

       response.read_body do |chunk|

         yielder << chunk

       end

     end

   end

 end

end

api_stream.lazy.each { |chunk| process_chunk(chunk) }

5. Real-World Examples

Processing Large CSV Files

Lazy evaluation is a game-changer when dealing with large CSV files. Instead of loading the entire file into memory, you can process it line by line.

require 'csv'

CSV.foreach('large_file.csv').lazy

  .map { |row| process_row(row) }

  .each { |processed_row| save_row(processed_row) }

Streaming APIs

When working with streaming APIs, lazy enumerators allow you to process data as it arrives, rather than waiting for the entire response.

require 'net/http'

def stream_data

 Enumerator.new do |yielder|

   url = URI('https://api.example.com/stream')

   Net::HTTP.start(url.host, url.port, use_ssl: true) do |http|

     request = Net::HTTP::Get.new(url)

     http.request(request) do |response|

       response.read_body do |chunk|

         yielder << chunk

       end

     end

   end

 end

end

stream_data.lazy.each { |chunk| process_chunk(chunk) }

Handling Infinite Sequences

Infinite sequences, such as the Fibonacci sequence or a stream of random numbers, can be handled efficiently using lazy enumerators.

random_numbers = Enumerator.new do |yielder|

 loop { yielder << rand(100) }

end

puts random_numbers.lazy.take(10).to_a

6. Performance Considerations

Memory Efficiency: Lazy enumerators are memory-efficient because they process items one at a time, avoiding the need to load the entire collection into memory.

Performance Trade-offs: While lazy evaluation can save memory, it may introduce slight overhead due to the additional abstraction layer. Always profile your code to ensure it meets performance requirements.

Infinite Sequences: Lazy enumerators are ideal for infinite sequences, as they allow you to work with sequences that would otherwise be impossible to handle.

Conclusion

Ruby’s Enumerable and Enumerator modules provide a powerful foundation for working with collections and streams. By leveraging lazy evaluation and custom iterators, you can optimize performance, handle large datasets, and even work with infinite sequences. 

Whether you’re processing large CSV files, consuming streaming APIs, or implementing complex iteration patterns, these tools will help you write more efficient and expressive Ruby code. By mastering these concepts, you’ll be well-equipped to tackle a wide range of challenges in your Ruby projects. 

If you’re looking to deepen your understanding of Ruby or need expert guidance to implement these techniques, Techdots is here to help. With our expertise in Ruby and advanced programming practices, we can assist you in building scalable, high-performance applications. Let Techdots be your partner in unlocking the full potential of Ruby!

Ready to start a project?

Let’s work together to ensure your digital space is inclusive and compliant. Reach out to our team and start building an application that works for everyone.

Book Meeting