Question 1

Explain the difference between an iterable and an iterator in Python.

Accepted Answer

An iterable in Python is any object capable of returning its elements one at a time, such as a list, tuple, or string. Iterables implement the __iter__() method, which returns an iterator object.

An iterator, on the other hand, is the object returned by calling __iter__() on an iterable. It maintains an internal state and implements the __next__() method, which produces the next value when called, raising StopIteration when the sequence ends.

Practically, you can loop over an iterable directly using a for loop, but the for loop internally converts it into an iterator and calls __next__() repeatedly until exhaustion.

Question 2

How can custom iterators be implemented in Python for a class that generates squares of numbers?

Accepted Answer

To implement a custom iterator, a class must define the __iter__() and __next__() methods. __iter__() typically returns self, while __next__() calculates the next value and maintains the iteration state.

For a class generating squares of numbers, you can maintain a counter that increments with each call to __next__(), returning the square of the counter until a predefined limit is reached, after which StopIteration is raised.

This approach allows the class to be used in any context that expects an iterator, such as for loops or comprehension expressions, providing a clean and memory-efficient iteration mechanism.

Question 3

Discuss how Python iterators handle large data streams and why they are preferred over lists in such scenarios.

Accepted Answer

Python iterators enable lazy evaluation, generating each element only when needed rather than storing the entire dataset in memory. This is crucial for large data streams where holding all elements at once is infeasible.

For example, reading a multi-gigabyte log file line by line can be efficiently handled using an iterator, whereas loading all lines into a list could consume excessive memory and degrade performance.

Iterators also integrate with generators and itertools to create complex, composable pipelines. This combination allows for filtering, mapping, and batching operations on-the-fly without materializing intermediate sequences, enhancing scalability and reducing memory footprint.

Question 4

Which of the following statements about Python iterators are true?

Accepted Answer

All iterators are inherently iterables because they implement the __iter__() method, allowing them to be used in a for loop or any context requiring an iterable.

Calling iter() on an iterator returns the iterator itself as part of the iterator protocol. Iterators do not store all elements in memory and cannot be reused after exhaustion without re-creating them.

Question 5

Identify valid ways to create iterators in Python.

Accepted Answer

iter() converts any iterable into an iterator. Custom classes implementing __iter__() and __next__() can also be iterators.

Generator functions with yield implicitly create an iterator object when called. List comprehensions create lists, which are iterables but not iterators.

Question 6

When working with Python iterators, which behaviors are correct?

Accepted Answer

Iterators internally track the current position, allowing successive calls to __next__() to return the next element.

When an iterator is exhausted, next() raises StopIteration. Iterators cannot reset automatically; they must be recreated.

itertools.chain() combines multiple iterables into a single iterator without storing all elements in memory.

Question 7

Write a Python iterator class that yields even numbers up to a given limit.

Accepted Answer

The class maintains a current value starting from 0 and increments by 2 with each call to __next__().

When the current value exceeds the specified limit, StopIteration is raised, signaling the end of iteration.

This design allows iteration over even numbers up to a given limit without precomputing a list, saving memory and providing lazy evaluation.

Question 8

Create a generator function that yields Fibonacci numbers up to n elements.

Accepted Answer

The generator maintains two variables a and b to track consecutive Fibonacci numbers.

Yield produces each Fibonacci number on-the-fly, allowing iteration without storing the entire sequence.

This is efficient for generating long sequences where memory usage is a concern.

Question 9

Write an iterator that traverses a nested list and yields all integers in a flattened sequence.

Accepted Answer

The iterator uses a stack to manage elements, processing nested lists in a LIFO manner to flatten them.

Lists encountered during iteration are reversed and extended onto the stack to preserve order in the flattened sequence.

This approach allows traversal of arbitrarily nested lists without recursion, making it suitable for large or deeply nested data structures.

Question 10

Implement a Python iterator that cycles indefinitely over a finite list.

Accepted Answer

The iterator maintains an index and wraps around using modulo arithmetic, producing an infinite repeating sequence.

StopIteration is only raised if the input list is empty, otherwise iteration continues indefinitely.

This pattern is useful in applications such as round-robin scheduling or repeated simulations where cyclic access is required.

Question 11

Why can iterators improve application performance when processing large database exports or log files?

Accepted Answer

Iterators process data one element at a time instead of loading the entire dataset into memory. When working with multi-gigabyte database exports, audit logs, or event streams, this significantly reduces memory consumption and startup time.

A common production pattern is reading records from a file, transforming them, and sending them to another system. Using iterators allows each record to be processed immediately after it is read, creating a streaming pipeline rather than a batch-loading approach.

This design also improves scalability because memory usage remains relatively constant regardless of the size of the source data. As datasets grow, iterator-based solutions typically remain stable while list-based approaches may encounter memory pressure or performance degradation.

Question 12

Which operations consume elements from an iterator?

Accepted Answer

Iterators are stateful objects. Every call to next() advances the iterator. A for loop repeatedly calls next() internally, consuming elements until StopIteration is raised.

Converting an iterator to a list also consumes all remaining elements. Most iterator objects do not support len() because the total number of remaining elements may be unknown or expensive to determine.

Question 13

Create a custom iterator that returns records in batches of a specified size.

Accepted Answer

Batch processing is a common requirement when sending records to APIs, databases, or message queues. Instead of handling one record at a time, the iterator returns groups of records.

The iterator maintains an index and slices the underlying collection on each iteration. This pattern is frequently used in ETL and integration workloads where systems impose batch size limits.

Question 14

What are the risks of passing the same iterator to multiple consumers?

Accepted Answer

Iterators maintain internal state. When multiple consumers share the same iterator, each consumer advances the iterator position. This can result in missing records, inconsistent processing, or difficult-to-debug behavior.

For example, if one component reads five records before another component starts processing, those five records are no longer available to the second consumer. Unlike lists, iterators do not automatically provide independent views of the same data.

In production systems, it is often safer to create separate iterators from the original iterable or use tools such as itertools.tee() when independent traversal is required. However, developers should understand the memory implications of duplicating iterator state.

Question 15

Which statements about StopIteration are correct?

Accepted Answer

StopIteration is the mechanism used by the iterator protocol to indicate exhaustion. When no additional values are available, __next__() raises StopIteration.

For loops catch this exception internally and terminate the loop gracefully. Developers usually interact with it indirectly through iteration constructs.

Question 16

Write code that manually iterates through a tuple using the iterator protocol.

Accepted Answer

This example demonstrates the low-level iterator protocol that powers every for loop in Python.

The iter() function creates an iterator, and next() retrieves successive values until StopIteration signals completion. Understanding this behavior helps when debugging custom iterators.

Question 17

Implement an iterator that reads a text file one chunk at a time instead of loading the entire file.

Accepted Answer

Large files can be processed incrementally using chunk-based iteration. This approach avoids loading the entire file into memory.

The iterator reads a fixed number of characters during each iteration and automatically stops when the end of the file is reached. Similar patterns are commonly used for log processing and file transfer systems.

Question 18

Which built-in Python functions return iterator objects in modern Python versions?

Accepted Answer

map(), filter(), and zip() produce lazy iterators that generate values on demand. This allows large datasets to be processed efficiently.

sorted() is different because it immediately creates and returns a list containing all sorted elements.

Question 19

When should a developer choose a generator instead of building a custom iterator class?

Accepted Answer

Generators are usually preferred when iteration logic is straightforward and does not require complex state management. They provide the same lazy behavior while significantly reducing boilerplate code.

A generator can often replace dozens of lines of iterator class implementation with a few yield statements. This improves readability and maintainability without sacrificing performance.

Custom iterator classes become more valuable when multiple state variables, configuration options, resource management requirements, or specialized behaviors need to be encapsulated within a reusable object.

Question 20

Create a generator-based iterator that filters only successful API response codes from a stream of status codes.

Accepted Answer

The generator evaluates each status code lazily and yields only successful HTTP responses. This avoids creating unnecessary intermediate collections.

Similar filtering pipelines are frequently used in API monitoring, integration platforms, observability systems, and event-processing applications where millions of records may pass through a workflow.

Question 21

Explain how Python's itertools module complements iterators in real-world applications.

Accepted Answer

The itertools module provides a suite of tools for building complex iterators that perform combinations, permutations, chaining, grouping, and infinite iteration without creating intermediate collections.

For example, itertools.cycle() can be used for round-robin scheduling, and itertools.islice() allows slicing an iterator efficiently, which is especially useful for large datasets or streaming data.

By leveraging itertools with custom iterators, developers can create memory-efficient pipelines for ETL, batch processing, or analytics tasks without the overhead of storing all intermediate results in memory.

Question 22

Which itertools functions return iterators in Python?

Accepted Answer

itertools.count() generates an infinite iterator of numbers. permutations() and combinations() produce iterators over all possible arrangements and selections, respectively.

sum() computes a value immediately and returns an integer, not an iterator, so it does not support lazy iteration.

Question 23

Write a Python iterator that flattens a dictionary of lists into individual key-value pairs.

Accepted Answer

The iterator keeps track of both the current key and index within the list associated with that key.

It moves to the next key when the inner list is exhausted, flattening the dictionary into a stream of key-value tuples.

This approach is useful when iterating over structured data from APIs, configuration files, or nested datasets in a memory-efficient manner.

Question 24

What is the difference between a generator expression and a list comprehension regarding iteration?

Accepted Answer

A generator expression uses lazy evaluation, creating an iterator that yields values one at a time, whereas a list comprehension evaluates immediately and returns a complete list.

Generator expressions are memory-efficient for large sequences because they produce items on-the-fly without storing the entire result in memory.

In practice, generator expressions are preferred when processing streams of data, while list comprehensions are convenient for small collections where immediate access to all items is required.

Question 25

Which of the following are valid ways to consume an iterator in Python?

Accepted Answer

For loops, list(), and sum() internally iterate through the iterator, consuming elements as they go.

reversed() requires a sequence with a known length and indexable elements, so it cannot directly operate on generic iterators.

Question 26

Implement a generator that yields only prime numbers up to a given limit.

Accepted Answer

The generator iterates from 2 up to the specified limit and checks each number for primality by testing divisibility up to its square root.

Using yield allows each prime number to be produced on demand, avoiding storage of all primes in memory and supporting efficient processing of large limits.

Question 27

How do Python iterators behave when combined with asynchronous operations or streams?

Accepted Answer

Standard iterators are synchronous and block until each element is available. When dealing with asynchronous streams, you must use async iterators and async for loops.

Python provides the __aiter__() and __anext__() methods for asynchronous iteration, allowing integration with async generators, network I/O, or event-driven streams without blocking the main thread.

This separation ensures that large-scale real-time data processing, such as consuming messages from a queue or streaming logs, can be efficiently handled using iterator patterns while maintaining non-blocking concurrency.

Question 28

Which scenarios require careful iterator handling to avoid unexpected behavior?

Accepted Answer

Iterators are stateful, so sharing them across threads or reusing them after exhaustion can lead to lost data or inconsistent processing.

Functions that consume iterators fully can leave the caller with an empty iterator, which may not be expected unless the developer explicitly accounts for it.

Question 29

Create a Python generator that yields an infinite arithmetic sequence with a given start and step.

Accepted Answer

The generator maintains a current value and increments it by the step size on each iteration.

Because it is infinite, it never raises StopIteration, and elements are produced lazily as needed.

This pattern is useful in simulations, scheduling, or generating predictable sequences in streaming applications.

Question 30

Write a Python iterator that merges two sorted iterators into a single sorted output.

Accepted Answer

The iterator maintains the next element from each input iterator and always yields the smaller one, advancing the corresponding iterator.

This allows efficient, memory-friendly merging of two sorted sequences without creating intermediate lists.

Such iterators are widely used in external sorting, merging logs, or streaming sorted datasets from multiple sources.

Python Iterators