fix: prevent OOM by using file handles and chunked base64 encoding #1951

anthonymq · 2026-01-14T10:11:05Z

Summary

This PR fixes memory spikes that can cause OOM (Out of Memory) errors in the _interactions layer when handling file uploads and inline data.

Problem

Several functions in the _interactions layer were loading entire files into memory before processing, causing OOM errors for large files.

Note: The main aio.files.upload() API (files.py → _api_client.py) is already memory-efficient - it uses 8MB chunked streaming via anyio.Path().open('rb'). The OOM issues occur in the experimental _interactions layer.

Root Causes Identified

_interactions/_files.py: Used path.read_bytes() to load entire files before passing to httpx
_interactions/_utils/_transform.py: Base64 encoding loaded entire files with read_bytes() before encoding
_interactions/_utils/_utils.py: file_from_path() loaded entire files with read_bytes()

Solution

1. File Handle Streaming (`_interactions/_files.py`)

Return open file handles (open(path, 'rb')) instead of loading bytes with read_bytes()
httpx natively supports IO[bytes] file handles, so this is a drop-in fix
Applies to both sync and async code paths

2. Chunked Base64 Encoding (`_interactions/_utils/_transform.py`)

Implement chunked reading with 3MB chunks for base64 encoding
Chunk size is a multiple of 3 (required for correct base64 encoding without padding issues)
Reduces peak memory from O(file_size) to O(chunk_size)

3. File Handle in Utility (`_interactions/_utils/_utils.py`)

file_from_path() now returns a file handle instead of loaded bytes

Memory Flow (Before vs After)

Before:

File path → read_bytes() (🔴 entire file in memory) → process

After:

File path → open() → file handle → stream in chunks (✅ memory-efficient)

Testing

Verified Python syntax compiles correctly
Verified file handle approach works with httpx's expected types
Verified chunked base64 encoding produces identical output to original implementation

Backwards Compatibility

This is a fully backwards-compatible change:

Public API remains unchanged
Return types are compatible (httpx accepts both bytes and IO[bytes])
Base64 output is identical (chunked encoding with multiples of 3 produces same result)

janasangeetha · 2026-01-16T08:34:36Z

Hey @anthonymq
Thanks for contributing!
The branch is out-of-date. Kindly update the same.

This change addresses memory spikes that can cause OOM errors when uploading large files to the Gemini File API. Changes: 1. _interactions/_files.py: Return open file handles instead of loading entire files into memory with read_bytes(). httpx supports IO[bytes] directly, so there's no need to pre-load file contents. 2. _interactions/_utils/_transform.py: Implement chunked base64 encoding using 3MB chunks (must be multiple of 3 for base64 correctness) to reduce peak memory usage when encoding files for inline data. The existing chunked upload mechanism in _api_client.py (8MB chunks) was already correct, but files were being loaded into memory before reaching that code path. This fix ensures memory-efficient handling from the start of the upload flow.

Additional fix for the _interactions layer - file_from_path was loading entire files with read_bytes() when it can return a file handle instead.

anthonymq · 2026-01-16T08:41:49Z

Branch updated,
Cheers

janasangeetha self-assigned this Jan 16, 2026

janasangeetha added the size:XL Code changes > 100 lines label Jan 16, 2026

janasangeetha added the status:awaiting user response label Jan 16, 2026

anthonymq added 2 commits January 16, 2026 09:40

fix: use file handle in file_from_path to prevent OOM

ccf8a52

Additional fix for the _interactions layer - file_from_path was loading entire files with read_bytes() when it can return a file handle instead.

anthonymq force-pushed the fix/memory-efficient-file-uploads branch from bf6914f to ccf8a52 Compare January 16, 2026 08:41

janasangeetha removed the status:awaiting user response label Jan 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: prevent OOM by using file handles and chunked base64 encoding #1951

fix: prevent OOM by using file handles and chunked base64 encoding #1951

anthonymq commented Jan 14, 2026 •

edited

Loading

Uh oh!

janasangeetha commented Jan 16, 2026

Uh oh!

anthonymq commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: prevent OOM by using file handles and chunked base64 encoding #1951

Are you sure you want to change the base?

fix: prevent OOM by using file handles and chunked base64 encoding #1951

Conversation

anthonymq commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Root Causes Identified

Solution

1. File Handle Streaming (_interactions/_files.py)

2. Chunked Base64 Encoding (_interactions/_utils/_transform.py)

3. File Handle in Utility (_interactions/_utils/_utils.py)

Memory Flow (Before vs After)

Testing

Backwards Compatibility

Uh oh!

janasangeetha commented Jan 16, 2026

Uh oh!

anthonymq commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anthonymq commented Jan 14, 2026 •

edited

Loading

1. File Handle Streaming (`_interactions/_files.py`)

2. Chunked Base64 Encoding (`_interactions/_utils/_transform.py`)

3. File Handle in Utility (`_interactions/_utils/_utils.py`)