Feat/batch processing utility #76

LifeJiggy · 2025-10-29T16:05:59Z

Summary

This PR adds a BatchProcessor utility class that enables efficient grouping and processing of multiple requests, helping developers optimize API utilization and reduce overhead.

Problem

When making multiple API calls, developers often need to batch requests for efficiency, but currently have no built-in way to do this. This leads to:

Inefficient API usage with many small requests
Manual batching logic scattered throughout applications
Difficulty managing timeouts and batch sizes
Poor performance for bulk operations

Solution

Add BatchProcessor class with:

Configurable batch size limits
Timeout-based automatic processing
Simple API for checking if requests can be made
Force processing capability for immediate batch handling
Callback-based batch processing
Thread-safe implementation using standard library

Key Features

Configurable Batch Size: Set maximum items per batch
Timeout Processing: Automatic processing after timeout
Force Processing: Immediate batch processing when needed
Callback Integration: Custom processing logic via callbacks
Thread Safe: Uses standard library only, no external dependencies
Simple API: Easy to integrate into existing workflows

Benefits

Reduces API overhead for bulk operations
Improves request efficiency and throughput
Simplifies batch processing logic
Better resource utilization
Automatic timeout handling

Testing

Added comprehensive test suite covering:

Basic batch processing operations
Size-based automatic processing
Timeout-based processing
Force processing functionality
Multiple batch scenarios
Edge cases and error conditions

All tests pass with full coverage of batch processing functionality.

Usage Examples

from gradient._utils import BatchProcessor

# Create batch processor for API requests
processor = BatchProcessor(batch_size=10, timeout_seconds=5.0)

def process_batch(requests):
    # Process multiple requests efficiently
    responses = []
    for req in requests:
        response = client.chat.completions.create(**req)
        responses.append(response)
    return responses

processor.set_callback(process_batch)

# Add requests to batch
for i in range(15):
    request = {
        "messages": [{"role": "user", "content": f"Question {i}"}],
        "model": "llama3.3-70b-instruct"
    }
    processor.add(request)
    # Automatically processes every 10 requests

# Process remaining items
processor.force_process()

LifeJiggy added 4 commits October 28, 2025 15:26

feat: add enhanced API key validation functions

afb24b4

feat: add ResponseCache utility class

d941755

feat: add RateLimiter utility class

05399b6

feat: add BatchProcessor utility class

5d60871

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/batch processing utility #76

Feat/batch processing utility #76

LifeJiggy commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Feat/batch processing utility #76

Are you sure you want to change the base?

Feat/batch processing utility #76

Conversation

LifeJiggy commented Oct 29, 2025

Summary

Problem

Solution

Key Features

Benefits

Testing

Usage Examples

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant