Streaming with OpenAIChatCompletionClient raises an 'empty_chunk` exception when usage is requested, fixed using max_consecutive_empty_chunk_tolerance #5078

auto-d · 2025-01-16T17:29:02Z

The model client documentation suggests this fix for missing tokens in an OpenAIChatCompletionClient streaming response:

set extra_create_args={"stream_options": {"include_usage": True}},

However the (final) message from the server with the requested usage information raises an exception due to an 'empty chunk' in the streaming processor implementation (openai/_openai_client.py).

Code to reproduce in autogen 0.41:

model_client = OpenAIChatCompletionClient(model="gpt-4o-mini", api_key = api_key)
model_client.create_stream(
                messages=["Tell me a story about pirates"],
                extra_create_args={"stream_options": {"include_usage": True}})

async for response in stream: 
    print(response) 
    # ValueError raised by logic in _openai_client.py's `create_stream`.

Workaround for me was setting max_consecutive_empty_chunk_tolerance to 2, which per the comments was included to fix a problem with the Azure endpoint.

The text was updated successfully, but these errors were encountered:

ekzhu · 2025-01-16T21:09:53Z

Could you submit a PR to update the API docs of the create_stream methods of both OpenAIChatCompletionClient and AzureOpenAIChatCompletionClient on your error resolution? cc @MohMaz

github-actions bot added the needs-triage label Jan 16, 2025

rysweet removed the needs-triage label Jan 16, 2025

rysweet assigned jackgerrits and ekzhu Jan 16, 2025

ekzhu added the proj-extensions label Jan 16, 2025

ekzhu added this to the 0.4.x milestone Jan 16, 2025

ekzhu changed the title ~~Streaming with OpenAIChatCompletionClient raises an exception when usage is requested~~ Streaming with OpenAIChatCompletionClient raises an 'empty_chunk` exception when usage is requested, fixed using max_consecutive_empty_chunk_tolerance Jan 16, 2025

ekzhu added the documentation Improvements or additions to documentation label Jan 16, 2025

ekzhu unassigned ekzhu and jackgerrits Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming with OpenAIChatCompletionClient raises an 'empty_chunk` exception when usage is requested, fixed using max_consecutive_empty_chunk_tolerance #5078

Streaming with OpenAIChatCompletionClient raises an 'empty_chunk` exception when usage is requested, fixed using max_consecutive_empty_chunk_tolerance #5078

auto-d commented Jan 16, 2025 •

edited

Loading

ekzhu commented Jan 16, 2025 •

edited

Loading

Streaming with OpenAIChatCompletionClient raises an 'empty_chunk` exception when usage is requested, fixed using max_consecutive_empty_chunk_tolerance #5078

Streaming with OpenAIChatCompletionClient raises an 'empty_chunk` exception when usage is requested, fixed using max_consecutive_empty_chunk_tolerance #5078

Comments

auto-d commented Jan 16, 2025 • edited Loading

ekzhu commented Jan 16, 2025 • edited Loading

auto-d commented Jan 16, 2025 •

edited

Loading

ekzhu commented Jan 16, 2025 •

edited

Loading