feat: enable function calling support for streaming responses by Pavilion4ik · Pull Request #102 · openedx/openedx-ai-extensions

Pavilion4ik · 2026-01-23T15:43:36Z

This PR refactors the LiteLLM-based processors to support streaming responses even when OpenAI function calling (tools) is enabled. Specifically, it includes:

chunk aggregation: Added logic to buffer streaming chunks in LLMProcessor, reconstruct fragmented tool call arguments, and execute the tools once the stream for that specific call is complete.

Recursive Streaming: Implemented yield from recursion in _handle_streaming_tool_calls to allow the LLM to call a function, receive the output, and continue streaming the final text response to the user.

Educator Processor Update: enabled streaming in EducatorAssistantProcessor for general chat, while explicitly forcing non-streaming mode for generate_quiz_questions (since it requires full response JSON validation and retry logic).

Unit Tests: Added comprehensive tests to verify that streaming works correctly with single and multiple tool calls.

Why?
Previously, LitellmProcessor explicitly disabled streaming if any tools were configured. This resulted in a poor User Experience (UX) where users had to wait for the entire generation to finish before seeing any text, simply because a tool might have been used. This change allows for a "best of both worlds" scenario: immediate feedback via streaming for text responses, and correct execution of background functions when the model decides to use them.

openedx-webhooks · 2026-01-23T15:43:42Z

Thanks for the pull request, @Pavilion4ik!

This repository is currently maintained by @felipemontoya.

Once you've gone through the following steps feel free to tag them in a comment and let them know that your changes are ready for engineering review.

🔘 Get product approval

If you haven't already, check this list to see if your contribution needs to go through the product review process.

If it does, you'll need to submit a product proposal for your contribution, and have it reviewed by the Product Working Group.
- This process (including the steps you'll need to take) is documented here.
If it doesn't, simply proceed with the next step.

🔘 Provide context

To help your reviewers and other members of the community understand the purpose and larger context of your changes, feel free to add as much of the following information to the PR description as you can:

Dependencies

This PR must be merged before / after / at the same time as ...
Blockers

This PR is waiting for OEP-1234 to be accepted.
Timeline information

This PR must be merged by XX date because ...
Partner information

This is for a course on edx.org.
Supporting documentation
Relevant Open edX discussion forum threads

🔘 Get a green build

If one or more checks are failing, continue working on your changes until this is no longer the case and your build turns green.

Details

Where can I find more information?

If you'd like to get more details on all aspects of the review process for open source pull requests (OSPRs), check out the following resources:

When can I expect my changes to be merged?

Our goal is to get community contributions seen and reviewed as efficiently as possible.

However, the amount of time that it takes to review and merge a PR can vary significantly based on factors such as:

The size and impact of the changes that it introduces
The need for product review
Maintenance status of the parent repository

💡 As a result it may take up to several weeks or months to complete a review and merge your PR.

codecov · 2026-01-23T15:45:25Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 91.27%. Comparing base (a794e3c) to head (b518869).
⚠️ Report is 6 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #102      +/-   ##
==========================================
+ Coverage   91.22%   91.27%   +0.05%     
==========================================
  Files          51       51              
  Lines        4547     4724     +177     
  Branches      276      298      +22     
==========================================
+ Hits         4148     4312     +164     
- Misses        311      320       +9     
- Partials       88       92       +4

Flag	Coverage Δ
unittests	`91.27% <ø> (+0.05%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

felipemontoya · 2026-01-27T23:10:02Z

Hi @Pavilion4ik I did not notice this was open. I assigned myself to take a look soon.

Henrrypg · 2026-01-28T17:05:55Z

Hi @Pavilion4ik i haven't checked static code but i tried to run it with an error:

My profile's config:

/*
Test experience using chat. Ask ChatGPT to answer in a way that forces to use functions
to retrieve context from the current unit and the course outline
*/
{
  "orchestrator_class": "ThreadedLLMResponse",
  "processor_config": {
    "OpenEdXProcessor": {
    },
    "LLMProcessor": {
      "provider": "openai",
      "enabled_tools": [
        "get_context",
        "get_location_content",
        "roll_dice",
        "get_location_link"
      ],
      "stream": true
    }
  }
}

- Removed the restriction in LitellmProcessor that disabled streaming when tools are present - Implemented `_handle_streaming_tool_calls` in LLMProcessor to aggregate chunks, reconstruct tool calls, and handle recursion - Updated `_completion_with_tools` to delegate to the streaming handler when `stream=True` - Added unit tests covering streaming tool calls and recursive execution # Conflicts: # backend/tests/test_litellm_base_processor.py # backend/tests/test_llm_processor.py

- Implemented recursive tool execution for streaming via the Responses API - Refactor streaming logic into `_execute_stream_tool_call` to reduce nesting - Add token usage tracking and session persistence for streamed threads - Add unit tests for recursive streaming and tool call synchronization.

openedx-webhooks added the open-source-contribution PR author is not from Axim or 2U label Jan 23, 2026

openedx-webhooks added this to Contributions Jan 23, 2026

github-project-automation bot moved this to Needs Triage in Contributions Jan 23, 2026

mphilbrick211 moved this from Needs Triage to Ready for Review in Contributions Jan 27, 2026

mphilbrick211 requested a review from felipemontoya January 27, 2026 21:21

felipemontoya self-assigned this Jan 27, 2026

Pavilion4ik added 2 commits February 13, 2026 03:44

Pavilion4ik force-pushed the feature/enable-functions-in-streaming branch from 6c44596 to b518869 Compare February 13, 2026 17:20

felipemontoya assigned Henrrypg Feb 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enable function calling support for streaming responses#102

feat: enable function calling support for streaming responses#102
Pavilion4ik wants to merge 2 commits intoopenedx:mainfrom
raccoongang:feature/enable-functions-in-streaming

Pavilion4ik commented Jan 23, 2026

Uh oh!

openedx-webhooks commented Jan 23, 2026

Uh oh!

codecov bot commented Jan 23, 2026 •

edited

Loading

Uh oh!

felipemontoya commented Jan 27, 2026

Uh oh!

Henrrypg commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Pavilion4ik commented Jan 23, 2026

Uh oh!

openedx-webhooks commented Jan 23, 2026

Uh oh!

codecov bot commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

felipemontoya commented Jan 27, 2026

Uh oh!

Henrrypg commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Jan 23, 2026 •

edited

Loading