refactor(streaming): remove ChatNVIDIA streaming patch #1607

Pouyanpi · 2026-01-29T12:40:21Z

Summary

Remove the custom _langchain_nvidia_ai_endpoints_patch.py module that patched ChatNVIDIA with streaming decorators
Update _init_nvidia_model to use standard ChatNVIDIA from langchain_nvidia_ai_endpoints directly
Remove associated tests and mock classes for the patched ChatNVIDIA

greptile-apps · 2026-01-29T12:43:23Z

Greptile Overview

Greptile Summary

This PR removes the custom ChatNVIDIA streaming patch that was previously needed to enable streaming functionality for NVIDIA AI Endpoints and NIM models. The refactor simplifies the codebase by:

Removing the custom patch module (_langchain_nvidia_ai_endpoints_patch.py) that wrapped ChatNVIDIA with streaming decorators
Updating _init_nvidia_model to import ChatNVIDIA directly from langchain_nvidia_ai_endpoints
Removing 379 lines of tests for the patch's custom streaming decorators
Cleaning up pyproject.toml to remove the coverage exclusion for the deleted file

The changes align with the base branch refactor/drop-streaming-callback which aims to remove LangChain callback dependencies from the streaming infrastructure. This refactor assumes the upstream langchain_nvidia_ai_endpoints package now natively supports streaming, making the custom patch obsolete.

Key considerations:

Verify that streaming still works with native ChatNVIDIA in integration tests
Check that streaming parameter can still be passed through kwargs if needed
Ensure backward compatibility for existing configurations using nim and nvidia_ai_endpoints providers

Confidence Score: 4/5

This PR is safe to merge with minor risk
The refactor cleanly removes custom streaming patch code that is no longer needed, assuming the upstream langchain_nvidia_ai_endpoints package now natively supports the streaming functionality that was previously patched in. The changes are well-scoped (removes patch module and its tests) and align with the base branch's goal of dropping LangChain callback dependencies. However, the PR lacks explicit verification that native streaming still works.
Verify that streaming functionality still works with the native ChatNVIDIA implementation in integration tests

Important Files Changed

Filename	Overview
nemoguardrails/llm/models/langchain_initializer.py	removed custom patch import, now uses standard `ChatNVIDIA` from `langchain_nvidia_ai_endpoints`, simplified docstring and error handling, added type hint to `_PROVIDER_INITIALIZERS`
nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py	removed 107-line custom patch module that wrapped `ChatNVIDIA` with streaming decorators

Sequence Diagram

sequenceDiagram
    participant App as Application
    participant Init as langchain_initializer
    participant Patch as _langchain_nvidia_ai_endpoints_patch (REMOVED)
    participant Native as langchain_nvidia_ai_endpoints.ChatNVIDIA
    
    Note over App,Native: BEFORE: Custom Patch Flow
    App->>Init: Initialize NVIDIA model
    Init->>Patch: Import custom ChatNVIDIA
    Patch->>Native: Inherit from ChatNVIDIAOriginal
    Patch->>Patch: Apply stream_decorator
    Patch->>Patch: Apply async_stream_decorator
    Patch-->>Init: Return patched ChatNVIDIA
    Init-->>App: Return model with streaming support
    
    Note over App,Native: AFTER: Native Implementation Flow
    App->>Init: Initialize NVIDIA model
    Init->>Native: Import ChatNVIDIA directly
    Native-->>Init: Return ChatNVIDIA
    Init-->>App: Return model with native streaming

greptile-apps

_{2 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

Remove the custom ChatNVIDIA patch that added streaming decorators. Now using the standard ChatNVIDIA from langchain_nvidia_ai_endpoints directly since LangChain callback-based streaming has been dropped.

codecov · 2026-01-29T12:50:14Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

trebedea

👍 LGTM

greptile-apps bot reviewed Jan 29, 2026

View reviewed changes

refactor(streaming): remove ChatNVIDIA streaming patch

48a77d8

Remove the custom ChatNVIDIA patch that added streaming decorators. Now using the standard ChatNVIDIA from langchain_nvidia_ai_endpoints directly since LangChain callback-based streaming has been dropped.

Pouyanpi force-pushed the refactor/remove-chatnvidia-patch branch from a97a9be to 48a77d8 Compare January 29, 2026 12:44

trebedea approved these changes Jan 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(streaming): remove ChatNVIDIA streaming patch #1607

refactor(streaming): remove ChatNVIDIA streaming patch #1607

Uh oh!

Pouyanpi commented Jan 29, 2026

Uh oh!

greptile-apps bot commented Jan 29, 2026

Confidence Score: 4/5

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Uh oh!

codecov bot commented Jan 29, 2026

Uh oh!

trebedea left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor(streaming): remove ChatNVIDIA streaming patch #1607

Are you sure you want to change the base?

refactor(streaming): remove ChatNVIDIA streaming patch #1607

Uh oh!

Conversation

Pouyanpi commented Jan 29, 2026

Summary

Uh oh!

greptile-apps bot commented Jan 29, 2026

Greptile Overview

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jan 29, 2026

Codecov Report

Uh oh!

trebedea left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants