Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add envoy gateway streaming support #377

Merged
merged 18 commits into from
Nov 14, 2024
Merged

Add envoy gateway streaming support #377

merged 18 commits into from
Nov 14, 2024

Conversation

varungup90
Copy link
Collaborator

@varungup90 varungup90 commented Nov 12, 2024

Address #380

@varungup90 varungup90 changed the title Add completition chunk Add streaming support Nov 12, 2024
@Jeffwan Jeffwan changed the title Add streaming support Add envoy gateway streaming support Nov 12, 2024
@varungup90 varungup90 force-pushed the add-completition-chunk branch from be36bca to 8ac5876 Compare November 13, 2024 19:24
Copy link
Collaborator

@Jeffwan Jeffwan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

overall looks good to me. I left some questions

@Jeffwan Jeffwan merged commit 603ce51 into main Nov 14, 2024
10 checks passed
@Jeffwan Jeffwan deleted the add-completition-chunk branch November 14, 2024 20:00
varungup90 added a commit that referenced this pull request Jan 9, 2025
* Add reference grant to support httprouting for different namespace

* lint fix

* create reference grant per namespace

* refactor validate routing strategy

* test

* Add streaming support

* nit

* bug fix for streaming

* nit

* nit comment update

* comment model name check from cache till we use static lora
Jeffwan pushed a commit that referenced this pull request Jan 9, 2025
* Add envoy gateway streaming support (#377)

* Add reference grant to support httprouting for different namespace

* lint fix

* create reference grant per namespace

* refactor validate routing strategy

* test

* Add streaming support

* nit

* bug fix for streaming

* nit

* nit comment update

* comment model name check from cache till we use static lora

* Add client traffic policy to increase per connection buffer size from 32kb to 256kb (#395)

* Add client traffic policy to increase per connection buffer size

* rename client traffic policy
gangmuk pushed a commit that referenced this pull request Jan 25, 2025
* Add reference grant to support httprouting for different namespace

* lint fix

* create reference grant per namespace

* refactor validate routing strategy

* test

* Add streaming support

* nit

* bug fix for streaming

* nit

* nit comment update

* comment model name check from cache till we use static lora
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants