[Feature]: Chat inputs to AsyncLLMEngine #14289

sfc-gh-mkrubinski · 2025-03-05T13:25:09Z

🚀 The feature, motivation and pitch

Currently, only the LLM class meant for offline inference supports the chat method.
Are there any plans to implement a similar method for AsyncLLMEngine, besides the existing generate?
Alternatively, is there any work on extending the PromptType acceptable by generate to include more prompt variants, such as chat conversations?

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

DarkLight1337 · 2025-03-05T14:06:11Z

I think there is not much reason to do this now because the interface of async engine will change significantly in V1. Maybe after the API is more stable? cc @robertgshaw2-redhat

sfc-gh-mkrubinski added the feature request New feature or request label Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Chat inputs to AsyncLLMEngine #14289

[Feature]: Chat inputs to AsyncLLMEngine #14289

sfc-gh-mkrubinski commented Mar 5, 2025

DarkLight1337 commented Mar 5, 2025

[Feature]: Chat inputs to AsyncLLMEngine #14289

[Feature]: Chat inputs to AsyncLLMEngine #14289

Comments

sfc-gh-mkrubinski commented Mar 5, 2025

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

DarkLight1337 commented Mar 5, 2025