Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pluggable Router Implementations #36

Open
danehans opened this issue Jan 29, 2025 · 3 comments
Open

Pluggable Router Implementations #36

danehans opened this issue Jan 29, 2025 · 3 comments
Labels
feature request New feature or request

Comments

@danehans
Copy link

I currently use a Gateway API implementation with an inference extension to perform similar functionality as the vLLM router. I would like to use vllm-stack but with my current router implementation. Will you consider integrating with inference extension as a supported router implementation?

@danehans danehans changed the title Pluggable router Implementations Pluggable Router Implementations Jan 29, 2025
@KuntaiDu
Copy link
Collaborator

Oh good suggestion. I am less familiar with k8s inference extension, so I'll involve @ApostaC and we will investigate and get back to you.

@ApostaC
Copy link
Collaborator

ApostaC commented Jan 29, 2025

Thanks for asking!
We don't have an immediate plan to integrate with the inference extension API for now. But will absolutely consider it as this project grows.

An alternative solution is to disable the router and directly connect the gateway API to the vLLM service. (Note that currently the helm chart does not support disabling router via values.yaml, so maybe create an issue for this if you want so that we can work on that?)

@danehans
Copy link
Author

danehans commented Feb 5, 2025

@ApostaC thanks for the feedback. I created #66 to support disabling the router via Helm values. I would like to keep this issue open so production-stack can consider supporting Gateway API with inference extension.

@ApostaC ApostaC added the feature request New feature or request label Feb 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants