Pluggable Router Implementations #36

danehans · 2025-01-29T17:23:34Z

I currently use a Gateway API implementation with an inference extension to perform similar functionality as the vLLM router. I would like to use vllm-stack but with my current router implementation. Will you consider integrating with inference extension as a supported router implementation?

KuntaiDu · 2025-01-29T17:37:22Z

Oh good suggestion. I am less familiar with k8s inference extension, so I'll involve @ApostaC and we will investigate and get back to you.

ApostaC · 2025-01-29T23:55:38Z

Thanks for asking!
We don't have an immediate plan to integrate with the inference extension API for now. But will absolutely consider it as this project grows.

An alternative solution is to disable the router and directly connect the gateway API to the vLLM service. (Note that currently the helm chart does not support disabling router via values.yaml, so maybe create an issue for this if you want so that we can work on that?)

danehans · 2025-02-05T21:35:30Z

@ApostaC thanks for the feedback. I created #66 to support disabling the router via Helm values. I would like to keep this issue open so production-stack can consider supporting Gateway API with inference extension.

danehans changed the title ~~Pluggable router Implementations~~ Pluggable Router Implementations Jan 29, 2025

danehans mentioned this issue Feb 5, 2025

Support Disabling Router #66

Open

ApostaC added the feature request New feature or request label Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pluggable Router Implementations #36

Pluggable Router Implementations #36

danehans commented Jan 29, 2025

KuntaiDu commented Jan 29, 2025

ApostaC commented Jan 29, 2025

danehans commented Feb 5, 2025

Pluggable Router Implementations #36

Pluggable Router Implementations #36

Comments

danehans commented Jan 29, 2025

KuntaiDu commented Jan 29, 2025

ApostaC commented Jan 29, 2025

danehans commented Feb 5, 2025