-
Notifications
You must be signed in to change notification settings - Fork 140
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RoadMap] Mooncake Roadmap Q1 & Q2 2025 #44
Comments
The roadmap looks fantastic! What does "Cluster reconfiguration" mean? Does it refer to dynamically adjusting the P/D role type? |
Hello, I am interested in the Transport support CXL/shared memory and maybe I can do this in one month. |
Yes, it has two meanings. Firstly, any GPU server can freely join or leave the Mooncake KVCache pool. Secondly, a Prefill or Decoding Instance can change its role type. |
Thank you for your contribution. I look forward to seeing the pull request on GitHub. |
|
Thank you for your contribution! Looking forward to seeing the pull request on GitHub. |
I think it quite hard to implement "ZeroCopy from vLLM to RDMA memory" and "Layer-by-layer pipeline" without modifying core components of vllm a lot. Is there an easier way to implement "ZeroCopy" and "Layer-by-layer Pipeline"? |
Is the functionality provided by this Mooncake Managed Object Store similar to that of an in-memory database like Redis? |
We categorized our roadmap into two major themes: New Component (Mooncake Managed Object Store) and New Features of Mooncake. As we are seeing more.
New Component: Mooncake Managed Object Store
25Q1
25Q2+
New Features of Mooncake
Transfer Engine
P2P Store
LLM Framework Integration
If any of the items you wanted is not on the roadmap, your suggestion and contribution are still welcomed! Please feel free to comment in this thread, open feature request, or create an RFC.
The text was updated successfully, but these errors were encountered: