[RoadMap] Mooncake Roadmap Q1 & Q2 2025 #44

stmatengss · 2024-12-18T02:42:28Z

VegetaPn · 2024-12-20T12:06:43Z

The roadmap looks fantastic! What does "Cluster reconfiguration" mean? Does it refer to dynamically adjusting the P/D role type?

SkylarKBKB · 2024-12-22T11:08:25Z

Hello, I am interested in the Transport support CXL/shared memory and maybe I can do this in one month.

stmatengss · 2024-12-22T13:06:37Z

The roadmap looks fantastic! What does "Cluster reconfiguration" mean? Does it refer to dynamically adjusting the P/D role type?

Yes, it has two meanings. Firstly, any GPU server can freely join or leave the Mooncake KVCache pool. Secondly, a Prefill or Decoding Instance can change its role type.

stmatengss · 2024-12-22T13:07:43Z

Hello, I am interested in the Transport support CXL/shared memory and maybe I can do this in one month.

Thank you for your contribution. I look forward to seeing the pull request on GitHub.

doujiang24 · 2024-12-25T12:46:06Z

Check & revise error handling (problems from device/connection/software)
Hello, I'm happy to take this one in Q1.

alogfans · 2024-12-26T01:56:58Z

Check & revise error handling (problems from device/connection/software)
Hello, I'm happy to take this one in Q1.

Thank you for your contribution! Looking forward to seeing the pull request on GitHub.

ANormalMan12 · 2025-01-06T09:46:14Z

I think it quite hard to implement "ZeroCopy from vLLM to RDMA memory" and "Layer-by-layer pipeline" without modifying core components of vllm a lot. Is there an easier way to implement "ZeroCopy" and "Layer-by-layer Pipeline"?

cherhh · 2025-01-10T14:12:58Z

Is the functionality provided by this Mooncake Managed Object Store similar to that of an in-memory database like Redis?

stmatengss pinned this issue Dec 18, 2024

james0zan added the Roadmap Future roadmap or plan for new features label Dec 18, 2024

james0zan self-assigned this Dec 18, 2024

ShangmingCai mentioned this issue Dec 19, 2024

[Core] Support disaggregated prefill with Mooncake Transfer Engine vllm-project/vllm#10884

Merged

ShangmingCai mentioned this issue Jan 7, 2025

Why is ITL's first token so long? #62

Open

wx-csy unpinned this issue Jan 7, 2025

stmatengss pinned this issue Jan 21, 2025

stmatengss mentioned this issue Jan 22, 2025

Libfabric transport layer support #76

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RoadMap] Mooncake Roadmap Q1 & Q2 2025 #44

[RoadMap] Mooncake Roadmap Q1 & Q2 2025 #44

stmatengss commented Dec 18, 2024 •

edited

Loading

VegetaPn commented Dec 20, 2024

SkylarKBKB commented Dec 22, 2024

stmatengss commented Dec 22, 2024

stmatengss commented Dec 22, 2024

doujiang24 commented Dec 25, 2024

alogfans commented Dec 26, 2024

ANormalMan12 commented Jan 6, 2025 •

edited

Loading

cherhh commented Jan 10, 2025

[RoadMap] Mooncake Roadmap Q1 & Q2 2025 #44

[RoadMap] Mooncake Roadmap Q1 & Q2 2025 #44

Comments

stmatengss commented Dec 18, 2024 • edited Loading

New Component: Mooncake Managed Object Store

25Q1

25Q2+

New Features of Mooncake

Transfer Engine

P2P Store

LLM Framework Integration

VegetaPn commented Dec 20, 2024

SkylarKBKB commented Dec 22, 2024

stmatengss commented Dec 22, 2024

stmatengss commented Dec 22, 2024

doujiang24 commented Dec 25, 2024

alogfans commented Dec 26, 2024

ANormalMan12 commented Jan 6, 2025 • edited Loading

cherhh commented Jan 10, 2025

stmatengss commented Dec 18, 2024 •

edited

Loading

ANormalMan12 commented Jan 6, 2025 •

edited

Loading