sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 720
Star 7.5k

Code
Issues 193
Pull requests 41
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: sgl-project/sglang

Labels 27 Milestones 0

New pull request New

41 Open 1,954 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Enable Cohere2 Models

#3018 opened Jan 20, 2025 by hliuca

Loading…

enable kv_scale remap

#3017 opened Jan 20, 2025 by hliuca

Loading…

deepseek v3 and r1 chat template

#3015 opened Jan 20, 2025 by qeternity

Loading…

4 tasks

Fix flush_cache and bench_serving for eagle

#3014 opened Jan 20, 2025 by merrymercy

Loading…

support telechat2 model

#3000 opened Jan 20, 2025 by shunxing12345

Loading…

4 tasks

Integrate turbomind into sgl-kernel

#2999 opened Jan 20, 2025 by bjmsong

Loading…

4 tasks

[devcontainer] add non-root user

#2989 opened Jan 19, 2025 by ByronHsu • Draft

4 tasks

[Fix] Address remain issues of supporting MiniCPMV

#2977 opened Jan 19, 2025 by mickqian • Draft

3 tasks done

[MOE] try to optimize cu kernel single block execution - distribute cumsum workload from thread 0 to other threads

#2970 opened Jan 19, 2025 by yiakwy-xpu-ml-framework-team

Loading…

3 of 4 tasks

[Core] Optimize the delay scheduling of in batch prefix caching

#2962 opened Jan 18, 2025 by MrAta • Draft

4 tasks

[EAGLE] Fix some boundary situation when retract reqs and req's max token = 1

#2939 opened Jan 17, 2025 by josephydu

Loading…

Test removing a branch logic

#2905 opened Jan 15, 2025 by rkooo567

Loading…

3 tasks

Integration of TurboMind AWQ

#2900 opened Jan 15, 2025 by bjmsong

Loading…

3 tasks

[Feature] Support dynamic loading and unloading of Lora adapters

#2891 opened Jan 14, 2025 by Fridge003 • Draft

1 of 3 tasks

support triton backend int8 kvcache

#2864 opened Jan 13, 2025 by sleepcoo

Loading…

[DO NOT MERGE] Merged PRs for verl integration

#2849 opened Jan 13, 2025 by fzyzcjy • Draft

3 tasks

Support direct weight loading

#2845 opened Jan 12, 2025 by fzyzcjy

Loading…

3 tasks done

Support distributed tensor when updating weights

#2831 opened Jan 10, 2025 by fzyzcjy

Loading…

3 tasks done

Support custom device mesh for tensor parallel workers

#2827 opened Jan 10, 2025 by fzyzcjy

Loading…

3 tasks done

Use CUDA_VISIBLE_DEVICES instead of gpu_id variables everywhere.

#2824 opened Jan 10, 2025 by heiner

Loading…

1 task done

Improve the mixed chunk prefill by lanuch two kernels

#2811 opened Jan 9, 2025 by libratiger • Draft

1 of 3 tasks

[WIP] [Feature] Support Deepseek-VL2 enhancement

New feature or request

#2798 opened Jan 8, 2025 by ccw1996 • Draft

3 tasks

Add endpoint for file support, purely to speed up processing of input_embeds.

#2797 opened Jan 8, 2025 by RinRin-32

Loading…

2 of 3 tasks

Allow multi SGLang engines to coordinate

#2791 opened Jan 8, 2025 by fzyzcjy

Loading…

3 tasks done

Speculative decoding with lookahead enhancement

New feature or request

high priority

#2790 opened Jan 8, 2025 by jjjjohnson

Loading…

3 tasks done

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly