-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding DS Feature API in accelerator #5423
base: master
Are you sure you want to change the base?
Changes from 16 commits
04611e4
0703d79
7293308
0acfaf9
e2327a1
49cbda7
77567b5
a103c83
0f196a0
d571aef
7662492
7ccc117
993f35f
7588dc6
159c8e4
6ad67c7
89b9f55
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,31 @@ | ||
# Copyright (c) Microsoft Corporation. | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
# DeepSpeed Team | ||
|
||
#A list of constants used in the DeepSpeed feature dictionary | ||
|
||
OP_ASYNC_IO = "async_io" | ||
OP_CCL_COMM = "deepspeed_ccl_comm" | ||
OP_CPU_ADAGRAD = "cpu_adagrad" | ||
OP_CPU_ADAM = "cpu_adam" | ||
OP_CPU_LION = "cpu_lion" | ||
OP_EVOFORMER_ATTN = "evoformer_attn" | ||
OP_FP_QUANTIZER = "fp_quantizer" | ||
OP_FUSED_ADAM = "fused_adam" | ||
OP_FUSED_LAMB = "fused_lamb" | ||
OP_FUSED_LION = "fused_lion" | ||
OP_INFERENCE_CORE_OPS = "inference_core_ops" | ||
OP_CUTLASS_OPS = "cutlass_ops" | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. A general name is needed to cover the non-cuda devices? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Hi @rogerxfeng8 - is there a specific one you're referencing? I believe we call all devices (cuda and non-cuda) accelerators. |
||
OP_QUANTIZER = "quantizer" | ||
OP_RAGGED_DEVICE_OPS = "ragged_device_ops" | ||
OP_RAGGED_OPS = "ragged_ops" | ||
OP_RANDOM_LTD = "random_ltd" | ||
OP_SPARSE_ATTN = "sparse_attn" | ||
OP_SPATIAL_INFERENCE = "spatial_inference" | ||
OP_STOCHASTIC_TRANSFORMER = "stochastic_transformer" | ||
OP_TRANSFORMER = "transformer" | ||
OP_TRANSFORMER_INFERENCE = "transformer_inference" | ||
ZERO_1 = "zero1" | ||
ZERO_2 = "zero2" | ||
ZERO_3 = "zero3" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This reflection mechanism better be lazy initialized. Otherwise there might be circular dependence because this init function be called before
__compatible_ops__
being initialized.