Skip to content

Commit

Permalink
Merge pull request #432 from luislh-dev/main
Browse files Browse the repository at this point in the history
remove redundant asterisks in README
  • Loading branch information
GeeeekExplorer authored Feb 5, 2025
2 parents 09d1086 + 97b35f1 commit 1d7d440
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -111,7 +111,7 @@ Throughout the entire training process, we did not experience any irrecoverable
</div>

> [!NOTE]
> The total size of DeepSeek-V3 models on Hugging Face is 685B, which includes 671B of the Main Model weights and 14B of the Multi-Token Prediction (MTP) Module weights.**
> The total size of DeepSeek-V3 models on Hugging Face is 685B, which includes 671B of the Main Model weights and 14B of the Multi-Token Prediction (MTP) Module weights.
To ensure optimal performance and flexibility, we have partnered with open-source communities and hardware vendors to provide multiple ways to run the model locally. For step-by-step guidance, check out Section 6: [How_to Run_Locally](#6-how-to-run-locally).

Expand Down Expand Up @@ -261,7 +261,7 @@ python fp8_cast_bf16.py --input-fp8-hf-path /path/to/fp8_weights --output-bf16-h
```

> [!NOTE]
> Hugging Face's Transformers has not been directly supported yet.**
> Hugging Face's Transformers has not been directly supported yet.
### 6.1 Inference with DeepSeek-Infer Demo (example only)

Expand Down

0 comments on commit 1d7d440

Please sign in to comment.