Popular repositories Loading
-
prompts.chat
prompts.chat PublicForked from f/prompts.chat
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
HTML
-
cuda-samples
cuda-samples PublicForked from NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
C
-
triton-tutorial
triton-tutorial PublicForked from dsl-learn/triton-tutorial
Getting Started with Triton: A Tutorial for Python Beginners
HTML
-
CUDA-Programs
CUDA-Programs PublicForked from RichardAns/CUDA-Programs
Examples from Programming in Parallel with CUDA
Cuda
-
triton
triton PublicForked from triton-lang/triton
Development repository for the Triton language and compiler
MLIR
-
fastllm
fastllm PublicForked from ztxz16/fastllm
fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。
C++
If the problem persists, check the GitHub status page or contact support.