Change the repository type filter
All
Repositories list
25 repositories
- Fork from https://github.com/deepseek-ai/FlashMLA
- A refactored codebase for Gaussian Splatting. Faster(3.5x)!! Modular!! Pure Python or CUDA Extension
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.
- The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
- Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
- Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.