Skip to content
@thu-ml

TSAIL group

Tsinghua Statistical Artificial Intelligence & Learning Group

Pinned Loading

  1. zhusuan zhusuan Public

    A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

    Python 2.2k 419

  2. SageAttention SageAttention Public

    Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

    Cuda 1.1k 66

  3. unidiffuser unidiffuser Public

    Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

    Python 1.4k 88

  4. prolificdreamer prolificdreamer Public

    ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

    Python 1.5k 45

  5. ares ares Public

    A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

    Python 501 87

  6. tianshou tianshou Public

    An elegant PyTorch deep reinforcement learning library.

    Python 8.3k 1.1k

Repositories

Showing 10 of 72 repositories
  • tianshou Public

    An elegant PyTorch deep reinforcement learning library.

    thu-ml/tianshou’s past year of commit activity
    Python 8,277 MIT 1,137 145 (1 issue needs help) 5 Updated Mar 12, 2025
  • SpargeAttn Public

    SpargeAttention: A training-free sparse attention that can accelerate any model inference.

    thu-ml/SpargeAttn’s past year of commit activity
    Cuda 264 Apache-2.0 10 7 1 Updated Mar 12, 2025
  • DiffusionBridge Public

    Official codebase for "Diffusion Bridge Implicit Models" (ICLR 2025) and "Consistency Diffusion Bridge Models" (NeurIPS 2024)

    thu-ml/DiffusionBridge’s past year of commit activity
    Python 32 2 2 0 Updated Mar 10, 2025
  • GFT Public
    thu-ml/GFT’s past year of commit activity
    Python 26 MIT 0 3 0 Updated Mar 8, 2025
  • i-DODE Public

    Official code for "Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs" (ICML 2023)

    thu-ml/i-DODE’s past year of commit activity
    Python 17 Apache-2.0 1 1 0 Updated Mar 4, 2025
  • MMTrustEval Public

    A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

    thu-ml/MMTrustEval’s past year of commit activity
    Python 133 CC-BY-SA-4.0 8 3 0 Updated Mar 4, 2025
  • RIFLEx Public

    Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers"

    thu-ml/RIFLEx’s past year of commit activity
    Python 391 Apache-2.0 42 9 0 Updated Mar 3, 2025
  • TetraJet-MXFP4Training Public

    Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training

    thu-ml/TetraJet-MXFP4Training’s past year of commit activity
    Python 6 Apache-2.0 1 0 0 Updated Mar 3, 2025
  • SageAttention Public

    Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

    thu-ml/SageAttention’s past year of commit activity
    Cuda 1,112 Apache-2.0 66 38 1 Updated Feb 28, 2025
  • STAIR Public

    Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

    thu-ml/STAIR’s past year of commit activity
    Python 26 MIT 1 0 0 Updated Feb 26, 2025