Skip to content
@AI-Hypercomputer

AI-Hypercomputer

Reference implementations, benchmarks, recipes, and all things Google Cloud AI Hypercomputer

Popular repositories Loading

  1. maxtext maxtext Public

    A simple, performant and scalable Jax LLM!

    Python 1.6k 311

  2. JetStream JetStream Public

    JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

    Python 267 33

  3. maxdiffusion maxdiffusion Public

    Python 182 20

  4. xpk xpk Public

    xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

    Python 98 30

  5. jetstream-pytorch jetstream-pytorch Public

    PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

    Python 48 17

  6. gpu-recipes gpu-recipes Public

    Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

    Dockerfile 31 3

Repositories

Showing 10 of 16 repositories
  • maxdiffusion Public
    AI-Hypercomputer/maxdiffusion’s past year of commit activity
    Python 182 Apache-2.0 20 4 (1 issue needs help) 8 Updated Jan 31, 2025
  • xpk Public

    xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

    AI-Hypercomputer/xpk’s past year of commit activity
    Python 98 Apache-2.0 30 16 27 Updated Jan 31, 2025
  • maxtext Public

    A simple, performant and scalable Jax LLM!

    AI-Hypercomputer/maxtext’s past year of commit activity
    Python 1,607 Apache-2.0 311 30 (2 issues need help) 119 Updated Jan 31, 2025
  • torchprime Public

    TorchPrime is a reference model implementation for PyTorch on TPU/GPU.

    AI-Hypercomputer/torchprime’s past year of commit activity
    Python 2 0 24 3 Updated Jan 31, 2025
  • JetStream Public

    JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

    AI-Hypercomputer/JetStream’s past year of commit activity
    Python 267 Apache-2.0 33 11 6 Updated Jan 30, 2025
  • tpu-recipes Public
    AI-Hypercomputer/tpu-recipes’s past year of commit activity
    Shell 7 Apache-2.0 7 3 4 Updated Jan 30, 2025
  • pathways-utils Public

    Package of Pathways-on-Cloud utilities

    AI-Hypercomputer/pathways-utils’s past year of commit activity
    Python 7 Apache-2.0 2 0 2 Updated Jan 30, 2025
  • gpu-recipes Public

    Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

    AI-Hypercomputer/gpu-recipes’s past year of commit activity
    Dockerfile 31 Apache-2.0 3 0 0 Updated Jan 29, 2025
  • AI-Hypercomputer/ml-goodput-measurement’s past year of commit activity
    Python 10 Apache-2.0 0 0 0 Updated Jan 27, 2025
  • ray-tpu Public
    AI-Hypercomputer/ray-tpu’s past year of commit activity
    Python 1 Apache-2.0 2 1 0 Updated Jan 25, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…