inference server and endpoints for elle-elle-aime #43

monperrus · 2023-04-27T11:45:42Z

A Triton inference server might be useful for the open-source models

monperrus · 2023-05-30T06:54:54Z

we may get an instance soon with StarCoder

monperrus · 2024-06-07T09:06:11Z

Huggingface inference with zeto-gpus

@GGmorello has set up RepairLLama over HuggingFace Spaces thanks to our zero-gpus account

monperrus · 2024-06-07T09:06:14Z

@FredBonux is able to use Mixtral and LLama over groq for free

monperrus · 2024-11-23T06:12:10Z

OpenRouter: A unified interface for LLMs
https://openrouter.ai/

Used in repairbench

monperrus · 2024-11-23T06:12:34Z

Replicate
Run and fine-tune open-source models. Deploy custom models at scale. All with one line of code.
https://replicate.com/

monperrus · 2024-11-23T06:12:58Z

Together.ai
Train, fine-tune, and run inference on AI models blazing fast, at low cost, and at production scale.
https://www.together.ai/

monperrus mentioned this issue Apr 27, 2023

Future work: serverless structure #40

Closed

monperrus changed the title ~~inference server?~~ self-hosted inference server for elle-elle-aime May 30, 2023

monperrus changed the title ~~self-hosted inference server for elle-elle-aime~~ inference server and endpoints for elle-elle-aime Jun 7, 2024

Provide feedback