-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Muxing Support for Model Routing #566
Comments
Moving this to ready, as I think we can at least start prototyping and thinking about this, albeit we should land #454 first to really make it useful. |
Filter by file: can do support wildcards (eg doc*.txt), as well as specific file names? |
Would we want to specify a version of a model, or something broader (eg claude_sonnet vs claude_sonnet_3.5). |
Would users select their repo, or just type it in (eg can we see a list or is it just manual?) |
Would this be on a global level or per workspace? |
I think it would be nice, I can envision being OK to send
We would want to but I don't think it was easy. Tagging @aponcedeleonch and @JAORMX who were looking into this. I suspect it's not fully possible until codegate is hooked into the IDE, but I don't know 100% |
(Similar to the other reply, this is my 2 öre, not a canonical reply)
This is a question about model aliases right_
I think we could start with just being specific - in other words, just require the full model name to by typed. Many providers have aliases anyway, e.g. anthropic supports But I can also see that codegate having some logic to pick the best model might be good, e.g. "for simple requests, use anthropic haiku, for more complex changes use anthropic sonnet, for very complex tasks use anthropic opus". |
That's right, codegate can't know unless it's actually hooked into the IDE, which it isn't |
First pass at model routing UI, love to hear your feedback, or if I have misunderstood anything Designs |
The work on this issue has finished. The next enhancement is tracked in #1059 |
Introduce support for "Muxing" in CodeGate, enabling users to route specific types of logic to different large language models (LLMs) based on file type, individual files and workspace. This would provide cost savings , a common complaint from users is the expense of tokens and how to optimise those costs along with a way of isolation use to specific models (based on privacy, security , protection of IP)
It would also enable context switching (context window refers to the amount of text data a language model can consider at one time when generating responses) , whereby we could switch models during a long protracted session of prompt / response.
For example:
.md
,.rst
) could be processed by a free, local model, while source code files (e.g.,.py
,.js
) are handled by an advanced model like Claude Sonnet 3.5.Why is this feature important?
Possible Solution
Expand CodeGate’s functionality to include:
Repository-Level Configuration:
local_model
, Repository B ->advanced_model
.File-Type-Based Routing:
.md
->local_model
,.py
->claude_sonnet_3.5
.Per-File Routing:
config.py
->local_model_v2
.Challenges and Considerations
The text was updated successfully, but these errors were encountered: