Question about "Auto" model selection logic in Copilot Chat #189945
Replies: 5 comments
-
|
OLha, tem sido um dilema grande isso. E, pior que mesmo quando usamos ele, não tem um equilibrio entre o free da conta e os modelos premium. Acaba não compensando, nenhum pouco, usar ele. Até dá mais dor de cabeça, para arrumar a baguncinha que ele vai fazendo. |
Beta Was this translation helpful? Give feedback.
-
|
Topic Area: Copilot Chat Hi, This is a really valid concern, and you’re not imagining things — “Auto” can feel overly aggressive at times. Why “Auto” may pick a premium model (even for simple prompts)The “Auto” selection isn’t based only on how simple your message looks. It typically considers a mix of factors such as:
Because of this, even something like “Who are you?” can still get routed to a premium model if:
So yes — it can feel like overkill, but it’s often intentional rather than a bug. Does this consume premium quota?In many cases, yes — if a premium model is selected, it may count against your premium request quota depending on the multiplier (e.g., 1×, 3×). That said, the exact accounting can vary slightly depending on:
How to avoid wasting premium requestsRight now, the most reliable workaround is:
Why GitHub designed it this wayThe goal of “Auto” is to:
But the trade-off is exactly what you noticed: less cost control and occasional over-selection of premium models. Bottom line
If GitHub refines this, a smarter “cost-aware Auto” mode would definitely be a welcome improvement. Hope this helps clarify things 👍 |
Beta Was this translation helpful? Give feedback.
-
|
From what I’ve seen, Auto mode does not only look at how simple the current message is. It also considers the whole conversation context, enabled features/tools, and tries to prioritize response quality. That’s probably why even simple prompts can sometimes get routed to GPT-5.x models. I don’t think it necessarily means every basic question will waste premium quota intentionally. Sometimes the system just keeps using the same higher-tier model during an active chat session for consistency. If you want to save premium requests, it’s usually safer to manually select a lower/base model instead of relying completely on Auto mode. |
Beta Was this translation helpful? Give feedback.
-
|
This is probably expected behavior. The “Auto” mode doesn’t only look at how simple the prompt appears — it also considers response quality, latency, current system routing, and available models. Because of that, even very basic prompts can sometimes be routed to a premium model. If you want to avoid using premium requests for simple tasks, the safest option is usually to manually select a standard/base model instead of relying on Auto mode. |
Beta Was this translation helpful? Give feedback.
-
|
Totally valid concern. The "Auto" mode doesn't just look at how simple your message is it factors in your entire conversation context, response quality targets, and internal routing logic. So even "Who are you?" can get routed to a premium model if it's part of an ongoing chat or the system defaults to consistency over cost-saving. Unfortunately right now there's no way to make Auto "quota-aware." The only reliable fix is to manually select a base model for lightweight questions instead of trusting Auto to do it cheaply. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Select Topic Area
Question
Copilot Feature Area
General
Body
Hi GitHub Support Team,
I have a question regarding how the "Auto" model selection works in Copilot Chat.
I noticed that when I set the model to "Auto", even a very simple and trivial question like "Who are you?" triggered the use of GPT-5.4 (or a similar premium model).
Since premium models often have a higher multiplier (like 1x or 3x) against my monthly 300 premium requests quota, I am concerned that the "Auto" mode is being too aggressive in selecting high-end models for basic tasks that could easily be handled by a 0x (base) model.
Could you please clarify:
Why does the "Auto" logic choose a premium model for such a simple query?
I want to avoid wasting my premium quota on simple greetings or basic questions. Looking forward to your response.
Beta Was this translation helpful? Give feedback.
All reactions