[nu-demo] hero_aibroker modelsconfig.yml ships Groq-only models with no fallback chain #137
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Symptom
When the user's Groq account hit a spend-limit during the demo,
agent.chatrouted to groq-primary models and failed completely with 401/400 (no fallback). 10 of 42 bundled models inmodelsconfig.ymlwere groq-only or groq-priority-1 with no secondary provider.Root cause
The bundled
hero_aibroker/modelsconfig.ymlincludes:llama-3.3-70b-versatile,llama-3.1-8b-instant, and 5 others.The auto-routing logic does not degrade to the next model on 401/403/429 from the primary — it fails the whole request.
Demo workaround (applied 2026-04-23)
Edited
modelsconfig.ymlto drop all groq backends (both in the repo copy AND the var/ copy — see companion issue on reseeding clobbering edits). Chat went through openrouter only after that.Proper fix
(a) Every multi-provider model should list openrouter as a priority-2 fallback in
modelsconfig.yml.(b) hero_aibroker's auto-routing should automatically degrade to the next available model (or next provider on the same model) when the primary returns a retryable-like error (401/403/429).
(c) Consider removing groq-only models from the bundled default, or gating them behind a
groq-enabled=truemarker that's only on when a valid Groq key is present at startup.Filed 2026-04-23 nu-shell demo bring-up. Signed-off-by: mik-tf
Moved to hero_demo#18 — see lhumina_code/hero_demo#18