brokencode 3 days ago

I personally like being able to choose because I understand the tradeoffs and want to choose the best one for what I’m asking. So I hope this doesn’t go away.

But I agree that they probably need some kind of basic mode to make things easier for the average person. The basic mode should decide automatically what model to use and hide this from the user.

2
FergusArgyll 3 days ago

I thought sama said that that's the plan for gpt-5: a router which'll choose the right model and thinking level for you

CaptainFever 3 days ago

Would that be considered a Mixture of Experts system?

simonw 3 days ago

No, Mixture of Experts is a really confusing term.

It sounds like it means "have a bunch of models, one that's an expert in physics, one that's an expert in health etc and then pick the one that's a best fit for the user's query".

It's not that. The "experts" are each another giant opaque blob of weights. The model is trained to select one of those blobs, but they don't have any form of human-understandable "expertise". It's an optimization that lets you avoid using ALL of the weights for every run through the model, which helps with performance.

https://huggingface.co/blog/moe#what-is-a-mixture-of-experts... is a decent explanation.