There are any number of reasons why a user may want to switch the model they are using for generation

  • Different models may be more prone to hallucinations and errors, based on their training data and foundational prompt
  • Newer models are more likely to contain updated references and more data points, leading to better outputs (but as a result, may come with a premium price)
  • Commercially, they may not want to spend the money or tokens to use a newer model. Older models can be used to hone a prompt before it is applied to updated data
  • Image generators appreciate the different aesthetics they can tap into with different models, much like someone may choose to listen to a specific album on vinyl for its aesthetic vibe despite the higher audio quality of digital recordings
  • Some image generators allow remixing across models, capturing the aesthetics of one model, and then remixing it in a different model that might return more predictable results against their prompt
  • For security reasons, users may avoid using certain models with delicate or proprietary data due to how the model provider handles this type of information
  • Researchers, engineers, etc may want to move between models to compare results

Whatever the reason, giving users the ability to adjust the model they are prompting with has become a standardized pattern.

If you are working on an interface that allows for this setting to be changed, consider who should have that permission to change it. Companies may wish to restrict or enforce the use of certain models for compliance reasons.

Consider also that regulations related to AI are in flux. Be prepared for entire models to be restricted from use in certain geopolitlcal areas do to their policies.

Details and variations

  • The model version can be changed in settings, when starting a new conversation, or at the prompt level within a conversation
  • Let users change the model on subsequent regenerations and versions to explore different directions
  • Encourage users to use cheaper / older models for drafting at a lower cost while newer models might be used for final drafts
  • Store the model version in the meta data of a conversation or generation so users can recall which they used

Considerations

Positives

Shape the clay
Giving someone the ability to see and shape the operational stuff behind the scenes can help them become more advanced users of the tool. By exploring how different models affect their results, users can learn to tune their results to get predictable results across models, or take advances of differences within them

Co-ownership
By allowing users the ability to change their model, or even eventually upload their own, you will learn things about your own software interface that you couldn't easily uncover with this scale of use. Converting users into co-owners of the model through feedback and prompt results improves the model overall.

Commercial opportunities
Will we see a market for LLMs or SLMs? There every reason to think we will. Allowing users to manipulate models across prompts makes this a more viable possibility. Imagine someone being able to blend the model of a large research institution with their layer of information. Think of the ways this could be use in UX research, academic, education, etc. Could they operate similar to how child themes operate on parent themes in large CRMs? Could people test a model before buying it? We should expect to see these business models emerge, and prepare our software and our experiences.

Potential risks

Access does not absolve accountability
Just because something is available doesn't mean its producer loses responsibility for the outcome. Think about how many beta products exist that lead to poor customer experiences and missed expectations because those limitations were not clear. If you do provide the ability for people to switch models, make it obvious to the user which model they are using, and if possible the differences between the two so they can expect how it will impact their experience.

Use when:
Users have needs for different models, whether to use money responsibly or use the nuances of different models creatively.

Examples

Claude, GPT, and other model creators allow users to select from their range of available models for different needs
Products like Julius allow users to switch between models and providers
Midjourney gives you the option to change the model in your prompt, even when remixing on an image created in a different model
ChatGPT lets you change the model within a conversation to explore different results
No items found.