AI UX Patterns | Remix | ShapeofAI.com

A picture is worth a thousand words, or in this case, tokens.

When existing content is used as the basis for further generation, its underlying tokens can operate as a prompt in of itself. Using prompt layering or references, users can build multiple iterations off of a single input, remixing it countless times until they find what they are seeking.

Remixing is a broad category of actions that change the structural form of an existing piece of content while retaining its modality. If you intend to change the modality itself (e.g., convert a text file to a presentation), use transforming patterns. If you only want to change the style of the content (tone, mood, etc) but leave everything below the surface layer in tact, using restyling.

Sample forms of remixing include:

Adjust the lead or thesis of an essay
Swap POV or audience
Change narrative pattern: essay to Q&A, debate, case study, FAQ
Add new visual tokens or a new reference to an image
Condense a long video into a collection of clips for quick review

Pairing remixing with inline actions

People struggle to write effective prompts–at this point, we should simply take this as true, and look for ways to allows users to communicate the maximum amount of information without having to prompt engineer. Pre-set remixing actions allow users to work with AI to improve a piece of content with a specific intent, on demand. For example, Descript supports remix actions like "remove filler words." Users don't have to define what these words are in order to run the action. The system simple knows, and acts intelligently on their behalf.

‍

Example from Descript showing the number of use cases for remixing that they support

‍

How to design a remixing flow

Before prompt:

Let users attach multiple references. Consider allowing them to adjust the weight of those references (see: Midjourney's Omni reference approach)
Start from an existing piece of content or let users upload/construct something new
Consider using token transparency so users know what they are working with (see: FloraFauna's "enhance prompt" feature to show the tokens behind an image or video in the input card)

During generation

Use previews to show where the remixed output is coming along so users can stop the generation or submit a new adjustment without waiting for the final result
Deploy variations with intensity sliders, which creates multiple branches for users to select from and work with
Allow remixing in place via inpainting if only parts of the content are being adjusted. This way users can target specific areas of the canvas without recreating the entire composition

After generation

For code (or other users) show a structured diff: what moved, what was added, what stance changed.
Consider forcing users to verify the change if the remix occurred via inpainting vs. generating new variations
Let users promote a remix to a new working draft and start a new branch
Use footprints to let users trace back through the results of remixing to find the original source

Details and variations

References are used to inject new prompt tokens into the generation
Variations create branches for users to choose from and remix further
Remixing can be applied to the full canvas or to select areas using inpainting
Maintain visibility into the inputs using an open canvas or footprints off of the generated asset

Considerations

Positives

Avoids the cold start problem

By working off of an existing piece of content, users can collaborate with the AI refine and enhance their generation through remixing, instead of relying on the user to perfectly describe their ideal output via their initial prompt. Remixing can therefore be a compelling opening to onboarding, guiding users to identify with the power they have using the product.

‍

Concerns

Watch for overspending

Remixing can quickly deplete tokens, leaving users feeling frustrated or stuck. This is especially true when it's used in onboarding. FloraFauna solves this by awarding 500 bonus tokens after onboarding, which includes remixing to demonstrate the capabilities of the product. Use transparent spending to let users anticipate the cost of their generation (although remixing is a compelling use case for upselling. What if you exposed the cost up front and invited users to purchase a larger package ahead of remixing? This way if they hit a limit later, they have already been primed.

Remix