A picture is worth a thousand words, or in this case, tokens.

When existing content is used as the basis for further generation, its underlying tokens can operate as a prompt in of itself. Using prompt layering or references, users can build multiple iterations off of a single input, remixing it countless times until they find what they are seeking.

Remixing is a broad category of actions that change the structural form of an existing piece of content while retaining its modality. If you intend to change the modality itself (e.g., convert a text file to a presentation), use transforming patterns. If you only want to change the style of the content (tone, mood, etc) but leave everything below the surface layer in tact, using restyling.

Sample forms of remixing include:

  • Adjust the lead or thesis of an essay
  • Swap POV or audience
  • Change narrative pattern: essay to Q&A, debate, case study, FAQ
  • Add new visual tokens or a new reference to an image
  • Condense a long video into a collection of clips for quick review

Pairing remixing with inline actions

People struggle to write effective prompts–at this point, we should simply take this as true, and look for ways to allows users to communicate the maximum amount of information without having to prompt engineer. Pre-set remixing actions allow users to work with AI to improve a piece of content with a specific intent, on demand. For example, Descript supports remix actions like "remove filler words." Users don't have to define what these words are in order to run the action. The system simple knows, and acts intelligently on their behalf.

Example from Descript showing the number of use cases for remixing that they support

How to design a remixing flow

  • Before prompt:
    • Let users attach multiple references. Consider allowing them to adjust the weight of those references (see: Midjourney's Omni reference approach)
    • Start from an existing piece of content or let users upload/construct something new
    • Consider using token transparency so users know what they are working with (see: FloraFauna's "enhance prompt" feature to show the tokens behind an image or video in the input card)
  • During generation
    • Use previews to show where the remixed output is coming along so users can stop the generation or submit a new adjustment without waiting for the final result
    • Deploy variations with intensity sliders, which creates multiple branches for users to select from and work with
    • Allow remixing in place via inpainting if only parts of the content are being adjusted. This way users can target specific areas of the canvas without recreating the entire composition
  • After generation
    • For code (or other users) show a structured diff: what moved, what was added, what stance changed.
    • Consider forcing users to verify the change if the remix occurred via inpainting vs. generating new variations
    • Let users promote a remix to a new working draft and start a new branch
    • Use footprints to let users trace back through the results of remixing to find the original source
  • Details and variations

    • References are used to inject new prompt tokens into the generation
    • Variations create branches for users to choose from and remix further
    • Remixing can be applied to the full canvas or to select areas using inpainting
    • Maintain visibility into the inputs using an open canvas or footprints off of the generated asset

    Considerations

    Positives

    Avoids the cold start problem

    By working off of an existing piece of content, users can collaborate with the AI refine and enhance their generation through remixing, instead of relying on the user to perfectly describe their ideal output via their initial prompt. Remixing can therefore be a compelling opening to onboarding, guiding users to identify with the power they have using the product.

    Concerns

    Watch for overspending

    Remixing can quickly deplete tokens, leaving users feeling frustrated or stuck. This is especially true when it's used in onboarding. FloraFauna solves this by awarding 500 bonus tokens after onboarding, which includes remixing to demonstrate the capabilities of the product. Use transparent spending to let users anticipate the cost of their generation (although remixing is a compelling use case for upselling. What if you exposed the cost up front and invited users to purchase a larger package ahead of remixing? This way if they hit a limit later, they have already been primed.

    Use when:
    You don't want to start from scratch. Content already contains tokens that can inform the generated output versus relying on users to construct them themselves