Draft mode

Draft mode lets users begin work using less details or less powerful processing power before committing to a full, resource-intensive run. This functionality supports an efficient, iterative process. Early generations go faster and users spend less compute or credits to get a preview, and users can easily switch to a higher-quality model once ready.

Versions of this pattern can be found across modalities:

  • Image and video generators often generate low-scaled versions for early iterations.
  • Audio generators produce short previews of voices or mixes before rendering the full track.
  • Coding editors surface plans of action or “design mode” previews to show intended steps or rough interfaces before deeper interactivity and integrations are added.
  • Text editors may start with an outline before generating full documents.
  • Presentation generators often default to slide outlines for early confirmation.

Explicit vs. implicit drafting

While some products support an explicit draft mode, others offer a combination of parameters that creates a similar environment without being direct.

Visual mediums benefit from being specific, since a less powerful model will impact the overall quality of the generation. This guards user expectations and frames draft mode as a positive means of saving time and money in their creative process.

Implicit experiences across all content types allow users to decrease the step count of a generation, which reduces the fidelity of the AI's reasoning while drawing less compute power. This is a technical concept, though, so these options are often buried in settings for more savvy user to explore.

Decreasing the initial length or size of content also has the effect of creating within a drafting context. Audio and video generations, for example, often produce a short snippet of a high quality run for the user to review before expanding.

Similarly, text-based tools don't necessarily require an explicit mode since an outline is also a common writing tool and method of reducing the quality of the output before upscaling into a full document. Notably, text files use less compute power than other types of media, but making drafting mode explicit can still give users a sense of command over the amount of time spent. Jasper and other content editors mimic this choice by offering a parameter in the input box for users to choose to optimize for speed (draft) or quality (final).

Using model routing for drafting

As automatic model routing becomes more common, users increasingly rely on the system to select the best model for the task and iterative state. A form of drafting can be incorporated by defaulting to the less powerful alternative until instructed otherwise. This benefits the user by decreasing compute and credit consumption, but not all users will be wise to what is happening behind the scenes. However, less savvy users may be unaware that they can force the AI to use a more powerful model when needed.

Consider introducing this concept during onboarding in an interactive way, demonstrating that less powerful models can be the better tool earlier in the design process or while chatting about a topic before running a more powerful command like deep research.

Design considerations

  • Make drafts explicit. Not everything needs to be labeled drafting mode but users should never be surprised when the system produces a lower scaled version of what they expect. Whether producing audio snippets instead of full files, outlines instead of documents, or lower quality images, ensure users retain a sense of control and understanding.
  • Make the trade-off explicit. Describe what is reduced in draft mode, for example model tier, steps, resolution, or duration. Pair the label with expected speed and cost so users can decide quickly without guessing.
  • Keep upgrade paths deterministic. Preserve seeds, prompts, and key parameters between draft and final. If the pipeline changes settings, disclose them and let users lock the seed to maintain composition and layout continuity.
  • Design for rapid loops. Draft mode exists to compress iteration time. Keep “upgrade,” “duplicate to final,” and “compare” one click away. Avoid flows that force users to recreate settings to get a comparable final.
  • Prevent accidental downgrades. Never silently route heavy tasks into draft quality. If you auto-switch for performance, surface a clear notice with a one-click override so users retain control.
  • Expose cost controls near commit points. Indicate token, time, or credit impacts right where the user chooses draft versus final. People plan differently when they see the real cost of pressing “Generate.”

Examples

Chronicle produces a draft outline of each slide as a plan of action for the user to confirm before generating the full presentation.
Leonardi.ai’s realtime canvas allows users to sketch out a rough draft of their intended image and see it rendered against the prompt in real time.
Midjourney draft mode retains the functionality of the full generator but uses a lower scaled model to reduce compute cost for rapid iteration.
Replit gives the option to create a visual preview of a generation to review before committing to running a full application build.
Runway explicitly recommends working with a less powerful model to draft your video before upscaling to the most powerful model.
Synthesia produces a draft video walkthrough from the editing canvas for users to review before exporting. Video animations and dubbing are suppressed in draft mode to decrease processing needs.
While not an explicit draft mode, Udio defaults to 30second clips for new generations. Users can change this length up front or lengthen the draft clip once they are ready.