Voice and tone

For AI to be used professionally, people need confidence that its outputs can be shaped and kept consistent. This includes ensuring that a consistent voice and tone can be maintained across channels and content:

  • A marketing team can’t afford one group sounding casual while another sounds formal.
  • An individual doesn’t want writing that feels off-brand or unlike them.
  • A creative group wants to produce consistent artifacts across an account while splitting up the work.

Most generative platforms begin with simple presets. Text editing tools allow users to select whether the AI should be witty, professional, concise, while image generators allow you to select from a style gallery. These quick toggles work well at the moment of generation, but they aren’t enough for teams that need precision.

Custom voice and tone modifiers allow users and teams to define their own presets and save them to use later. These presets can be saved and reused, modified, and allow for rapid and consistent generation across multiple prompts or account users.

Sample traits

Depending on the modality, custom styles can incorporate a broad array of defining traits:

  • General tone and perspective (formal, casual, witty, empathetic, academic)
  • Vocabulary preferences (preferred terms, banned terms, jargon level)
  • Sentence length and structure (short/concise vs. long/complex)
  • Level of detail or depth (high-level summary vs. in-depth explanation)
  • Formatting style (headings, bullet use, citation format, code commenting style)
  • Visual aesthetic (hand-drawn, cinematic, photorealistic, minimalist)
  • Audio qualities (accent, pitch, pacing, warmth, formality of speech)
  • Coding conventions (indentation, naming schemes, documentation style)
  • Error tolerance and strictness (lenient vs. precise in adhering to rules)
  • Instructional stance (encouraging coach, critical reviewer, neutral explainer)
  • Cultural or regional variants (US vs. UK spelling, metric vs. imperial units, localized idioms)

Configured voices vs. native personalities

Configuring tone of voice is different from the AI’s own personality. Personality comes from training data and system prompts, shaping how the AI interacts with you. Modifying the AI's default voice and tone shapes how it reflects you back in its outputs.

Keeping these roles distinct matters. People rarely care about the AI’s character, but they do care whether its writing sounds right for the context.

Design considerations

  • Start with clear entry points. Users shouldn’t have to dig through settings to shape how the AI sounds or writes. Lightweight selectors at the point of generation (make this more formal / more casual) lower friction and make voice controls visible from the start.
  • Layer in depth for custom voices or styles. Once people see the value, they’ll want richer control. Provide a dedicated space where they can define rules, set phrases to use or avoid, and specify tonal markers. Treat this like a brand kit, with previews that show how outputs will look in practice.
  • Handle persistence with intention. Decide if voices should follow users globally or stay tied to specific projects. Global persistence is convenient but risky if tone carries into the wrong context. Project-scoped settings reduce this risk but require more setup. If you support both, make the scope obvious and easy to switch.
  • Make conflicts visible. If multiple voices could apply, such as a personal default and a team brand voice, don’t apply overrides silently. Show which voice is active and why. A simple label like “Using: Team Brand Voice” prevents confusion.
  • Always provide a reset. Users need confidence that they can return to neutral. An explicit “Reset to default voice” gives them permission to experiment without worrying about permanent changes.

Examples

Cofounder creates a personal voice for you during onboarding based on your emails, and generates a sample email it might send on your behalf to confirm  it got it right.
The Dia browser accepts details such as references and influences and specific instructions for how you code in its settings panel.
Midjourney allows users to create multiple custom profiles trained on preferences between styles, or generated from scratch using moodboards. Each profile can be added and blended as a parameter to influence the final generation.