Skip to main content

Envisioning is an emerging technology research institute and advisory.

LinkedInInstagramGitHub

2011 — 2026

research
  • Observatory
  • Newsletter
  • Methodology
  • Origins
  • Vocab
services
  • Research Sessions
  • Signals Workspace
  • Bespoke Projects
  • Use Cases
  • Readinessfree
impact
  • ANBIMAFuture of Brazilian Capital Markets
  • IEEECharting the Energy Transition
  • Horizon 2045Future of Human and Planetary Security
  • WKOTechnology Scanning for Austria
audiences
  • Innovation
  • Strategy
  • Consultants
  • Foresight
  • Associations
  • Governments
resources
  • Pricing
  • Partners
  • How We Work
  • Data Visualization
  • Multi-Model Method
  • FAQ
  • Security & Privacy
about
  • Manifesto
  • Community
  • Events
  • Support
  • Contact
  • Login
ResearchServicesPricingPartnersAbout
ResearchServicesPricingPartnersAbout
  1. Home
  2. Vocab
  3. Modality-Appropriate Refusals

Modality-Appropriate Refusals

Safety refusals calibrated for the specific modality (speech vs. text) rather than generic text-based refusals.

Year: 2025Generality: 520Added: May 12, 2026
Back to Vocab

Modality-appropriate refusals are safety responses that are calibrated for the specific communication modality in which a request is made — particularly the distinction between text and speech. A refusal in a text interface can be long, precise, and heavily qualified: "I'm sorry, but I can't help with that request because it involves [specific policy category]." A refusal in a spoken interface must be colloquial, brief, and natural-sounding — the same policy boundary expressed in the cadence and vocabulary of natural speech, without sounding robotic or stilted. Modality-appropriate refusals address this gap by calibrating the refusal response to the modality, ensuring that safety responses are both effective and natural-feeling across text, audio, and video channels.

The challenge is that safety responses generated for text can sound jarring or over-cautious when rendered as speech. A text refusal might include extensive hedging language that sounds natural in writing but unnatural in spoken dialogue. A text refusal might also be longer than appropriate for a spoken exchange, creating an awkward pause in a voice conversation. Conversely, a refusal that sounds natural in speech might be too brief or too casual for a text interface. Modality-appropriate refusals address this by training separate refusal behaviors for each modality, with the refusal boundary (what is refused versus what is permitted) calibrated to be equally firm but appropriately expressed.

The training approach described in interaction model research involves using a text-to-speech model to generate refusal and over-refusal training data covering a range of disallowed topics, with the refusal boundary calibrated to favor naturally-phrased but no less firm refusals. This allows the model to learn the appropriate prosody, pacing, and phrasing for refusals in the audio modality while maintaining the same safety policy precision as text refusals.

The deeper principle is that safety calibration is not modality-neutral. A model that is well-calibrated for text interactions may be over- or under-calibrated for audio or visual interactions, because the expression of uncertainty, the social meaning of a refusal, and the user's expectations of appropriate responses differ across modalities. Modality-appropriate safety is an active area of research as AI systems move from text-only to multimodal interaction.