VoiceNet-Extension

A comprehensive taxonomy of physical, environmental, and social situations that influence human voice quality, vocal bursts, and speech production.

11
Dimensions
289
Situations
53
Groups

Dimension Overview

Each dimension captures a distinct axis of influence on voice production. Click any card to jump to its detailed entries. Dimensions are designed to be orthogonal: a speaker can be annotated on all dimensions simultaneously.

🧍
POSE
Body Posture & Gravitational Alignment
Static physical configuration of the speaker. How gravity, skeletal alignment, and thoracic compression alter the vocal tract and diaphragm.
32 entries 6 groups
🏃
ACTV
Physical Activity & Dynamic Load
How metabolic demand, physical movement, and interaction with objects compete with the speech signal.
69 entries 9 groups
🎙
TRGT
Speaking Target & Projection
Who or what the speaker is addressing. Determines the throw of voice, register, and feedback loop.
25 entries 5 groups
👥
SOCT
Social Situation & Context
The social game being played. Captures psychological intent, power dynamics, register, and emotional masking.
56 entries 8 groups
🏔
ENVI
Environment & Acoustic Space
The physical space surrounding the speaker. Determines reverb characteristics, ambient noise floor, and acoustic reflections that color the voice.
22 entries 4 groups
🩺
HLTH
Health & Physiological Condition
Medical, biological, or physiological conditions of the speaker that alter voice production at the source. These are states of the body itself, not external forces.
18 entries 4 groups
🪖
GEAR
Face & Head Obstructions
Physical barriers, coverings, or equipment on or near the speaker's face and head that mechanically filter, muffle, or distort the voice before it reaches the listener.
14 entries 4 groups
🌪
CLMT
Climate & Atmospheric Conditions
Weather, temperature, and atmospheric conditions that the speaker is physically enduring. These affect muscle tension, breathing patterns, and airway moisture.
10 entries 3 groups
🧪
SUBST
Substance & Chemical Influence
Ingested, inhaled, or injected substances that alter neuromuscular control, cognitive function, or arousal state, thereby changing voice production.
12 entries 3 groups
😴
FATG
Fatigue, Sleep & Energy State
The speaker's current energy level, sleep status, and recovery state. These determine the baseline "fuel" available for voice production.
19 entries 4 groups
PAIN
Pain & Physical Distress
The type, intensity, and location of pain the speaker is currently experiencing. Pain hijacks the voice by triggering involuntary protective responses.
12 entries 3 groups
🧍 POSE Body Posture & Gravitational Alignment

Concept: Static physical configuration of the speaker. How gravity, skeletal alignment, and thoracic compression alter the vocal tract and diaphragm.

Annotation Instruction: Focus only on acoustic evidence of where the body is relative to gravity.
32 entries 6 groups
Horizontal Spectrum 5
Lying Flat on Back (Supine)
supine_flat

The "puddled" voice. Fluid redistribution causes mild nasal congestion. The tongue falls back, slightly dampening high frequencies. Energy is low; projection is poor because the diaphragm fights gravity to lift the viscera.

Lying on Stomach (Prone)
prone_stomach

The "pressed" voice. The physical weight of the body crushes the rib cage against the floor. Inhalations are shorter and slightly labored. The voice sounds closer to the ground, with less chest resonance.

Curled on Side (Fetal Position)
side_lying_fetal

The "cozy" voice. One lung is restricted by the floor, the other is open. Breath is quiet and intimate. Soft, low-volume, with minimal dynamic range.

Supine with Head Hanging (Hyperextended)
supine_head_hang

The "stretched" voice. The neck is arched backward over an edge. The skin of the throat is tight, raising the pitch slightly and thinning the tone. Swallowing is audible and difficult.

Lying Prone Propped on Elbows
propped_elbows

Similar to prone, but the upper chest is open. Abdominal compression (stomach pressed into floor) but clearer projection than being fully flat.

Sedentary Spectrum 6
Deep Recline / Slouching
sitting_slouched

The "mushy" voice. The spine is C-shaped, collapsing the diaphragm. Articulation is lazy; the jaw barely moves. The voice lacks drive or attack.

Upright Chair (Office/Table)
sitting_neutral

The "standard" voice. Balanced resonance. Breath support is stable but lacks the full power of standing. Neutral acoustic baseline.

Perched on Edge/Stool
sitting_perched

The "alert" sitter. Core muscles engaged to maintain balance. Voice slightly more energized and tighter than neutral sitting.

Floor Sitting (Lotus/Cross-legged)
sitting_cross_legged

The "grounded" voice. Spine usually straight, hips tight. Slight bracing in the voice, resonance implies mouth closer to reflective surfaces.

Elbows on Knees (The Brooder)
sitting_lean_forward

The "compressed" voice. Chest cavity physically made smaller. Voice often directed downward, sounding darker and more introspective. Breath is shallow.

Driver's Seat (Reclined + Arms Forward)
driving_recline

Blend of relaxed torso with arm tension from holding wheel. Often louder projection to overcome cabin noise, but with sitting breath capacity.

Vertical Spectrum 5
Standing Neutral
standing_neutral

The "open" voice. Maximum lung capacity. Clear, resonant, and effortless. Reference point for zero acoustic impedance.

Rigid Attention (Military)
standing_military

The "hard" voice. Chest puffed out; neck muscles tense. Tension transfers to larynx, making voice metallic, barked, or overly projected.

Leaning Against Wall
standing_lean_wall

The "cool" voice. Slight phase cancellation or bass boost from wall reflection behind head. Vertical but relaxed/asymmetrical.

Shifted Weight (Hip Cocked)
standing_one_leg

Asymmetrical stance. Often correlates with casual or sassy tone. Vertical but lacks rigid drive of neutral stance.

Power Pose (Hands on Hips)
standing_hands_hips

The "dominant" voice. Elbows out, mechanically locking rib cage expanded. Voice naturally louder and more commanding.

Head & Neck Geometry 4
Looking Up (Chin High)
head_chin_up

The "thinned" voice. Trachea stretched. Larynx pulled upward, shortening the vocal tract. Voice sounds lighter, tighter, higher-pitched.

Looking Down (Chin to Chest)
head_chin_down

The "gravel" voice. Larynx compressed; double chin effect dampens sound. Voice becomes creaky, bass-heavy, and mumbled.

Twisted Neck (Looking over Shoulder)
head_turned_side

Torsion in neck muscles restricts larynx on one side. Phonation sounds slightly strained or squeezed.

Head Resting on Hand
head_rest_hand

The "smushed" face. Palm pushes one cheek up, distorting mouth shape. Bilabial sounds are clumsy. Voice sounds bored or tired.

Compromised & Contorted 8
Crouching / Huddled Ball
crouching_hiding

The "small" voice. Knees against chest; maximum compression. Breathing rapid and shallow. Voice hushed, anxious, acoustically boxed in.

Bent Over (Touching Toes)
bending_waist

The "rush" voice. Blood rushes to head; diaphragm crushed. Voice sounds congested, thick, and strained.

Deep Squat
squatting_deep

The "valsalva" voice. High abdominal pressure for core stability. Grunting glottal closure and strained, pushed tone.

Kneeling (Prayer/Begging)
kneeling_upright

Similar to standing but less grounding. Acoustically open in chest. Often correlates with submissive or pleading tone.

Inverted (Handstand/Hanging)
inverted_upside_down

The "gargle." Gravity reverses blood flow. Vocal folds engorge, making voice extremely thick, wet, and difficult to control.

Confined Space (Crawling/Box)
squeezed_tight_space

The "claustrophobic" voice. Shoulders hunched to ears, shortening neck. Tight, tense sound. Very short acoustic reflections.

Arms Reaching Up
arms_overhead

The "pulled" voice. Rib cage stretched to limit. Difficult to inhale deeply. Voice thin, lacking lower resonance.

Torso Twist (Yoga/Reaching Back)
twisted_torso

Uneven lung expansion. Breath control erratic/wobbly. Effort of twist audible in vowel stability.

Failed Posture 4
Post-Exertion Collapse
slumped_exhausted

The "breathless" slump. Heavy uncontrolled breathing overpowers speech. Body limp, voice has zero tension.

Semi-Conscious / Fainting
limp_faint

The "fade." Vocal cord closure weak (breathy/whispery). Volume trails off at end of sentences.

Physically Restrained / Choked
choked_restrained

The "strangled" voice. External pressure on neck or chest. Airflow turbulent and raspy. Pitch erratic.

Pinned Under Weight
crushed_weight

The "forced" exhale. Cannot inhale freely. Short panicked syllables squeezed out against resistance.

↑ Back to top
🏃 ACTV Physical Activity & Dynamic Load

Concept: How metabolic demand, physical movement, and interaction with objects compete with the speech signal.

Annotation Instruction: Based on acoustic evidence only - specific breath patterns, rhythmic interference, or handling noise.
69 entries 9 groups
Oral Interference 6
Eating (Crunchy/Hard)
eating_crunchy

Sharp high-frequency transients (snaps, crunches) >2kHz. Irregular pauses for swallowing. Tempo slows to mitigate choking risk.

Eating (Soft/Bolus)
eating_soft

The "full mouth" resonance. Tongue occupied with food bolus, causing wet smacking and dampened lingual consonants. Vowels sound wider.

Drinking / Gulping
drinking_gulping

The "Fluid Sequence": speech stops, liquid slosh, heavy wet glottal closure gulp, wet release exhalation, speech resumes.

Dental Hygiene (Brushing)
brushing_teeth

Extreme articulation distortion. Jaw held open, destroying labial consonants. Constant background scrubbing noise and foaming saliva.

Smoking / Vaping
smoking_vaping

The "Drag Cycle": audible inhalation with crackle, silence for breath hold, speech during exhalation creating turbulent friction under the voice.

Yawning
yawning_stretching

The "Pharyngeal Slide." Pharynx expands to maximum, voice pitch slides dramatically downward. Articulation becomes hollow and rounded.

Sedentary Handling 13
Typing (Standard/Office)
typing_keyboard

Fast rhythmic rain of plastic clicks. "Cognitive Load Pacing": as typing speed increases, speech rate slows or fragments.

Gaming (Controller/Mouse)
gaming_intense

Louder, more mechanical bursty clicks. Chair squeaks, sudden volume shifts, rapid-fire adrenaline-fueled speech patterns.

Writing / Drawing
writing_manual

Quiet continuous scritch-scratch friction. Head often angled down causing slight muffling. Pacing slow and synchronized with hand movement.

Phone Use / Scrolling
scrolling_phone

"Acoustic Shadow." Voice directed downward. Quiet thumb-taps. High frequency of distracted fillers while visual attention is occupied.

Cleaning / Scrubbing
cleaning_scrubbing

Rhythmic repetitive friction sounds. Breath often synchronizes with arm movement.

Cooking / Kitchen Work
cooking_active

Complex soundscape of sizzling, chopping, clattering. Lombard Effect: speaker projects louder to compete with kitchen noise.

Dishwashing / Water Handling
dishwashing

Constant white noise floor of running water. Sharp clinks of glass and ceramics. Voice loud but distant.

Rummaging / Searching
rummaging_bag

Heavy fabric rustling, zipper zips, plastic crinkles. Speech shifts to object-search mode with mumbling and distracted tempo.

Sorting Papers
handling_paper

Dry sliding shuffling and sharp paper snaps. Voice often drops to whisper or mumble while reading.

Knitting / Needlework
knitting_sewing

Very quiet rhythmic metallic click-click. Breathing usually very calm and steady, meditative state.

Using Tools (Hammer/Wrench)
tool_use_manual

Heavy rhythmic impacts or metallic ratcheting. Effort grunts at moment of torque or impact interrupt speech.

Dressing
dressing_undressing

Heavy fabric swooshing that may cover mouth temporarily. Compression grunts from bending over for shoes/pants.

Holding / Rocking Infant
holding_baby

Rhythmic physical bounce in voice from rocking. Tone hushed. One side of chest often compressed.

Locomotion 5
Slow Walk (Strolling)
strolling_leisure

Gentle relaxed lilt or bobbing in voice pitch (~1Hz). Footsteps barely audible. Breathing slightly deeper than sitting but controlled.

Power Walking
walking_power

The "Pump." Physical impact of heel-strike creates micro-tremor in voice. Inhalations become audible and rhythmic every 4-6 words.

Walking on Noisy Surface
walking_gravel

Loud crunching footsteps compete with speech. Speaker raises volume and articulates more clearly.

Climbing Stairs
climbing_stairs

The "Ascension." Progressive breathlessness. Pitch tends to rise with effort. Sentences chopped into 2-3 word chunks.

Hiking (Steep Incline)
hiking_uphill

The "Labor." Heavy deep full-lung breathing. Long pauses between short phrases. Voice has puffed quality.

High Exertion 4
Jogging / Running
jogging_steady

The "Jogger's Tremolo." Voice shakes in sync with step cadence. Breathing machine-like and rhythmic, cannot be interrupted for speech.

Sprinting / Fleeing
sprinting_max

Communication Breakdown. Only single syllables or short shouts between violent loud gasps. Voice raw, panicked, unmodulated.

Heavy Lifting (Static Load)
lifting_heavy

The "Valsalva Maneuver." Breath held to brace core, then strangled vocalization under pressure, ending with explosive breath release.

Fighting / Physical Struggle
fighting_struggle

The "Chaos." Sharp sudden impacts. Grunts of pain or exertion. Breath erratic and jagged. Voice aggressive, barking, or screaming.

Physiological & Passive 9
Swimming (Head Out)
swimming_surface

Distinct wet acoustic environment. Speaker gulps air quickly. Speech accompanied by hollow shimmering reverb of water surface.

Shivering / Freezing
shivering_cold

The "Chatter." Involuntary vocal tremolo (~8-10Hz) from core muscle spasms. Jaw tense/locked. Speaker hisses through clenched teeth.

Crying / Sobbing
crying_sobbing

The "Inspiratory Spasm." Inhalation jerky like reverse hiccup. Pharynx wet/mucousy. Pitch cracks frequently. Vowels wavering.

Laughing Fit
laughing_fit

The "Wheeze." Total inability to articulate due to rapid exhalation spasms. High-pitched squeaks or silence where speaker runs out of air.

Stifling (Holding Back)
stifling_hold

The "Leak." Glottis clamped shut, voice high-pitched, squeaky, leaking out under extreme pressure.

Whispering
whispering_quiet

Aphonation. No vocal cord vibration. Speech shaped entirely by air turbulence. High frequencies emphasized; low frequencies absent.

Kissing / Intimacy
kissing_intimate

Smacking wet lip sounds. Extremely close proximity (bass boost). Breath soft, slow, interspersed with soft moans.

Gagging / Nausea
vomiting_gagging

The "Heave." Retching sounds. Speech garbled, watery, interrupted by involuntary stomach contraction. Voice thick with mucus.

Sexual Climax
orgasm_sexual

Involuntary vocalization. Rhythmic breathing becomes spasmodic. Pitch rises sharply. Loss of articulation control, replaced by moans and gasps.

Biological Glitches 4
Coughing
coughing_fit

Explosive percussive blasts of air. Voice immediately following is raspy or wet until cords clear.

Sneezing
sneezing_recovery

Build-up (sharp intake), explosion, recovery (nasal congested speech due to mucosal swelling).

Hiccuping
hiccuping_spasm

Sharp high-pitched intake from diaphragm spasm cuts words in half.

Burping
burping_belching

Deep guttural vibration from esophagus. Voice momentarily drops into ultra-low monster vocal fry.

Machines & Passive Motion 3
Driving (Active Traffic)
driving_active

Background engine hum and turn signal clicks. Focus pauses when navigating, head-turn voice drops.

Vehicle Turbulence / Bumpy Road
riding_turbulence

The "Shimmer." Voice wavers or vibrates rapidly from vehicle shaking. Like artificial vibrato the speaker cannot control.

Shaving (Electric Razor)
shaving_electric

Constant mechanical buzz. Speaker distorts cheek/lip to flatten skin, changing vowel shapes.

Sustained Exertion 12
Intense Cycling / Spinning
cycling_intense

The "pedal pulse." Breathing is deep, rhythmic, and locked to the cadence of the legs. Unlike running, there is no vertical impact bounce in the voice — instead a steady, pressured airstream. The saddle compresses the pelvis, slightly tightening the lower abdomen. Speech comes in short bursts timed to exhalation phases. At high intensity, the voice becomes a forced whisper between gasps.

Rowing / Pulling Heavy Resistance
rowing_pulling

The "drive-and-recover." A binary rhythm: explosive grunt on the pull stroke (glottis slams shut for core bracing), then a rushed, breathy phrase during the recovery slide. The body rocks forward and backward, creating a Doppler-like pitch wobble. Sweat and grip strain add tension to the jaw.

Rock Climbing / Rope Climbing
climbing_rope_rock

The "cling." Arms overhead and weight on fingertips. The rib cage is stretched and exposed, but the core is locked tight for stability. Speech is whispered or staccato — the speaker cannot waste a single breath on projection. Fear of falling adds a tremor. Head is often tilted upward (chin-up vocal tract geometry).

Vigorous Dancing / Rave
dancing_vigorous

The "bounce." Full-body rhythmic movement creates a fast, regular tremolo in the voice synced to the beat. Breathing is elevated but often masked by adrenaline and music. The speaker shouts to be heard over the music (extreme Lombard effect). Laughter and exclamations punctuate the gaps between dance moves.

Swimming Laps (Endurance)
swimming_laps

The "gasp window." Speech is only possible during the brief head-turn to breathe. Words are expelled in 1-2 syllable bursts between strokes. Water in the mouth and nose creates gurgling and sputtering. The voice is urgent, clipped, and has a wet, hollow quality from the pool acoustic environment.

Carrying Heavy Load / Forced March
carrying_load_march

The "grind." A constant low-grade Valsalva — the core is braced to support the weight, restricting the diaphragm. Breathing is shallow and rapid. Speech has a strained, pushed quality. Footsteps are heavy and thudding. Over time, the voice progressively weakens as fatigue accumulates. Grunting on every step at extreme loads.

Digging / Shoveling Earth
digging_shoveling

The "heave-ho." Rhythmic, percussive effort grunts synced to the shovel strike. The torso bends and straightens cyclically, alternately compressing and expanding the chest. Dirt and gravel sounds punctuate the background. Speech is choppy, timed to the brief upright recovery between digs.

Chopping Wood / Axe Work
chopping_wood

The "split." A sharp, explosive grunt at the moment of impact, followed by the crack of splitting wood. Speech only occurs during the wind-up or between swings. Arms-overhead position stretches the rib cage. The voice has a raw, outdoorsy, unmasked quality — zero social filtering.

Jumping / Plyometric Exercise
jumping_plyometric

The "launch." A sharp exhalation or "hup!" on every takeoff, followed by a thud on landing. The voice jolts violently on each impact. Speaking during jump sequences is nearly impossible — words are fragmented into single syllables launched mid-air and cut off on landing.

Martial Arts / Contact Sparring
martial_arts_sparring

The "kiai." Sharp, explosive shouts timed to strikes — these are functional (bracing the core) not decorative. Between exchanges, breathing is controlled and predatory. The voice switches between feral intensity during engagement and calm, measured breathing during circling. Impact sounds of body hits and mat slaps.

Power Yoga / Intense Hold Poses
yoga_power_hold

The "tremor hold." Muscles shaking under isometric load creates a visible vibrato in the voice. Breathing is deliberately controlled (ujjayi breath — audible oceanic friction in the throat). Speech is slow, strained, and often directed inward. The instructor's cue to "breathe" is ironic — the speaker can barely manage it.

Treading Water / Staying Afloat
treading_water

The "bob." The mouth intermittently dips below the waterline, creating a rhythmic alternation between clear speech and gurgling. Breathing is panicked and gulping if fatigued. Splashing water noise competes with the voice. Arms are occupied, so the speaker cannot gesture — all expression is purely vocal.

Sexual & Intimate Activity 13
Foreplay / Arousal Building
foreplay_arousal

The "slow burn." Breathing gradually deepens and slows. The voice drops in pitch and becomes increasingly breathy and soft. Whispered words alternate with sighs and soft moans. Wet mouth sounds (kissing, licking) punctuate the speech. Articulation loosens as cognitive focus shifts from language to sensation.

Active Sexual Intercourse (Exerting Partner)
intercourse_active

The "thrust rhythm." Breathing becomes heavy, rhythmic, and locked to physical movement. Speech fragments into single words or short commands. Effort grunts sync with exertion. The voice oscillates between breathy whispers (intimate) and involuntary vocalizations (primal). Bed springs, skin contact, and movement noise in background.

Receptive Sexual Intercourse (Receiving Partner)
intercourse_receptive

The "wave." Vocalizations are primarily reactive — moans, gasps, and cries that rise and fall with stimulation intensity. Breathing is irregular and punctuated by held breath at peaks of pleasure. The voice may crack into higher registers involuntarily. Speech, when attempted, is fragmentary and loses syntactic structure.

Giving Oral Sex (Mouth Occupied)
oral_sex_giving

The "hum." The mouth is physically occupied, making articulation impossible. Vocalizations are limited to nasal moans, humming, and throat sounds. Breathing is restricted to nasal passages. If the speaker attempts words, they are extremely muffled and distorted. Wet, slurping sounds are prominent.

Receiving Oral Sex
oral_sex_receiving

The "unravel." The mouth is free but the brain is overwhelmed by sensation. Speech degrades from coherent sentences to fragments to pure vocalization. Breathing is irregular with sharp intakes. The pitch rises progressively. Moans have a melodic, sustained quality. The speaker may grip surfaces, creating incidental sounds.

Sustained Pleasure Vocalization / Moaning
moaning_sustained

The "aria." Extended, semi-musical vocalizations on open vowels ("ah," "oh"). Pitch contours rise and fall with pleasure waves. The glottis alternates between full phonation (moans) and breathy phonation (sighs). Volume builds with intensity. These are partially involuntary — the speaker cannot fully suppress them.

Physical Restraint / Bondage (Consensual)
bondage_restrained_consensual

The "strain." Movement is restricted, creating frustrated physical tension that transfers to the voice. Breathing is elevated from arousal and mild panic. Speech may be muffled by a gag (if present) or strained against bindings. The voice sounds vulnerable, exposed, and stripped of physical agency. Rope creak or chain clink in background.

Impact Play / Spanking (Receiving)
impact_play_receiving

The "sting." Sharp, involuntary yelps or gasps at the moment of impact, followed by a recovery moan that transitions from pain to pleasure. Breathing is anticipatory — held breath before the strike, explosive release after. The voice pitch spikes on impact then settles. Counting or verbal acknowledgment may be required by the dynamic.

Breath Play / Choking (Consensual)
breath_play

The "squeeze." Progressively restricted airflow. The voice thins, rises in pitch, and becomes raspy as pressure increases on the throat. Phonation becomes turbulent — a wet, crackling quality. On release, a loud gasp followed by euphoric, dizzy-sounding speech. Extremely dangerous — the acoustic signature is unmistakable.

Edging / Orgasm Denial
edging_orgasm_denial

The "cliff." Vocalizations build toward a crescendo (increasing pitch, volume, and tempo) then are abruptly forced back down. The speaker makes frustrated, whimpering sounds during denial. Breathing is jagged and unsatisfied. Multiple build-and-stop cycles create increasingly desperate, pleading vocal quality. Speech dissolves into begging.

Solo Masturbation
masturbation_solo

The "private." Vocalizations are typically suppressed or minimal — soft breathing, quiet moans, occasional whispered fantasy narration. The speaker may hold their breath during peaks. If vocalizing freely, the sounds are unselfconscious and raw. No audience awareness — purely involuntary vocal responses to self-stimulation.

Post-Coital Afterglow
post_coital_afterglow

The "dissolve." Extreme vocal fry and minimal effort. Breathing is slowly returning to baseline but still elevated. The voice is warm, intimate, and completely unguarded. Endorphins create a dreamy, languid quality. Speech is slow, content, and punctuated by satisfied sighs. Muscle relaxation removes all tension from the throat.

Multiple / Successive Orgasms
multiple_orgasm

The "cascade." Each climax produces a peak vocalization (scream, moan, or silent apnea), followed by a brief recovery of shaky, overwhelmed breathing, before the next wave hits. The voice becomes progressively more destroyed — hoarse, cracking, and uncontrolled. By the third or fourth, speech is completely impossible.

↑ Back to top
🎙 TRGT Speaking Target & Projection

Concept: Who or what the speaker is addressing. Determines the throw of voice, register, and feedback loop.

Annotation Instruction: Based on distance cues, intonation patterns, and articulation effort.
25 entries 5 groups
Self & Abstract 4
Soliloquy / Self-Talk
target_self_mumble

Low volume, lazy articulation, swallowed consonants. Narrow pitch range. Sentences trail off without finishing.

Internal Monologue (Thinking Aloud)
target_internal_monologue

The "Processing" tone. Slow pacing with irregular pauses. Searching quality with elongated vowels, lacks projection.

Screaming into the Void
target_void_scream

Maximum volume but no communicative intent. Raw, often lacking distinct words. Pure cathartic release.

Prayer / Divine
target_divine_prayer

Hushed, reverent, or chanting quality. Highly regular or melodic rhythm. Lacks conversational fluctuations.

Dyadic 5
Intimate / Lover
target_intimate_lover

Proximity Effect dominant: extremely close, boosting bass. Phonation breathy or creaky. Soft consonants. Implies exclusivity.

Confidential / Conspirator
target_confidential_secret

The "Side-Mouth." Hushed but hyper-precise articulation for clarity without being overheard. Monotonous pitch.

Conversational (Peer/Friend)
target_conversational_peer

Neutral baseline. Moderate volume, relaxed turn-taking. Contractions used. Rising intonation for turn signals.

Confrontational (In-Your-Face)
target_confrontational_face

High intensity, hard glottal attacks. Loud but dry (little reverb), implying very close physical proximity.

Telephonic / Remote Human
target_telephonic_remote

"Distance Compensation." Slightly louder than natural. Checking intonation (upward inflection) to verify connection. Clearer enunciation.

Group Unamplified 5
Small Group (Dinner Table)
target_small_group

Slight projection (2-3m radius). Phase shifts as speaker's head turns to address different people.

Lecture / Classroom
target_classroom_lecture

The "Teacher Voice." Projected from diaphragm. Pace deliberately slowed. Key terms heavily stressed.

Drill Sergeant (Platoon)
target_drill_sergeant

Maximum unamplified volume. Percussive staccato rhythm. Vowels shouted, consonants hard/spat.

Soapbox / Street Orator
target_street_orator

Strained projection cutting through ambient noise. Repetitive looping cadences. Elevated pitch.

Town Crier / Shouting Distance
target_town_crier

Vowels heavily elongated to carry over distance. Consonants dropped. Significant delay between phrases.

Group Amplified 4
Public Speaker (PA System)
target_public_pa

The "Politician." High energy but controlled volume. Large dynamic range. Dramatic pauses.

News Anchor / Broadcaster
target_broadcaster

The "News Prosody." Artificial, polished pitch contours. Non-regional accent. No waiting for response.

Podcaster / Radio Host
target_podcaster

"False Intimacy." Close-mic technique (warm, dry, bass-heavy) but performative/projected energy.

Stage Actor (Theatrics)
target_stage_actor

The "Stage Whisper." A whisper loud enough for the back row. Over-articulation of final consonants.

Specialized Targets 7
Infant-Directed (Parentese)
target_infant

High pitch baseline. Exaggerated swooping pitch contours. Simplified vocabulary. Slow, repetitive tempo.

Animal-Directed (Pet Voice)
target_animal

Similar to parentese but includes clicks, kissy-noises, whistles, short command bursts. Tone swings between affection and command.

Voice Assistant (Siri/Alexa)
target_machine_voice

The "Robot Talk." Hyper-correct pronunciation. Unnatural pauses between words. Flat intonation. Emotion removed.

Dictation (Recording for Transcribing)
target_dictation

The "Typewriter" voice. Fast, steady, monotone. Punctuation spoken aloud. Zero emotional affect.

To Deaf / Hard of Hearing
target_hard_of_hearing

The "Loud & Slow." Volume consistently high. Tempo dragged. Articulation exaggerated for lip-reading.

To Non-Native Speaker
target_non_native

The "Simplifier." Grammar stripped down. Words separated by distinct gaps. Volume unnecessarily loud.

The Interrupter
target_interrupter

The "Cut-In." Higher pitch and faster tempo to steal the floor. Voice urgent, pushing against the other person's rhythm.

↑ Back to top
👥 SOCT Social Situation & Context

Concept: The social game being played. Captures psychological intent, power dynamics, register, and emotional masking.

Annotation Instruction: Based on prosodic and register cues. Listen for masking, status, and stakes.
56 entries 8 groups
Professional & Transactional 5
Boardroom / Business Negotiation
social_boardroom_negotiation

The "Alpha Shield." Guarded, low-pitched, steady. Deliberate lack of emotional leakage. Competitive but polite turn-taking.

Customer Service (The Mask)
social_customer_service_mask

The "Smile-Voice." Pitch artificially raised. High uptalk usage. Articulation clear but emotion detached or plastic.

Legal / Bureaucratic
social_legal_bureaucratic

The "Fine Print." Rapid-fire monotone delivery. Emphasis on qualifying words. Zero melody; goal is accuracy.

Medical / Clinical Assessment
social_medical_clinical

The "Detach." Gentle but investigative. Soft to soothe, clinically distant for authority. Falling intonation for questions.

Job Interview (The Applicant)
social_job_interview

The "Anxious Professional." High energy masked by careful grammar. Agreement noises. Over-enunciation. Shallow breath from nerves.

Conflict & High Stakes 8
Domestic Argument (Cyclical)
social_argument_domestic

The "Nag/Whine." Overlapping speech. Nasal or high-pitched complaining. Cyclical repetition with increasing frustration.

Interrogation / Police Interaction
social_interrogation_police

The "Power Imbalance." One voice stern using silence as weapon; other voice shaky with hedging language.

Pleading / Begging
social_plea_desperate

The "Submission." High pitch, shaky breath, creaky onset. Speaker talks quickly before being cut off. Submissive tone.

Emergency / Crisis (911)
social_emergency_panic

The "Suppression." Hyperventilating but forcing steady voice. Rapid rate, high pitch, staccato delivery.

Bullying / Intimidation
social_bullying_intimidation

The "Growl." Pitch drops to bottom of range. Tempo slows to invade acoustic space. Lazy but threatening articulation.

Passive-Aggressive
social_passive_aggressive

The "Sugar-Coated Knife." Sweet polite tone but stress falls on wrong words. Frequent heavy sighing implying judgment.

Cold Revenge / Calculated Payback
social_revenge_cold

The "ice." The voice is eerily calm and measured — the opposite of hot anger. Pitch is low, tempo is slow, and every word lands with deliberate weight. There is a satisfaction audible in the control — the speaker has rehearsed this. Smiling while threatening. The lack of emotional leakage is itself terrifying. "I've been waiting to tell you this."

Discovering Betrayal / Trust Shattered
social_betrayal_discovery

The "break." A stunned, breathless quality. The speaker repeats what they've learned in disbelief ("You... you did what?"). The voice is quiet — shock suppresses volume. Then comes the wave: pitch rises, volume explodes, and the voice cracks. The transition from numb to volcanic happens in seconds. Accusations become fragmented, circling back to the central wound.

Intimacy & Vulnerability 10
Pillow Talk / Post-Coital
social_pillow_talk

The "Melt." Extreme vocal fry. Very slow tempo. Minimal articulation. Long comfortable silences. Voice unprojected.

Flirting / Attraction
social_flirt_attraction

The "Play." Breathy voice. Frequent soft laughter. Men drop pitch; women raise pitch or use head voice. Teasing rhythm.

Gossip / Sharing Secrets
social_gossip_conspiracy

The "Huddle." Rapid-fire pacing, hushed volume, sudden pitch spikes. Intonation invites listener into exclusive circle.

Comforting / Consoling
social_comforting_soothe

The "Blanket." Steady, rhythmic, low-frequency hum. Downward inflections on every sentence. Speaker matches distressed person's breathing.

Confession / Admitting Guilt
social_confession_guilt

The "Drop." Volume drops at moment of truth. Hesitation markers increase. Speaker avoids assertive downward intonation.

Sincere Apology / Making Amends
social_apology_sincere

The "kneel." The voice drops in volume and pitch — the speaker makes themselves acoustically smaller. Sentences are carefully constructed, then often restarted ("What I mean is..."). The tone is heavy with genuine remorse. Pauses are frequent as the speaker monitors the listener's reaction. The voice may crack if the apology touches deep guilt.

Permanent Goodbye / Final Farewell
social_goodbye_permanent

The "last page." Every word is weighed because there won't be more. The voice is thick with suppressed emotion — trembling but controlled. Speaking rate slows to an almost ceremonial pace. The speaker memorizes the moment — sensory descriptions increase. Silences are full, not empty. The voice may break on the very last word.

Surprise Reunion / Seeing a Loved One
social_reunion_surprise

The "shockwave." An involuntary gasp or scream at recognition. The voice immediately jumps to maximum pitch and volume — then collapses into sobbing or laughter. Speech is incoherent fragments ("Oh my God! You're—how—I can't—"). The speaker physically moves toward the person, creating Doppler shifts and rustling. Pure, unfiltered emotion.

Therapy Session / Emotional Unpacking
social_therapy_session

The "excavation." Starts with guarded, careful speech (intellectualizing). Gradually, the voice softens and cracks as defenses lower. Long pauses where the speaker stares into the middle distance. The therapist's prompts are gentle with upward inflection. Breakthrough moments sound like the voice "giving way" — a sudden shift from controlled to raw, followed by tears or silence.

Admitting Vulnerability / "I'm Not Okay"
social_vulnerable_admission

The "crack." The speaker has been performing strength, and the mask slips. The transition is audible — the voice abruptly drops from confident to small. Breath becomes shaky. The speaker often laughs nervously immediately after the admission to re-armor. Pitch wavers. Volume falls to barely audible. The hardest sentences in the human repertoire.

Casual & Group Dynamics 4
Bro-Talk / Locker Room
social_brotalk_locker

The "Posturing." Loud volume, heavy slang. Boisterous percussive laughter. Frequent friendly interruptions.

Awkwardness / Strangers
social_awkward_stranger

The "Gap." Frequent uncomfortable silences. Excessive filler laughs. Topics shift rapidly.

Party / Cocktail Hour
social_party_mingle

The "Lombard Shift." Projecting over background noise while trying to sound casual. Elevated pitch, short punchy sentences.

Online Gaming (Co-op)
social_gaming_team

The "Tactical." Mix of high-stress callouts and casual banter. Instant code-switching between focus mode and social mode.

Performance & Ritual 4
Storytelling / Campfire
social_storytelling_camp

The "Theater." Wide dynamic range. Character voices. Hypnotic rhythm.

Wedding Toast / Eulogy
social_toast_speech

The "Nervous Formal." Poor projection with public speaking tremor mixed with sentimental language. Overly long pauses.

Preaching / Religious Service
social_sermon_preach

The "Cadence." Musical rhythmic delivery. Huge dynamic swells. Call-and-response gaps.

Military / Drill Command
social_military_command

The "Bark." Zero emotion. Maximum volume. Words clipped short for distance carry without echoing.

Negative & Deceptive 5
Lying / Deception
social_deceptive_lying

The "Cognitive Load." Subtle pitch increase. Over-explanation. Inconsistency between word confidence and voice shakiness.

Chronic Complaining
social_complaining_whine

The "Drone." Nasal resonance. Repetitive sing-song pitch contour. Elongated vowels to emphasize suffering.

Intoxicated / Drunk
social_drunk_intoxicated

The "Slur." Loss of tongue motor control. Volume control fails. Emotional volatility.

Scolding / Disciplining
social_scolding_discipline

The "Teacher Tone." Low pitch, slow tempo. Distinct word separation. Heavy final intonation drop.

Mockery / Sarcasm
social_mockery_sarcasm

The "Caricature." Nasal exaggerated tone mimicking someone. Extreme pitch contours.

Emotional Extremes & Crises 12
Grief / Active Mourning
social_grief_mourning

The "hollow." A voice emptied of its normal energy. Long, vacant pauses. The pitch is low and monotone, punctuated by sudden cracks into crying. Sentences start with intention but collapse midway. Swallowing is frequent and audible as the speaker fights the lump in the throat. A quality of disbelief — "this can't be real" — pervades the tone.

Heartbreak / Romantic Devastation
social_heartbreak

The "shatter." The voice wavers between numb flatness and sudden sobbing outbursts. Sentences are obsessively repetitive ("I can't believe...", "How could..."). The pitch is unstable — the speaker cannot find their normal register. Breath hitches (mini-sobs) interrupt even calm sentences. A raw, exposed vulnerability makes every word sound like it physically hurts.

Ecstatic Joy / Overwhelming Happiness
social_ecstatic_joy

The "burst." Volume spikes uncontrollably. Pitch jumps to the top of the speaker's range. Speech rate accelerates until words tumble over each other. Laughter and crying become indistinguishable (tears of joy). The speaker may scream, jump, or hyperventilate. Sentences are fragmentary — the emotion overflows faster than language can contain it.

Jealousy / Corrosive Envy
social_jealousy_envy

The "acid." A controlled but bitter tone. The voice is tight, clipped, and slightly higher than baseline from tension. Fake congratulations sound hollow — the smile doesn't reach the voice. Sarcastic undertones bleed through despite effort. The speaker often falls silent when the object of envy is discussed, then re-enters with forced casualness.

Panic Attack / Anxiety Spiral
social_panic_attack

The "spiral." Hyperventilation dominates — rapid, shallow, audible gasping that the speaker cannot control. Speech is fragmented, repetitive, and catastrophic ("I'm dying," "I can't breathe"). The voice is high-pitched and quavering. Physical symptoms (tingling, chest pain) make the speaker sound genuinely medically distressed. Attempts to speak trigger more hyperventilation.

Public Humiliation / Crushing Shame
social_shame_humiliation

The "shrink." The voice physically retracts — volume drops to near-inaudible. The speaker wishes to disappear. Sentences trail off into nothing. The pitch is flat and defeated. If forced to speak, the voice cracks. Swallowing and throat-clearing increase as the body physically tightens. The speaker avoids eye contact, and the voice follows — directed at the floor.

Nostalgia / Deep Longing / Homesickness
social_nostalgia_longing

The "ache." A warm but sorrowful tone. The voice softens and slows as the speaker enters the memory. Pitch descends gently. Sentences lengthen and become poetic or descriptive. Sighing is frequent. A bittersweet smile is audible — the voice is simultaneously fond and pained. Long pauses as the speaker lingers in the feeling.

Religious Ecstasy / Spiritual Rapture
social_religious_ecstasy

The "transcendence." The voice breaks free of normal conversational patterns into something chant-like, glossolalic, or sing-song. Pitch soars. The tempo becomes rhythmic and almost musical. The speaker may cry, laugh, or fall into ecstatic repetition. Rationality and self-monitoring are abandoned — the voice sounds possessed or channeling.

Dissociation / Emotional Numbness / Derealization
social_dissociation

The "glass wall." Flat, robotic affect. The speaker sounds like they are narrating someone else's life. Zero pitch variation. Responses are delayed as if traveling through fog. The voice has an eerie calm that is more disturbing than screaming. Words are technically coherent but emotionally vacant. No filler words — just empty, precise statements.

Manic Episode / Hypomania
social_manic_episode

The "rocket." Extremely fast, pressured speech — the speaker cannot stop talking. Ideas cascade and jump tangentially (flight of ideas). Volume is too loud. Grandiosity inflates the tone — everything sounds like a TED talk. Laughter is frequent and unearned. The speaker interrupts themselves before finishing sentences to start new ones. Energy feels limitless but brittle.

Blind Fury / Uncontrolled Rage
social_fury_rage

The "eruption." Maximum volume delivered from the gut. Vocal cords are slammed together — the voice is rough, distorted, and may crack from sheer force. Sentences become short, percussive attacks. Profanity increases. Pitch drops to the basement or jumps to screaming. Breathing is heavy and snorting. Objects may be thrown or struck — crashes and impacts in background.

Profound Relief / Cathartic Release
social_relief_catharsis

The "exhale." A long, shuddering outbreath that carries the weight of sustained tension. The voice trembles but is rising — pitch lifts with hope. Laughter and tears mix freely. The speaker may collapse physically (audible body drop or slump). Sentences often begin with "Oh my God" or "Thank God." The tone transitions from shaky to warm over seconds.

Sexual & Romantic Dynamics 8
Phone Sex / Audio Erotica
social_phone_sex

The "performance." Breathy, low-pitched, deliberately seductive vocal quality. The speaker narrates actions and sensations in real-time — slow, descriptive, and hypnotic. Moans and sighs are semi-performative (louder and more shaped than genuine arousal). Close-mic proximity effect makes the voice warm and bass-heavy. Pauses are filled with audible breathing.

Dirty Talk / Verbal Domination
social_dirty_talk

The "command." Low pitch, authoritative delivery. Words are chosen for shock value and spoken with deliberate, hard articulation. The tone oscillates between growling dominance and breathy seduction. Profanity is weaponized — delivered with relish. Volume modulates between whispered threats in the ear and firm, projected commands.

Dom/Sub Power Exchange
social_dom_sub_dynamic

The "polarity." Two distinct vocal registers in one interaction: the Dominant voice is steady, low, slow, and uses clipped commands with falling intonation ("Kneel." "Look at me."). The Submissive voice is high, breathy, compliant, and uses rising intonation seeking approval ("Yes, Sir?" "Like this?"). The power asymmetry is entirely audible.

Calculated Seduction / The Approach
social_seduction_calculated

The "velvet." Every vocal parameter is consciously optimized. Pitch is lowered. Tempo is slowed. Pauses are loaded with intention. The speaker mirrors the target's speech patterns (rapport building). Compliments are delivered with a slight breathiness. Laughter is strategically placed. The voice sounds effortlessly charming — which requires enormous effort.

Sexual Rejection / Turning Someone Down
social_sexual_rejection

The "wall." Abrupt shift from warm to firm. The pitch flattens. Volume drops but clarity increases — every word is deliberate to leave no ambiguity. "No" is said with a downward inflection and a period, not a question mark. Awkwardness creates audible discomfort: throat clearing, shifted posture, and a desire to exit the conversation quickly.

Casual Hookup / One-Night Stand Negotiation
social_hookup_casual

The "shortcut." Fast-forwarded intimacy — the voice is warmer and more familiar than the actual relationship warrants. Both speakers use a lot of nervous laughter to mask vulnerability. Volume is conspiratorially low (don't want friends/roommates to hear). Sentences are short and logistical ("Your place or mine?") masked in casual affect.

BDSM Aftercare / Post-Scene Recovery
social_bdsm_aftercare

The "landing." The Dominant voice shifts from commanding to extremely tender — soft, low, reassuring. The Submissive voice is small, shaky, and child-like (subdrop). Both voices are exhausted and intimate. Blanket rustling, water sipping, and soft physical contact sounds. The rhythm is slow and repetitive — "You did so well," "I've got you" — like soothing an infant.

Sexual Consent Negotiation / Boundary Setting
social_consent_negotiation

The "checkpoint." A deliberate shift from aroused/playful to serious/clear. The voice becomes notably more sober, slower, and articulate. Rising intonation for genuine questions ("Is this okay?"). The listener's response is carefully attended to — silence triggers a full stop. If boundaries are stated, the voice is firm but not hostile. An audible gear-shift back to arousal follows a clear "yes."

↑ Back to top
🏔 ENVI Environment & Acoustic Space

Concept: The physical space surrounding the speaker. Determines reverb characteristics, ambient noise floor, and acoustic reflections that color the voice.

Annotation Instruction: Listen for reverb tail length, early reflections, ambient noise type, and proximity effects caused by the environment.
22 entries 4 groups
Indoor Intimate 6
Tiled Bathroom / Shower
env_bathroom_tiled

Bright, harsh reverb with strong early reflections off hard parallel surfaces. Flutter echo on sharp consonants. The "singing in the shower" resonance amplifies the voice, making it sound larger than it is.

Walk-in Closet / Padded Room
env_closet_padded

Acoustically "dead." Fabric absorbs all reflections. The voice sounds dry, flat, and unusually close. No sense of space. Often used as improvised vocal booths.

Carpeted Bedroom
env_bedroom_carpeted

Warm, moderate absorption. Soft surfaces dampen high frequencies. The voice sounds intimate and natural. Slight low-frequency reinforcement from bed/furniture.

Elevator / Metal Box
env_elevator_metal

Metallic flutter echo in a tiny space. The voice bounces between parallel steel walls creating a harsh, ringing quality. Mechanical hum and cable noise in background.

Concrete Stairwell
env_stairwell_concrete

Long, bright reverb spiraling upward. Hard surfaces create strong reflections. Footstep echoes compete with speech. Voice sounds grandiose but harsh.

Long Hallway / Corridor
env_corridor_hallway

Distinctive "tunnel" reverb. Sound travels linearly with long decay. Footsteps echo rhythmically. Voice gains an eerie, elongated quality.

Indoor Large 5
Cathedral / Stone Church
env_cathedral_stone

Massive reverb tail (3-6 seconds). Stone walls create warm, enveloping reflections. Speech intelligibility drops; only slow, deliberate speech cuts through. Organ-like low-frequency resonance.

Empty Gymnasium / Sports Hall
env_gymnasium_empty

Harsh, metallic flutter echo off polished floors and high ceilings. The voice sounds thin and echoey. Sneaker squeaks and ball bounces in the background.

Theater / Auditorium Stage
env_auditorium_stage

Designed acoustics with controlled reverb. The voice projects clearly into the space. Stage monitors may create slight delay. The speaker hears their own voice returning from the back wall.

Warehouse / Factory Floor
env_warehouse_industrial

Industrial echo with machinery hum. High ceilings create long reverb but hard surfaces make it harsh. Safety alarms, forklifts, and ventilation compete with speech.

Open Plan Office
env_office_open

Moderate reverb dampened by cubicle partitions. HVAC white noise floor. Background murmur of other conversations. The speaker unconsciously lowers volume for privacy.

Outdoor 6
Open Field / Plain
env_open_field

Zero reflections. The voice sounds small and unsupported, quickly absorbed by open air. Wind is the primary interference. No bass reinforcement.

Dense Forest / Jungle
env_forest_dense

Scattered absorption from trees and foliage. The voice sounds dampened and "swallowed" by the environment. Bird calls, insects, and rustling leaves in background.

City Street
env_urban_street

Complex reflection pattern off buildings. Traffic noise (60-80dB) triggers strong Lombard effect. Emergency sirens, construction, and crowd noise compete for attention.

Mountain Summit / Cliff Edge
env_mountain_peak

Thin air reduces projection power. Wind is fierce and constant. Dramatic echoes off cliff faces with long delay. The speaker must shout but has less breath to do it.

Beach / Shoreline
env_beach_shore

Rhythmic wave crash provides constant low-frequency background. Open space with sand absorption. Wind exposure. Voice sounds clear but unanchored.

In Heavy Rain / Thunderstorm
env_rain_heavy

Continuous broadband noise from rain hitting surfaces. Thunder interrupts. The speaker raises volume significantly. Water on face may affect articulation.

Extreme & Specialized 5
Partially Submerged / Water Surface
env_underwater_partial

Gurgling, splashing interference. Voice alternates between clear (above water) and completely muffled (submerged). Gasping for air between phrases.

Inside Sealed Helmet / Space Suit
env_sealed_helmet

Voice reverberates inside the helmet cavity. Breathing is amplified and claustrophobic. Radio crackle on transmission. The voice sounds tinny and confined.

Cave / Underground Tunnel
env_cave_underground

Deep, booming reverb with irregular reflections. Dripping water provides rhythmic background. The voice has a primordial, resonant quality. Low frequencies amplified.

Inside Moving Vehicle (Car/Bus)
env_moving_vehicle_interior

Engine drone provides constant low-frequency bed. Road noise (tire hiss) adds white noise. Vibration affects voice stability. Lombard effect engaged.

Aircraft Cabin (Pressurized)
env_aircraft_cabin

Intense broadband engine noise (75-85dB). Pressurized air dries the throat. The speaker must lean close and project. Ear pressure affects self-monitoring of volume.

↑ Back to top
🩺 HLTH Health & Physiological Condition

Concept: Medical, biological, or physiological conditions of the speaker that alter voice production at the source. These are states of the body itself, not external forces.

Annotation Instruction: Listen for pathological voice qualities - hoarseness, breathiness, nasality, reduced range, tremor, or unusual effort in phonation.
18 entries 4 groups
Respiratory Illness 4
Head Cold / Nasal Congestion
hlth_common_cold

The "stuffed" voice. Nasal consonants (M, N, NG) become oral (M->B, N->D). Resonance shifts away from nasal cavity. Frequent sniffling, throat clearing, and nose blowing interrupt speech.

Sore Throat / Laryngitis
hlth_sore_throat

The "sandpaper" voice. Hoarse, raspy phonation with reduced pitch range. The speaker winces audibly on swallowing. Volume is limited because pushing air past inflamed folds is painful.

Asthma / Restricted Airway
hlth_asthma_attack

Audible wheeze on both inhalation and exhalation. Sentences are extremely short (3-5 words max). The voice sounds "squeezed" through a narrowed passage. Inhaler clicks may be present.

Seasonal Allergies
hlth_allergies_seasonal

Watery, itchy quality. Frequent sneezing fits. Eyes watering affects facial tension. Nasal drip creates a perpetual "post-nasal" thickness. Antihistamine dryness may also be present.

Systemic Conditions 5
High Fever / Delirium
hlth_fever_high

The "wavering" voice. Temperature dysregulation causes shivering and sweating. Speech may become confused or hallucinatory. Pitch is unstable. Dry, cracked lips affect bilabials.

Severe Dehydration
hlth_dehydration_severe

The "cracked" voice. Mouth is parched--audible sticky lip sounds and tongue clicks. Saliva is insufficient, causing harsh frictional noise on sibilants. Voice fatigues rapidly.

Low Blood Sugar / Hypoglycemia
hlth_low_blood_sugar

The "fading" voice. Hands trembling affects jaw stability. Speech becomes slurred and confused, similar to intoxication. Volume drops. Concentration breaks mid-sentence.

Late Pregnancy (Third Trimester)
hlth_pregnancy_late

Compressed diaphragm from enlarged uterus. Breathlessness even at rest. Voice sounds effortful. Frequent sighing. Posture-related discomfort audible in shifting and adjusting.

Altitude Sickness
hlth_altitude_sickness

Gasping for adequate oxygen. Speech is slow with frequent deep breaths. Headache pain audible as wincing. Confusion may cause word-finding difficulty.

Injury & Post-Medical 5
Post-Dental Work (Numbed)
hlth_dental_numbed

The "novocaine" voice. Half the face is paralyzed. Bilabials (B, P, M) are sloppy. Drooling may occur. The speaker compensates by over-articulating the working side.

Jaw Injury / Wired Shut
hlth_jaw_wired

Severe articulation restriction. All speech passes through clenched teeth. Consonants requiring jaw movement (wide vowels, bilabials) are almost impossible. Voice sounds muffled and strained.

Concussion / Head Injury
hlth_concussion

The "fog" voice. Slow processing speed. Long pauses searching for words. Monotone delivery. Volume may be inappropriately loud (difficulty self-monitoring).

Vocal Nodules / Cord Damage
hlth_vocal_nodules

Chronic hoarseness. The voice "breaks" unpredictably between registers. Breathy quality with air escaping through incomplete cord closure. Limited pitch range.

Speaker Has Hearing Loss
hlth_hearing_loss_speaker

The "uncalibrated" voice. Volume is inappropriately loud (cannot self-monitor). Pitch may drift. Articulation is either overly precise (trained) or degraded (untrained). Sibilants are poorly controlled.

Neurological & Chronic 4
Parkinson's / Neurological Tremor
hlth_parkinsons_tremor

The "diminishing" voice. Progressive reduction in volume (hypophonia). Monotone pitch. Tremor creates a rhythmic wobble. Articulation becomes increasingly mumbled.

Post-Stroke / Aphasia
hlth_stroke_aphasia

The "searching" voice. Long, frustrated pauses. Words come out wrong or garbled. The speaker knows what they want to say but the motor pathway is disrupted. Often accompanied by emotional frustration.

Chronic Stutter / Disfluency
hlth_stuttering_chronic

Repetitions, prolongations, and blocks. Silent blocks where the speaker is physically unable to initiate phonation. Tension visible in pitch spikes at moment of release. Secondary behaviors (eye blinks, head nods) may be audible.

Vocal Tics / Tourette's
hlth_tourettes_vocal

Involuntary vocal intrusions: sudden shouts, words, or sounds that interrupt the flow of normal speech. The speaker often continues immediately after the tic as if nothing happened.

↑ Back to top
🪖 GEAR Face & Head Obstructions

Concept: Physical barriers, coverings, or equipment on or near the speaker's face and head that mechanically filter, muffle, or distort the voice before it reaches the listener.

Annotation Instruction: Listen for high-frequency damping, resonant cavity effects, breathing restriction, and mechanical noise from the equipment itself.
14 entries 4 groups
Medical & Safety Masks 4
Surgical / Cloth Face Mask
gear_surgical_mask

High frequencies above 4kHz are attenuated by 3-12dB. Sibilants (S, SH, F) become muddy. Plosives (P, T, K) lose their burst. The speaker unconsciously raises volume 2-3dB to compensate. Breath sounds are amplified inside the mask.

N95 / Industrial Respirator
gear_respirator_n95

Heavy filtering effect. The voice sounds "behind a wall." Breathing is audibly restricted with a distinct valve-click on exhalation. Speech effort increases dramatically. Very poor intelligibility for quiet speakers.

Full Gas Mask / CBRN
gear_gas_mask_full

The "Darth Vader" effect. Voice passes through a chemical filter, creating extreme muffling. Breathing sounds mechanical and menacing. The internal cavity creates a resonant, hollow quality. Communication requires near-shouting.

Oxygen Mask / CPAP
gear_oxygen_mask

Plastic cup over nose and mouth dampens mid-frequencies. Constant hissing of oxygen flow. The voice sounds distant and "in a fishbowl." Speaker must project forcefully.

Helmets & Headgear 4
Full-Face Motorcycle Helmet
gear_motorcycle_helmet

Voice reverberates inside the padded cavity. Visor creates a narrow acoustic aperture. Wind noise at speed makes speech nearly impossible. At rest, the voice sounds boomy and enclosed.

Space Helmet / Sealed Visor
gear_space_helmet

Similar to gas mask but with added radio transmission artifacts--compression, limited bandwidth, static. Breathing is the dominant sound inside the helmet. The voice sounds tinny and robotic.

Medieval Helm / Full Visor
gear_medieval_helm

Metal resonance cavity. The voice rings metallically. The narrow eye slit acts as the only acoustic opening, creating a directional, muffled output. Clanking of the visor punctuates speech.

Construction Hard Hat with Visor
gear_hardhat_visor

Minimal voice distortion but hard plastic reflects sound downward. Construction site noise in background. Speaker shouts. The hat rim creates a slight "cupping" of the voice.

Fabric & Coverings 4
Scarf / Balaclava Over Face
gear_scarf_wrapped

Fabric dampens all frequencies proportionally. Breath becomes warm and humid against the face, creating moist mouth sounds. The voice sounds muffled but retains more clarity than hard masks.

Hand Covering Mouth
gear_hand_over_mouth

Immediate, dramatic muffling. The hand acts as a low-pass filter. Often associated with shock or trying to stay quiet. Consonants that require lip visibility (labials) are severely impacted.

Face Buried in Pillow / Cushion
gear_pillow_face

Extreme muffling--the voice is almost entirely absorbed. Only the loudest, lowest frequencies escape. Used when crying, screaming into a pillow, or lying face-down in bed.

Fabric Veil / Niqab
gear_veil_niqab

Subtle high-frequency damping. Less severe than a surgical mask because the fabric hangs loosely. The voice retains most characteristics but sounds slightly "filtered" to the listener.

Communication Equipment 2
Push-to-Talk Radio / Walkie-Talkie
gear_radio_ptt

Narrow bandwidth (300-3400Hz) eliminates bass warmth and treble air. Compression squashes dynamics. The "kkhht" of the PTT button bookends every transmission. "Over" protocol structures turn-taking.

Megaphone / Bullhorn
gear_megaphone_bullhorn

Extreme compression and distortion. The voice is clipped, harsh, and unnaturally loud. Feedback squeals punctuate pauses. All subtlety and nuance is destroyed in favor of raw volume.

↑ Back to top
🌪 CLMT Climate & Atmospheric Conditions

Concept: Weather, temperature, and atmospheric conditions that the speaker is physically enduring. These affect muscle tension, breathing patterns, and airway moisture.

Annotation Instruction: Listen for involuntary muscle tension (shivering, panting), changes in breath moisture, and environmental noise signatures.
10 entries 3 groups
Temperature Extremes 3
Bitter Cold / Sub-Zero
clmt_freezing_bitter

The "lockjaw." Facial muscles are numb and rigid. Jaw barely opens. Vowels are centralized ("uh" quality) because the tongue is sluggish. Visible breath vapor may cause slight acoustic distortion close to the mouth.

Extreme Heat / Heatstroke
clmt_extreme_heat

The "wilted" voice. Dehydration dries the vocal folds. Panting replaces normal breathing. Speech is lethargic, slow, and broken by heavy exhalations. Sweat drips may be audible.

Humid Tropical / Sauna
clmt_humid_tropical

Airway is saturated with moisture. The voice sounds "wet" and heavy. Breathing is deep but unsatisfying. Sweat and moisture create "slippery" mouth sounds.

Wind & Precipitation 3
Strong Wind / Gale
clmt_strong_wind

Wind buffets the microphone and face. The speaker turns away from the wind to breathe, creating directional shifts. Volume is raised to compete. Words are "torn away" before they reach the listener.

Blizzard / Snow Storm
clmt_blizzard_whiteout

Combines freezing cold (lockjaw) with wind noise and disorientation. The speaker shouts into the void. Snow dampens reflections, making the voice sound swallowed by the environment.

Sandstorm / Dust Storm
clmt_sandstorm_dust

Speaker breathes through fabric or hand to filter particles. Coughing interrupts. The voice sounds muffled and desperate. Grit in the teeth affects sibilants.

Atmospheric 4
High Altitude / Thin Air
clmt_high_altitude_thin

Reduced oxygen makes every sentence an effort. Deep gasping breaths between short phrases. The voice lacks its usual power and projection. Light-headedness may cause slurring.

Smoke-Filled Room / Fire
clmt_smoke_filled_room

The "char." Irritated airways cause raspy, coughing speech. Eyes watering and burning affects facial tension. Urgency in voice as survival instinct kicks in. Carbon monoxide may cause confusion.

Torrential Rain (Exposed)
clmt_underwater_rain

Water streaming down the face enters the mouth. The speaker sputters. Voice competes with the roar of heavy rainfall. Sentences are shouted in gaps between thunder.

Dense Fog / Mist
clmt_fog_dense

Psychologically oppressive silence. Sound travels strangely--absorbed in some directions, amplified in others. The speaker often speaks louder than necessary, uncertain if they are being heard.

↑ Back to top
🧪 SUBST Substance & Chemical Influence

Concept: Ingested, inhaled, or injected substances that alter neuromuscular control, cognitive function, or arousal state, thereby changing voice production.

Annotation Instruction: Listen for motor control degradation, tempo changes, volume dysregulation, and emotional volatility not explained by context.
12 entries 3 groups
Stimulants 4
High Caffeine Intake
subst_caffeine_high

The "wired" voice. Speech rate increases 15-25%. Pitch baseline rises slightly. Articulation is sharp but jittery. The speaker may talk over others or fail to pause. Restless energy audible in rapid breathing.

Stimulant (Amphetamine/MDMA)
subst_amphetamine_stimulant

The "machine gun." Extremely rapid, pressured speech. Jaw clenching (bruxism) distorts consonants. Dry mouth creates sticky articulation sounds. Emotional intensity is disproportionate to content.

Cocaine / Strong Stimulant Rush
subst_cocaine_rush

Nasal damage creates audible sniffing and congestion. Speech is grandiose, confident, and fast. Volume is too loud. The speaker interrupts constantly. Numbed throat affects swallowing.

Natural Adrenaline Surge
subst_adrenaline_natural

The "survival" voice. Pitch jumps. Time perception warps--speech is fast but feels slow to the speaker. Tunnel-vision focus makes speech extremely direct and stripped of social niceties.

Depressants & Sedatives 4
Mildly Intoxicated (1-3 drinks)
subst_alcohol_tipsy

The "loosened" voice. Social inhibitions drop. Laughter comes easier. Slight volume increase. Very subtle slurring on complex consonant clusters. Emotional range widens.

Heavily Intoxicated
subst_alcohol_heavily_drunk

The "wreck." Severe slurring--sibilants smear, plosives lose their stop. Volume swings wildly. Emotional volatility (crying to laughing). Balance issues cause the voice to sway with the body.

Opioid / Heavy Sedation
subst_opioid_sedated

The "nod." Extremely slow speech with long pauses. Volume drops to near-whisper. The speaker trails off mid-sentence as consciousness fades. Voice is breathy and lacks any tension or attack.

Benzodiazepine / Tranquilizer
subst_benzodiazepine_calm

The "flatline." Emotional affect is completely blunted. Speech is coherent but robotically calm. No excitement, no fear, no urgency. Muscles are relaxed, creating slightly mushy articulation.

Other Substances 4
Cannabis / THC
subst_marijuana_high

The "drift." Slower speech rate with frequent "spacing out" pauses. Dry mouth (cottonmouth) creates sticky lip sounds. Giggly. Thought patterns meander. Volume is usually lower than baseline.

Psychedelic (LSD/Psilocybin)
subst_psychedelic_trip

The "wonder." Speech becomes philosophical and tangential. Long pauses for overwhelming sensory experiences. Pitch range expands dramatically. The speaker may laugh, cry, or fall silent in rapid succession.

Post-Anesthesia / Waking From Sedation
subst_anesthesia_waking

The "emergence." Confused, slurred, childlike speech. Volume and pitch are uncontrolled. The speaker may say things they would normally filter. Motor control returns gradually--early speech is nearly unintelligible.

Chronic Heavy Smoking
subst_nicotine_chronic

The "gravel." Permanent vocal fold thickening from years of irritation. Lowered pitch baseline. Raspy, rough quality. Frequent throat clearing. Reduced pitch range. Morning voice is especially damaged.

↑ Back to top
😴 FATG Fatigue, Sleep & Energy State

Concept: The speaker's current energy level, sleep status, and recovery state. These determine the baseline "fuel" available for voice production.

Annotation Instruction: Listen for reduced volume, slower tempo, pitch dropping, vocal fry increase, trailing sentences, and overall "dimming" of the voice.
19 entries 4 groups
Sleep-Related 5
Just Woken Up (Morning Voice)
fatg_just_woken

The "gravel bed." Vocal folds are swollen from overnight fluid redistribution. Pitch is at its lowest point of the day. Extreme vocal fry. Articulation is sluggish. The voice "warms up" over 15-30 minutes.

24+ Hours Without Sleep
fatg_sleep_deprived_24h

The "zombie." Microsleep pauses where the speaker blanks out mid-word. Slurred articulation resembling mild intoxication. Flat affect. The voice sounds hollow and lifeless. Yawning interrupts frequently.

Drifting Off / Nodding Off
fatg_falling_asleep

The "descent." Sentences start at normal pace but slow dramatically, trailing into mumbles. Volume decreases progressively. Words merge together. The speaker may jerk awake and restart mid-thought.

Post-Nap Grogginess (Sleep Inertia)
fatg_post_nap_groggy

The "fog." Confused, disoriented quality. The speaker doesn't know what time it is. Voice is thick and heavy. Responses are delayed. Cognitive function is worse than pre-nap for 10-20 minutes.

Chronic Insomnia / Long-Term Sleep Debt
fatg_insomnia_chronic

The "hollow." Perpetual low energy. No vocal "peaks"--the dynamic range is compressed. The voice has an anxious, tense quality underlying the exhaustion. Dark circles audible in the tone.

Physical Energy 3
Post-Exercise Endorphin High
fatg_post_exercise_high

The "glow." Voice is energized, open, and clear. Breathing is still elevated but controlled. Confidence and positivity leak into tone. Articulation is crisp.

Complete Physical Exhaustion
fatg_complete_exhaustion

The "empty tank." Cannot project. Every word costs effort. Long recovery breaths between phrases. The voice sounds like it is being squeezed from a nearly empty tube.

Second Wind (Late Night Energy)
fatg_second_wind

The "manic." An artificial, slightly delirious energy spike. Speech becomes fast and giggly. The voice has a "punchy" quality that feels unsustainable. Laughter comes too easily.

Relaxation States 4
Deep Meditation / Trance
fatg_deep_meditation

The "center." Extremely slow, deliberate speech. Perfect breath control. Voice is soft but clear. Zero tension in the throat. Each word feels intentionally placed.

Deep Relaxation (Post-Massage/Bath)
fatg_post_massage_relaxed

The "melt." Languid, unhurried speech. Muscles are so relaxed that articulation is loose. The voice sounds warm and satisfied. No urgency whatsoever.

Jet Lag / Circadian Disruption
fatg_jet_lagged

The "disconnect." The voice is at odds with the environment--exhausted at noon, wired at midnight. Cognitive fog causes word-finding difficulty. Yawning at inappropriate times.

Caffeine / Sugar Crash
fatg_caffeine_crash

The "wall." Abrupt energy drop after stimulant wears off. Speech slows noticeably. Irritability enters the tone. The speaker sounds deflated and resentful.

Physical Exhaustion 7
Hitting the Wall (Endurance Collapse)
fatg_hitting_wall

The "bonk." A sudden, cliff-edge energy drop during sustained exertion (marathon mile 20, hour 3 of a hike). The voice was functional moments ago and now sounds hollow, slurred, and confused — resembling hypoglycemia. Speech becomes monosyllabic. The speaker may become emotionally volatile (crying for no reason). Breathing is inefficient and labored.

Post-Sprint Oxygen Debt
fatg_oxygen_debt

The "payback." Immediately after max-effort exertion. Breathing is violent, gasping, and completely involuntary — it hijacks the vocal system. The speaker bends over (compressed chest), hands on knees. Speech is impossible for 30-60 seconds. First words come out as breathy, shaky whispers between heaving breaths. Nausea may be audible (gagging, spitting).

Heat Exhaustion / Overheating
fatg_heat_exhaustion

The "wilt." Dehydration dries the vocal folds. Panting replaces normal breathing. The voice is weak, confused, and may slur (mimicking intoxication). Core temperature elevation causes dizziness — the speaker sounds disoriented. Volume drops because the body prioritizes cooling over communication. Speech rate slows to a crawl.

Muscle Failure / Shaking Exhaustion
fatg_muscle_failure

The "quake." The body's muscles are in visible tremor — this transmits directly to the voice as a fast, involuntary vibrato. Jaw and neck muscles shake, destabilizing articulation. The speaker sounds like they're standing on a vibrating platform. Every exhale carries a tiny, involuntary whimper. Core instability makes the voice sound fundamentally unreliable.

Post-Adrenaline Crash
fatg_adrenaline_crash

The "comedown." After a fight, accident, or emergency. Adrenaline withdrawal causes trembling, nausea, and emotional flooding. The voice is shaky and tearful even if the speaker isn't sad. Hands are shaking (audible if holding objects). Speech is rapid and repetitive as the brain replays the event. Sudden emotional outbursts (laughing, crying) without warning.

Multi-Day Sleep Deprivation (48h+)
fatg_multi_day_deprivation

The "unraveling." Beyond mere tiredness — the voice enters a hallucinatory, detached quality. Sentences begin rationally and derail into nonsense. The speaker may argue with things that aren't there. Micro-sleeps create sudden silences. Emotional regulation is destroyed — inappropriate laughter, sudden rage, paranoia. The voice sounds like a system failing.

Post-Crying Exhaustion / Cried Out
fatg_post_crying_exhaustion

The "aftermath." The speaker has exhausted their tears. The voice is hoarse and raw from prolonged crying. Nasal congestion is severe, turning all nasal consonants oral. The eyes are swollen, affecting facial resonance. A fragile, emptied-out quality — the speaker is calm not because they feel better but because they have nothing left to expel.

↑ Back to top
PAIN Pain & Physical Distress

Concept: The type, intensity, and location of pain the speaker is currently experiencing. Pain hijacks the voice by triggering involuntary protective responses.

Annotation Instruction: Listen for sudden gasps, guarded breathing, pitch spikes at onset of pain, reduced volume to minimize physical movement, and overall vocal tension.
12 entries 3 groups
Acute Pain 4
Sharp Sudden Pain (Cut/Stub/Sting)
pain_sharp_sudden

The "spike." An involuntary yelp or sharp intake of breath at the moment of injury. Immediately followed by guarded, tense speech with held breath. Pitch jumps dramatically at onset then settles into a strained baseline.

Burn / Scald / Contact Pain
pain_burn_scald

The "hiss." Immediate sharp inhalation through clenched teeth ("sssss"). The voice becomes urgent and panicked. The speaker instinctively holds the affected area, reducing one arm's gestures.

Impact / Blow to Body
pain_impact_blow

The "crumple." A sudden expulsion of air (grunt or "oof") as the diaphragm is impacted. Speech immediately after is breathless and shaky. If to the head, confusion and slurring follow.

Labor Contractions / Waves of Pain
pain_labor_contractions

The "crest." Pain arrives in predictable waves. Speech is normal between contractions, then rapidly deteriorates into moaning, grunting, or screaming at the peak, then gradually returns to coherence.

Chronic & Persistent 4
Chronic Back/Joint Pain
pain_chronic_back

The "wince." A constant low-level tension in the voice. Periodic sharp intakes of breath when the speaker shifts position. Speech is weary and colored by long-suffering. Sighing is frequent.

Migraine / Severe Headache
pain_headache_migraine

The "whisper of necessity." The speaker minimizes all sound because noise exacerbates the pain. Volume is drastically reduced. Light sensitivity causes squinting which tightens the face. The voice sounds fragile.

Dental Pain / Toothache
pain_dental_toothache

The "guard." The speaker avoids opening the mouth fully. One-sided chewing of words. Cold air intake through the mouth causes wincing. Sibilants may be avoided because air flow over the tooth hurts.

Abdominal Cramp / Gut Pain
pain_abdominal_cramp

The "curl." The speaker doubles over, compressing the rib cage. Breathing becomes shallow and guarded. Voice is directed downward. Groaning undertone on sustained vowels.

Severe & Traumatic 4
Broken Bone / Fracture
pain_broken_bone

The "shock." Initial scream followed by controlled, shaky breathing. The speaker becomes very still to avoid movement. Voice trembles. Adrenaline may temporarily mask the worst, creating an eerie calm before the pain fully registers.

Open Wound / Bleeding
pain_wound_bleeding

The "fade." Progressive weakening as blood loss and shock set in. Voice starts panicked and energized, then gradually loses projection and coherence. Cold sweat and pallor audible as shivering onset.

Muscle Spasm / Cramp
pain_muscle_cramp_spasm

The "seize." Sudden involuntary interruption of speech as the muscle locks. A sharp, strangled sound. The speaker may curse reflexively. Speech resumes once the spasm passes but remains tense and guarded.

Extreme Pain / Torture
pain_torture_extreme

The "break." Voice cycles between screaming, sobbing, and near-silence (shock). All social masking is destroyed. Raw, primal vocalizations replace language. Between episodes, the voice is destroyed--hoarse, breathy, and broken.

↑ Back to top