All posts

If you are a therapist, you already know the documentation problem. You see 6-8 clients a day. Each session generates a note that needs to be written: subjective observations, objective data, assessment, plan. Maybe you use SOAP format. Maybe DAP. Maybe your EHR has its own template. Regardless of format, the notes take time. Studies consistently show that mental health clinicians spend 1.5 to 2.5 hours per day on documentation. That is time you are not seeing clients, not resting between sessions, and not doing the work you trained for.

Voice-to-text can cut documentation time by 60-70%. Instead of typing a progress note in 10-15 minutes, you dictate it in 2-4 minutes. Instead of saving all your notes for the end of the day (when details are fading), you dictate each note in the 5-10 minute gap between sessions while the clinical content is fresh.

This guide covers how to use voice-to-text for therapy documentation, which tools work best for clinical use, and the privacy considerations you need to think about.

The Session Notes Problem

Documentation burden is the leading cause of burnout among mental health professionals. The irony is brutal: therapists enter the field to help people, and the thing that burns them out is not the emotional weight of the work but the paperwork that follows it.

The typical documentation workflow looks like this:

  1. See a client for 50 minutes
  2. Have a 10-minute gap before the next client
  3. Use that gap for the restroom, water, and mental transition
  4. See the next client
  5. Repeat 6-8 times
  6. At the end of the day, sit down with a stack of notes to write
  7. Try to remember the details of sessions from 4-6 hours ago
  8. Spend 1.5-2 hours typing notes
  9. Go home exhausted, having worked an extra 2 hours beyond your clinical schedule

The problem is not just the time. It is the quality. Notes written hours after sessions contain less clinical detail, fewer direct observations, and more generic language. By the end of the day, session 2 and session 5 blur together. You write what you remember, which is less than what you observed.

Voice-to-text changes this by making it practical to dictate each note immediately after the session. You have 5-10 minutes between clients. Dictating a note takes 2-4 minutes. That leaves time for a bathroom break and a moment to collect yourself. The notes are better because the session is fresh. And at the end of the day, you are done. No note stack. No 2-hour documentation marathon.

Quick Comparison

ToolPriceClinical VocabOn-DeviceEHR CompatibleAI CleanupHIPAA
Voice Keyboard ProFree / $4.99/moStrongYesYes (system-wide)YesOn-device processing
Dragon Medical One~$99/moExcellentNo (cloud)Deep EHR integrationNoBAA available
Apple DictationFreePoorPartialYes (system-wide)NoNo BAA
Freed$99/moExcellentNo (cloud)Some EHRsAuto-generates notesBAA available
Nuance DAXEnterpriseExcellentNo (cloud)Deep integrationAuto-generates notesBAA available

Dictating SOAP Notes by Voice

Here is what dictating a therapy session note actually sounds like. This example uses SOAP format:

Subjective: "Client reports increased anxiety over the past week, rating it 7 out of 10 compared to 5 out of 10 at last session. States the anxiety is primarily related to an upcoming performance review at work. Reports difficulty sleeping, falling asleep taking approximately 45 minutes compared to usual 15 minutes. Denies suicidal ideation. Reports continued use of diaphragmatic breathing exercises learned in previous sessions with moderate effectiveness."

Objective: "Client appeared alert and oriented. Affect was anxious with fidgeting and rapid speech noted in the first 15 minutes. Eye contact was maintained. Thought process was logical and goal-directed. No evidence of psychotic symptoms. Client engaged actively in session and demonstrated understanding of cognitive restructuring techniques introduced today."

Assessment: "Generalized Anxiety Disorder, moderate severity. Anxiety symptoms are situationally exacerbated by occupational stressor. Client is making progress with behavioral interventions but cognitive distortions related to perfectionism continue to drive anticipatory anxiety. PHQ-9 score remains stable at 8, indicating mild depressive symptoms."

Plan: "Continue weekly CBT sessions. Introduced cognitive restructuring worksheet for automatic thoughts related to performance evaluation. Assigned daily thought record focusing on situations that trigger anticipatory anxiety. Will review sleep hygiene strategies at next session. Consider referral to psychiatry for medication evaluation if sleep disturbance persists beyond two weeks. Next session scheduled for April 24."

That entire note took about 2 minutes to speak. It contains the clinical detail that makes a note useful: specific ratings, observed behaviors, treatment interventions, and measurable plans. Typing the same note takes 10-12 minutes. The difference over 7 clients per day is substantial: 14 minutes by voice versus 70-84 minutes by keyboard.

Detailed Tool Reviews

Voice Keyboard Pro

Voice Keyboard Pro is a dictation app for Mac and iPhone that works in every application on your computer. For therapists, this means you can dictate directly into your EHR's note field, whether you use SimplePractice, TherapyNotes, Jane App, or a web-based system. Press the keyboard shortcut, speak your note, release. The text appears in the field.

Voice Keyboard Pro's profession-aware vocabulary adapts to clinical terminology. It handles DSM-5 diagnoses (Generalized Anxiety Disorder, Major Depressive Disorder, PTSD), medication names (sertraline, buspirone, escitalopram), therapeutic modalities (CBT, DBT, EMDR, ACT), and assessment instruments (PHQ-9, GAD-7, PCL-5) with high accuracy. This eliminates the constant corrections that plague generic dictation tools when used for clinical documentation.

The critical feature for therapists is on-device processing. Voice Keyboard Pro processes audio on your Mac. Your dictation of session notes, which contains protected health information, never leaves your computer. There is no cloud server storing recordings of your clinical observations. This sidesteps the primary HIPAA concern with voice-to-text tools.

Voice Keyboard Pro's AI cleanup actions are useful for clinical writing. After dictating a note quickly, you can clean up filler words and fix grammar without changing the clinical content. The "professional tone" action can help shift casual dictation into formal clinical language.

Best for: Therapists in private practice who want affordable, private, accurate clinical dictation. Clinicians who use web-based EHR systems. Therapists who want one tool for session notes, emails, reports, and all other writing. For a broader look at medical dictation, see our medical dictation software guide.

Honest limitation: Voice Keyboard Pro is not purpose-built for clinical documentation. It does not auto-generate notes from session recordings. It does not have SOAP/DAP templates built in. It does not integrate directly with EHR systems via API. It is a dictation tool that works in your EHR, not a clinical documentation platform. If you want AI-generated notes from session audio, look at Freed or Nuance DAX.

Dragon Medical One

Dragon Medical One is the healthcare-specific version of Dragon NaturallySpeaking. It is the incumbent in medical dictation, used by hospitals and clinics for decades. The clinical vocabulary is the best in the industry. It handles medical terminology across all specialties with consistently high accuracy.

Dragon Medical One integrates deeply with major EHR systems. In some configurations, it adds a dictation button directly into the EHR interface. It supports custom macros and templates: you can say "insert SOAP template" and it fills in the section headings, or "insert depression screening" and it populates a PHQ-9 scoring template.

The downside is cost. At approximately $99 per month per user, Dragon Medical One is designed for large practices and hospital systems that negotiate enterprise pricing. For a solo therapist, that is $1,200 per year. It also processes audio in the cloud, which means you need a Business Associate Agreement with Nuance/Microsoft to maintain HIPAA compliance.

Best for: Large group practices and healthcare organizations. Therapists whose employer provides Dragon as part of the practice's software stack. Clinicians who need deep EHR integration and extensive clinical vocabulary.

Honest limitation: Expensive for individual practitioners. Cloud-based processing means audio leaves your device. The interface feels dated compared to modern tools. Overkill for therapists who primarily write progress notes rather than lengthy medical reports.

Apple Dictation

Free on every Mac, iPhone, and iPad. Works in every app. For therapists on a tight budget, it is the starting point.

Apple Dictation handles basic clinical documentation adequately. Simple sentences about client presentation, mood observations, and treatment plans transcribe well. It struggles with clinical terminology: DSM-5 diagnoses, medication names, and assessment instruments are frequently mis-transcribed.

On newer Macs and iPhones, Apple Dictation processes on-device, which keeps audio off Apple's servers. On older devices, audio is sent to Apple for processing. Apple does not offer Business Associate Agreements for Dictation, which creates a gray area for HIPAA compliance.

Best for: Therapists who need a free, zero-setup option. Simple progress notes without heavy clinical terminology.

Honest limitation: No clinical vocabulary. Expect frequent errors on medical terms, medication names, and diagnostic language. No AI cleanup. No custom vocabulary. No HIPAA compliance guarantees. You will spend time correcting, which partially defeats the purpose of dictation.

AI Note Generators (Freed, Nuance DAX)

A newer category of tools goes beyond dictation: they record the entire session and use AI to generate a complete clinical note. You do not dictate anything. The AI listens to the therapeutic conversation and produces a SOAP or DAP note from the session content.

These tools are impressive but raise significant ethical and clinical questions. They record client conversations, which requires informed consent. They send session audio to cloud servers for processing. They use AI to make clinical judgments about what to include in the note. And they insert themselves into the therapeutic space in a way that some clinicians and clients find uncomfortable.

Best for: Clinicians who want fully automated documentation and are comfortable with the privacy and clinical implications. Practices that have already integrated AI documentation into their workflow and obtained client consent.

Honest limitation: Expensive ($99+/month). Records client sessions, requiring explicit informed consent. Cloud processing of sensitive clinical conversations. The AI-generated note may not capture nuances that the clinician would include. Still requires clinician review and editing. Not appropriate for all therapeutic modalities or client populations.

The Between-Session Workflow

The most practical way to use dictation for therapy notes is the between-session workflow. Here is how it works:

  1. During the session: Be fully present with your client. If you must, jot 2-3 keywords on a notepad: specific topics discussed, direct quotes, observable behaviors, assessment scores. Do not dictate during the session.
  2. Session ends: Walk your client to the door. Return to your desk.
  3. Immediately dictate (2-3 minutes): Open your EHR to the client's chart. Click into the note field. Press your dictation shortcut and speak the note. Use your keywords as prompts. The session is fresh and details are vivid. Speak in complete sentences using clinical language.
  4. Quick review (1-2 minutes): Scan the transcribed note. Fix any errors. Use AI cleanup if available. Verify that clinical terms transcribed correctly. Sign the note.
  5. Transition (2-3 minutes): Use the remaining time before your next client for water, restroom, and mental preparation. You are done with documentation for that session.

This workflow means you finish your last session of the day and you are done. No note stack. No evening documentation. No weekend catch-up. That alone is worth the investment in a dictation tool.

Clinical Terminology Accuracy

The difference between generic and clinical dictation tools shows up most clearly in terminology:

Spoken TermApple DictationVoice Keyboard ProDragon Medical
"PHQ-9 score"PHQ nine scorePHQ-9 scorePHQ-9 score
"sertraline 50 mg"Sir trailing 50 mgsertraline 50 mgsertraline 50 mg
"EMDR processing"EMD are processingEMDR processingEMDR processing
"GAD-7 elevated"GAD seven elevatedGAD-7 elevatedGAD-7 elevated
"affect was dysphoric"effect was dysphoricaffect was dysphoricaffect was dysphoric
"CBT and ACT"CBT and actCBT and ACTCBT and ACT

The "affect vs effect" distinction is a perfect example of why clinical vocabulary matters. In clinical documentation, "affect" (noun, meaning emotional expression) is used constantly. Generic dictation tools default to "effect" every time because it is far more common in general English. Clinical tools learn that in a therapy note context, "affect" is almost always the correct transcription.

Privacy and HIPAA Considerations

Voice-to-text for therapy notes involves dictating protected health information. This means HIPAA rules apply. Here is a practical breakdown:

On-device processing (Voice Keyboard Pro, newer Apple Dictation)

Audio is processed on your Mac or iPhone. No third party ever hears or stores the audio. No server receives your clinical observations. From a HIPAA perspective, this is the simplest option because there is no "business associate" handling PHI. Your device is processing your data just like your keyboard does when you type.

Cloud processing with BAA (Dragon Medical One, Freed)

Audio is sent to the vendor's servers for processing. The vendor signs a Business Associate Agreement, making them legally responsible for protecting the PHI. This is HIPAA-compliant but requires that you have a signed BAA on file. If you use one of these tools, make sure the BAA covers voice data specifically, not just text records.

Cloud processing without BAA (most consumer tools)

Generic dictation tools like Google Voice Typing, most free transcription apps, and older Apple Dictation send audio to servers without a BAA. Using these for clinical documentation creates a HIPAA risk. The audio containing PHI is processed by a company with no legal obligation to protect it under HIPAA rules.

For more on privacy-first dictation, see our offline voice-to-text guide. For the clinician-specific workflow, see our clinician workflow guide.

Tips for Clinical Dictation

Frequently Asked Questions

Is voice-to-text HIPAA compliant for therapy notes?

It depends on where audio is processed. Tools that process on-device (like Voice Keyboard Pro) never send PHI to external servers, which avoids the main HIPAA concern. Cloud-based tools like Dragon Medical One offer HIPAA-compliant plans with Business Associate Agreements. Always verify the specific tool's data handling before using it for patient documentation.

Can voice-to-text handle clinical terminology accurately?

Generic tools struggle with clinical terms. Apps designed for healthcare (Dragon Medical) or that adapt to your profession (Voice Keyboard Pro) handle clinical vocabulary well: DSM-5 diagnoses, medication names, therapeutic modalities, and assessment instruments transcribe correctly. Custom vocabulary features help with practice-specific terms.

Should I dictate during the therapy session or after?

After. Dictating during a session reduces therapeutic presence and can make clients uncomfortable. Most therapists dictate in the 5-10 minute gap between sessions. Some jot brief keywords during the session, then use those as prompts for dictation afterward.

How long does it take to dictate a therapy session note?

Most therapists report dictating a complete SOAP or DAP note in 2-4 minutes, compared to 10-15 minutes of typing. A progress note for a 50-minute session typically runs 200-400 words, which takes about 90 seconds to speak. With review and corrections, expect 3-5 minutes per note.

What is the best free dictation tool for therapists?

Apple Dictation. It is free, works in every app, and handles basic documentation. It struggles with clinical terminology, so expect more corrections. For therapists in private practice with limited budgets, it is a reasonable starting point before investing in a clinical tool.

Can I use voice-to-text with my EHR system?

Yes. System-wide dictation tools (Voice Keyboard Pro, Apple Dictation, Dragon) work in any text field on your computer, including web-based EHR systems like SimplePractice, TherapyNotes, and Jane App. Place your cursor in the note field and dictate. No special integration required.

You became a therapist to help people, not to spend your evenings typing notes. Dictation gives you those evenings back.

Try Voice Keyboard Pro for clinical dictation: voicekeyboardpro.com. On-device processing, clinical vocabulary, and works directly in your EHR.