Voice Typing for Dissertation Writing: How PhD Students Beat the Blank Page

All posts

Every PhD student knows the feeling. You sit down in front of your dissertation, open the chapter you have been avoiding, and produce 200 words in three hours. Most of those words you delete by the end. The next morning you tell yourself today will be different, and it usually is not. The problem is rarely that you do not know your topic. The problem is that the writing process itself is hostile to the way you actually think.

Voice typing is not a magic solution to the dissertation problem. It is, however, one of the few tools that addresses the actual bottleneck: the gap between thinking speed and typing speed.

Why Dissertations Get Stuck

Dissertation chapters fail in a predictable pattern. You have read the literature. You have your data or your theoretical argument. You can explain your contribution clearly to a friend over coffee. But the moment you try to put it into formal written prose, three things happen at once.

First, you start editing while drafting. You write a sentence, immediately look at it, and revise it before the paragraph is finished. The internal critic and the internal author are both in the room, fighting for the same chair. Second, your typing speed cannot keep up with your thinking, so by the time you have typed the first half of an argument, you have lost track of where the second half was going. Third, the cognitive load of formatting, citation insertion, and academic register pulls you out of the flow state where good ideas live.

The result is that you write at maybe 15 percent of the rate at which you can speak about the same material. A PhD student who can verbally walk through their argument for an hour might produce just a page or two of text in the same hour at the keyboard.

How Voice Typing Closes the Gap

Most people speak at 130 to 160 words per minute and type at 35 to 60. That ratio is the mechanical reason voice typing works. But the cognitive reason is more interesting. When you speak, you are forced to commit to a sentence and move on. You cannot easily backspace your speech. This constraint, which feels like a limitation at first, is actually what unlocks momentum.

Voice typing reframes drafting as a closer cousin to talking, and talking is something you have done your whole life without freezing. You can explain your hypothesis to a colleague in a hallway without writer's block. Voice typing essentially reproduces that hallway conversation, except the words land directly in your document.

A Workflow That Actually Works for Dissertation Drafting

The PhD students who get the most out of voice typing tend to converge on a similar pattern. It looks like this.

1. Outline First, in Writing

Voice typing is for prose generation, not for outlining. Build the structural skeleton of your chapter the normal way, with headings, subheadings, and a sentence under each one describing what that section will argue. The skeleton serves as the railing you grip while dictating.

2. Speak in Paragraph-Sized Chunks

Open Steno, hold the hotkey, and dictate one paragraph at a time. Do not try to dictate an entire section in one breath. The paragraph is the natural unit of an argument, and treating it as the dictation unit gives you a moment between bursts to glance at the outline and remember where you are headed. After each paragraph, release the key, watch the text appear, take a breath, and start the next.

3. Do Not Edit During the First Pass

Resist the urge to fix the transcription as it appears. Voice typed first drafts will have stylistic roughness, repeated words, and the verbal tics you do not realize you have. That is fine. The goal of the dictation pass is to get the argument out of your head. Editing happens later, in a separate session, with the keyboard.

4. Save Citations for Editing

Do not try to dictate citations during your draft pass. Use placeholders like [CITE Smith 2019] or just say "cite Smith twenty nineteen here" and move on. When you switch to editing mode, you can search for the placeholders, look up the references, and insert them properly using your reference manager.

Handling Academic Vocabulary

One concern that comes up immediately is whether voice typing can handle the technical vocabulary of an academic field. The honest answer is that it depends on the field, and that custom vocabulary is the lever that solves it.

Steno includes a custom vocabulary feature where you can paste in a list of terms specific to your field. A neuroscience PhD might add hippocampus, prefrontal cortex, dorsolateral, GABAergic, and the names of the assays they use most often. A philosophy student might add Heidegger, phenomenology, intentionality, and the German technical terms that come up regularly. Once added, these terms are recognized consistently. The transcription engine learns the vocabulary you actually use rather than guessing from the general English distribution.

For author and citation names, the workflow above (placeholders during draft, real citations during edit) sidesteps the problem entirely. You should not be trying to dictate "as Føllesdal argues in his 1968 paper" anyway, because you would be looking up the spelling either way.

What About Footnotes, Equations, and Tables?

Voice typing handles prose. It does not handle equations, tables, or LaTeX commands well, and it is not supposed to. The natural division of labor is to dictate the prose sections and switch back to the keyboard for the structured content. In a typical chapter that is 80 to 90 percent prose, this division still represents an enormous time savings.

For LaTeX users, the workflow is the same. Dictate paragraphs into your editor, then keyboard in the math environments and citation commands during the editing pass. Steno works in any text editor, including TeXShop, VS Code, Overleaf in the browser, and Vim, because it operates at the keyboard level rather than requiring an integration.

The Discipline of Talking to Yourself

The strangest thing about voice typing is the social dimension. You will feel self-conscious the first time you sit alone in your office talking to your laptop. That self-consciousness disappears within about two days, and it is replaced by something more interesting: a much sharper awareness of how you actually sound when you make an argument.

This turns out to be useful. Many dissertation chapters are written in a stiff, defensive register that distances the reader from the argument. When you dictate the same content, your natural speaking voice produces prose that is clearer, more direct, and easier to read. Your committee will thank you.

Getting Started Tomorrow Morning

Pick the section of your current chapter that you have been avoiding most. Open Steno, find a quiet room, hold the hotkey, and try to explain that section out loud as if you were talking to a fellow grad student in your program. Do one paragraph. Then another. After 30 minutes, look at what you have produced. It will not be your final draft. It will, almost certainly, be more than you have written all week.

Steno is available as a free download for macOS at stenofast.com. The free tier covers enough daily dictation for most graduate students to draft a full chapter section per day, which is roughly the rate at which dissertations actually get finished.

The dissertation does not require you to type more. It requires you to think clearly and commit your thoughts to the page. Voice typing removes the friction between those two acts and lets the chapter you have been carrying around in your head finally land on the screen.