All posts

Photographers spend a strange amount of time typing. The shoot is the visible part of the job, but the invisible part — captions, keywords, client galleries, invoices, location scout notes, retouch instructions — often takes longer than the shutter clicks themselves. A wedding photographer might spend forty hours editing for every eight-hour event, and a sizeable chunk of those forty hours is text production, not image production.

The strange thing is that most of this writing happens with your eyes locked on a 6K monitor, comparing images, evaluating skin tones, or scanning a contact sheet. Looking down at the keyboard to type a caption pulls your attention away from the very thing you are trying to describe. Dictation removes that break in attention, letting you describe images while you are still looking at them.

Where Photographers Lose Hours to Typing

Lightroom Keywording and Metadata

Stock photographers, editorial shooters, and anyone who licenses work knows the pain of keywording. Every image needs ten to thirty descriptive keywords for it to be searchable in agency databases. A 1,500-image travel shoot can take a full day of keywording alone. Dictation makes this dramatically faster because you can keep your eyes on the image and just say what you see. "Beach sunset Bali Indonesia, two figures silhouette, palm trees foreground, orange sky, golden hour, vertical composition." That sentence becomes a clean keyword list in seconds rather than a minute of typing.

Client Gallery Descriptions

Premium clients increasingly expect a short narrative with their delivered galleries — a paragraph or two about the day, the location, the moments captured. These are easy to write but tedious to type, especially when you have already spent ten hours at the screen. Dictating a heartfelt note about a wedding while looking at the couple's portraits produces warmer, more specific writing than fighting the keyboard at midnight.

Retouching Instructions for Outsourced Edits

Many working photographers send selects to a remote retouching team. The notes that go with each image — "remove power line top-left, even skin under the eyes, lift shadows in the dress" — are exactly the kind of short, technical, observation-driven writing that dictation handles best. The photographer points at what they see, voices the instruction, and moves to the next image without ever lifting their hands from the wacom pen or trackpad.

Shot Lists and Pre-Shoot Briefs

Before a commercial or wedding shoot, the photographer typically writes out a detailed shot list. This work happens in the car, in coffee shops, between meetings — places where laptop typing is awkward. A photographer who can dictate shot lists into a notes app on the way to a venue saves an evening of catch-up work.

Email and Client Communication

The hidden writing work of a working photographer is email. Inquiry replies, follow-ups, contracts, scheduling, late-night reassurances to anxious brides. Voice typing turns most of these into thirty-second tasks. For a sole proprietor, that alone can save five to ten hours a week.

What Makes Dictation Work in a Photography Workflow

It Has to Sit in Front of Every App

Photographers do not live in one application. Lightroom, Capture One, Photo Mechanic, Photoshop, Bridge, a CRM, a gallery delivery platform, email, accounting software — each tool needs text dropped into it. A dictation system that only works inside one app is useless. Voice Keyboard Pro was built to work as a system-wide voice keyboard on macOS, so when your cursor is in a Lightroom keyword field, that is where the text lands. Same for Capture One IPTC fields, the caption box in a gallery uploader, or a client email draft in Mail.

It Has to Handle Photographic Vocabulary

Generic dictation tools mangle photography terms. They turn "bokeh" into "bouquet," "rim lighting" into "ring lighting," and "tilt-shift" into nonsense. A dictation tool that lets you add custom vocabulary — lens names, lighting equipment, color science terms, the names of your regular clients — produces clean output the first time. The minute you spend building a vocabulary list pays back across every shoot you ever caption.

It Has to Respect Concentration

Editing is a deep-focus activity. A dictation system that requires you to click menus, manage modes, or remember what state it is in breaks concentration. A hold-to-speak hotkey, where you press a key, speak, and release, is the only interaction model that survives a long editing session intact. You never have to think about the dictation system. It is just there when you need it.

A Sample Editing Session

Imagine a wedding photographer culling 4,200 images down to 800 selects on a Tuesday morning. They are in Photo Mechanic, tagging keepers. As they work, they hold down the hotkey and dictate a caption into the IPTC field for each hero shot: "First look in the garden, groom turns, bride's mother in background." They move on. Three hours later, the cull is done and every hero shot already has a caption. That same task without dictation would have meant either typing every caption (slow) or doing a separate caption pass later (a whole extra session). The dictation pass folded an evening of work into the cull itself.

The same photographer then opens an email to the bride: "Hi Sarah, your gallery is on track for Friday delivery. I wanted to send a few favorites tonight while they're fresh." She holds the hotkey, speaks the rest of the message, releases, and the email is done. Total time: under a minute for a message that would normally have taken five.

Privacy Considerations for Photographers

Photographers often work with confidential material — celebrity portraits under embargo, wedding photos before delivery, commercial campaigns under NDA. Any dictation tool you bring into the workflow should be one you trust with that content. Look for tools that do not store your audio long-term and do not use your transcriptions to train models. Voice Keyboard Pro processes audio for transcription only, does not retain recordings on its servers, and does not train on user data. That matters for a profession built on discretion.

Getting Started

Most photographers start with the workflow that hurts most. For a stock shooter, that is keywording. For a wedding photographer, it is gallery descriptions and client email. For a commercial shooter, it is retouching notes. Pick one and use dictation for it exclusively for a week. By the end of the week, you will have a sense of where it earns its keep in your specific business.

Voice Keyboard Pro is a free download for macOS, with a Pro tier that unlocks unlimited dictation and a custom vocabulary tuned to your gear, your clients, and your style. The hold-to-speak interaction works inside Lightroom, Capture One, Photo Mechanic, Photoshop, every browser, every email client, and any other Mac app. You can install it and be dictating into Lightroom in under a minute from voicekeyboardpro.com.

Your eyes belong on the image, not the keyboard. Dictation closes the gap between what you see and what you write about it.