Productivity

Voice first workflow: a day in our voice-first workday

A practical founder walkthrough of where voice saves time, where it fails, and how we recover.

Shuran Zhou, Founder · 2026-05-03 ·6 min ·Updated 2026-05-03

TL;DR

A voice first workflow is not voice-only work. Loqua is a Mac-native voice typing tool, and our practical pattern is voice for first-pass intent, keyboard for exact manipulation, and app-aware formatting everywhere in between. This is a normal voice-first workday using Loqua across inbox, standup, code review, specs, Slack, and journaling.

I am Shuran, founder of Loqua.ai. This is the honest version of our voice first workflow: where it saves time, where it fails, and what I do instead. The goal is not to perform productivity. The goal is to reduce the distance between a thought and the artifact that moves work forward. By the end of a day like this one, the typing-versus-speaking question is mostly invisible; the work itself is what I notice.

8am inbox

I start with Spark Mail and Slack. Voice works well here because replies are mostly intent and tone. I dictate the rough answer, Loqua removes filler, and the app-aware layer keeps the output shorter in Slack than in email.

I say

"tell her Thursday morning works and I can be flexible on timing but I'd prefer before noon if possible"

Loqua writes (in Spark)

Thursday morning works for me, and I can be flexible on timing. Before noon would be ideal if possible.

The inbox block usually takes fifteen to twenty minutes. About two-thirds of replies are dictated; the rest are typed because they need exact wording, a list of links, or a delicate explanation. The split is not a rule. It is what the morning ends up being once I stop forcing one tool to do the other tool's job.

Where voice fails: confidential threads in public spaces. If I am in a café and the message contains sensitive context, I type. Voice first does not mean privacy second.

Standup

For standup, I speak three slots: shipped, today, blocker. This is one of the highest-return daily voice typing workflow patterns because the structure repeats and the content changes.

I say

"standup shipped validator phase today writing productivity posts blocker only H4 forward refs until phase three"

Loqua writes (in Slack)

Shipped: validator phase.
Today: writing productivity posts.
Blocker: expected H4 forward references until Phase 3.

The trick is not to ramble. I say the slot names out loud. Loqua uses them as structure. The standup itself takes maybe thirty seconds to write and another minute to read what the rest of the team posted. Voice does not save much time on each individual standup; it saves the cost of putting it off, which used to be the larger drag on the day.

Code review

Code review is mixed. Voice is excellent for explaining the concern; keyboard is better for line-level precision. I dictate review summaries and high-level comments, then use the mouse and keyboard for exact file navigation.

A good voice review comment says why the issue matters, not just what line is wrong. For example: "This cache key ignores locale, so Chinese and English results can collide. Can we include locale in the key and add a regression test?" That comment is much easier to speak than to type while reading code.

I say

"comment this guard handles the missing token case but we still leak the partial state into the metrics path can we move the early return above the increment"

Loqua writes (in GitHub PR comment)

This guard handles the missing-token case, but we still leak partial state into the metrics path. Can we move the early return above the increment?

Spec draft

Spec drafting is where voice first workflow feels closest to thinking. I dictate goal, non-goals, user flow, edge cases, and acceptance criteria. Then I edit headings with the keyboard. For a larger spec, I start in Obsidian; for implementation tasks, I move the cleaned version into Claude Code or Cursor.

I say

"spec goal add blog productivity cluster non goal no template changes acceptance validator passes once H4 exists and hub has exactly one featured"

Loqua writes (in Obsidian)

## Goal
Add the blog productivity cluster.

## Non-goals
No template changes in this phase.

## Acceptance
- Validator passes once H4 exists.
- Hub has exactly one featured post.

The first dictated draft is rarely the spec we ship. It is the scaffold that makes the keyboard editing obvious. The trick I keep relearning is that the dictated version surfaces missing sections faster than a typed outline would, because saying "acceptance" out loud forces me to follow it with at least one criterion.

Slack threads

Slack is where tone matters. Voice can sound too blunt if the cleanup model only transcribes. Loqua's destination formatting keeps replies short but not cold. I still read before sending; voice should accelerate judgment, not replace it.

One pattern that took a while to learn: dictate the warm version, not the efficient version. Slack reads better when the first sentence acknowledges the person and the second sentence gets to the point. A typed reply tends to skip the first sentence. A dictated one usually keeps it, and the thread is healthier for it.

Where voice fails: when a thread requires careful quoting or multiple links. I type those. The hybrid rule is simple: use voice for the argument, keyboard for references.

End-of-day journal

At the end of the day, I dictate what surprised me. This is not a status update. It is memory capture: what changed my mind, what was harder than expected, and what I should not forget tomorrow. Obsidian is the destination because it is searchable and linkable.

A typical journal entry runs three short paragraphs and takes about five minutes. The interesting pattern is that the most valuable entries are about the small surprises, not the big decisions. The big decisions get written down anyway, often more than once. The small surprise — the API that returned in a different shape than the docs implied, the user comment that contradicted my model — is the one that disappears by morning if not captured.

When voice didn't work today

Two examples from the same day. First, a dense code refactor in a busy file. I tried dictating the renaming plan into the editor and the model kept getting one identifier wrong because the visible context was scrolling faster than the listener could catch up. I switched to typing. Voice was the wrong tool because the cursor was moving too quickly for context to stabilize.

Second, a tense Slack thread where the right reply was three sentences and zero adjectives. I dictated, the cleanup added one polite hedge, and the message ended up reading softer than I wanted. I rewrote it by hand. The lesson is that voice is good for warmth and bad for deliberate coldness; when you need a flat message, type it.

For broader stack details, see our voice productivity stack. For the argument behind the habit, see why your keyboard is the wrong tool for thinking with AI. External references that shaped our Mac workflow include Apple Dictation and the Linear docs.

Frequently asked questions

What is a voice first workflow?

A voice first workflow uses speech as the default capture method for intent, drafts, replies, and status updates. It is not voice-only. In practice, voice handles first-pass thinking and structured text, while keyboard and mouse handle exact edits and navigation.

What parts of the workday are best for voice?

Inbox replies, standups, code review summaries, spec drafts, Slack updates, and end-of-day journals are all strong fits. They involve natural-language explanation and repeated formats, which lets Loqua turn rough speech into useful text quickly.

Where does voice fail during the day?

Voice fails when privacy is risky, when the task requires exact line-level edits, or when you need to insert many links and quotes. In those cases I switch to keyboard. A mature voice workflow includes explicit fallback points.

Do you use voice for code itself?

Sometimes for comments, docstrings, commit messages, and prompts to code agents. I do not dictate large code blocks by voice. Code still benefits from keyboard precision, editor completions, and tests.

How do you keep dictated Slack from sounding weird?

I speak the honest version, then Loqua cleans tone for the destination. I still read before sending. The goal is to remove friction, not to automate judgment or send unreviewed text.

How should a team adopt voice workflows?

Start with low-risk repeated artifacts: standups, PR descriptions, meeting follow-ups, and issue descriptions. Do not mandate voice. Let each person decide where it helps and where typing remains better.

Does voice work in an open office?

Partially. The most useful slots become the ones you can dictate quietly: standup, journal entry, and a couple of focused prompt blocks. The high-frequency Slack replies and inbox messages tend to shift to typing. The workflow still survives; the mix just changes.

Try Loqua today

Free to start. Mac native. Built by algorithm researchers who use it every day.

Download

More from the Loqua Blog

Productivity

Voice for thinking with AI: why your keyboard is the wrong tool

Productivity

Voice productivity stack: 9 tools we actually use to write, ship, and think

How-to

Mac meeting notes voice: from voice to done with notes and action items

Engineering

Omni-modal voice typing: multimodal understanding, MoE, and streaming text output

Compare

Loqua vs Wispr Flow: a Mac-first Wispr Flow alternative for context, coding, and privacy