Topics

Why your voice shouldn't transit a US cloud

The argument isn't paranoid. It's about what happens to audio that mentions a child's name, a patient, a source - on infrastructure where the rules are written by the host.

Privacy
Cloud
Sovereignty
Julia Yukovich
Julia YukovichCo-Founder + CEO
·May 2, 2026·
2 min read

A voice clip is the densest piece of personal data you can produce in one tap. Treating it the way we treat a Google Doc is a category error.

What 'cloud transcription' actually means

When you hit record in a typical transcription app, the audio bytes leave your device, traverse a CDN, land in object storage in a US AWS region, and are read by a transcription service hosted in the same region. The app you see in the browser is a thin viewer over that pipeline. The contract you sign in the Terms of Service is the only thing standing between that audio and any other use the host wants to put it to.

Why a voice clip is denser than a document

A typed document gives away words. A voice clip gives away: words, vocal stress, accent, age, sometimes gender, ambient noise (so the room, possibly the city), background voices (other people in the room, unconsented), and timing patterns that biometrically identify the speaker. We are collectively unprepared for how much information an unrehearsed 90-second voice memo contains.

What the law actually does and doesn't do

GDPR is strong when both the data subject and the controller are in the EU. It gets weaker the further the data physically travels. The post-Schrems-II legal landscape technically allows EU-to-US transfer under a patchwork of mechanisms, but the practical recourse if something goes wrong - notification, takedown, damages - depends on which jurisdiction's court you're in front of and how much money you can spend. The cheaper option is: don't let the data leave.

The alternative is boring and works

Sprachmemo runs on Hetzner in Falkenstein, Germany. The Vosk transcription model runs in the same datacentre as your audio file - no network hop between storage and inference. The transcription engine is open source so anyone can audit what it does and doesn't send anywhere. There's nothing exotic about this; we just chose a stack where 'audio stays where you put it' is the default instead of a paid feature.

FAQ

Frequently asked

Try Sprachmemo

Free plan, no credit card. We host in Germany. You can export and delete everything self-serve.

Julia Yukovich

Written by

Julia Yukovich

Co-Founder + CEO

Julia is one of the Co-Founders. She handles design, product direction, and most of the support replies that arrive in the morning.

julia.yukovich at aicuflow dot comLinkedIn