MEETINGS & CALLS

Real-time meeting translation, without a bot in your call.

VoxisLive captures system audio directly using WASAPI loopback, so nothing joins your Zoom, Teams or Google Meet session. Translation is private, spoken, and works with every conferencing app.

The problem with meeting translation today

Sitting in a call conducted in a language you don't fully understand is exhausting: you catch fragments, miss the nuance of a question, and spend half the meeting reading machine subtitles that scroll past before you can parse them.

The standard industry answer — a translation bot that joins the call as a participant — creates its own problems. Your colleagues can see it. Your host may not allow third-party attendees. And you've handed a recording of your conversation to a third-party server the moment the bot connects.

VoxisLive takes a different approach: it sits on your Windows machine, listens to what your speakers are already playing, and speaks the translated audio back to you — privately, without ever touching the call itself.

How does it translate Zoom, Teams and Meet audio?

VoxisLive reads audio directly from your sound card's output stream via WASAPI loopback. When someone speaks, the audio travels from the conferencing app to your speakers as it normally would; VoxisLive intercepts that stream at the OS level, isolates speech with on-device detection, and returns a spoken translation within seconds.

Because capture happens at the OS audio layer — not inside the conferencing app — the process is invisible to every participant. No bot joins, no extra attendee appears, no notification fires. Zoom, Microsoft Teams, Google Meet, Webex, Slack Huddles and Discord all work identically.

Why bot-free matters

Meeting bots used by competing translation services connect a second participant to your call: it has a microphone feed, generates a recording, and is visible in the meeting. Several enterprise platforms flag or block unrecognized bots by default, and many corporate security policies prohibit third-party bots outright.

With no bot, none of that arises: no IT whitelisting, no new data processor for legal review, no unknown attendee for the host to admit. For regulated industries, healthcare, legal proceedings or any genuinely confidential meeting, bot-free translation is not a convenience — it is the minimum viable standard.

Two-way conversations

Meeting mode handles both directions at once: incoming speech is translated into your language through your speakers, and your own responses are translated into the meeting language and injected through a virtual microphone. Both sides speak naturally — no pausing, no interpreter line. Ideal for international sales calls, cross-border interviews and support sessions.

Getting started takes under five minutes

  1. Install VoxisLive from the Microsoft Store.
  2. Select your meeting output device as the capture source.
  3. Choose your translation direction (79 languages).
  4. Start the session, then join your meeting as you normally would.

The app runs quietly alongside your call. Prefer your own key? The open-source BYOK build keeps the full data flow under policies you set directly with Google. See how it works and pricing for details.

FAQ

Common questions

01Can other participants see that I'm using VoxisLive?
No. VoxisLive captures audio from your Windows sound output using WASAPI loopback and never connects to the meeting itself. Nothing joins the call, no participant entry appears, and no platform notification fires.
02Does it work with Google Meet or Webex, or only Zoom and Teams?
It works with every conferencing app on Windows, because it operates at the system audio layer rather than inside any specific app — Zoom, Teams, Meet, Webex, Discord, Slack Huddles and browser-based tools alike.
03What's the difference between VoxisLive and a meeting translation bot?
A bot joins your meeting as a second participant, receives the platform's audio feed and typically records on its own infrastructure. VoxisLive reads the audio your speakers are already playing — no second participant, no third-party recording, no platform permission.
04Is it suitable for confidential meetings?
Audio is captured locally and on-device speech detection runs before anything leaves your machine; no audio is retained after the session. For the highest-sensitivity cases, the open-source BYOK build sends audio directly to Google under your own key.
Free to try · 10 minutes on us

Hear every language, in real time.

Runs on Windows 10 and 11 — no drivers, no setup ritual, no bot in your call.