VoxisLive Download

**VoxisLive for Meetings — Real-Time Audio Translation Without a Bot**

VoxisLive translates your meeting audio into spoken English — or any supported language — in real time on Windows, without sending a bot into the call. It captures system audio directly using WASAPI loopback, so nothing joins your Zoom, Teams, or Google Meet session. Translation is private, instant, and works with every conferencing app.

The problem with meeting translation today

Sitting inside a conference call conducted in a language you don't fully understand is exhausting. You catch fragments, you miss the nuance of a question, and you spend half the meeting reading machine-generated subtitles that scroll past before you can parse them. The standard industry answer — a translation bot that joins the call as a participant — creates its own set of problems. Your colleagues can see it. Your host may not allow third-party attendees. And you've just handed a recording of your conversation to a third-party server the moment the bot connects.

VoxisLive takes a different approach. It sits on your Windows machine, listens to what your speakers are already playing, and speaks the translated audio back to you — privately, without ever touching the call itself.

How does VoxisLive translate Zoom, Teams, and Google Meet audio on Windows?

VoxisLive uses Windows WASAPI loopback capture to read audio directly from your sound card's output stream. When someone speaks in a meeting, the audio travels from the conference app to your speakers or headphones as it normally would. VoxisLive intercepts that stream at the OS level, passes it through on-device speech detection to isolate speech from background noise, then sends the speech segment to Gemini Live for translation. The result comes back as spoken audio in your chosen language within seconds.

Because capture happens at the OS audio layer — not inside the conferencing app — the process is invisible to every participant in the call. No bot joins. No extra attendee appears in the participant list. No meeting notification fires. Zoom, Microsoft Teams, Google Meet, Webex, and any other conferencing software running on Windows work identically; VoxisLive does not distinguish between them.

Why does it matter that no bot joins the call?

Meeting bots used by competing translation services — including JotMe, Transync, Wordly, and DeepL Voice — work by programmatically connecting a second participant to your call. That participant has a microphone feed, generates a recording, and is visible in your meeting. Several enterprise video platforms flag or block unrecognized bot accounts by default. Corporate security policies at many organizations explicitly prohibit third-party bots from joining internal calls.

When no bot joins, none of those problems arise. Your IT administrator does not need to whitelist a service. Your legal team does not need to review a new data processor. The call host does not need to grant entry to an unknown attendee. And the audio never leaves your local machine until you initiate a translation request under your own Gemini API key — or under the managed-minutes plan you control on your account.

For regulated industries, healthcare conversations, legal proceedings, or any meeting where confidentiality is a genuine concern, bot-free translation is not a convenience feature. It is the minimum viable standard.

Does VoxisLive support two-way conversation during a meeting?

Yes. VoxisLive includes a two-way meeting mode designed for live conversations rather than passive listening. In this mode, the app handles both directions: it translates incoming speech from other participants into your language via the system audio stream, and it listens to your microphone input to translate your spoken responses into the meeting language before you speak. You can hold a full bilingual conversation without either party needing to switch languages, pause, or use a separate interpreter line.

The mode is particularly useful for international sales calls, cross-border HR interviews, and technical support sessions where both participants need to respond naturally rather than waiting for a human interpreter.

What languages and meeting platforms does VoxisLive support?

VoxisLive supports the language pairs made available through Gemini Live, which covers the major European, East Asian, South Asian, and Middle Eastern languages used most frequently in international business. The translation engine is not tied to any specific conferencing platform. Because the app works at the audio-output layer of Windows, it translates whatever your speakers produce — meaning it works equally well with Zoom, Microsoft Teams, Google Meet, Webex, Slack Huddles, Discord calls, or a browser-based conferencing tool you've never heard of.

No plugin installation, no browser extension, and no meeting-platform integration is required. Install VoxisLive on your Windows machine, select the audio output device you're using for the meeting, choose your target language, and start translation.

How to get started with meeting translation on VoxisLive

Getting VoxisLive running for your next meeting takes under five minutes. Download the Windows installer, run it, and open the app. Select your audio output as the capture source — VoxisLive will list every active playback device on your system. Choose your translation direction. If you are on the Developer plan, paste your Gemini API key into settings; if you are on Creator or Pro, your managed minutes are already allocated and no key is needed. Start a translation session, then open your meeting as you normally would.

The app runs in the background in your system tray and produces no visible overlay on your screen, so your meeting window remains uncluttered. You hear the translated voice through a secondary audio channel — either a second output device or a software mixer — while the original audio continues simultaneously if you prefer to follow both.

Learn more about how the capture and translation pipeline works on the how it works page, compare plans and minute allocations on the pricing page, or go straight to the download page to get the installer.

Common questions

Can other participants in my Zoom or Teams meeting see that I'm using VoxisLive?

A: No. VoxisLive captures audio from your Windows sound output using WASAPI loopback — a standard OS audio API — and never connects to the meeting itself. Nothing joins the call, no participant entry appears, and no meeting-platform notification fires. The translation process is entirely local to your machine until a Gemini API request is made.

Does VoxisLive work with platforms like Google Meet or Webex, or only Zoom and Teams?

A: VoxisLive works with every conferencing app on Windows because it operates at the system audio layer, not inside any specific app. Zoom, Microsoft Teams, Google Meet, Webex, Discord, Slack Huddles, and browser-based meeting tools all produce audio through your Windows sound card. VoxisLive captures that audio regardless of which app generated it.

What is the difference between VoxisLive and a meeting translation bot?

A: A translation bot joins your meeting as a second participant, receives the audio feed from the conferencing platform's servers, and typically records the session on its own infrastructure. VoxisLive instead reads the audio your speakers are already playing via the Windows WASAPI loopback API. No second participant joins, no recording is made by a third party, and no conferencing-platform permission is required.

Is VoxisLive suitable for confidential or legally sensitive meetings?

A: VoxisLive's architecture is designed with privacy in mind. Audio is captured locally using WASAPI; on-device speech detection runs on your machine before any data leaves it. Translation requests go to Gemini Live under either your own Gemini API key (BYOK, Developer plan) or under VoxisLive's managed-minutes plans, where you retain control of your account data. No bot connects to the call. For the highest-sensitivity use cases, the BYOK Developer plan keeps the full data flow under the account policies you set directly with Google.

---

Hear every language, in real time.

Download