Tap2Talk
Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut
README
Tap2Talk
โโโโโโโโโ โโโโโโ โโโโโโโ โโโโโโโ โโโโโโโโโ โโโโโโ โโโ โโโ โโโ
โโโโโโโโโ โโโโโโโโ โโโโโโโโ โโโโโโโโ โโโโโโโโโ โโโโโโโโ โโโ โโโ โโโโ
โโโ โโโโโโโโ โโโโโโโโ โโโโโโโ โโโ โโโโโโโโ โโโ โโโโโโโ
โโโ โโโโโโโโ โโโโโโโ โโโโโโโ โโโ โโโโโโโโ โโโ โโโโโโโ
โโโ โโโ โโโ โโโ โโโโโโโโ โโโ โโโ โโโ โโโโโโโโ โโโ โโโ
โโโ โโโ โโโ โโโ โโโโโโโโ โโโ โโโ โโโ โโโโโโโโ โโโ โโโ
Talk in ANY app, type nothing. Works everywhere you type - VSCode, Terminal, Slack, Browser, anywhere. Press a key, speak, and watch your words appear instantly where you're working.
"I believe talking to computers is the future. With AI getting smarter, why are we still typing everything? I built Tap2Talk for myself because I wanted to speak naturally in ANY application without switching tools. Now I'm sharing it with you." - @unclecode
Built by @unclecode, creator of Crawl4AI (50K+ stars).
Works Everywhere
โ
Code Editors - VSCode, Sublime, Vim, IntelliJ, any IDE
โ
Terminal - Dictate complex commands without typing
โ
Chat Apps - Slack, Discord, Teams, WhatsApp Web
โ
Browsers - Gmail, ChatGPT, Google Docs, anywhere on the web
โ
Note Taking - Notion, Obsidian, Apple Notes, OneNote
โ
Literally Anywhere - If you can type there, you can talk there
Features
- One Shortcut for Everything: Press
Ctrl+Space(double-tap Space) to start recording anywhere - Instant Paste: Your words appear right where your cursor is - no copy/paste needed
- Powered by Groq Whisper: Fast and accurate transcription using state-of-the-art AI
- Customizable Shortcuts: Set your own recording and abort shortcuts
- Cross-Platform: Works on macOS, Windows, and Linux
- Background Service: Runs quietly in the background, always ready
- System Tray Integration: Clean interface with status indicators
Groq API Setup
Tap2Talk currently uses Groq's lightning-fast Whisper API for transcription. Support for OpenAI Whisper, local models, and other providers coming soon!
Why Groq?
- โ Free tier available - Perfect for personal use (30 requests/min, 14,400 requests/day)
- โ Blazing fast - Near-instant transcription
- โ High accuracy - Powered by Whisper Large v3
- โ No credit card required - Start immediately
Get Your API Key (2 minutes)
- Create Account: Visit console.groq.com and sign up with Google or GitHub
- Generate API Key: Navigate to "API Keys" โ Click "Create API Key" โ Name it "Tap2Talk"
- Copy Your Key: Copy the key (starts with
gsk_) - Tap2Talk will ask for it on first run
The free tier includes 30 requests per minute and 14,400 requests per day - more than enough for regular use!
Coming Soon
- OpenAI Whisper API support
- Local Whisper models (no internet required)
- Other transcription providers
Installation
npm install -g tap2talk
Quick Start
-
Run interactively (recommended for first time):
tap2talk -
Start as background service:
tap2talk start -
Check status:
tap2talk status
Commands
tap2talk- Interactive mode with menutap2talk start- Start as background servicetap2talk stop- Stop background servicetap2talk status- Check service statustap2talk restart- Restart servicetap2talk logs- Show recent logstap2talk update- Check for updates
Setup
On first run, you'll need to:
- Get a free API key from Groq Console
- Enter it when prompted
- Grant accessibility permissions (macOS) for global shortcuts
Default Shortcuts
- Start Recording:
Ctrl+Space(hold Ctrl, double-tap Space) - Abort Recording: Double-tap
Escape
Both shortcuts are fully customizable in Settings.
How It Works
- Press your recording shortcut
- Speak naturally
- Recording stops automatically or press shortcut again
- Text is transcribed and pasted instantly
Status Indicators
The system tray shows real-time status:
Tap- Ready[*] REC- Recording[~] PRO- Processing[OK] DON- Done[X] ERR- Error[_] ABT- Aborted
Configuration
Settings are stored in ~/.tap2talk/config.json
Requirements
- Node.js 14 or higher
- Groq API key (free)
- Microphone access
Privacy
- Audio is processed via Groq API
- No audio is stored locally after transcription
- Settings and logs stay on your machine
Troubleshooting
Service won't start
tap2talk stop
tap2talk start
Permission issues (macOS)
Go to System Preferences โ Security & Privacy โ Privacy โ Accessibility
Add and enable Terminal/iTerm
Check logs
tap2talk logs
Contributing
Issues and PRs welcome at github.com/unclecode/tap2talk
License
MIT ยฉ unclecode
Built with inspiration from productivity needs and powered by Groq Whisper API.
