DeskTalk listens when you hold a button, transcribes what you said, and types it where your cursor is.
- Run
DeskTalk_0.2.0_x64-setup.exe - Follow installer prompts
- Launch from Start Menu
Option A - Winget (Easiest):
winget install Gyan.FFmpegOption B - Scoop:
scoop install ffmpegOption C - Manual:
- Download from https://ffmpeg.org/download.html
- Extract to
C:\ffmpeg - Add
C:\ffmpeg\binto PATH
- Open DeskTalk → License tab
- Click "Buy License" ($100 one-time)
- Complete payment via Stripe
- Check your email for license key
- Paste key in DeskTalk → Click "Activate"
For OpenAI (Recommended for Accuracy):
- Go to https://platform.openai.com/api-keys
- Create new secret key
- Copy the key (starts with
sk-...) - Paste in DeskTalk → Transcription tab
- Click "Validate"
For Local Model (Offline, Free):
- Select "Local Model" in Transcription tab
- Choose a model (e.g.,
base.en) - First use will download the model
- Open DeskTalk → General tab
- Click dropdown next to "Push-to-Talk Key"
- Select a key (e.g.,
F15,F13,CapsLock) - Click "Save Settings"
- Status should show "Running" (green)
- Click in any text field
- Hold your PTT key → Speak → Release
- Your words appear! 🎉
- Choose an unused key (F13-F24 recommended)
- Avoid keys you use often (don't use Space, Enter, etc.)
- Best options:
F15- Rarely used function keyF13- Good if you have extended keyboardCapsLock- If you never use it
- "Default Device" - Uses Windows default mic
- Or select specific microphone
- Test by recording and checking quality
- Capitalize first letter - Start transcriptions with capital
- Add space after - Auto-space between transcriptions
- Type characters instead of pasting ⓘ
- Slower but works in apps that block Ctrl+V
- Leave off for best speed
- Start with Windows - Launch DeskTalk on boot
- Start minimized to tray - Hide window on startup
- Dark mode - Easy on the eyes
Pros:
- Most accurate transcription
- Handles accents, background noise
- Fast processing
Cons:
- Costs money (~$0.006 per minute)
- Requires internet
- Sends audio to OpenAI
Setup:
- Get key from https://platform.openai.com/api-keys
- Paste in field
- Click "Validate" to test
- Click "View Usage & Billing" to monitor costs
Cost Example:
- 1000 minutes = ~$6
- Average user: $2-5/month
Pros:
- Free forever
- Works offline
- Private (audio stays on your PC)
Cons:
- Less accurate than OpenAI
- Slower on older PCs
- Larger download (~1GB for best model)
Models:
tiny.en- Fastest, least accurate (75 MB)base.en- Good balance (142 MB)small.en- Better accuracy (466 MB)medium.en- Best accuracy (1.5 GB)
- Active (Green) - Everything works
- Expired (Yellow) - Renew license
- Inactive (Red) - Need to activate
- Paste your license key
- Click "Activate"
- Key tied to this PC (3 devices max per license)
- Click "Deactivate" to free up device slot
- Useful for switching PCs or testing
Track your usage:
- Total Words - Words transcribed all-time
- Average WPM - Your speaking speed
- Transcriptions - Number of recordings
- Recording Time - Total time speaking
- Speak clearly (but naturally)
- Use a good microphone (not laptop built-in)
- Quiet environment (reduce background noise)
- Hold button first, then speak (avoid cutting off start)
- Pause briefly before releasing (avoid cutting off end)
Writing Emails:
- Use PTT for long paragraphs
- Faster than typing
- Natural conversational tone
Coding Comments:
- Quickly document code
- Explain complex functions
- Add TODOs
Documentation:
- Draft documents quickly
- Brainstorm ideas
- Meeting notes
Accessibility:
- RSI/carpal tunnel relief
- Hands-free input
- Faster than typing
- Click tray icon - Open settings
- Right-click tray icon - Quick menu
- Restart Engine
- Stop Engine
- Settings
- Quit
No transcription appearing:
- Check status is "Running" (green)
- Verify PTT key is configured
- Ensure cursor is in a text field
- Try longer recording (>1 second)
Transcription is wrong:
- Speak more clearly
- Try OpenAI instead of Local
- Check microphone quality
- Reduce background noise
Sound plays but nothing types:
- Click in text field first
- Check "Type characters" option
- Try different application
- Restart engine (right-click tray)
DeskTalk respects these env vars:
OPENAI_API_KEY- Override API keyRUST_LOG=debug- Enable debug logging
%APPDATA%\com.desk-talk.app\config.json
%APPDATA%\com.desk-talk.app\statistics.json
%APPDATA%\com.desk-talk.app\logs\
"Windows protected your PC"
- Click "More info" → "Run anyway"
- Normal for apps without code signing
Installation fails
- Run as Administrator
- Disable antivirus temporarily
- Check disk space (need ~50 MB)
"FFmpeg not found"
# Test if FFmpeg is installed
ffmpeg -version
# If not found, install:
winget install Gyan.FFmpeg
# Restart DeskTalk after installing"License validation failed"
- Check internet connection
- Verify key was copied correctly
- Contact support@desktalk.app
"Device already activated"
- You've activated 3 devices max
- Deactivate on another PC first
- Or contact support for device reset
"Invalid API key"
- Get new key from OpenAI
- Ensure you copied full key (starts with
sk-) - Check OpenAI account has credits
Transcription too slow
- Use
tiny.enorbase.enmodel - Or switch to OpenAI API (faster)
- Close other heavy applications
Poor accuracy
- Use OpenAI instead of Local
- Upgrade to larger local model
- Improve microphone quality
No recording
- Check microphone permissions in Windows
- Select correct audio device in settings
- Test mic in Windows Sound settings
Echo or feedback
- Disable microphone monitoring
- Use headphones
- Reduce speaker volume
- Pro License: $100 (one-time)
- Includes:
- Unlimited transcriptions
- 3 device activations
- Lifetime updates
- Email support
- Whisper API: ~$0.006/minute
- Pay-as-you-go (no subscription)
- Example costs:
- Light user (100 min/month): ~$0.60
- Medium user (500 min/month): ~$3.00
- Heavy user (2000 min/month): ~$12.00
- $0 forever
- No per-minute costs
- No internet required
- Local Model: Audio never leaves your PC
- OpenAI API: Audio sent to OpenAI for processing
- See: https://openai.com/policies/privacy-policy
- OpenAI doesn't train on API data (by default)
- Config: Settings only, no audio
- Statistics: Word counts, times
- License: License key (encrypted in Windows Credential Manager)
- Audio: Temporary files deleted after transcription
- License validation: Keygen.sh (on startup/activation)
- OpenAI API: Only when transcribing (if using cloud)
- Updates: None (manual updates only for now)
- Email: support@desktalk.app
- Billing: billing@desktalk.app
Include:
- Windows version
- DeskTalk version (see About)
- Error message
- Steps to reproduce
Getting Started with DeskTalk: https://youtu.be/SzPE_AE0eEo
How to Get an OpenAI API Key: https://youtu.be/SzPE_AE0eEo
Current Version: 0.2.0
Last Updated: October 1, 2025
What's New:
- Initial public release
- Keygen licensing system
- Dark mode support
- Statistics tracking
- Error sound feedback
Coming Soon:
- Auto-updates
- macOS support
- Bundled FFmpeg
- Code signing
Copyright © 2025 DeskTalk
See LICENSE file for details
Enjoy faster, hands-free typing with DeskTalk! 🎤✨