Push-to-talk Vosk extension for Pi with live editor transcription
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Matteo Benedetto 5f950fc59a fix: prevent Enter from sending message during recording, play error cue 2 months ago
assets Add smooth and hacker logo variants 2 months ago
extensions fix: prevent Enter from sending message during recording, play error cue 2 months ago
.gitignore Initial import of Vosk Pi voice extension 2 months ago
README.md Add smooth and hacker logo variants 2 months ago
package.json Initial import of Vosk Pi voice extension 2 months ago

README.md

pi-vosk-voice logo

pi-vosk-voice

Push-to-talk voice input for Pi using Vosk offline speech recognition.

Features

  • Hold-to-talk recording
  • Live transcription while speaking
  • Partial stabilization on pauses
  • Writes into the Pi editor in paste mode
  • Optional send mode
  • Start/stop audio cues via SoX
  • Global or project-local Pi configuration

Requirements

  • sox (rec and play)
  • Python 3
  • vosk Python package
  • A downloaded Vosk model, e.g. vosk-model-it-0.22

Configuration

Put this in ~/.pi/vosk-voice.json or .pi/vosk-voice.json:

{
  "shortcut": "f12",
  "enabled": true,
  "mode": "paste",
  "modelPath": "/home/you/.cache/vosk/vosk-model-it-0.22",
  "soundsEnabled": true
}

Install in Pi

pi install /path/to/pi-vosk-voice

Then reload Pi:

/reload

Commands

  • /voice
  • /voice config
  • /voice stop
  • /voice download-model
  • /voice set shortcut ctrl+t
  • /voice set mode paste
  • /voice set sounds false

Notes

The extension uses Vosk streaming partial/final results and updates the editor progressively.

Logo Variants

Smooth / vector-like

Smooth logo

Terminal / hacker style

Hacker logo