SkillJavaScriptv1.0.1

venice-transcribe

Transcribe audio to text using Venice AI's Whisper-based speech recognition.

0 downloads

sabrinaaquino

Updated Feb 20, 2026

Venice Transcribe

Transcribe audio files to text using Venice AI's speech recognition (Whisper-based).

API Base URL: https://api.venice.ai/api/v1 Documentation: docs.venice.ai

Setup

Get your API key from venice.ai → Settings → API Keys
Set the environment variable:

export VENICE_API_KEY="your_api_key_here"

Transcribe Audio

Convert audio files to text.

uv run {baseDir}/scripts/transcribe.py --file recording.mp3

Options:

--file (required): Audio file path
--output: Save transcription to file (default: prints to stdout)
--model: ASR model (default: openai/whisper-large-v3)
--format: Output format: json or text (default: json)
--timestamps: Include word/segment timestamps
--language: Language hint (ISO 639-1 code, e.g., en, es, fr)

Supported audio formats:

WAV, WAVE
MP3
FLAC
M4A, AAC
MP4 (audio track)

Examples

Basic transcription:

uv run {baseDir}/scripts/transcribe.py --file meeting.mp3

Get just the text (no JSON):

uv run {baseDir}/scripts/transcribe.py --file audio.wav --format text

With timestamps:

uv run {baseDir}/scripts/transcribe.py --file podcast.mp3 --timestamps

Spanish audio with language hint:

uv run {baseDir}/scripts/transcribe.py --file spanish.mp3 --language es

Save to file:

uv run {baseDir}/scripts/transcribe.py --file interview.mp3 --output transcript.json

Output Format

JSON format (default):

{
  "text": "Hello, this is a transcription test.",
  "duration": 3.5
}

JSON with timestamps:

{
  "text": "Hello world",
  "duration": 2.1,
  "timestamps": {
    "word": [
      {"word": "Hello", "start": 0.0, "end": 0.5},
      {"word": "world", "start": 0.6, "end": 1.0}
    ],
    "segment": [
      {"text": "Hello world", "start": 0.0, "end": 1.0}
    ]
  }
}

Text format:

Hello, this is a transcription test.

Runtime Note

This skill uses uv run which automatically installs Python dependencies (httpx) via PEP 723 inline script metadata. No manual Python package installation required - uv handles everything.

API Reference

Endpoint	Description	Method
`/audio/transcriptions`	Transcribe audio to text	POST (multipart)

Full API docs: docs.venice.ai

Free

Installation

Reviews

No reviews yet. Be the first.