What This Skill Does
Transcribe audio files to text with optional speaker diarization (identifying who said what) and known-speaker hints for better accuracy.
When to Use It
- Transcribing audio or video recordings to text
- Interview transcription with speaker labels
- Meeting transcription with multiple participants
- Extracting text from any audio source
Key Features
Diarization
Automatically identify and label different speakers in the recording.Known Speakers
Provide speaker hints (names, voice samples) for more accurate identification.Format Support
Works with common audio and video formats.Best Practices
- Provide speaker hints when you know who's talking for better accuracy
- Use high-quality audio when possible — background noise degrades results
- Review diarization boundaries for critical transcripts
- Break very long recordings into segments for better processing