k9cli/video-vec2wav2-tokenizer
General NLPEnglish
The k9cli/video-vec2wav2-tokenizer dataset is a English General NLP resource from k9cli at 2026. With 137.9K downloads and 1 likes, it is actively used by the community.
About k9cli/video-vec2wav2-tokenizer
video-vec2wav2-tokenizer
Production-ready pipeline (Python package video_vec2wav2_tokenizer, CLI command
video2dataset) that turns a folder of videos into clean AI training datasets
for speech recognition (ASR) and text-to-speech (TTS).
videos ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- k9cli
- Year
- 2026
- Downloads
- 137946
- Likes
- 1