Qwen3-Omni-30B-A3B
AlibabaLanguage modeling/generationQuestion answeringVisual question answeringImage captioningVideo descriptionSpeech recognition (ASR)Speech synthesisSpeech-to-textText-to-speech (TTS)Open weights
Developed by Alibaba in 2025, Qwen3-Omni-30B-A3B is a language modeling/generation model with 35300000000.0 parameters with openly available weights.
About Qwen3-Omni-30B-A3B
We present Qwen3-Omni, a single multimodal model that, for the first time, maintains state-of-the-art performance across text, image, audio, and video without any degradation relative to single-modal counterparts. Qwen3-Omni matches the performance o
Details
- Provider
- Alibaba
- Task
- Language modeling/generation,Question answering,Visual question answering,Image captioning,Video description,Speech recognition (ASR),Speech synthesis,Speech-to-text,Text-to-speech (TTS)
- Parameters
- 35300000000.0
- Released
- 2025-09-22
- Open weights
- Yes