Skip to content

Qwen3-Omni-30B-A3B

AlibabaLanguage modeling/generationQuestion answeringVisual question answeringImage captioningVideo descriptionSpeech recognition (ASR)Speech synthesisSpeech-to-textText-to-speech (TTS)Open weights

Developed by Alibaba in 2025, Qwen3-Omni-30B-A3B is a language modeling/generation model with 35300000000.0 parameters with openly available weights.

About Qwen3-Omni-30B-A3B

We present Qwen3-Omni, a single multimodal model that, for the first time, maintains state-of-the-art performance across text, image, audio, and video without any degradation relative to single-modal counterparts. Qwen3-Omni matches the performance o

Details

Provider
Alibaba
Task
Language modeling/generation,Question answering,Visual question answering,Image captioning,Video description,Speech recognition (ASR),Speech synthesis,Speech-to-text,Text-to-speech (TTS)
Parameters
35300000000.0
Released
2025-09-22
Open weights
Yes
View model source

Explore

FAQ