Llama-3.1-Nemotron-Ultra-253B-v1
Nebius Token FactoryOpen weights
Llama-3.1-Nemotron-Ultra-253B-v1 by Nebius Token Factory costs $0.60 per 1M input tokens and $1.80 per 1M output tokens, with a 128K-token context window.
Pricing
Input (per 1M tokens)
$0.60
Output (per 1M tokens)
$1.80
Cached input (per 1M)
$0.06
Specifications
- Provider
- Nebius Token Factory
- Context window
- 128K tokens
- Parameters
- —
- Released
- Jan 2025
- Open weights
- Yes
- Frontier model
- No
Benchmarks
- LMArena (Chatbot Arena) Elohuman-preference
- 1319.5
Compare Llama-3.1-Nemotron-Ultra-253B-v1 with…
Llama-3.1-Nemotron-Ultra-253B-v1 vs Qwen3.5-397B-A17B$3.60/1MLlama-3.1-Nemotron-Ultra-253B-v1 vs Kimi-K2.5$2.50/1MLlama-3.1-Nemotron-Ultra-253B-v1 vs DeepSeek V4 Pro$3.50/1MLlama-3.1-Nemotron-Ultra-253B-v1 vs gpt-oss-120b$0.60/1MLlama-3.1-Nemotron-Ultra-253B-v1 vs Nemotron-3-Nano-30B-A3B$0.24/1MLlama-3.1-Nemotron-Ultra-253B-v1 vs GLM-5$3.20/1M
FAQ
Pricing is per 1M tokens (USD); confirm with the provider before production use.