NVIDIA · Single modalityReleased 2026-03-18

nemotron-3-super-120b

nvidia/nemotron-3-super-120b-a12b id

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. It delivers up to 7x higher throughput, providing fast, cost-efficient inference for agentic tasks. Additionally, a long context window gives the model long-term memory, preventing AI agents from losing focus on long, multi-step tasks and ensuring high-accuracy results. Fully open with weights, datasets, and recipes, Super allows easy customization and secure deployment anywhere.

ChatOpen weightsLow cost
Type
Use nemotron-3-super-120b
# Drop-in OpenAI-compatible client
$ import { generateText } from 'ai'
$
$ const { text } = await generateText({
$ model: 'nvidia/nemotron-3-super-120b-a12b',
$ baseURL: 'https://synapse.garden/api/v1',
$ apiKey: process.env.MG_KEY,
$ prompt: 'Why is the sky blue?',
$ })
256K
CONTEXT WINDOW
32K
MAX OUTPUT
$0.165/M
INPUT · PER M
$0.715/M
OUTPUT · PER M
PRICING

List prices, every modality.

RatePer million tokens · USD
Input$0.165/M
Output$0.715/M
Honest list pricesHow we calculate prices
MORE FROM NVIDIA

Other NVIDIA models

See all 4
FAQ · NEMOTRON-3-SUPER-120B

Frequently asked

01 / 04

How do I call nemotron-3-super-120b from my code?

Use the OpenAI or Anthropic SDK and point baseURL at https://synapse.garden/api/v1. Set model: ‘nvidia/nemotron-3-super-120b-a12b and supply your Synapse Garden API key. No code changes beyond the base URL.

02 / 04

How much does nemotron-3-super-120b cost?

Input: $0.165/M per million tokens. Output: $0.715/M per million tokens. The free tier includes a million tokens every month at no cost.

03 / 04

What's the context window for nemotron-3-super-120b?

nemotron-3-super-120b supports a context window of 256K tokens, with a maximum output of 32K tokens.

04 / 04

Do I need a separate Anthropic or OpenAI account?

No. Synapse Garden is the single API surface — one key gives you OpenAI, Anthropic, Google, Meta, Mistral, DeepSeek, xAI, Cohere, and more. Billing, rate limits, and audit logs are unified.

READY

Try nemotron-3-super-120b in three minutes.

Sign up, create a key, drop our base URL into your existing client. The free tier includes a million tokens every month — no credit card.