NVIDIA open-sources the 120B Nemotron 3 Super intelligent agent model, with throughput increased by up to 5 times

Gate News, March 12 — NVIDIA announced the open-source large language model Nemotron 3 Super, designed for multi-agent application scenarios. The model has a total of 120 billion parameters, utilizing a hybrid Mamba-Transformer MoE architecture. During inference, only 12 billion parameters are activated per token. Its core technology, “Latent MoE,” compresses token embeddings into a low-rank latent space before routing to expert networks, enabling four experts to be activated simultaneously at the cost of a single expert’s computation. This results in up to a fivefold increase in inference throughput compared to the previous Nemotron Super. The model natively supports a context window of 1 million tokens, suitable for autonomous agents that need to maintain long-term workflow states. In the PinchBench benchmark for evaluating agent workloads, Nemotron 3 Super scored 85.6%, the highest among similar open-source models. NVIDIA also open-sourced over 10 trillion tokens of training data, 15 reinforcement learning training environments, and evaluation schemes, under the NVIDIA Nemotron Open Model License. The model is now available on platforms such as Hugging Face, build.nvidia.com, Perplexity, and OpenRouter, and supports deployment via Google Cloud, Oracle, AWS Bedrock, Azure, and other cloud services. Companies like Perplexity, CodeRabbit, Cadence, Dassault Systèmes, and Siemens have already adopted it.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

DeFi hackers stole $600 million in April; Kelp DAO and Drift accounted for 95% of the monthly losses

In April 2026, within just 20 days, cryptocurrency protocols suffered losses of more than $606 million due to hacker attacks, becoming the worst single-month loss record since the February 2025 exchange incident in which $1.4 billion in data was leaked. The two attacks by KelpDAO and Drift Protocol accounted for 95% of April’s losses, and 75% of the total $771.8 million losses as of now in 2026.

MarketWhisper1h ago

Moody's: Stablecoin Market Exceeds $315.8B, but Near-Term Bank Threat Remains Limited

Moody's report shows stablecoins have reached a market value of $315.8 billion, primarily dominated by USDT. While near-term risks to banks are limited due to narrow adoption and regulation, long-term growth may challenge traditional banking.

GateNews2h ago

Crypto VC Funding Standards Rise; 2026-2027 Expected as Strongest Investment Period Since 2018

The crypto venture capital landscape is shifting, with investors now demanding proven user bases and revenue before funding. The appeal of token models is waning, while the AI sector diverts resources. However, improved conditions may enhance investments in stablecoins and financial infrastructure by 2026-2027.

GateNews5h ago

Nomura institutional investor survey: 65% view crypto assets as a key diversification allocation, and nearly 80% plan to enter within three years

According to a survey by Nomura Holdings and Laser Digital, 65% of Japanese institutional investors view crypto assets as an important diversification allocation tool, highlighting their focus on low correlation. Nearly 80% of respondents plan to allocate 2%-5% of assets to crypto over the next three years, and they show strong interest in diversified sub-themes such as staking and lending. Improvements in Japan’s regulatory environment have driven this trend, but barriers still remain, such as taxation and internal risk control.

ChainNewsAbmedia17h ago

AI consumes 80% of global venture capital; Q1 2026 sees a pull of $242 billion: How crypto players should respond to the reallocation of capital

According to reports, in Q1 2026 the total global venture capital funding is close to $300 billion, of which AI-related companies account for about $242 billion alone, or 80% of venture capital. This shows that AI has become the primary focus of venture capital. As funding concentrates on AI, other areas such as crypto have been squeezed; industry players need to adjust their strategies, integrate AI more deeply into their businesses, and expect a trend of infrastructure consolidation.

ChainNewsAbmedia22h ago

Nomura Securities survey: Eight in ten institutional investors plan to allocate 2% to 5% of AUM to crypto assets

A 2026 digital asset institutional investor survey by Nomura Securities (Nomura) and its crypto subsidiary, Laser Digital, shows that nearly four-fifths of surveyed institutional investors plan to allocate 2% to 5% of their total assets under management (AUM) to the crypto market. Most institutions say they plan to do so within the next year rather than investing immediately.

MarketWhisper04-17 03:05
Comment
0/400
No comments