The most important thing today is Nvidia's GTC conference, practically an AI version of a brief history of humankind.

robot
Abstract generation in progress

Today’s biggest event is NVIDIA GTC, basically an AI version of A Brief History of Humankind.

Jensen Huang hasn’t even taken the stage yet, but the pre-release information alone is enough to fill a book.

Tonight, I’ve summarized three main highlights. Let’s go, friends, follow me.

  1. AI Computing Power Costs Cut in Half

The previous generation Blackwell was already impressive, right? Soon, the new Vera Rubin chip will go into mass production.

What makes Vera Rubin so powerful? Simply put, two words: cheap.

Running the same AI models, chip count reduced to a quarter, inference computation costs cut by 90%. Ninety percent reduction, friends. AWS, Microsoft, and Google’s top cloud providers are already on board.

  1. The Groq Acquisition from Last Year, Now Delivering Results

Previously, Jensen Huang said at the earnings call that Groq would be integrated into NVIDIA’s ecosystem as an expansion architecture, just like Mellanox was acquired to enhance networking capabilities.

Groq’s LPU (Low Power Unit) and NVIDIA GPUs are housed in the same data center—GPUs handle understanding problems, while LPUs quickly produce answers.

This division of labor between the two chips, working together, directly reduces latency in AI agent scenarios.

AI agents do tasks for people—each task might require dozens of model adjustments, burning inference power each time, and users are waiting. A slower experience could cause a crash.

Inference involves two steps: first understanding your question, then generating the answer word by word.

GPUs excel at the first step, but for the second—producing words quickly and reliably—Groq’s LPU is better.

Is 20 billion dollars expensive?

Think about it—every company in the future will run hundreds of agents, each adjusting models thousands of times a day.

  1. NVIDIA’s OpenClaw Launches as NemoClaw

It’s an open-source platform that companies can deploy to have AI employees handle workflows, data processing, and project management. Rumor has it they’re already talking with Salesforce and Adobe.

What’s interesting is that NemoClaw doesn’t require NVIDIA chips. Think about this logic. Selling chips only earns hardware-level profits; setting the rules allows earning from the entire chain. Jensen Huang has a clear grasp of this.

  1. Jensen Huang Says He Will Showcase “Chips the World Has Never Seen”

Most likely, the next-generation architecture, Feynman, will make its debut, with mass production expected in 2028 using TSMC’s most advanced 1.6nm process.

There’s also a lesser-known rumor I find quite interesting.

NVIDIA is releasing laptop processors—two models, aimed at gaming. This means graphics card makers are now competing for CPU market share.

Tonight, I feel Jensen Huang is destined to become a legendary figure.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pin