Karpathy: AI capability perception has a major gap; the free tier and the cutting-edge agent are “completely different products”

ChainNewsAbmedia

Former Tesla AI Chief Architect and OpenAI founding member Andrej Karpathy published a long post on X on April 9, pointing out that the public’s understanding of AI capabilities is becoming severely split. He believes that people using the free version of ChatGPT and technical professionals using cutting-edge agent tools like Codex and Claude Code every day are actually discussing “completely different products,” yet both sides think they’re seeing the full picture of AI.

Two worlds, two types of AI understanding

Karpathy currently divides AI users into two groups.

The first group tried the free version of ChatGPT at some point last year, and formed their overall impression of AI from that. What they see are various failures of the model—hallucinations, absurd search results, and even simple questions like whether the voice mode should “drive or walk to get a car wash.” Karpathy admits these problems do exist, but emphasizes that the free version and outdated models can’t represent the real capabilities of cutting-edge agent models before 2026.

The second group satisfies two conditions at the same time: they pay to use the latest cutting-edge agent models (such as OpenAI Codex or Claude Code), and they use them professionally in technical fields like software development, mathematics, and research. Karpathy says this group is experiencing a high level of “AI psychosis,” because the recent progress of these models in technical areas can only be described as astonishing—you can literally watch them solve in an hour programming architecture problems that previously would have taken days or even weeks.

Why progress is concentrated in technical fields

Karpathy explains why improvements in AI capabilities are particularly noticeable in technical fields like software development, but less so in general uses such as search, writing, and making recommendations.

There are two reasons: first, technical fields provide a verifiable reward function (for example, whether unit tests pass), which makes reinforcement learning training work effectively; by contrast, it’s hard to determine objectively how good writing quality is. Second, technical fields have greater commercial value in B2B scenarios, so AI companies put the largest share of team resources into these directions.

The two groups can’t understand what the other is saying

Karpathy concludes that these two groups are “talking past each other.” OpenAI’s free voice mode botches everyday problems, while OpenAI’s top-tier paid Codex can restructure an entire codebase or discover system vulnerabilities within an hour—both of these things are simultaneously true.

In a follow-up reply, he added that someone offered him an observation: the OpenClaw incident drew so much social attention precisely because it introduced a large number of non-technical people to the latest agent models for the first time, and these people previously only knew that AI equals ChatGPT’s web version.

This article by Karpathy: AI capability recognition shows a severe gap; the free version and the cutting-edge Agent are “completely different products.” First appeared on Chain News ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Web3 AI Infrastructure AIW3 Raises $2M in Seed Funding Led by Buffalo Capital

Gate News message, April 24 — Web3 AI infrastructure platform AIW3 announced the completion of a $2 million seed round funding. The round was led by Buffalo Capital, with GalaXin Capital and Three-stones Ventures participating as co-investors. AIW3 is transitioning toward an Agent-as-a-Service

GateNews49m ago

The UAE government announced the rollout of AI agents, aiming to complete automation of half of its business operations by 2028 at the latest

The UAE announced that within two years, 50% of federal government departments, services, and operations will be run by autonomous AI agents, making it a global first. AI will become a government implementation partner, helping with decision-making, improving services, and self-optimizing. All civil servants will be required to undergo training, minister performance and the effectiveness of AI adoption will be linked, and the initiative will be driven by a dedicated task force supervised by the president. This move stems from more than a decade of policy accumulation and an AI strategy, with a core focus on people.

ChainNewsAbmedia2h ago

OristaPay Launches AI-Powered Payment System on Telegram, Enables Instant USDT Settlements on TON

Gate News message, April 24 — OristaPay, a brand operating under RD Technologies, announced a complete payment pathway enabling AI agents to execute transactions within the Telegram ecosystem during the Hong Kong Web3 Festival. The system allows users to trigger digital asset transactions through na

GateNews4h ago

Jeff Bezos' Project Prometheus Raises $10B at $38B Valuation

Gate News message, April 24 — Project Prometheus, an AI lab founded by Amazon founder Jeff Bezos and former Google executive Vik Bajaj, has closed a $10 billion funding round at a $38 billion valuation. JPMorgan Chase and BlackRock are

GateNews9h ago

OpenAI Launches GPT-5.5, Designed for Agent Tasks and Complex Workflows

Gate News message, April 24 — OpenAI has officially released GPT-5.5, a next-generation AI model designed to handle complex objectives, tool integration, self-verification, and multi-step task completion. The model excels at code writing and debugging, online research, data analysis, document

GateNews9h ago

AI Agent Startup Band Raises $17M Seed Round Led by Sierra Ventures, Hetz Ventures, Team8

Gate News message, April 24 — Band, a startup building a communication and collaboration platform for AI agents, has closed a $17 million seed round led by Sierra Ventures, Hetz Ventures, and Team8. Founded in mid-2025 by CEO Arick Goomanovsky and CTO Vlad Luzin, the company develops software for re

GateNews10h ago
Comment
0/400
No comments