Show full content
Insights from Redpoint’s “Unsupervised Learning” Podcast
Few figures in AI are as quietly pivotal as Bob McGrew. After a decade at Palantir, he joined OpenAI, where he served as Chief Research Officer until late 2023, helping shape GPT-3, GPT-4, and what he calls the “o1” model. In this conversation, he discusses everything from AI model limits and form factors to enterprise adoption, robotics, and the slow path toward AGI.
1. The Big Debate: Have AI Models Hit a Wall?
A recurring theme is whether large language models (LLMs) like GPT-4 have maxed out their capabilities.
Inside vs. Outside PerspectivesBob: “From the outside, it feels like everything’s accelerating. From the inside, it looks different.”
- Outside: AI seems to have exploded overnight with ChatGPT in 2022, then GPT-4 in 2023.
- Inside: Each step (GPT-3 to GPT-4) required a 100x compute scale-up, heavy R&D, and massive data centers—costly, slow, and complex to orchestrate.
Bob notes that moving beyond GPT-4 will require even larger data centers and new algorithmic approaches, but don’t expect any immediate “GPT-5” label. Labs can rebrand future models as they see fit.
2. Beyond Pre-Training: Reinforcement Learning & Test-Time Compute
A major new direction for improving LLMs—without always requiring bigger data centers—is using reinforcement learning (RL) at inference time. Bob points to OpenAI’s “o1” model (internally a successor to GPT-4) that harnesses RL to generate longer, more coherent reasoning.
Extended Reasoning at InferenceBob: “o1 is effectively a new generation. It’s a 100x compute increase at test-time, though we didn’t call it GPT-5.”
- Instead of a quick one-second response, the model can “think” for 30–60 seconds, amplifying its ability to handle complex prompts.
- This approach sidesteps building another gargantuan data center. You leverage the same GPU cluster but let the model “run longer.”
Bob believes 2025 will see more emphasis on test-time compute for deeper chains of thought—rather than solely relying on bigger pre-trained models.
3. AI in 2025: Slowing Progress or Shifting Focus?
Bob predicts AI’s trajectory will continue but shift away from “release a bigger model every few months” to focusing on:
- New form factors beyond simple chat interfaces.
- Reliability and enterprise trust.
- Deeper integration into real-world workflows (robotics, enterprise apps, specialized tools).
Bob: “Reliability becomes a huge deal once a system is acting in the world on your behalf.”
4. New Form Factors: Where Do We Go After Chatbots?
Chatbots shine for quick Q&A, brainstorming, or coding, but they’re limiting for:
- Long-term tasks (e.g., project management).
- Enterprise workflows where an AI needs to act autonomously.
Bob envisions agents that:
- Connect to APIs and tools.
- Maintain a long chain of thought over hours or days.
- Run autonomously, refining work step by step.
Bob’s example: Programming is a great fit for an agent that can iterate on code over minutes or hours. A policy analyst might do likewise for complex briefs.
5. Reliability & Enterprise Integration Why Reliability Is Critical
In consumer contexts, an AI glitch is often a small annoyance. But in enterprise settings, errors can lead to:
- Financial losses
- Regulatory issues
- Security breaches
- Damaged relationships
Improving reliability by each additional “9” (e.g., 99.9% up-time vs. 99%) is an order-of-magnitude engineering leap.
Next Big Challenge: Integration- Enterprises store info in Slack, Docs, or specialized tools.
- AI must be secure and handle complex permissioning.
- Bob sees many startups building connectors so LLMs can access data safely—akin to Palantir’s early mission of unifying enterprise data.
6. Going “Hands-On”: Agents and Computer-Use
Anthropic’s “Computer-Use” feature (AI controlling a virtual mouse/keyboard) intrigues Bob. Instead of building specialized APIs for each app, the AI uses the universal interface of a screen and cursor.
Bob: “It’s one big hammer that can do anything, but it’s expensive—models have to parse enormous UI steps.”
He predicts that technology demos of AI “controlling” a computer today will be 10x better in a year, and in two years, potentially mainstream—though cost and reliability will remain hurdles.
7. Multimodal, Video, and Creativity: The Sora Launch
Bob discusses Sora, a video-focused model that merges with OpenAI’s ecosystem. Video is notably expensive and complex (30+ frames per second), demanding huge resources to train and run.
Bob: “Sora is widely available through ChatGPT Plus. It’s a milestone since generating or editing video was beyond typical LLM capabilities.”
He expects a full-length AI-generated movie within two years, albeit more of a “co-production” between a director and AI rather than a fully autonomous creation.
8. Robotics: Five Years Away (Again) Why It’s So Challenging
Bob joined OpenAI initially to focus on robotics. Progress has been slower than anticipated—hardware constraints, tricky real-world physics, and bridging simulation with messy real environments (like cloth or cardboard) present huge obstacles.
Where We StandBob: “Simulators excel at rigid objects, but real warehouses have floppy materials. That’s tough.”
- Vision and RL have improved, enabling more advanced warehouse robots.
- Consumer-level home robots remain far off due to unstructured environments and safety concerns.
- Within five years, Bob expects more advanced robots in retail or manufacturing—though the “home butler robot” may take longer.
9. The Complexity of Automating Jobs
Despite GPT-4 and beyond surpassing many human benchmarks, jobs aren’t vanishing overnight. A job comprises many tasks, and AI typically handles only some.
Boring Problems, Big OpportunitiesBob: “Tasks are fractal. Even if you automate a core part, there’s still a leftover ‘human’ component that’s tough to remove.”
Bob sees massive potential in automating “boring” tasks—procurement, compliance, data validation—giving companies “infinite patient workers” that handle grunt work. It’s less flashy but extremely impactful.
10. The Impact on Productivity—Especially Consultants
Consultants benefit greatly from AI, since much of their job is synthesizing large volumes of information—exactly what LLMs excel at. Bob notes bottom-half performers see the biggest productivity boost, narrowing skill gaps.
Still, there’s always a more nuanced layer demanding human strategic decision-making, ensuring that AI doesn’t completely replace the consultant’s role.
11. The Traits of Top AI Researchers
Bob has worked closely with world-class researchers (Ilya Sutskever, Alec Radford, etc.). A defining trait is grit—they persist through countless failed experiments until something finally works.
Bob: “Aditya Ramesh spent 18 months trying to generate ‘a pink panda skating on ice.’ At first, it was all blurs, but eventually, it clicked.”
12. The Evolution of OpenAI’s Mission
OpenAI has “refounded” itself multiple times:
- Nonprofit to Capped For-Profit: Massive capital was essential.
- Microsoft Partnership: Controversial but crucial for compute resources.
- API & Product Focus: Moving from pure research to revenue.
- Consumer & Enterprise: ChatGPT and robust enterprise integrations.
These shifts happened rapidly, redefining OpenAI’s culture and strategy each time.
13. The Challenge of Scaling: On the Road to AGI
Bob is cautious about the term AGI, viewing it as a continuum, not a sudden arrival. Each new generation demands huge system engineering feats—in hardware, data, optimization, and distribution.
Bob: “We solved a big chunk of reasoning. Now the challenge is scaling it, which is very hard—but it’s the path forward.”
He imagines a future with self-driving commutes and AI-run offices that still feel mundane, even though they’re powered by astonishing technology.
14. The Future of AI & Human Agency
As intelligence gets cheaper, Bob argues that human agency—the ability to decide what to do—remains our most valuable resource.
Bob: “We might ask an AI to build a ‘cool product,’ but we still have to define what ‘cool’ means. That part is still very human.”
15. AI in Social Sciences and Academia
Originally trained in Game Theory, Bob left academia due to slow research cycles and career incentives he found unaligned. He sees product development as “applied social science” at scale.
One intriguing idea: “fake users” powered by advanced LLMs for product teams to do rapid prototyping or user research—before releasing features to real customers.
16. Reflections and Future Plans Why Bob Left OpenAI
After eight intense years—during which he helped launch GPT-3, GPT-4, and o1—Bob felt he’d accomplished what he wanted. Shipping o1 was a capstone.
Bob: “It’s a hard job. Once I felt done, it was time to hand off.”
He’s now exploring new ventures, particularly in robotics, and taking a break after a long sprint at OpenAI.
Parenting in an AI WorldThough AI can solve math and write code, Bob still teaches his child those fundamentals because they foster thinking skills—essential in any future.
17. Rapid-Fire Questions & Closing Thoughts
- Overhyped: Claims of “radical new AI architectures.” At scale, many flashy ideas don’t pan out.
- Underhyped: The “o1” model—already a generational leap, yet overshadowed by the name.
- Looking Ahead: Bob expects continued progress, shifting focus toward reliability, integration, and scaling rather than purely bigger models.
Bob: “Keep working on it. Progress won’t slow—it’ll just change direction.”
Key Takeaways & Final Reflections
-
Scaling Continues, but Differently
- Huge leaps require big compute, but future releases may appear under new names.
-
Reinforcement Learning at Inference
- “o1” proves we can boost reasoning by letting models “think longer.”
-
Reliability Matters
- Each extra “9” of reliability is a 10x engineering challenge—crucial for enterprise adoption.
-
Video & Robotics
- Sora marks a leap in multimodal AI.
- Robotics is edging forward, especially in structured industrial contexts, but still lags in homes.
-
Human Work Is Still Needed
- The truly “human” part—setting goals, deciding next steps—remains.
-
OpenAI’s Transformations
- From nonprofit to a “platform juggernaut,” pushing the whole field forward.
-
Agency as the Scarce Resource
- Intelligence may be plentiful, but deciding how to use it—that’s still on us.
Further Reading & Listening
- Full Podcast Episode: Ex-OpenAI Chief Research Officer: What Comes Next for AI?
- Bob’s Twitter: @bobmcgrewai
Final Word
Bob McGrew’s path—from early Palantir to OpenAI’s cutting-edge labs—reflects how AI has grown from niche projects to a force reshaping nearly every industry. For all the talk of AI “surpassing humans,” Bob emphasizes that true breakthroughs still require massive engineering, reliability leaps, and thoughtful integration into everyday workflows.
Ultimately, the tech might feel mundane once it’s baked into self-driving cars or AI-run offices, but that ordinariness masks tremendous complexity. Whether you’re a researcher, founder, or enthusiast, Bob’s story underlines how much is yet to be built—and how crucial human agency remains in charting AI’s path forward.
“It’s not slowing; it’s just changing—fast.” – Bob McGrew
Now is the time to experiment, build, and embrace the next wave of powerful, generative, and integrated AI systems. As Bob would say, find that “Hill” you’re willing to climb because the breakthroughs keep on coming.
Thank you for reading. For more in-depth discussions, subscribe to Redpoint’s “Unsupervised Learning” podcast.


















































