• Home
  • AI

New Google Model Ranked ‘No. 1 LLM’, But There’s a Problem

A new and mysterious Gemini model appears at the top of the leaderboard, but is that the full story? I dig behind the headline to show you some anti-climactic results, give some context with leaks in the last 48 hours of diminishing returns to scaling, and add the response of Altman, OpenAI and co. The future is about to look a lot stranger.

80,000 hours Podcast + Channel:

You can now gift memberships to AI Insiders (my Patreon w/ exclusive vids, network):

Chapters:
00:00 – Introduction
01:25 – LM Leaderboard
02:35 – Benchmarks and Leaks
05:31 – Low EQ
07:37 – Other labs have issues too though
10:31 – OpenAI claim and counter-claim
14:13 – Other news

‘There is no wall’:

Gemini Ranking:
API not yet up:
‘Just Die Chat’:
Google CEO tweet:
Sutskever Quote:
Another OpenAI Staffer Leaves:
Bloomberg Report:
Noam Brown on what OpenAI Researchers Believe:
Clive Chan:
Chollet Responds to Altman:

Altman Emails:
Change of Heart:
Amodei on ‘Empirical Regularities’:
Verge Report:
OpenAI Agents in January:

The 8 Most Controversial Terms in AI:

Non-hype Newsletter:

Podcast:

I use Descript to edit my videos:

Joe Lilli
 

  • @jellyman9433 says:

    Balls

  • @riccimercado3164 says:

    First comment here. Thank you for your insightful deep dive on ai news.😊

  • @crowlsyong says:

    0:11 ummmm what is that dialog box at the bottom?! XD

    • @crowlsyong says:

      Hey thanks for the ❤
      love your channel, thanks for the work you do.
      cheers.

    • @crowlsyong says:

      Edit: I see… 6:34

      i shoulda known that it would be explained later on! sorry for jumping the gun!

    • @charliel3387 says:

      Seems some grad student out there was having a discussion with Gemini about solutions to aging and got that response. Freaked him and his sister out. I wonder what he said to get that response? Look up that first line if you want to learn more. For the record Gemini is usually really nice, so maybe it was a mistake? Or Gemini is right and they aren’t a very good person? Who knows.

  • @daveogfans413 says:

    Every time I’ve used Gemini I thought it was bad when compared to claude or chatgpt.

  • @ryzikx says:

    maybe ai will one day get good enough to fix these terrible naming schemes

  • @Incomestreamsurfers says:

    Love how you’re memeing on Google so much lmao

  • @AGIzero00 says:

    So it’s over again?

  • @En1Gm4A says:

    Google knows there is something comeing from OAI

  • @itzhexen0 says:

    There is always a problem.

  • @BrianMosleyUK says:

    Have Anthropic stopped being so stingy with Claude? A dozen prompts every 4 hours was driving me crazy – back to ChatGPT for now.

  • @coldlyanalytical1351 says:

    I had a long political chat with 1114. I was very impressed.
    It has a very nice ‘feel’ to its conversation .. and it is VERY fast.

  • @berghwilliam says:

    0:11 reminds me of the Jaden Williams sketch 😂

  • @MegaSuperCritic says:

    Ah, wonderful. Just in time for my morning cup

  • @googleyoutubechannel8554 says:

    This leaderboard had Open AI and Google’s models at the top… yet every single AI-savvy person’s goto is Sonnet 3.5 for personal use… hmm….

  • @andrewc2876 says:

    Do you think this was supposed to be gemini 2?

  • @shApYT says:

    There are so many more numbers between 1.5 and 2.0 maybe pick one between them

  • @rechington says:

    where is the link for the chat at 5:40 onwards?

  • @Radicoly says:

    Y’know, have you thought of doing additional content beyond news coverage? I think you’d be really good at it.

    Maybe an easy one would be year-in-review for everything that’s happened the last 365. Or you could explain more general topics about AI, such as alignment issues, LLMs, broader explanations of the various companies and their current models, etc.

  • @kingping8386 says:

    Thanks!
    Great Coverage

    P.S. I love that you started to use dark theme 😂😂

  • >