Leak: ‘GPT-5 exhibits diminishing returns’, Sam Altman: ‘lol’

@arrogantprickly says:

November 10, 2024 at 6:39 pm

Orion’s performance will be a good test to see if we can trust future Altman hype at all.

Reply

@calliped-co5mj says:

November 10, 2024 at 6:50 pm

I think it will be underwhelming like o1, like sure it may be PHD level or even superhuman in benchmarks.

but it doesn’t mean much if they don’t solve hallucinations and long term coherence, Agents need reliability.

Reply
@arrogantprickly says:

November 10, 2024 at 6:57 pm

@@calliped-co5mj I think o1 preview/mini is pretty great, personally, but it depends on the use case (instruction following and debugging for me). I’m just skeptical how much smarter a model can be without CoT. I think a perplexity-like model that references an external source of truth (but with tools like calculators, code execution), could solve/greatly improve hallucinations, but I don’t think that gets solved with LLMs alone.

Reply
@timwang4659 says:

November 10, 2024 at 6:58 pm

@@calliped-co5mjthese models hallucinate by design, which is why it can come up with creative ideas. There needs to be some new “grounded” architecture that can verify the outputs from the generative model.

Reply
@Caldaron says:

November 10, 2024 at 6:59 pm

@@calliped-co5mj 110%

Reply
@arrogantprickly says:

November 10, 2024 at 7:02 pm

@@timwang4659 Right. Something like perplexity works very well (external source of truth). Also, the ability to use tools (calculators, code execution, etc.) is essential.

Reply

@evodevo420 says:

November 10, 2024 at 6:39 pm

It’s a blessing to be able to see through all the smoke and constant headlines

Reply

@maciejbala477 says:

November 11, 2024 at 12:22 am

that’s definitely my favorite aspect of following this channel. It feels like a dude who is very interested and well-versed in the field trying to give a measured overview of the latest advances in it. None of the fake sensationalism. By far the greatest strength that separates it from the lot

Reply
@lionelmessisburner7393 says:

November 11, 2024 at 6:40 am

@@maciejbala477but I still always leave optimistic. I guess reality is often NOT disappointing

Reply

@AGIzero00 says:

November 10, 2024 at 6:50 pm

Not only is it over, we’re also so back

Reply

@theterminaldave says:

November 10, 2024 at 7:57 pm

Pris knows what’s up

Reply
@1.4142 says:

November 10, 2024 at 8:46 pm

Not only is it back, we are so over.

Reply
@dr_zaius says:

November 10, 2024 at 9:02 pm

lol

Reply
@sillytruthandfacts says:

November 10, 2024 at 9:30 pm

Schrödinger’s comeback.

Reply
@lawrencefrost9063 says:

November 11, 2024 at 4:24 am

Not only is it over, we’re also so back. The plot thickens like you wouldn’t believe. Brace for impact the game’s changed.
Not only have I said nothing, I have given you brainrot.

Reply

@012vinc says:

November 10, 2024 at 6:51 pm

To be honest, I find the AI hype talk from Sam Altman quite irritating and it makes him untrustworthy in my opinion. It appears like he takes on more of a marketing role than than of a technical guy.

Reply

@minimal3734 says:

November 10, 2024 at 7:24 pm

He isn’t a technical guy, he’s the CEO.

Reply
@VictorKing144 says:

November 10, 2024 at 8:24 pm

He’s not a technical guy, he’s a hype man and a grifter whose consistently been lying. If we were to believe his words, 2024 should’ve had way more impressive models than we’ve seen this year. 2024 was supposed to be the year AI took the world by storm. It is now November and I have not seen a storm yet, only a bubble.

Reply
@BryanBortz says:

November 10, 2024 at 8:43 pm

He should go back to wearing two collared shirts at the same time.

Reply
@teaveins1466 says:

November 10, 2024 at 8:54 pm

That’ll be because he does

Reply

@ryzikx says:

November 10, 2024 at 6:53 pm

we’re not gonna get superhuman performance (excluding speed) by training on human data. We need to solve self play on language and only then we surpass our limits.

Reply

@alexandermoody1946 says:

November 10, 2024 at 7:58 pm

The library of Babel is by definition incomprehensible.

Reply
@ossian882 says:

November 10, 2024 at 8:25 pm

One More exception: knowledge breadth (already superhuman)

Reply
@richardfredlund8846 says:

November 10, 2024 at 8:31 pm

the models actually need to ‘know’ things to improve. We operate from a level of functional certainty on certain things about the world. which we can then use to reason.

Reply
@Skoomar-sg7ej says:

November 10, 2024 at 8:32 pm

It will be superhuman since obviously no human has achieved the feat of mastering all skills known in human data. But you are right that AI will be limited against considering possibilities that humans have never considered.

Reply
@alexandermoody1946 says:

November 10, 2024 at 8:32 pm

@@ossian882 the Key definitions and value definitions are critical component’s, without the speed at which everything becomes a generative mess is immense.

Even if all things considered humans are sub par in estimation there are alot of us and we have the knowledge and experience to share meaning and meaningful insights that language alone has no nuanced explaination provided.

We have to work together to resolve this and provide the Key definitions and meanings for this to work out in the interests of all.

Reply

@DentoxRaindrops says:

November 10, 2024 at 6:57 pm

Thanks for your constant high quality updates, Philip! Always makes my day 🔥

Reply

@aiexplained-official says:

November 10, 2024 at 7:03 pm

Thanks Dentox

Reply

@p-j-y-d says:

November 10, 2024 at 6:57 pm

4:34 “[In order to build AGI] I think we basically know what to do: just mining all minerals in the solar system and building a Dyson Sphere. It’ll take a while, it’ll be hard, but that’s tremendously exciting!”

Reply

@monad_tcp says:

November 10, 2024 at 8:00 pm

There’s a better way, just use humans and plug their brains direct to the internet. Like the Borg does.
This way you save billions of years of training at a planetary scale done by evolution.
We already have AGI, it’s called humans.
The problem is that they have to be paid, isn’t ?

Reply
@leonardosoto4603 says:

November 10, 2024 at 8:55 pm

AGI is not that far, actually by some definitions o1 is already AGI

Reply
@lucasbrant9856 says:

November 10, 2024 at 9:03 pm

@@leonardosoto4603 by those definitions AGI is pretty underwhelming.

O1 is cool but its not as revolutionary as AGI should be.

Reply
@Imperial_Squid says:

November 10, 2024 at 9:03 pm

Right? It’s like NASA saying “well we know how to get to the moon” in the 1950/60s, technically true but probably covering for an unimaginable amount of hard work that needs to happen to get there

Reply
@leonardosoto4603 says:

November 10, 2024 at 9:16 pm

@@lucasbrant9856 How do you define AGI ?

Reply

@kailohre9336 says:

November 10, 2024 at 6:57 pm

“… O1 – that might come in the next two weeks…” That was said with a mischievous smile 🤣

Reply

@AllisterVinris says:

November 10, 2024 at 7:01 pm

See? That’s why this is the only channel to watch on the topic of AI. Good research, informed and nuanced stance, Clearly conveyed information and Impressive reactivity.
Welcome newcomers to AI Explained, take a seat, we’ll probably be here a while.

Reply

@huzafah says:

November 11, 2024 at 6:59 am

no

Reply
@pradyumnabilagi7661 says:

November 11, 2024 at 1:56 pm

Totally this is the second most sensible video that I have seen since the AI madness began 3 years ago. The first being the linus’s video on AI.

Reply

@Slayer666th says:

November 10, 2024 at 7:03 pm

All these news make it obvious to me that LLMs probably will never become AGI, but LLMs might be a part of the complex that allows AGI.
I still firmly believe that the only way to achieve AGI is a continous input-output process that interacts with the physics of the real world.
If that sytsem utilizes an LLM to get all the worlds knowledge, while getting logic capabilities that results from interacting with the real world, we will get AGI.

Reply

@danielchoritz1903 says:

November 11, 2024 at 2:13 am

LLM’s need a personal mini model of the world, like every human has, known as believe. There is a reason, why LLM’s now act like genius kids.

Reply
@trevordohm6762 says:

November 11, 2024 at 2:50 am

No matter what we do to these models they will not become AGI. We need look no further than the underlying architecture. I could give all the information in the universe and it would still be performing statistical inference.

Reply
@squamish4244 says:

November 11, 2024 at 3:20 am

@@trevordohm6762 I don’t know why this is so shocking to some people. The least-hyped and least need-to-be-hyped public AI expert who is also a developer, Demis Hassabis, has always said that LLMs will not lead to AGI, since long before he was widely known. DeepMind has never bet the farm on LLMs.

Reply
@Xjaychax9 says:

November 11, 2024 at 3:39 am

@trevordohm6762 you are performing statistical inference.

Reply
@furtherback6131 says:

November 11, 2024 at 3:43 pm

THIS MAN SOLVED IT

Reply

@awsmith1007 says:

November 10, 2024 at 7:15 pm

Llama 4 will probably tell us a lot about the future of vanilla LLMs.

Reply

@Xjaychax9 says:

November 11, 2024 at 3:37 am

Why

Reply
@Tahazif_TheCool22 says:

November 11, 2024 at 3:51 am

@@Xjaychax9 bcz its open source?

Reply
@Xjaychax9 says:

November 11, 2024 at 4:29 am

@Tahazif_TheCool22 an LLM is not open source. Nobody truly understands how an LLM works. We can design it, but it’s internal thinking/processing is little understood. So no, on this basis we won’t know the future of LLM’s based of Llama 4.

Reply

@youriwatson says:

November 10, 2024 at 7:27 pm

You again proved to be the best AI channel. Really well done

Reply

@rubberducky5990 says:

November 10, 2024 at 7:30 pm

Altman is a sales guy who gives the impression of a phd without having one.

Reply

@AdmiralValdemar says:

November 10, 2024 at 8:58 pm

He needs to pump this stock to get another sports car or house out of it, before the bubble bursts on this dumb tech fad.

Reply
@EveDe-ug3zv says:

November 10, 2024 at 9:00 pm

While that is fair, Noam Brown is not a sales guy and he has seconded Altman’s statements on X yesterday

Reply
@unityman3133 says:

November 10, 2024 at 9:26 pm

@@AdmiralValdemar not a tech fad. Already transformers have been implemented one way or another into existing products.

Reply
@Caldaron says:

November 10, 2024 at 9:50 pm

reminds me of who elon musk was 10 years ago…

Reply
@rubberducky5990 says:

November 10, 2024 at 9:50 pm

@@EveDe-ug3zv all these maths evaluations are bs. It is like memorising past interview questions through unethical means and then claiming job proficiency through a fair process. It is not correlated to actual performance. Altman can’t say llms have no value as no business is implementing it outside the toy usecases of summarisation and fragile RAG. So, he says we are reaching limits. But to pump the stock, he has to say agi is near in the next sentence. I would rather wait to see a concrete usecase where it really works instead of gaming the system.

Reply

@zeol6766 says:

November 10, 2024 at 8:00 pm

Interesting that you didn’t bring up the part in that Y Combinator interview where Sam said that AGI was coming next year:
– Interviewer: “What are you excited about in 2025? What’s to come?
– Sam: “AGI. Excited for that”
Needless to say, claims like these, should be taken with a grain of salt.

Reply

@aiexplained-official says:

November 10, 2024 at 8:27 pm

Think that was misconstrued

Reply
@zeol6766 says:

November 10, 2024 at 8:51 pm

@@aiexplained-official Thanks for responding. I also saw some people make that claim, and since English is my second language maybe I indeed misconstrued what he was trying to convey. But isn’t the answer “AGI” a direct response to the question “what’s to come?” meaning “AGI is what’s to come” and from the context of the previous line “… in 2025?” we assume he’s referring to next year. But maybe he just meant next year he’s exited for AGI, I guess time will tell, anyways have a wonderful day.

Reply

@GeneralKenobi69420 says:

November 10, 2024 at 8:01 pm

When Terrence Tao says even he can’t solve it, you know it’s the real deal

Reply

@taopaille-paille4992 says:

November 11, 2024 at 12:36 am

He can’t solve what exactly ?

Reply
@dirtydicso says:

November 11, 2024 at 12:44 am

@@taopaille-paille4992the test problems

Reply
@kaystephan2610 says:

November 11, 2024 at 1:19 am

@@taopaille-paille4992 Complex AI math problems. Let’s just leave it at that. Detailed descriptions of this stuff would sound like Alien speech to us anyway lmao

Reply
@taopaille-paille4992 says:

November 11, 2024 at 1:43 am

@@kaystephan2610 Well speak for yourself , I have 3 master degrees in maths from top university , and worked as a quant in finance

Reply
@WoolyCow says:

November 11, 2024 at 2:08 am

@@kaystephan2610 math goes from numbers, to letters, to greek, to gibberish real fast :>

Reply

@GrindThisGame says:

November 10, 2024 at 8:24 pm

I think AGI / ASI is going to happen but this also feels like the tail end of an LLM bubble which will crash. There will be new breakthroughs though.

Reply

@Xjaychax9 says:

November 11, 2024 at 3:41 am

Morse code to telephone to cellphone.

Reply

@toadlguy says:

November 10, 2024 at 8:41 pm

13:29 If Sora doesn’t have enough real world knowledge that it knows flamingo legs can not pass through one another, it will remain a novelty item whether released in 2 weeks or not. All the videos OpenAI have released of Sora are just creepy and no one but avant garde artists would consider them actually useful.

Reply

@midoavdagic9069 says:

November 10, 2024 at 10:44 pm

Everyone but avant garde artist’

Great quote

Reply
@maciejbala477 says:

November 11, 2024 at 12:28 am

it’s the same issue as always, it can do amazing things… on the surface, and not reliably. That is basically the case for every AI to date. I’d truly take notice if it was reliably performing its tasks and could be left alone to its own devices without needing constant supervision.

Reply

@stefano94103 says:

November 10, 2024 at 9:01 pm

That’s why research papers are so important. Most scientist use data and evaluation and less hype or fear.

Reply

@peersvensson9253 says:

November 10, 2024 at 10:22 pm

As a physicist, I have to interject and say that the idea of solving physics through brute intelligence is rooted in a misunderstanding of how progress is made in the natural sciences. Physics uses math, but just as math requires axioms, physics needs facts about the real world to constrain the set of all possible theories to a theory of the world we actually inhabit. The problems facing physics today (specifically the subset of physics you read about in popular science) have more to do with a lack of experimental and observational data, than with the limitations of our feeble intellects. It is also worth pointing out that the math used in physics tends to lack the kind of rigour seen in mathematics, and is often motivated by intuition or slightly loose arguments. I don’t know if that will help or hurt the utility of LLMs.

Reply

@dejanp8558 says:

November 10, 2024 at 11:16 pm

that’s why it seems more and more realistic to me that progress from AI in science will probably come (as Dario Amodei said in his blog about the field of Biology) mostly from accelerating discoveries related to measurement tools or techniques

Reply
@bahroum69 says:

November 11, 2024 at 10:41 am

Hence the need for world models that can run millions of simulations to experiment orders of magnitude faster.

Reply
@technologist6102 says:

November 11, 2024 at 10:50 am

anch’io la penso un pò come te. io credo che il cervello umano/team di cervelli umani ben allenati siano capaci di scoprire, mediante l’invenzione di tecnologie sempre pià avanzate, tutta la scienza e di capirla nel profondo. Non credo che serva una capacità cognitiva superiore a quella dell’homo sapiens per comprendere ciò che ancora non sappiamo dell’universo ed è solo una questione di tempo (150/200 anni ???) prima che si arrivi a comprendere del tutto la fisica del mondo. Secondo me dire che serva una superintelligenza artificiale per fare ciò è superfluo perché alla nostra specie non mancano le capacità cognitive necessarie per risolvere i problemi aperti della fisica o di altre discipline. Tali capacità mancano di sicuro ai gorilla o agli scimpanzè o ai nostri antenati come i Neanderthal ma non ai Sapiens. Questa è la mia visione. Si sente dire che le macchine potranno superarci in capacità ma nella realtà dei fatti non lo sappiamo: la nostra corteccia cerebrale è quella pià avanzata sul pianeta Terra e probabilmente è strettamente collegata alle capacità cognitive necessarie per risolvere i problemi prima elencati; bisogna ricordare che un piccolo aumento del numero di neuroni nella corteccia ha permesso ai sapiens di eliminare i neanderthal e prendere il sopravvento sul pianeta Terra; quel piccolo incremento ha fatto tutta la differenza.
Però teoricamente è possibile costruire software alla Alphago in ambito matematico/fisico. Cosi come è teoricamente possibile, una volta che sarà compreso molto bene il cervello umano, costuire reti neurali in software che riproducono nel dettaglio le reti neurali biologiche della corteccia e delle altre parti del cervello. Infine è inoltre possibile spingersi più in là costruendo cervelli artificiali: si ricreano in hardware, con materiali diversi da quelli biologici, neuroni, sinapsi e altro? in modo identico e nella stessa numerosità di come sono nel cervello umano. Chissà nel penultimo e nell’ultimo caso elencati qui da me cosa verrebbe fuori da tali esperimenti!!

Reply

@lucaveneri313 says:

November 10, 2024 at 11:51 pm

Funny that researchers still thinking that training data is “all we need” when a standard university training was enough to make emerge all the math/engineering/physics genius in history . The basics knowledge bricks to use are already there in LLMa, is the way to the reasoning that is lacking…

Reply

@drakey6617 says:

November 11, 2024 at 10:29 am

People don’t understand this. Humans have so so much less knowledge than, yet are better at research. Imagine a human with the knowledge of ChatGPT.

That is why I also dislike that basketball comment about benchmarks. What is the point of current benchmarks if the models know the solutions to basically all problems humans have ever solved without need for thinking about them.

These tests only make sense for humans as we assume the students have not seen the answers before.

Reply
@bahroum69 says:

November 11, 2024 at 10:38 am

Thank you for explaining this so clearly. It has been my opinion since 2022.

Reply

Leak: ‘GPT-5 exhibits diminishing returns’, Sam Altman: ‘lol’

Related Posts

Joe Lilli