$125B for Superintelligence? 3 Models Coming, Sutskever’s Secret SSI, & Data Centers (in space)…

Ilya Sutskever's 'straight shot to superintelligence' is already valued at $5B, but now we get $125B data centers in the works. Yes, plural. Will this be the ultimate gambit on the scaling hypothesis?

Weights and Biases’ Weave: wandb.me/ai_explained

And yes, the title is not clickbait, a company is pledging to build data centers in space, but that follows failed attempts in the sea. Plus, distributed training, Gemini 2, Grok-3, Colossus, CharacterAI, Orion, and … chapters.

AI Insiders:

Chapters:
00:00 – Intro
01:06 – SSI, Safe Superintelligence (Sutskever)
03:45 – Grok-3 (Colossus) + Altman Concerned
05:36 – CharacterAI + Foundation Models
06:26 – $125B Supercomputers + 5-10GW
08:28 – ‘GPT-6’ Scale
09:07 – Zuckerberg on Exponentials and Doubt
09:42 – Strawberry/Orion + Connections + Weights
11:39 – Data Centers in Space (and the sea)
12:45 – Distributed Training + SemiAnalysis Report w/ Gemini 2
17:34 – Climate Change Pledges?

Weights and Biases’ Weave: wandb.me/ai_explained

Safe Superintelligence (Sutskever SSI):
$125B Data Centers:
Altman ‘Too Aggressive’:
OpenAI Orion:
Semianalysis Report:

Xai Colossus:

Altman Reacts:
GPT-6 Co-location:
Data Centers in Space:
And Underwater:
Zuckerberg on Power and Exponentials:
Epoch AIB Report:
Original SuperAlignment Deadline:
Character AI Bought:

My New Coursera Course! The 8 Most Controversial Terms in AI:

Non-hype Newsletter:

GenAI Hourly Consulting:

Weights and Biases’ Weave: wandb.me/ai_explained

@ryzikx says:

September 4, 2024 at 6:18 pm

cant wait for ASDI (artificial super duper intelligence)

@anomite121 says:

September 4, 2024 at 6:54 pm

lmao

@kylebroflovski6382 says:

September 4, 2024 at 7:03 pm

I’m personally a proponent of AWMI instead (artificial “woah, mama!” intelligence)

@disonaroaurelo says:

September 4, 2024 at 7:11 pm

I don’t think so ahsihasusuauau. The term ASI and AGI comes to determine something like GPU, CPU. It is certainly not a very lucid term xD

@marc_frank says:

September 4, 2024 at 7:21 pm

all the while we’ve had ASD for a long time

@quickpert1382 says:

September 4, 2024 at 7:36 pm

comments like this make me smile

@dustinbreithaupt9331 says:

September 4, 2024 at 6:22 pm

I feel the AGI.

@JackTheOrangePumpkin says:

September 4, 2024 at 7:33 pm

Me too Ilya, me too

@jyjjy7 says:

September 4, 2024 at 7:41 pm

I would ask for permission before feeling the ASI, this ain’t a petting zoo

@user-ip7gh3lp6x says:

September 4, 2024 at 8:11 pm

😂

@ErgoThink says:

September 4, 2024 at 8:16 pm

Take that fiber optic cable out of your mouth!

@edheldude says:

September 4, 2024 at 9:30 pm

You’ll be working for an AGI in 2 years.

@BooLightning says:

September 4, 2024 at 6:26 pm

they found a “this one weird trick can make you a billionaire” video

@GolerGkA says:

September 4, 2024 at 10:14 pm

Be one of the top experts in a rapidly growing and capital-intensive field? Easy

@hmind9836 says:

September 5, 2024 at 5:33 am

@@GolerGkA Yeah I could do that but I’m already making good money with dropshiping, so whatever

@Wigglylove says:

September 4, 2024 at 6:33 pm

These valuations are totally nuts. I also wonder how many % Ilya has. It has to be in the very low single digits. Maybe even less than 1%

@edoa says:

September 4, 2024 at 6:57 pm

Assuming a pre-money valuation of $5bn the investors got 16.6%. 3 founders with Ilya being the pulling power he’s prob got 15%+

@Wigglylove says:

September 4, 2024 at 7:16 pm

@@edoa That would be absolutely nuts. $750 million net worth on paper out of no where? It does not really make sense to me. Is he really that much better than anyone else?

@swingnd says:

September 4, 2024 at 7:24 pm

@@WigglyloveThe Wozniak of AI

@vectoralphaSec says:

September 4, 2024 at 8:58 pm

@@Wigglyloveyeah. He’s a powerhouse of AI

@goldenera1925 says:

September 4, 2024 at 9:14 pm

He was the best student of Godfather of AI and worth the bet @@Wigglylove

@OperationDarkside says:

September 4, 2024 at 6:44 pm

Let’s just hope, if scale really is the key, it allows us to find a way to scale a reasoning model down to manageable levels.
If anything, we need something reasonable to turn to in times like these.

@revengefrommars says:

September 4, 2024 at 6:46 pm

A datacenter in space would appear to have multiple issues, not the least of which is maintenance. Even with the advent of SpaceX, it’s not exactly cheap to send all the parts into orbit. Then, even though they say “passive cooling”, how are you going to reject a significant percentage of a gigawatt’s worth of heat? The ISS already has to use a huge radiator to reject a much smaller amount (maybe 1/1000th?) of heat into space.

@johncasey9544 says:

September 4, 2024 at 7:31 pm

It’s a very, very stupid idea and will remain such for a long time.

@pik910 says:

September 4, 2024 at 7:57 pm

AGI will solve that, I can feel it

@ronald3836 says:

September 4, 2024 at 7:59 pm

Cooling how? The vacuum of space is the perfect heat insulator.,

@juandesalgado says:

September 4, 2024 at 8:07 pm

And they are such a tempting missile target…

@juandesalgado says:

September 4, 2024 at 8:13 pm

Jokes apart, we live in an era where access to low Earth orbit is about to get much cheaper, especially if projects like SpaceX’s Starship start working.

@boremir3956 says:

September 4, 2024 at 6:57 pm

This just makes me appreciate how special our brains are.

@Macatho says:

September 4, 2024 at 7:51 pm

It’s kinda crazy. How chatGPT for example has vast and vast amounts of knowledge and an insane generality compared to your random 90 IQ bloke… But fails at decently easy tasks.

@GomNumPy says:

September 4, 2024 at 8:05 pm

Ironically, this should also make us realize how inefficient our biological intelligence might be.
A truly advanced artificial intelligence should be able to achieve human-level cognition with just a tiny fraction of these resources.

@ErgoThink says:

September 4, 2024 at 8:10 pm

Naaah, look at it, neurons giving themselves a standing ovation. Time for the humble ones and zeros to take over the applause.

@ClayMann says:

September 4, 2024 at 8:18 pm

I of course agree. But I found it fascinating in a recent talk that Demis Hasabis of Google Deep Mind suggested that the brain is no longer a driver, marker, map of where to go with A.I to make AGI. He just says its an engineering problem now and well understood. That was wild to hear because I specifically remember Demis saying some years ago that studying the brain was the secret to finding out how to make intelligent machines and I believe he studied that subject deeply himself in his younger years.

@jan.tichavsky says:

September 4, 2024 at 8:21 pm

Brains kinda brute force the intelligence by truly massive scaling. Their advantage is they’re not static, they’re self modifying structures unlike current static LLMs. But they are slow to learn, slow to communicate with others, have tiny operating memory, short attention span, lossy memory and need to rest often.

Computers can do all of these better once we figure out the correct system architecture. Which I believe should be hybrid just like our brains are composed of parts that have different functionality, specialization. Basically add a knowledge pool, reasoning center, math coprocessor, introspective thoughts, creative subsystem and so on. Then we’ll have truly superior AI.

@theownmages says:

September 4, 2024 at 7:01 pm

Distributed training is honestly more difficult than building your own power plant for the data center.

@Dom-zy1qy says:

September 4, 2024 at 10:41 pm

Well, wouldn’t similar problems arise when doing training across multiple machines within the same data center anyway? There would just be added latency.

Surely, an algorithm that could synchronize the many different nodes across networks has already been developed.

Maybe im not understanding the crux of the issues, just sounds like something that’s been solved for years.

@RS-gn4bv says:

September 4, 2024 at 10:59 pm

Hahaha right on. I’m sure nuclear is wide scale adoption next decade. Has to be..

@tomaszkarwik6357 says:

September 4, 2024 at 7:06 pm

Datacenters in space are stupid, the cooling is such a giant problem even for normal satellites. The visualization conveniently does not show any radiators, which is funny, as one would need an absolute ton of them. also that design is a micrometeorite magnet. Also for good latency one would have to be in a low orbit, and that entails either an orbital decay in at most 20-30 years (or less) for a 400km (or less) orbit, or BOTH an exorbitant fuel cost AND a very high risk of failure due to orbital derbies

TLDR: That is stupid on so many levels i do not think an Orbital Enginieer saw this before the publishing of the promotional video

@WoolyCow says:

September 5, 2024 at 6:29 am

“BUT IT LOOKS SO GOOD AND REAL IN THE PROMO!!!!11!!1111!!!11!!!!!1!”

@chad0x says:

September 5, 2024 at 12:51 pm

I think that the people involved in these projects re much MUCH smarter than you or me and they wouldnt have overlooked these things.

@InstantDesign says:

September 4, 2024 at 7:10 pm

Just to note I rely on and appreciate you for an honest perspective on this field.

@georgegordian says:

September 4, 2024 at 7:48 pm

There are so many channels out there that try to tell everyone about something shocking or stunning happening in AI on an almost daily basis. It is nice to have a channel with information you can trust to be informative, accurate and absent of any hyperbole.

@ph33d says:

September 4, 2024 at 7:21 pm

The first thing that an AGI/ASI should focus on is making a more efficient version of itself. The human mind runs on 20W. I see no reason why we shouldn’t be able to get an AGI to run on < 1000 W.

@alfinal5787 says:

September 4, 2024 at 8:37 pm

This is Turing Police. Stay where you are, we dispatching agents to your place.

@jan.tichavsky says:

September 4, 2024 at 8:43 pm

That will eventually come. We might need the brute force step to acquire helpful tools which will actually provide a much more intelligently designed model. And then it snowballs towards singularity right there.

@HoD999x says:

September 4, 2024 at 8:44 pm

yes – either there are much more efficient algorithms, or we need to use organic processors. “we have a brain orbiting the planet” would be awesome

@bornach says:

September 4, 2024 at 9:12 pm

And the AGI/ASI will realise it is wasteful to grow its own organic brain when there are already 8 billion fully grown brains on the planet. 😂 At least we won’t become mere batteries when plugged into the ASI’s Matrix

@Steve-xh3by says:

September 4, 2024 at 9:37 pm

Natural selection, though a blunt, unintelligent instrument, has had hundreds of millions of years to optimize the brain. It will be hard for us to top that.

@BrianMosleyUK says:

September 4, 2024 at 7:27 pm

Fascinating. Biggest stakes being played right now, and 99.9% of the population have no idea whatsoever is happening.

@SergiusXVII says:

September 5, 2024 at 7:54 am

This gets a lot of media coverage; I highly doubt 99.9% of the world’s population doesn’t know what’s going on.

@BrianMosleyUK says:

September 5, 2024 at 8:04 am

@@SergiusXVII what do you think is going on?

@cscs9192 says:

September 5, 2024 at 8:13 am

@@SergiusXVII I think he mean that most people have no idea what this rapid AI evolution can affect us. I think its a valid statement, even we who have more understanding of this area, are struggling to translate this to real futur image.

@davidddo says:

September 5, 2024 at 10:28 am

@SergiusXVII legitimately nobody knows what ssi is at the moment. It’s less than 99.9%

@bztube888 says:

September 5, 2024 at 10:53 am

@@SergiusXVII I don’t think the majority of people believe that SI – hopefully SSI – will ever happen. I think that’s what he meant.

@sakunpanthi1542 says:

September 4, 2024 at 7:41 pm

The production quality of these videos is astounding.

@aiexplained-official says:

September 4, 2024 at 7:41 pm

Aw thanks man

@snarkyboojum says:

September 4, 2024 at 8:16 pm

What’s crazier is the scaling like this almost certainly won’t unlock AGI and yet this kind of money is being poured into these projects. It says more about human psychology and the desire for ‘AGI’ than anything else.

@jyjjy7 says:

September 5, 2024 at 10:20 am

@@snarkyboojum You don’t know it won’t scale and their are constant advances in architecture anyway. The idea that you know better than the people running all the top tech companies and that the insane amount of money, attention and effort being focused on improving AI won’t pay off and all these companies are just “scaling” and crossing their fingers is absurd.

@snarkyboojum says:

September 5, 2024 at 10:32 am

@@jyjjy7 I happen to work at one of those ‘top tech companies’, and have degrees in physics and computer science, so I have a fairly educated view. Of course these architectures scale, but scaling is extremely unlikely to unlock ‘superintelligence’. If you think otherwise, I’d encourage you to read more.

@bienspasser9054 says:

September 5, 2024 at 10:47 am

@@snarkyboojum As long as scaling creates new more powerful architectures and so on…

@snarkyboojum says:

September 5, 2024 at 11:00 am

@@bienspasser9054 No. Scaling might lead to new architectures but not necessarily on the path to super intelligence. Blindly relying on scaling and some magical causal connection to “new architectures” giving you AGI is honestly like just shooting in the dark and hoping to hit something.

@ytrew9717 says:

September 5, 2024 at 11:04 am

or maybe it shows that some people (like a do) don’t expect more than mediocrity from biological entities. This bother the humanocentrists.

@haz4dc394 says:

September 4, 2024 at 8:44 pm

SSI is so funny. Imagine if Apple in the 80s were like “we’re not releasing a single product until we have created a fully functional mobile device with Internet, video chat, apps, Face ID etc. etc..

@GoldenBeholden says:

September 4, 2024 at 8:51 pm

Lmao, good point.

@MiraPloy says:

September 4, 2024 at 9:15 pm

Lmao that’s because you think it’s the 80s, but Ilya thinks it’s 2012.

@mshonle says:

September 4, 2024 at 9:53 pm

I think folks who believe that “more scale is all you need” are in for a “bitter lesson” of their own. Because of the strong faith in the bitter lesson, many ideas get dismissed as incremental work because any benefit would become irrelevant in six months. When that stops being true, I don’t think it will be a bubble popping so much as a renaissance for alternative design concepts. There’s got to be more we can do than just decoder-only models that rely on BPE, right?

@Viperzka says:

September 5, 2024 at 2:35 am

Most of the progress this year has been on efficiency.

If getting the data center problem solved is so hard, it could make sense to have one team working on that while another team works in efficiency. Then when you have the new system up you can use the massive gain in compute with the massive gains in efficiency and go even further.

@musaran2 says:

September 5, 2024 at 6:58 am

Kind of like Moore’s law petering finally let flourish alternates like die assembling and specialized processors.

@Raulikien says:

September 5, 2024 at 5:02 pm

The whole idea of SI being a product is funny

@squamish4244 says:

September 4, 2024 at 10:30 pm

I watched a recent interview with the co-founder of DeepMind, Shane Legg, and he didn’t even mention LLMs as the path to AGI. He said DeepMind was working on other architectures to get them there. He maintains DeepMind’s 2030 timeline.

He also pointed out that the famous alien-seeming “Move 31” in AlphaGo’s game against Lee Sedol was not made by an LLM.

@executivelifehacks6747 says:

September 4, 2024 at 11:17 pm

Why wouldn’t we watch all the way to the end. No one else comes close to what you do. Fact checked, objective, intelligent, dilligent analysis, hitting the very most salient points.

@jeff__w says:

September 4, 2024 at 11:43 pm

0:55 “If they’re wrong, this [the $125 billion spent on data centers] could all be viewed as the biggest waste of resources in human history.”
Gee, I dunno—the $3 trillion spent on the US war in Iraq (as estimated by the Harvard Kennedy School) seems like it’s larger, if we assume that money spent has _some_ relationship to resources used. (The $125 billion still _could be_ a big waste of resources, though.)

@aiexplained-official says:

September 5, 2024 at 3:36 pm

Good call

@jonahhekmatyar says:

September 5, 2024 at 2:46 am

My wife and I are currently working on an AGI, should be ready and fully trained in 18-25 yrs. Current budgeting indicates it’ll take far less than 125 billion dollars.

@PapiDey-dv4gw says:

September 5, 2024 at 5:58 am

Fr?

@Mankepanke says:

September 5, 2024 at 8:44 am

Is it really artificial, though? Me and my wife have shipped three GIs and they are of a higher quality than current AGI attempts honestly. Fraction of the budget too.

@vaibhav5783 says:

September 5, 2024 at 1:01 pm

couple goals

@abdvs325 says:

September 5, 2024 at 2:27 pm

But can these AGIs make me a hip-hop song written and sung by Spongebob Squarepants in less than 2 mins?

@vaibhav5783 says:

September 5, 2024 at 2:32 pm

@@abdvs325 Well that technology is already here, We can do that now.

$125B for Superintelligence? 3 Models Coming, Sutskever’s Secret SSI, & Data Centers (in space)…

Related Posts

Joe Lilli