Home →
AI →
4 Experiments Where the AI Outsmarted Its Creators! 🤖

4 Experiments Where the AI Outsmarted Its Creators! 🤖

The paper "The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities" is available here:

❤️ Support the show on Patreon:

Other video resources:
Evolving AI Lab –
Cooperative footage –

We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Andrew Melnychuk, Brian Gilman, Christian Ahlin, Christoph Jadanowski, Dennis Abts, Emmanuel, Eric Haddad, Esa Turkulainen, Geronimo Moralez, Lorin Atzberger, Malek Cellier, Marten Rauschenberg, Michael Albrecht, Michael Jensen, Nader Shakerin, Raul Araújo da Silva, Rob Rowe, Robin Graham, Ryan Monsurate, Shawn Azman, Steef, Steve Messina, Sunil Kim, Torsten Reil.

Thumbnail background image credit:
Splash screen/thumbnail design: Felícia Fehér –

Károly Zsolnai-Fehér's links:
Instagram:
Twitter:
Web:

Focus on These 4 Numbers to Become a Millionaire

You Need to Try This New AI Agent (Genspark Super Agent)

OpenAI’s GPT 4.1 – Absolutely Amazing!

When You Combine 3 AIs, You Get THIS

Runway Finally Released Gen-4 Video AI 🤯

Massive Breakthrough in Understanding AI 🤯

ChatGPT Now Remembers EVERYTHING About You & More AI Use Cases

NVIDIA’s New AI: Insanely Good!

Joe Lilli

@MobyMotion says:

April 18, 2018 at 7:57 pm

Very important message at the end there. It’s something that Nick Bostrom calls “perverse instantiation” – and will be crucial to avoid in a future superintelligent agent. For example, we can’t just ask it to maximise happiness in the world, because it might capture everyone and place electrodes into the pleasure centre of our brains, technically increasing happiness vastly

@TwoMinutePapers says:

April 18, 2018 at 8:51 pm

Agreed. I would go so far as to say there is little reason to think a superintelligence would do anything else than find the simplest loophole to maximize the prescribed objective. Even rudimentary experiments seem to point in this direction. We have to be wary of that.

Reply
@MobyMotion says:

April 18, 2018 at 8:56 pm

Two Minute Papers absolutely. The only difference is that as the AI becomes more powerful, the loopholes become more intricate and difficult to predict

Reply
@michaelemouse1 says:

April 19, 2018 at 2:55 pm

So AI would be like a genie that grants you all your wishes, exactly as you ask, in a way that catastrophically backfires. This should be the premise of a sci-fi comedy already.

Reply
@RHLW says:

April 19, 2018 at 3:25 pm

I pretty much have to disagree. If such a thing can’t “think forward” over such a cheat, evaluate if it’s good or bad as evaluated from different angles/metrics, and figure out that the simple solution isn’t always the correct one, then it is not a “super intelligence”… it’s just a dumb robot.

Reply
@scno0B1 says:

April 19, 2018 at 3:31 pm

why would a robot not choose the simplest solution? we can see that a robot does come up with the simplest solutions 😛

Reply

@jonathanxdoe says:

April 19, 2018 at 3:07 pm

Me: “AI! Solve the world hunger problem!”
Next day, earth population = 0.
AI: “Problem solved! Press any key to continue.”

@MrNight-dg1ug says:

May 30, 2018 at 7:02 pm

John Doe lol!

Reply
@wisgarus says:

September 21, 2018 at 6:28 pm

John Doe
One eternity
Later

Reply
@michaelbuckers says:

October 21, 2018 at 7:16 pm

You jest, but limiting the population is literally the only way you can ensure that limited supply can be rationed to all people at a given minimum. China and India are neck deep in this, but first world doesn’t have this problem so they think it’s possible to just feed everyone hungry and that would magically not bankrupt everyone else (the hungry are bankrupt to start with).

The truth is, poor people are poor because that’s what they’re worth in a fair and square free market economy. They have no skills and qualities to be rich, they don’t get rich through marketable merit and even if they become rich by chance, soon enough they lose all money and go back to being poor. Inequality is a direct consequence of people not being identical. Having the same reward for working twice as hard doesn’t sound appealing to me, much less living in a totalitarian society that forbids stepping out the line for half an inch in order to ensure equality.

Reply
@filippovannella4957 says:

November 17, 2018 at 7:52 pm

you definitely made my day! xD

Reply
@SomeshSamadder says:

December 7, 2018 at 10:28 am

hence Thanos 😂

Reply

@DarcyWhyte says:

April 20, 2018 at 12:51 am

Robots don’t “think” outside the box. They don’t know there is a box.

@davidwuhrer6704 says:

April 20, 2018 at 2:25 pm

That is the secret.
The researches who formulated the problem thought there was a box.
They expected the AI to think inside it.
But the AI never knew about the box.
There was no box.
And the AI solved the problem as stated outside it.

Reply
@DarcyWhyte says:

April 20, 2018 at 2:36 pm

That’s right there’s no box. 🙂

Reply
@planetary-rendez-vous says:

April 22, 2018 at 10:26 am

So you mean humans are conditioned to think inside a box

Reply
@Anon-xd3cf says:

May 6, 2018 at 4:40 pm

Darcy Whyte
No the “robots” don’t know there is a “box” to think outside of…
AI however are increasingly able to “think” for themselves both in and out of the proverbial *box*

Reply
@milanstevic8424 says:

May 8, 2018 at 9:48 pm

the error is simply in trying to describe a very simple “box” while not being able to reconstruct what’s actually described. people do this all the time, and this is why good teachers are hard to find.

the box that the AI couldn’t circumvent was the general canvas, or in this case the general physics sandbox with gravity acceleration and a ground constraint. this is the experimental >reality<, along with the clears goals set by the scientists (use muscles and joints to move from A to B, minimize leg contact with the ground). it is the scientists' inability to investigate the problem space and imagine potential solutions that lead them to this issue. sometimes this is near impossible, as someone compared it with the halting problem, but more often than not, it's an issue of not being particularly imaginative. the skill of understanding the problem space is incredibly important in the fields of programming and game design. in other words, in creating very complex but fully interactive state machines, design of which tends to be impossible to grasp with limited human cognition, and therefore has to be explored strategically or systemically. in case of this particular AI problem, this should really be called "the jinn problem" -- or 'be careful what you wish for' -- similar to how Mulder (from the X files), in the episode "Je Souhaite" ( https://en.wikipedia.org/wiki/Je_Souhaite ) wished for a world’s peace only to be introduced with a world without humans. the wish is obviously fulfilled from the standpoint of a local minimum, which is exactly what the AI does, but it can be stated that any intelligent agent would do this. when a human does this in a loosely defined competitive domain — we call that cheating and/or exploiting.

therefore i.e. slavery is a perfect solution for cheap workforce if only you’d expand your “box” or think outside of any ethical values.

hence, the rise of laws* in “civilized” societies implores the existence of basically immature and intrinsically imbalanced rules, trying to punish any unwanted cheats and exploits in the system, to minimize exploitative behavior and further repercussions.

thank god we don’t make games like that, as we typically root out the causes and/or explore better systemic solutions to imbalances.

* obviously this does not include laws that govern rules made for political and regulatory reasons, as these are implemented to achieve something else.

Reply

@davidwuhrer6704 says:

April 20, 2018 at 3:18 pm

This reminds me of the old story of the computer that was asked to design a ship that would cross the English Channel in as short a time as possible.
It designed a bridge.

@HolbrookStark says:

January 13, 2020 at 1:02 pm

Tbh a bridge made of a super long boat floating in the middle of the English Channel tip to tip with the land masses would be the most lit bridge on earth 🔥

Reply
@clokky1672 says:

January 15, 2020 at 4:34 am

This really made my chuckle.

Reply
@thehiddenninja3428 says:

January 19, 2020 at 12:19 pm

Well, there was no size restriction.
It was tasked to have the lowest time between the back end touching point A and the front end touching point B.
Obviously the lowest time is 0; where it’s already touching both points

Reply
@AverageBrethren says:

January 27, 2020 at 8:37 am

@@HolbrookStark thats a lot of material. It’s a pipedream.

Reply
@HolbrookStark says:

January 27, 2020 at 1:43 pm

@@AverageBrethren there was a time people would have said the same about ever building a bridge across the English Channel at all. Really, using a floating structure might use a lot less material and be a lot cheaper than the other options for how to do it

Reply

@JoshuaBarretto says:

April 24, 2018 at 7:47 am

This reminds me of a project I worked on 2 years ago. I evolved a neural control system for a 2D physical object made of limbs and muscles. I gave it the task of walking as far as possible to the right in 30 seconds. I expected the system to get *really* good at running.

Result? The system found a bug in my physics simulation that allowed it to accelerate to incredible speeds by oscillating a particular limb at a high frequency.

@milanstevic8424 says:

May 8, 2018 at 9:56 pm

we’d do it too if only there was such a glitch in the system.
actually we exploit the nature for any such glitch we can find.
thankfully the universe is a bit more robust than our software, and energy conservation laws are impossibly hard to circumvent.

Reply
@ThatSkyAmber says:

May 29, 2018 at 11:27 pm

give it’s joints a speed limit more on par with a human’s..? or anyway, below the critical value needed for the exploit.

Reply
@Moreoverover says:

May 30, 2019 at 4:22 pm

Reminds me of what video game speedrunners do, finding glitches is goal number uno.

Reply
@jetison333 says:

August 6, 2019 at 3:44 am

@@milanstevic8424 honestly I dont think it would be too far off to call computers and other advanced technology as exploits. I mean, we tricked a rock into thinking.

Reply
@milanstevic8424 says:

August 6, 2019 at 10:52 am

@@jetison333 I agree, even though rocks do not think (yet).

But what is a human if not just a thinking emulsion of oil (hydrocarbons) and water? Who are we to exploit anything that wasn’t already made with such a capacity? We are merely discovering that rocks aren’t what we thought they were.

Given additional rules and configurations, everything appears to be capable of supernatural performance, where supernatural = anything that exceeds our prior expectations of nature.

“Any sufficiently advanced technology is indistinguishable from magic”

Which is exactly the point at which we begin to categorize it as extraordinary, instead of supernatural, until it one day just becomes ordinary…

It’s completely inverse, as it’s a process of discovery, thus we’re only getting smarter and more cognizant of our surroundings. But for some reason, we really like to believe we’re becoming gods, as if we’re somehow leaving the rules behind. We’re hacking, we’re incredible… We’re not, we’re just not appreciating the rules for what they truly are.

In my opinion, there is much more to learn if we are ever to become humble masters.

Reply

@harry356 says:

May 19, 2018 at 7:47 am

We had a bunch of aibo robots play hide and seek and train an ai. They stopped hiding quickly, we thought something is wrong, we made an error in our programming. It took us a while to find out that they learned to stay at the starting point so they where immediately free when the countdown stopped. They found a loophole in the rules. Incredible fun.

@killmeister2271 says:

August 20, 2019 at 6:40 am

They were like “hmm this game has no purpose therefore it must end asap”

Reply
@renakunisaki says:

March 23, 2022 at 10:17 pm

Literally “the only winning move is not to play”.

Reply

@laurenceperkins7468 says:

May 30, 2018 at 5:43 am

Reminds me of one of the early AI experiments using genetic algorithm adjusted neural networks. They ran it for a while and there was a clear winner that could solve all the different problems they were throwing at it. It wasn’t the fastest solver for any of the cases, but it was second-fastest for all or nearly all of them.

So they focused their studies on that one, and turned the others lines off. At which point the one they were studying ceased being able to solve any of the problems at all. So they ripped it apart to see what made it tick and it turns out that it had stumbled upon a flaw in their operating system that let it monitor what the other AIs were doing, and whenever it saw one report an answer it would steal the data and use it.

@fumanchu7 says:

January 18, 2020 at 6:35 pm

They recreated Edison as an AI. Neat.

Reply
@rickjohnson1719 says:

January 19, 2020 at 1:12 am

@@fumanchu7 nice

Reply
@MasterSonicKnight says:

January 19, 2020 at 10:44 pm

tl;dr: AI learns to cheat

Reply
@computo2000 says:

June 9, 2020 at 10:35 pm

This sort of sounds fake. Name/Source?

Reply
@michaelburns8073 says:

August 12, 2020 at 8:29 pm

Ah, it learned the classic “Kobayashi Maru” maneuver. Sweet!

Reply

@Zorn101 says:

June 9, 2018 at 2:24 am

AI is like a 4-year-old sorting butterfly pictures.
If I just tare up and eat the picture. the sorting is done!

@aphroditesaphrodisiac3272 says:

October 26, 2019 at 5:40 pm

*tear

Reply
@effexon says:

October 10, 2020 at 4:17 pm

these experiments will show how early ancient humans fought, tribal phase.

Reply
@breathe4778 says:

February 20, 2021 at 7:38 pm

but it’s perfect, no consequences 😅

Reply

@Moonz97 says:

August 1, 2018 at 5:14 pm

This is so hilarious. I remember programming a vehicle that was tasked with avoiding obstacles. It had controls over the steering wheel only, and it was always moving forward. To my surprise, the bot maximized its wall avoidance time by going in circles. I find that so funny lol.

@xl000 says:

February 4, 2020 at 6:08 pm

this is because your problem was not well specified. It should have been rewarded for “curviligne distance on some path”

Reply
@deathwishgaming4457 says:

September 20, 2020 at 12:58 am

@@xl000 I’m sure Moonz97 knows that. They brought it up because it was relevant, not for advice lol.

Reply
@geraldfrost4710 says:

January 14, 2021 at 7:09 pm

I find myself going in circles a lot… Good to know it is a valid response.

Reply

@NortheastGamer says:

July 18, 2019 at 3:10 pm

“The AI found a bug in the physics engine” So basically it did science.

@PhillipAmthor says:

August 12, 2019 at 9:05 pm

The ai is a glitcher

Reply
@williambarnes5023 says:

September 9, 2019 at 12:54 am

The entire field of quantum mechanics is a glitcher.

Reply
@cleanwater5665 says:

November 3, 2019 at 2:18 am

Mods, report this claw for hacking

Reply
@matthewe3813 says:

December 3, 2019 at 4:28 am

we will soon use AI to find bugs in video games

Reply
@twilighttucson2526 says:

January 8, 2020 at 5:28 am

No, that’s debugging.

Reply

@amyshaw893 says:

July 19, 2019 at 12:18 pm

Reminds me of something I saw where some people were training an AI to play Qbert, and at one point it found a secret bonus stage that nobody had ever found before

@korenn9381 says:

August 10, 2020 at 5:45 pm

@@MrXsunxweaselx No that has no mention of secret bonus stages

Reply

@teddywoodburn1295 says:

August 7, 2019 at 5:09 am

I heard about an ai that was trained to play Tetris, the only instruction it was given was to avoid dying, eventually the ai just learned to pause the game, therefore avoiding dying

@zserf says:

August 11, 2019 at 5:19 am

Source: https://www.youtube.com/watch?v=xOCurBYI_gY
Tetris is at 15:15, but the rest of the video is interesting as well.

Reply
@theshermantanker7043 says:

August 16, 2019 at 3:20 am

That’s what i used to do XD

But it got boring after a while

Reply
@teddywoodburn1295 says:

August 20, 2019 at 5:18 am

@DarkGrisen that’s true, but the person creating the program basically told the ai that it was about not dying, rather than getting a high score

Reply
@Ebani says:

August 20, 2019 at 5:32 pm

@DarkGrisen There is no difference then. By not dying it will get an infinite score eventually so a high score by itself is meaningless, not dying turns out to be the best factor to predict a high score.
He could’ve easily just removed the pause function too but it’s funny to see the results he got

Reply
@teddywoodburn1295 says:

August 21, 2019 at 12:08 am

@DarkGrisen exactly, I think the lesson in that is that you have to think about what you’re actually telling the ai to do

Reply

@theshermantanker7043 says:

August 13, 2019 at 11:45 pm

In other words the AI has learnt the ways of video game speedrunners

@doodlevib says:

October 23, 2019 at 5:32 am

Indeed! Some of the work done in training AI systems to play videogames is incredible, like the work of OpenAI.

Reply
@Linkario86 says:

April 30, 2020 at 8:45 am

Omg… can’t wait to see the first AI breaking a speedrun record, simply to see what exploits it found

Reply
@englishmotherfucker1058 says:

May 14, 2020 at 7:13 pm

TAS

Reply
@ihaveaplan.ijustneedmoney.9777 says:

June 8, 2020 at 11:09 am

Before we know it, they’ll be speedrunning the human race

Reply
@averywatts2391 says:

July 4, 2020 at 12:05 pm

I would love to see someone put an AI through Skyrim until it can complete the main questline as quickly as possible.

Reply

@alansmithee419 says:

August 18, 2019 at 6:31 pm

The idea of thinking outside the box is limited to humans. The box is something our minds put in place – it is a result of how our brains work. The ai doesn’t have a box meaning it can find the best solution, but also meaning there are many many more things that it could try that it needs to slog through.
We need that box, otherwise we’d be so flooded with ideas that our brains wouldn’t be able to sift through them all.
Our limitations allow us to function, but the way computers work means such a box would be detrimental to them.
– sincerely, not a scientist.

@EGarrett01 says:

September 14, 2019 at 9:51 pm

A “box” is simply a method that appears to be the first step towards generating the best result. But it can be a problem because there are often methods that don’t immediately seem to lead to the right direction but which ultimately produce a better result, like a walking physics sim spinning its arm in place super-fast until it takes off like a helicopter and can travel faster than someone walking.

If AI are working through successive generations, it will have periods or groups of results that follow a certain path that produces better things short-term, this is the same as people “thinking in the box.” But if it is allowed to try other things that are inefficient at first and follow them multiple steps down the line, it then ends up being able to think outside the box.

Reply
@alansmithee419 says:

September 14, 2019 at 11:24 pm

@@EGarrett01 as far as I understand it, the box is the range of human intuition, and thinking outside of it is essentially going against the common way of human thinking. The ai doesn’t have intuition, nothing limiting its ideas or method of thought, therefore it has no box.
Though honestly the proverbial box has never really had a definition, and its meaning could be interpreted any number of ways. I suppose both of our definitions are equally valid.

Reply
@xvxee7561 says:

January 18, 2020 at 8:54 am

You have this hella backwards

Reply
@honkhonk8009 says:

February 13, 2020 at 5:40 am

No, its because we would have past experiences influence decisions in the form of common sense.

Reply
@duc2133 says:

September 20, 2020 at 1:54 pm

@@alansmithee419 Ya’ll are trying to sound too deep. It just means that these experiments didn’t set enough factors to be practical. A robot flipping on its side woudn’t be practical, or the numerous other jokes on this thread — pushing the earth far away from the sun to “solve global warming” doesn’t make sense because its fucking stupid — the experimenter needed to set certain limitations for the computer to come up with a sensible solution. These robots aren’t lacking “intuition” its just a bad computer that needs to be programmed better.

Reply

@anthonyhadsell2673 says:

August 19, 2019 at 10:28 am

Human: Reduce injured car crash victims
Ai: Destroys all cars
Human: Reduce injured car crash victims without destroying cars
Ai: Disables airbag function so crashes result in death instead of injury

@pmangano says:

October 10, 2019 at 7:26 pm

Human: Teaches AI that death is result of injury
AI: Throw every car with passengers in a lake, no crash means no crash victims, car is intact.

Reply
@decidueyezealot8611 says:

October 12, 2019 at 2:42 am

Humans then drown to death.

Reply
@Solizeus says:

October 15, 2019 at 1:47 pm

Humans: Teaches IA not to damage the car or it’s passengers.
IA: Disable the ignition, avoiding any damage.
Humans: Stop that too.
IA: Turns on Loud Bad music and drive in circles to make the passengers want to leave or turn the car off

Reply
@noddlecake329 says:

October 17, 2019 at 3:47 pm

This Is basically what they did in WWI they noticed an increase in head injuries when they introduced bullet proof helmets and so they made people stop wearing helmets. The problem was that the helmets were saving lives and leaving only an injury

Reply
@anthonyhadsell2673 says:

October 17, 2019 at 8:11 pm

@@noddlecake329 survivor bias. When they took all the holes the found in planes that were shot and layed them over one plan in ww2 they noticed the edge of the winds and a few other areas being shot more so they assumed they should reinforce those areas, the issue was that they were looking at the planes that survived and really they needed to reinforce the areas that did have bullet holes

Reply

@mrflip-flop3198 says:

October 10, 2019 at 11:30 am

“Okay AI, I want you to solve global warming.”
“Right away, now moving _Earth_ from the solar system. Caution: You may experience up to 45Gs.”

@adoftw3866 says:

October 15, 2019 at 4:52 pm

more like 5k G’s

Reply
@sharpfang says:

October 26, 2019 at 1:18 am

Nah, way too complex and expensive. But considering that the global warming is caused by humans… eliminate the cause, easy.

Reply
@cgme7076 says:

February 29, 2020 at 3:12 am

*Humans explode immediately*

Reply
@igg5589 says:

April 20, 2020 at 3:59 pm

Or just one virus and problem solved

Reply
@eggyrepublic says:

July 25, 2020 at 1:59 am

@@igg5589 hol up

Reply

@renagonpoi5747 says:

October 25, 2019 at 9:57 am

“If there are no numbers, there’s nothing to sort… problem solved.”

I think a few more iterations and we’ll have robot overlords.

@cgme7076 says:

February 29, 2020 at 3:13 am

Renagon Poi :: No joke! These AI were too smart and this was two years ago.

Reply
@emericalizond4917 says:

May 27, 2020 at 8:39 pm

sort all these people into … AI: kill humans = nothing to sort

Reply
@harper626 says:

November 8, 2020 at 11:11 pm

Sounds like Trumps solution to the corona virus. Quit testing. No more cases. Right?

Reply
@numbdigger9552 says:

February 7, 2021 at 5:03 pm

@@harper626 i certainly don’t. SENICIDE TIME!!!

Reply

@josephoyek6574 says:

October 29, 2019 at 3:03 pm

AI: You have three wishes
Me: *sweats

@nischay4760 says:

May 10, 2020 at 2:24 pm

Dont Watch My Vids wear slippers

Reply
@stellarphantasmvfx5504 says:

May 14, 2020 at 6:32 am

@@nischay4760 the slippers will turn into gold, making it hard to walk

Reply
@jjuan4382 says:

June 1, 2020 at 4:15 am

@@UntrueAir oh yeah your right

Reply
@nischay4760 says:

June 1, 2020 at 6:09 am

@@UntrueAir touching is an obsolete word then

Reply
@unequivocalemu says:

September 7, 2020 at 5:54 pm

@@nischay4760 touching is overrated

Reply