Home →
AI →
NVIDIA Cosmos – A Video AI…For Free!

NVIDIA Cosmos – A Video AI…For Free!

❤️ Check out Lambda here and sign up for their GPU Cloud:

Cosmos platform:

Hugging Face models:
More:

📝 The paper "Cosmos World Foundation Model Platform for Physical AI" is available here:

📝 My paper on simulations that look almost like reality is available for free here:

Or this is the orig. Nature Physics link with clickable citations:

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Alex Balfanz, Alex Haro, B Shang, Benji Rabhan, Gaston Ingaramo, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Martin, Michael Albrecht, Michael Tedder, Owen Skarpness, Richard Sundvall, Taras Bobrovytsky,, Thomas Krcmar, Tybie Fitzhugh, Ueli Gallizzi.
If you wish to appear here or pick up other perks, click here:

My research:
X/Twitter:
Thumbnail design: Felícia Zsolnai-Fehér –

#nvidia

OpenAI’s ChatGPT Surprised Even Its Creators!

The Most Fun You Can Have with AI Video

ChatGPT’s Controversial Shop & More AI Use Cases

Blender 4.4 Is Here – Stunning Power…For Free!

They Ignored AI… Until This Happened (10 Real Life Examples)

Impressive o3 Use Case You Need to Try

Best Way to Use New ChatGPT Image

NVIDIA’s New AI: Impossible Ray Tracing!

Joe Lilli

@rovalle5967 says:

January 7, 2025 at 9:06 am

Hold your papers!

@TwoMinutePapers says:

January 7, 2025 at 9:12 am

🙌📜

Reply
@christianvincentcostanilla8428 says:

January 7, 2025 at 10:51 am

More artificial intelligence news video

Reply
@MrYerak5 says:

January 7, 2025 at 3:44 pm

It’s time for AI to hold my papers for me

Reply

@MrQuickLine says:

January 7, 2025 at 9:10 am

The truck transporting traffic lights was the first time I really thought about people trying to mess up autonomous vehicles. What if someone has a vinyl sticker of a pedestrian on the back of their car? What if someone throws confetti out their window? There have to be a bunch of scenarios that not every manufacturer will have thought of.

@wobber17 says:

January 7, 2025 at 9:17 am

I predict that’s what the anti-AI terrorists have in store for us in the future. They already tried to crash computers of people using image generative AI.
We need to prepare to strike first, when the time comes.

Reply
@WallabyWinters says:

January 7, 2025 at 9:50 am

That’s why we need lidar/radar and not only cameras (looking at you Elon…)

Reply
@AndyMcBlane says:

January 7, 2025 at 10:00 am

Just need to make them smarter to understand. It’s easy for a human to see it’s an obvious trick, so therefore it can easily be done with cameras alone.

Reply
@naftaliten7989 says:

January 7, 2025 at 10:41 am

There is a video of a dude wearing a shirt of a stop sign and it stopped some teslas

Reply
@Omsip123 says:

January 7, 2025 at 10:52 am

⁠@@AndyMcBlanewhat you call “just” is the hard part which some of us see impossible (as of today) for certain scenarios

Reply

@TheAkdzyn says:

January 7, 2025 at 9:10 am

It feels like we’re looking into the imagination of a robot to understand how it perceives the world. Incredible time to be alive.

@cyancoyote7366 says:

January 7, 2025 at 9:22 am

Hey there Károly! Great videos, as usual! But… could you please give us more old-school type videos about simulations, light transport stuff? Kind of getting bored with all the AI stuff, and I think a lot of people share this opinion here. I understand it’s the craze right now, but there are other interesting stuff out there that isn’t AI. Thank you for reading and I hope you have a great day!

@can9660 says:

January 7, 2025 at 10:24 am

Seems to me like he may just be really fascinated by AI, perhaps his interests changed and this is the topic he’d like to make videos about now?

Reply
@Kewl_Zomb says:

January 7, 2025 at 10:29 am

I think with the recent advancement in AI, less papers about them are coming out / catches the attention, and I think even simulations and light transport are utilizing more and more AI as well.

Reply
@cyancoyote7366 says:

January 7, 2025 at 11:19 am

You two both bring up valid points, and I am in no position to tell him what kind of content he should be creating.

And no doubt, AI can be fascinating and is an interesting and extensive topic. With all the new research, the amount of AI papers just trumps all the other things, and improvements in the field are truly astonishing.

Not to discredit the researchers; they are doing fantastic work. Nor do I want to discredit Károly’s work, as his videos are always full of great information, presented fantastically.

But I can’t help but shrug when I see a semi-realistic, temporally unstable picture, video, or word of what is, pretty much, a weighted average of reality.

Maybe it’s just AI fatigue, maybe it’s just me. Just thought of voicing my opinion and what I’ve seen in the comments here and there.

But it’s just that, an opinion. Whatever video Károly produces, I know it will be a good one that explains the topic well.

Reply
@TwoMinutePapers says:

January 7, 2025 at 11:35 am

You are all too kind, thank you! Every now and then I try one of these simulation videos – they still exist! However, unfortunately very few of you Fellow Scholars are clicking them and Youtube does not recommend it to others. It has gotten to the point where I am not sure if we can keep on doing them. It breaks my heart and I really hope some kind of solution will present itself over time!

Reply
@cyancoyote7366 says:

January 7, 2025 at 12:48 pm

@@TwoMinutePapers Thank you for the explanatory response Károly!

I have suspected something like this could also be behind the scenes, but was wondering other reasons too.

I guess the algorithm does what the algorithm does and that’s truly the simplest and most obvious explanation. It’s completely understandable given how much of a hot topic AI/ML has become recently.

However, your videos keep me entertained, no matter the content, and it’s always pleasant to see a new upload notification pop up!

Have a great day, and thank you once again!

Reply

@Wizartar says:

January 7, 2025 at 9:25 am

AI plays first traffic light rhythm game! love it! 1:47

@tld8102 says:

January 7, 2025 at 9:28 am

film making has changed forever. you can now have creativity unlocked.

@jonathaningram8157 says:

January 7, 2025 at 9:30 am

But we lock our creativity more and more behind term like « cultural appropriation », « stereotype » etc.

Reply

@AndyMcBlane says:

January 7, 2025 at 9:59 am

Super cool and the same approach that Comma AI is taking for their self driving car / robotics agents

@jonmichaelgalindo says:

January 7, 2025 at 10:27 am

The weights are really on HF!!! NVidia actually did something! 😆 Hunyuan looks better at a glance, but I’m excited to compare them. Cosmos is supposed to have better physics, so if nothing else, we should be able to do image -> Cosmos (physics) -> Hunyan (vid2vid) for detail.

@I_am_who_I_am_who_I_am says:

January 7, 2025 at 11:11 am

I have worked in self-driving related projects in automotive. I have remarked that we should improve the algorithm to recognize “fake” or “unreal” signs and traffic lights and my proposal was rejected because it was unrealistic.
I ROFL-ed at the streaming traffic lights 😀 and that’s exactly what I was worried about.

@WinonaNagy says:

January 7, 2025 at 11:46 am

Diving into the Cosmos platform. Physical AI just got real. Thanks for the share.

@DownwithEA1 says:

January 7, 2025 at 11:58 am

Another way to look at this is look at where we were 2 papers before this. This would’ve been unthinkable to attempt on consumer hardware.

@lexibyday9504 says:

January 7, 2025 at 12:07 pm

what we need for perfect video AI is for the objects in the scene to be tracked as sub images or even 3D models instead of just as pixels. The first option prevents objects from disapearing for no reason and prevents them from merging together for no reason, the second option also prevents them from shapeshifting for no reason.

@MrTmansmooth says:

January 7, 2025 at 4:43 pm

That’s literally what this is

Reply
@jonanddy says:

January 7, 2025 at 5:27 pm

@@MrTmansmooth no it isn’t? I might’ve missed it, but nowhere in the video did it say it does that

Reply
@MrTmansmooth says:

January 7, 2025 at 5:39 pm

@ it literally is the keynote is like 3hrs did you watch it?

Reply

@coc1841 says:

January 7, 2025 at 12:14 pm

I’m looking forward to the day when an AI is released that can correct spoken text to make it sound like complete, well-formed sentences instead of just a series of words strung together.

@CharafEddineCHERAA says:

January 7, 2025 at 12:24 pm

“You have to wait for 5 minutes”
With my hardware, I waited longer for a 640×480 photo. 🙂

@teorloges315 says:

January 7, 2025 at 12:46 pm

This is the better AI, it doesn’t steal Independent artist’s work
but they must make laws for the future about them

@morphentropic says:

January 7, 2025 at 2:05 pm

AI generated narration on this video?

@vng says:

January 7, 2025 at 3:04 pm

If AI-created data is being used to train other AI, what are the measures in place to prevent errors in AI-created data from being passed down and tainting the other AI that is being trained?

@MrTmansmooth says:

January 7, 2025 at 4:44 pm

The nvidia keynote explained it better but essentially there is a physics simulation engine under all this which grounds the model

Reply

@chetangiradkar says:

January 7, 2025 at 3:32 pm

people who fight for open sourcing things have my eternal respect

@lesniak43 says:

January 7, 2025 at 4:49 pm

you can “run it at home for free” because you need their closed-source hardware to run it at home

Reply

@WiLLiW_oficial says:

January 7, 2025 at 4:35 pm

this AI narrator is so “Wow” and “bang!”

@NakedSageAstrology says:

January 7, 2025 at 5:43 pm

What a time to be alive!

NVIDIA Cosmos – A Video AI…For Free!

Related Posts

Joe Lilli