Home →
AI →
YOLO Object Detection (TensorFlow tutorial)

YOLO Object Detection (TensorFlow tutorial)

You Only Look Once – this object detection algorithm is currently the state of the art, outperforming R-CNN and it's variants. I'll go into some different object detection algorithm improvements over the years, then dive into YOLO theory and a programmatic implementation using Tensorflow!

Code for this video:

Please Subscribe! And like. And comment. That's what keeps me going.

Want more inspiration & education? Follow me:
Twitter:
Facebook:

More learning resources:

Join us in the Wizards Slack channel:

And please support me on Patreon:
Instagram:
Signup for my newsletter for exciting updates in the field of AI:

Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):

Have you tried vibe coding with AI? 🤔

New ChatGPT Image Generator is CRAZY Good! 🤯

DeepMind’s New Gemini AI: Build Anything For Free! 🏅

NVIDIA’s New AI Makes Cars Fly…Sort Of!

Gemini 2.5 Pro – It’s a Darn Smart Chatbot … (New Simple High Score)

OpenAI’s New Image Generator: An AI Revolution!

AI Image Revolution, Gemini 2.5 Pro & More Use Cases

How to Make & Edit Images with ChatGPT for Beginners

Joe Lilli

@gabrielvoss6251 says:

November 15, 2017 at 8:32 pm

Yeeeee I waited for so long for yolo

@RiteshKumarMaurya says:

November 16, 2017 at 1:00 pm

The Magic V, do you want to have a tutorial on Google Speech API, i.e., convert your speech into text!
Watch this:
https://youtu.be/jc_-AIYvfKs

Reply

@georgebockari289 says:

November 15, 2017 at 8:41 pm

Bro you might not know this…but you’re pretty good at this Youtube thing lol. Thanks man you’re the best

@xavdel0 says:

November 15, 2017 at 10:17 pm

The secret is use deeplearning to improve the video

Reply
@RiteshKumarMaurya says:

November 16, 2017 at 12:58 pm

Watch me man!
https://youtu.be/jc_-AIYvfKs

Reply
@SirajRaval says:

November 20, 2017 at 8:23 am

Thanks George lots of practice

Reply
@holychipotle says:

February 12, 2018 at 4:25 am

teaching is the best way to learn

Reply

@yet2BnAm3d says:

November 15, 2017 at 8:44 pm

I literally just sat down to do an assignment on this. Siraj, your timing is impeccable

@SirajRaval says:

November 20, 2017 at 8:23 am

thanks!

Reply
@DuhBroadcaster says:

December 26, 2017 at 2:45 am

@Siraj Raval, can you comment or make a video on how YOLO is trained? Are the two parts trained on different networks and then combined? Or are they all trained in one go? More info would be appreciated.

Reply
@sethagastya says:

May 26, 2019 at 3:20 pm

I just liked this comment to bring the total to 69 😀

Reply
@tejaschaudhari3259 says:

November 2, 2019 at 8:18 am

Hfish21 please can you tell me how did u do all this work… Because its my project work.. It need it at any cost please

Reply
@tonystark8493 says:

July 11, 2020 at 5:25 am

Hey my name is naazim I have made this video on detecting actions in basketball match with Yolo, tensorflow etc

Pls check it out if you are interested in this topic

https://youtu.be/0X6yTkXn-qQ

Reply

@JossWhittle says:

November 15, 2017 at 9:51 pm

At 4:10, HOG does actually mean Gradient in the same way as backprop does. An image is just a discrete representation of a continuous 2D signal, the gradient of the continuous signal at a point can be approximated from the discrete representation by taking the finite difference between neighbouring pixels.

@DavidSaintloth says:

November 16, 2017 at 2:14 am

yeah I was surprised that Siraj didn’t know that this was identical to a gradient.

Reply
@mike61890 says:

November 16, 2017 at 2:55 am

I think he meant the gradients don’t have the same function as they do in backprop, i.e. representing an error value

Reply
@MasterNeiXD says:

November 16, 2017 at 12:54 pm

So pretty much like a vector in physics.

Reply
@tioguerra says:

November 16, 2017 at 3:14 pm

Joss Whittle is right, and Siraj comment startled me as well first time I watched. The derivative always points to the direction of the (possibly local) maximum. The gradient definition used in the context of backprop is not different. Even though in HOG it does not represent an error to be minimized, the property still holds.

Reply
@Vancha112 says:

November 16, 2017 at 11:10 pm

Yes one is gradient as in describing a slope, the other is gradient as in color. I think thats what he means by different 🙂

Reply

@Lunsterful says:

November 15, 2017 at 10:42 pm

Gotta send a link of this to my ex-wife! Maybe she can finally detect that I am a person.

@theAppleWizz says:

November 16, 2017 at 3:03 am

Way to much info to much but it’s good your venting.

Reply
@contentity says:

November 16, 2017 at 7:17 am

Never marry a lizard person

Reply
@SirajRaval says:

November 20, 2017 at 8:20 am

haha wow thats real af

Reply
@mulindwajoseph5176 says:

January 5, 2018 at 11:07 pm

#LIZARD PERSON REALLY?/@#

Reply
@bluebear25519 says:

February 15, 2018 at 3:01 am

Lol, i wish in future it can detect and read mind

Reply

@schulca says:

November 16, 2017 at 5:56 am

These videos are great! also a lot easier to focus on when there aren’t memes popping up all the time. I enjoy the lecture style.

@SirajRaval says:

November 20, 2017 at 8:20 am

thanks Carl noted

Reply

@tonycatman says:

November 18, 2017 at 1:26 am

10/10 for this. I’d never heard of YOLO, and this is a really great introduction.

@MrZouzan says:

November 18, 2017 at 5:20 pm

I was looking for this just a few days ago and was a great coincidence that you decided to upload this video , thanks!!

@DannyJulian77 says:

November 28, 2017 at 11:59 pm

Siraj! Thank you so much! When you explain step by step like this I can undestand everything! Love this video!

@RatherBeCancelledThanHandled says:

November 30, 2017 at 2:54 am

I thank God, that I started studying programming/math, so much fun and so fascinating to be able to take part in such cool technological advancements.

@jazzpote4316 says:

December 4, 2017 at 11:43 pm

Your videos are so amazing. You cover all the fields of CS practically, with a state of the art approach.
So helpful, keep it up

@myperspective5091 says:

December 19, 2017 at 8:37 am

I’ve seen YOLO before about a year or two ago it seems like it got better even since then. Good to see them still improving their product.

@Loopyengineeringco says:

January 5, 2018 at 11:17 am

TBH, I only clicked this because it said YOLO. Now my brain is exploding.
But joking aside, you’re a great explainer and this is all starting to make sense. Thanks for the video!

@Lavimoe says:

January 16, 2018 at 5:08 pm

The whole video is very thorough and comprehensive, which makes such intimidating subject a no-brainer for the beginners. Not sure how I will use YOLO in my future projects, but I really learned a lot from this video!

@CAGonRiv says:

December 14, 2023 at 8:19 am

Its been five years. How about now?

Reply

@oliviersaint-jean6330 says:

March 19, 2018 at 7:48 pm

For videos, I think the algorithms should take the time dimension into account, (ie. increasing the probability of an object detected in one frame to be there again in the next frame) to decrease computation cost.

@med12med says:

July 15, 2018 at 12:57 pm

Man! You are amazing. your kind of presentation makes me stay completely focused!

@yannickmolinghen3425 says:

August 15, 2018 at 11:00 am

Thanks for your work it is the first time I find proper and clear explanations about how to interpret the network output!

@josephfoltz2423 says:

November 29, 2018 at 12:57 am

You sir, are the reason my company is headed into softwsee development, coding, and programming. This video is worth more than gold.

@jbuist says:

January 13, 2019 at 9:14 pm

That was an excellent description of a topic that has been confusing the heck out of me for many hours. Thank you!

@yashchandraverma3131 says:

March 13, 2019 at 2:51 pm

CNN works this time
1- Computation
2- Large Amount of Image available

YOLO Object Detection (TensorFlow tutorial)

Related Posts

Joe Lilli