#110 – Jitendra Malik: Computer Vision
from Lex Fridman Podcast
by Lex Fridman
Published: Tue Jul 21 2020
Show Notes
Jitendra Malik is a professor at Berkeley and one of the seminal figures in the field of computer vision, the kind before the deep learning revolution, and the kind after. He has been cited over 180,000 times and has mentored many world-class researchers in computer science.
Support this podcast by supporting our sponsors:
– BetterHelp: http://betterhelp.com/lex
– ExpressVPN: https://www.expressvpn.com/lexpod
If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on ApplePodcasts, follow on Spotify, or support it on Patreon.
Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.
OUTLINE:
– Introduction
– Computer vision is hard
– Tesla Autopilot
– Human brain vs computers
– The general problem of computer vision
– Images vs video in computer vision
– Benchmarks in computer vision
– Active learning
– From pixels to semantics
– Semantic segmentation
– The three R’s of computer vision
– End-to-end learning in computer vision
– 6 lessons we can learn from children
– Vision and language
– Turing test
– Open problems in computer vision
– AGI
– Pick the right problem