Study 10-15 hrs/week and complete in 3 months.
From computer graphics to social robotics to autonomous vehicles, computer vision is powering world-changing new technologies. In this program, you’ll write code to perform everything from facial recognition to scene-understanding to object tracking; by the end of this program, you’ll have a broad portfolio of applications that you’ve built!
total
Learn the essentials of computer vision, including image transformation, neural network architectures, and object recognition
*Sign-up for the program by April 24 to save AED 650!
Computer vision is a rapidly growing field that powers a variety of emerging technologies—from facial recognition to augmented reality to self-driving cars. Learn the latest deep learning architectures and image processing techniques today!
We collaborated with industry leaders from NVIDIA to Affectiva to build a program that showcases how computer vision is being applied on the front-lines of technology today.
You’ll learn how to program computer vision techniques in Python, and then use that knowledge to create your own applications! You’ll complete three major computer vision projects, and build a strong portfolio in the process.
Get personalized feedback on your computer vision projects from a team of technical reviewers. The invaluable reviews you receive mirror the experience of working on a team of engineers and mentors, and this feedback offers you unique and actionable insights as to how you should develop code!
Udacity President
Sebastian Thrun is a scientist, educator, inventor, and entrepreneur. Prior to founding Udacity, he launched Google’s self-driving car project.
Curriculum Lead
Cezanne is an expert in computer vision with a Masters in Electrical Engineering from Stanford University. As a former researcher in genomics and biomedical imaging, she’s applied computer vision and deep learning to medical diagnostic applications.
Content Developer
Alexis is an applied mathematician with a Masters in computer science from Brown University and a Masters in applied mathematics from the University of Michigan. She was formerly a National Science Foundation Graduate Research Fellow.
Content Developer
Juan is a computational physicist with a Masters in Astronomy. He is finishing his PhD in Biophysics. He previously worked at NASA developing space instruments and writing software to analyze large amounts of scientific data using machine learning techniques.
Content Developer
Jay has a degree in computer science, loves visualizing machine learning concepts, and is the Investment Principal at STV, a $500 million venture capital fund focused on high-technology startups.
Content Developer
Ortal Arel has a PhD in Computer Engineering, and has been professor and researcher in the field of applied cryptography and embedded platforms. She has worked on design and analysis of intelligent algorithms for high-speed custom digital architectures.
Content Developer
Luis was formerly a Machine Learning Engineer at Google. He holds a PhD in mathematics from the University of Michigan, and a Postdoctoral Fellowship at the University of Quebec at Montreal.
See fewer details
This program requires experience with Python, statistics, machine learning, and deep learning. See detailed requirements.
Master computer vision and image processing essentials. Learn to extract important features from image data, and apply deep learning techniques to classification tasks.
Learn to apply deep learning architectures to computer vision tasks. Discover how to combine CNN and RNN networks to build an automatic image captioning application.
Learn how to locate an object and track it over time. These techniques are used in a variety of moving systems, such as self-driving car navigation and drone flight.
Basic shell scripting:
Basic statistical knowledge, including:
Intermediate differential calculus and linear algebra, including:
We have a number of courses and programs we can recommend that will help prepare you for the program, depending on the areas you need to address. For example: