Andrej Karpathy

Career

Timeline from Toronto to OpenAI and Tesla

A chronological view of roles, research, and milestones — aligned with the biography on karpathy.ai.

2024 –

Educational videos on AI

I create educational videos on AI on my YouTube channel. The videos come in two parallel tracks: a technical track and a general audience track.

Technical track: Follow the Zero to Hero playlist.
Deep Dive into LLMs like ChatGPT covers under-the-hood fundamentals of LLMs.
How I use LLMs is a practical guide with examples from my own life.
Intro to Large Language Models is an earlier parallel introduction.

For all the latest, I spend most of my time on X/Twitter or GitHub.

2023 – 2024

OpenAI

I came back to OpenAI where I built a new team working on midtraining and synthetic data generation.

2017 – 2022

Director of AI at Tesla

I was the Director of AI at Tesla, where I led the computer vision team of Tesla Autopilot and (very briefly) Tesla Optimus. My team handled in-house data labeling, neural network training, and deployment on Tesla's custom inference chip. See Tesla AI Day 2021 for more.

2015 – 2017

OpenAI founding member

I was a research scientist and a founding member at OpenAI.

2011 – 2015

PhD at Stanford

My PhD focused on convolutional/recurrent neural networks in computer vision, NLP, and their intersection. Adviser: Fei-Fei Li. I designed and was the primary instructor for CS 231n: Convolutional Neural Networks for Visual Recognition — growing from 150 students in 2015 to 750 in 2017. Internships at Google Brain (2011), Google Research (2013), and DeepMind (2015).

2009 – 2011

MSc, University of British Columbia

Worked with Michiel van de Panne on learning controllers for physically-simulated figures — machine learning for agile robotics in simulation.

2005 – 2009

BSc, University of Toronto

Double major in computer science and physics with a minor in math. This is where I first got into deep learning, attending Geoff Hinton's class and reading groups.

Teaching

Courses, lectures, and learning resources

From the first Stanford deep learning class to today's YouTube lectures on LLMs and neural networks from scratch.

01

Neural Networks: Zero to Hero

Technical YouTube playlist building micrograd, makemore, and GPT-style models from scratch. Full series page.

02

Stanford CS231n

Convolutional Neural Networks for Visual Recognition — course notes, syllabus, and 2016 lecture videos.

03

General-audience LLM videos

Deep dives into how ChatGPT-style systems work and practical guides for using LLMs in everyday workflows.

Zero to Hero playlist CS231n course notes CS231n syllabus r/cs231n community

Pet projects

Open-source tools and experiments

Educational and research code that helped many developers understand autograd, RNNs, arXiv discovery, and in-browser deep learning.

micrograd

A tiny scalar-valued autograd engine implementing reverse-mode autodiff over a dynamically built DAG, with a small neural net library and PyTorch-like API.

char-rnn

Torch character-level language model built from LSTMs/GRUs/RNNs — paired with the famous blog post on RNN effectiveness.

arxiv-sanity-lite

Discover relevant arXiv papers, search by similarity, and get recommendations — a from-scratch rewrite of the original arxiv-sanity project.

neuraltalk2

Early image captioning in (lua)Torch, extended later with Justin Johnson to dense captioning.

ConvNetJS

Deep learning in JavaScript — train CNNs entirely in the browser with many interactive demos.

ulogme

Privacy-first productivity tracker — an alternative to cloud-based time tracking for developers.

Sometimes jokingly called the reference human for ImageNet after competing against an early ConvNet on 1,000-way classification. See the blog post and Wired article.

FAQ

Common questions about Andrej Karpathy

Quick answers on background, teaching, and where to follow new work.

Who is Andrej Karpathy?

Andrej Karpathy is an AI researcher and educator. He was a founding member of OpenAI and later Director of AI at Tesla, where he led the computer vision team for Autopilot. At Stanford he architected CS231n, one of the university's most popular deep learning courses.

What is he working on now?

He creates educational videos on AI via YouTube — including the Zero to Hero technical series and general-audience explainers on large language models. Updates are most active on X/Twitter and GitHub.

Where should I start learning from his content?

Beginners interested in how LLMs work can start with intro and deep-dive videos on YouTube. Developers building intuition from scratch should follow the Zero to Hero playlist and micrograd repository.

What is Software 2.0?

Software 2.0 is Karpathy's framing where neural networks become the primary program — optimization finds weights instead of humans hand-writing all logic. The essay remains influential in ML engineering culture.

What did he do at Tesla?

As Director of AI, he led Autopilot vision: in-house labeling, training, and deployment on Tesla's custom inference hardware, with the long-term goal of scalable full self-driving.

What defines Karpathy's work in AI

Research at scale

Teaching & clarity

Build in public