Home/Roadmaps/AI and Data Scientist
Roadmap · Updated May 2026

The AI and Data Scientist trek

Classical ML to generative AI. Statistics, Python, deep learning, NLP, LLMs, RAG, computer vision, and production ML systems. The unified path for the modern AI/DS role.

Stages
13
Estimated time
9 months
Level
Beginner → Advanced
Maintained by
3 practitioners
01
Stage 01

Python & math foundations

The language and mathematical toolkit that everything else builds on.

PythonMathBeginner
02
Stage 02

Data wrangling & EDA

Cleaning, exploring, and understanding data before modeling. The work that determines whether a model is useful.

EDApandasVisualization
03
Stage 03

Classical machine learning

The algorithms that work well on tabular data — and the intuitions that make neural networks make sense.

scikit-learnMLXGBoost
04
Stage 04

Deep learning with PyTorch

Neural networks from autograd up: build CNNs and RNNs before touching pretrained models.

PyTorchDeep LearningNeural Networks
05
Stage 05

NLP & transformers

Text processing, embeddings, the transformer architecture, and fine-tuning pretrained models for real tasks.

NLPTransformersHuggingFace
06
Stage 06

Computer vision

Image classification, object detection, segmentation, and the vision models used in production.

Computer VisionPyTorchYOLO
07
Stage 07

Large language models

How modern LLMs work, what RLHF does, inference parameters, cost modeling, and prompt engineering at depth.

LLMsPromptingRLHF
08
Stage 08

RAG systems

Retrieval-augmented generation end-to-end: embeddings, vector stores, chunking, reranking, and evaluation.

RAGEmbeddingsVector DB
09
Stage 09

MLOps & model deployment

Taking models from notebooks to production: versioning, serving, monitoring, and retraining pipelines.

MLOpsMLflowModel Serving
10
Stage 10

Experimentation & causal inference

Designing experiments that actually change decisions and going beyond correlation to understand what causes what.

ExperimentationCausal InferenceA/B Testing
11
Stage 11

AI agents & tool use

Building agentic systems that can plan, use tools, and complete multi-step tasks reliably.

AgentsTool UseLLMs
12
Stage 12

Research skills & paper reading

Reading papers efficiently, implementing ideas from scratch, and staying current in a field that moves weekly.

ResearchPaper ReadingAdvanced
13
Stage 13

Capstone — end-to-end AI system

Research → prototype → production → evaluation. Build something real that combines classical ML, deep learning, and LLM capabilities.

CapstoneAdvancedPortfolio

Trek complete. What's next?

You've walked the full roadmap. Now ship the capstone, write about it, and share the path with the next engineer who needs it.

Read the blogExplore more roadmaps