Home/Roadmaps/AI and Data Scientist

Roadmap · Updated May 2026

The AI and Data Scientist trek

Classical ML to generative AI. Statistics, Python, deep learning, NLP, LLMs, RAG, computer vision, and production ML systems. The unified path for the modern AI/DS role.

Stages

Estimated time

9 months

Level

Beginner → Advanced

Maintained by

3 practitioners

Stage 01

Python & math foundations

The language and mathematical toolkit that everything else builds on.

PythonMathBeginner

Stage 02

Data wrangling & EDA

Cleaning, exploring, and understanding data before modeling. The work that determines whether a model is useful.

EDApandasVisualization

Stage 03

Classical machine learning

The algorithms that work well on tabular data — and the intuitions that make neural networks make sense.

scikit-learnMLXGBoost

Stage 04

Deep learning with PyTorch

Neural networks from autograd up: build CNNs and RNNs before touching pretrained models.

PyTorchDeep LearningNeural Networks

Stage 05

NLP & transformers

Text processing, embeddings, the transformer architecture, and fine-tuning pretrained models for real tasks.

NLPTransformersHuggingFace

Stage 06

Computer vision

Image classification, object detection, segmentation, and the vision models used in production.

Computer VisionPyTorchYOLO

Stage 07

Large language models

How modern LLMs work, what RLHF does, inference parameters, cost modeling, and prompt engineering at depth.

LLMsPromptingRLHF

Stage 08

RAG systems

Retrieval-augmented generation end-to-end: embeddings, vector stores, chunking, reranking, and evaluation.

RAGEmbeddingsVector DB

Stage 09

MLOps & model deployment

Taking models from notebooks to production: versioning, serving, monitoring, and retraining pipelines.

MLOpsMLflowModel Serving

Stage 10

Experimentation & causal inference

Designing experiments that actually change decisions and going beyond correlation to understand what causes what.

ExperimentationCausal InferenceA/B Testing

Stage 11

AI agents & tool use

Building agentic systems that can plan, use tools, and complete multi-step tasks reliably.

AgentsTool UseLLMs

Stage 12

Research skills & paper reading

Reading papers efficiently, implementing ideas from scratch, and staying current in a field that moves weekly.

ResearchPaper ReadingAdvanced

Stage 13

Capstone — end-to-end AI system

Research → prototype → production → evaluation. Build something real that combines classical ML, deep learning, and LLM capabilities.

CapstoneAdvancedPortfolio

Trek complete. What's next?

You've walked the full roadmap. Now ship the capstone, write about it, and share the path with the next engineer who needs it.

Read the blog Explore more roadmaps