Data Science vs. Statistics vs. AI vs. Machine Learning: A Comparative Overview
Robert Daniels
Data Scientist and Statistician
This presentation is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0).
Please give appropriate credit if you reuse or adapt this work.
Rob Daniels, MS, MPH, CPH
Experience: 13+ years in applied statistics, data science, machine learning
Former roles:
Morris et al. (2024). Current and future applications of artificial intelligence in surgery: implications for clinical practice and research. Frontiers in Surgery, 11. https://doi.org/10.3389/fsurg.2024.1393898.
Simulation of human intelligence in machines that are programmed to think, learn, and make decisions
Artificial narrow intelligence (ANI)
• “One-trick-pony” applications
• E.g., smart speaker, self-driving car, web search
Generative AI
• More general-purpose uses
• E.g., ChatGPT, Claude, Gemini, DeepSeek, DALL-E 2
“You won’t lose your job to AI, but you may lose your job to someone who knows how to use AI”
Agentic AI
• More autonomous decisions/actions
• E.g., OpenAI’s Operator
Artificial general intelligence (AGI)
• Anything a human can do
Ng, A. (2021). What is AI? [Video lecture]. AI for Everyone. Coursera. https://www.coursera.org/learn/ai-for-everyone
“The science of getting computers to learn without being explicitly programmed”
Supervised learning
• Maps input to output: X → Y labels
• Classification, regression, neural networks
Examples:
– X-ray image → tumor/no tumor
– Words → next word (LLM)
Unsupervised learning
• No labels
• Clustering, anomaly detection, data reduction
Examples:
– Customer segmentation
– Fraud detection
Reinforcement learning
• Rewards and punishments guide learning
Examples:
– Self-driving cars
– Robotics
Subset of machine learning using multi-layered neural networks to model complex patterns
Commonly used types of NN:
Recurrent neural networks (RNNs) (natural language processing, speech recognition)
Generative adversarial networks (GANs) (image generation, style transfer)
Convolutional neural networks (CNNs) (image processing, object detection)
Convolutional neural network diagram created by Claude AI (Anthropic, 2025). https://claude.ai.
Data Science
Statistics
Scope
Core principles
Problem-solving approach
(1/2)
Data Science
Statistics
Tools and techniques
Data
Applications
(2/2)
Inference involves drawing conclusions about a population based on a sample of data; it is concerned with understanding relationships, estimating parameters, and testing hypotheses
Prediction refers to using a model to forecast future outcomes based on current or historical data
Harrell, Frank. (2018, April 30). Road map for choosing between statistical modeling and machine learning. Statistical Thinking. https://www.fharrell.com/post/stat-ml/.
ML tends to work poorly when:
Harrell, Frank. (2018, April 30). Road map for choosing between statistical modeling and machine learning. Statistical Thinking. https://www.fharrell.com/post/stat-ml/.
| What is it? | If a human can complete a task in about one second of thought, AI may be able to automate it |
| Why it works | • Pattern recognition: AI learns from data like human intuition • Low complexity: quick decisions are ideal for AI • Automation potential: repetitive, structured tasks fit well |
| Examples | • Object recognition in images • Text prediction • Document classification |
| When the rule breaks down | • Deep reasoning required • Ambiguity and subjectivity |
Retrieval-augmented generation (RAG)
Fine-tuning pre-trained models
© 2025 R Daniels — Licensed under CC BY-SA 4.0