04. Term-by-Term Definitions
Artificial Intelligence (AI):
The field of computer science concerned with building systems that can perform tasks typically requiring human intelligence, including reasoning, learning, perception, and language understanding (Google Cloud, 2024). AI predates machine learning: rule-based expert systems, symbolic logic programs, and search algorithms are all AI. The field began formally at the Dartmouth Summer Research Project in 1956.
Machine Learning (ML):
The subset of AI focused on algorithms that learn patterns from data and generalize those patterns to new inputs, without being explicitly programmed for every case (IBM, 2024). The key difference from classical AI: the logic is not hard-coded; it is inferred from examples. Arthur Samuel coined the term in 1959.
See Machine Learning Basics for how supervised, unsupervised, and reinforcement learning work.
Deep Learning (DL):
A subset of ML that uses neural networks with many hidden layers (IBM, 2024). The "depth" refers to the number of layers between input and output. Deep learning automates feature extraction: the network figures out which aspects of the raw data matter, rather than requiring human engineers to define those features in advance. GPUs made deep learning practical at scale in the early 2010s.
Neural Network:
The computational structure underlying deep learning. Loosely inspired by the brain, it consists of interconnected layers of nodes (neurons), each performing a nonlinear mathematical operation. Connections have numerical weights; those weights are what training adjusts. A shallow neural network has one or two hidden layers. A deep neural network has many. Neural networks are not a separate category from deep learning; they are the mechanism by which deep learning operates.
Large Language Model (LLM):
A deep learning model built on the transformer architecture and trained on large volumes of text data, using next-token prediction as its primary training objective (IBM, 2024). After training, LLMs can understand context, generate fluent text, answer questions, translate, summarize, and write code. Examples include GPT-4, Claude, Gemini, and Llama. LLMs are also foundation models.
For a detailed treatment, see What Is a Large Language Model?.
Generative AI:
A category of AI models that produce new content (text, images, audio, video, code) rather than only classifying or predicting. Generative AI is not a separate architecture; it describes the output type. LLMs are generative AI. Diffusion models for images (Stable Diffusion, DALL-E) are generative AI. Not all generative models are LLMs, and not all LLMs are used purely generatively. The contrasting category is discriminative or predictive AI, which outputs a label, category, or numerical prediction rather than generating new content. A spam filter is discriminative. ChatGPT is generative.
See Generative AI.
Foundation Model:
A large model trained on broad data at scale such that it can be adapted to a wide range of downstream tasks (Stanford HAI / CRFM, 2021). The term was coined by Stanford researchers to describe a new paradigm where a single pre-trained model, such as BERT or GPT-3, serves as a reusable base for many applications via fine-tuning or prompting. LLMs are foundation models. So are large vision models, multimodal models, and some code models. Not all foundation models are LLMs.
Data Science:
A field that combines statistics, programming, and domain knowledge to extract insights from data. Data science overlaps with ML: data scientists use ML algorithms as tools, and ML engineers productionize the models data scientists build. The key difference is orientation: data science is primarily about understanding data and answering questions; ML is about building systems that learn and generalize. A data scientist might use linear regression to forecast quarterly revenue without caring whether the result is deployed in production. An ML engineer cares deeply about deployment, monitoring, and model drift. In practice, many practitioners work across both.
Natural Language Processing (NLP):
A subfield of AI and computer science concerned with enabling machines to understand, interpret, and generate human language (IBM, 2024). NLP is the problem domain. Deep learning and transformers are current dominant methods for solving NLP tasks. LLMs are the most powerful NLP tools available today but NLP also includes older approaches such as rule-based parsers and statistical methods. Translation, sentiment analysis, speech recognition, and question answering are all NLP tasks.
Computer Vision:
The subfield of AI concerned with enabling machines to extract meaning from images and video (Google Cloud, 2024). Tasks include image classification, object detection, image segmentation, and optical character recognition. Like NLP, computer vision is a problem domain. Deep learning (specifically convolutional neural networks, and more recently transformers) provides the primary solution methods.