01. What It Is
Federated learning is a decentralized training approach in which a global model is improved by aggregating parameter updates computed locally on each participating node, rather than by collecting all data in one place. The term was introduced by McMahan et al. at Google in 2016 (arXiv:1602.05629) and first deployed in Gboard, the Google Keyboard for Android, in 2017.
The key architectural property: raw data never leaves the device or silo where it was generated. Only model weights, gradients, or parameter deltas are transmitted, and only after local training has occurred. The central server sees no individual training record.
Federated learning divides into two primary deployment patterns:
Cross-device:
Millions of consumer devices (phones, tablets, IoT sensors) participate intermittently. Each device holds a small local dataset. Devices participate only when idle, plugged in, and on a free wireless connection. Scale is massive but individual device resources are constrained and connectivity is unreliable.
Cross-silo:
A small number of institutions (hospitals, banks, research consortia) participate, each holding a large, well-maintained local dataset. Connectivity and compute are reliable. The main concern is inter-institutional privacy and regulatory compliance rather than resource constraints.