TL;DR
A machine learning paradigm where a model temporarily updates its own parameters during inference using unlabeled test data to adapt to novel contexts or domain shifts.
Unlike traditional static inference, Test-Time Training utilizes a self-supervised auxiliary task, such as reconstruction, to optimize a subset of the model's weights on the specific test input before generating a final prediction. This allows the system to establish custom task-specific representations and correct for unexpected data distributions without changing the core model weights permanently. By doing so, it significantly improves robustness, few-shot reasoning, and generalization on structurally novel tasks.
Why this matters for your business
It bridges the gap between fixed training regimes and dynamic, real-world deployment environments, enabling AI to handle edge cases and unpredictable data flows gracefully.