MIT researchers achieved 61.9% on ARC tasks by updating model parameters during inference. Is this key to AGI? We might reach the 85% AGI doorstep by scaling and integrating it with COT (Chain of thought) next year. Test-time training (TTT) for large language models typically requires additional compute resources during inference compared to standard inference. …