[2201.02177] Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets#KI#research#grokking#overgeneralization·arxiv.org·Mar 5, 2024[2201.02177] Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets
Large Language Models: A Survey#KI#research#LLM#overview·arxiv.org·Feb 13, 2024Large Language Models: A Survey