Karmakar, P., & Chattopadhyay, K. (2024). Six Thinking Hats: A Educational Technique to Enhance Cognitive Abilities in Education. Asian Journal of Education and Social Studies.
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Scaling Proprioceptive-Visual Learning with Heterogeneous...
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training - Microsoft Research
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
PDF
The Curse of Recursion: Training on Generated Data Makes Models Forget
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
PDF
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Hyena Hierarchy: Towards Larger Convolutional Language Models
Meet in the Middle: A New Pre-training Paradigm
Extracting Training Data from Diffusion Models