Karmakar, P., & Chattopadhyay, K. (2024). Six Thinking Hats: A Educational Technique to Enhance Cognitive Abilities in Education. Asian Journal of Education and Social Studies.
Greenblatt, R. et al. (2024). Alignment faking in large language models.
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Scaling Proprioceptive-Visual Learning with Heterogeneous...
Simple probes can catch sleeper agents \ Anthropic
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training - Microsoft Research
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
PDF
The Curse of Recursion: Training on Generated Data Makes Models Forget
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
PDF
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Hyena Hierarchy: Towards Larger Convolutional Language Models
Meet in the Middle: A New Pre-training Paradigm
Extracting Training Data from Diffusion Models