Karmakar, P., & Chattopadhyay, K. (2024). Six Thinking Hats: A Educational Technique to Enhance Cognitive Abilities in Education. Asian Journal of Education and Social Studies.#Cognition#Pattern Recognition#Creativity#Training#Paper#PDF·public.paper4promo.com·Jan 5, 2025Karmakar, P., & Chattopadhyay, K. (2024). Six Thinking Hats: A Educational Technique to Enhance Cognitive Abilities in Education. Asian Journal of Education and Social Studies.
Greenblatt, R. et al. (2024). Alignment faking in large language models.#Alignment#Paper#Training#Anthropic·assets.anthropic.com·Dec 18, 2024Greenblatt, R. et al. (2024). Alignment faking in large language models.
The Surprising Effectiveness of Test-Time Training for Abstract Reasoning#Large Language Models#Training#Testing#Paper#PDF·arxiv.org·Dec 9, 2024The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Scaling Proprioceptive-Visual Learning with Heterogeneous...#Robotics#Training#Paper#PDF#Transfer Learning·arxiv.org·Nov 3, 2024Scaling Proprioceptive-Visual Learning with Heterogeneous...
Simple probes can catch sleeper agents \ Anthropic#Training#Large Language Models#Anthropic#Paper#Classification#Cybersecurity·anthropic.com·Apr 24, 2024Simple probes can catch sleeper agents \ Anthropic
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions#Large Language Models#Priorities#Instruction#Paper#PDF#Training·arxiv.org·Apr 24, 2024The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias#Large Language Models#Training#Machine Learning#Paper#PDF·arxiv.org·Jul 4, 2023Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training - Microsoft Research#Large Language Models#Training#Microsoft#Paper#PDF·microsoft.com·Jun 22, 2023ZeRO++: Extremely Efficient Collective Communication for Giant Model Training - Microsoft Research
The Curse of Recursion: Training on Generated Data Makes Models Forget#Large Language Models#Training#Paper#PDF·arxiv.org·Jun 14, 2023The Curse of Recursion: Training on Generated Data Makes Models Forget
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day#Large Language Models#Biomedical#Training#Paper#PDF#Microsoft·arxiv.org·Jun 13, 2023LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation LearnersPDF#Machine Learning#Stable Diffusion#Training#Computer Vision#Paper#PDF·arxiv.org·Jun 4, 2023StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelPDF#Large Language Models#Preferences#Reward#Training#Paper#PDF·arxiv.org·Jun 4, 2023Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes#Large Language Models#Training#Coding#Paper#PDF·arxiv.org·Apr 30, 2023Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Hyena Hierarchy: Towards Larger Convolutional Language Models#Large Language Models#Architecture#Training#Paper#PDF#Machine Learning#CNN·arxiv.org·Apr 25, 2023Hyena Hierarchy: Towards Larger Convolutional Language Models
Meet in the Middle: A New Pre-training Paradigm#Training#Paper#PDF#Large Language Models·arxiv.org·Mar 19, 2023Meet in the Middle: A New Pre-training Paradigm
Extracting Training Data from Diffusion Models#Training#Machine Learning#Diffusion#Paper#PDF#Analysis·arxiv.org·Feb 2, 2023Extracting Training Data from Diffusion Models