LEACE: Perfect linear concept erasure in closed form
Enabling Scalable AI Computational Lithography with Physics-Inspired Models | Research
Enabling_Scalable_AI_Computational_Lithography_with_Physics-Inspired_Models.pdf
Faith and Fate: Limits of Transformers on Compositionality
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
Bytes Are All You Need: Transformers Operating Directly On File Bytes
PDF
Fine-Tuning Language Models with Just Forward Passes
PDF
The Impact of Positional Encoding on Length Generalization in Transformers
PDF
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
PDF
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
PDF
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
PDF
Neuralangelo: High-Fidelity Neural Surface Reconstruction | Research
Li_Neuralangelo_High-Fidelity_Neural_Surface_Reconstruction_CVPR_2023_paper.pdf
Large Language Models as Tool Makers
Stronger Together: on the Articulation of Ethical Charters, Legal Tools, and Technical Documentation in ML
PDF
Intriguing Properties of Quantization at Scale
PDF
Improving Mathematical Reasoning with Process Supervision
AutoML for neuromorphic computing and application-driven co-design: asynchronous, massively parallel optimization of spiking architectures
Improving Factuality and Reasoning in Language Models through Multiagent Debate
Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery
Deep learning-guided discovery of an antibiotic targeting Acinetobacter baumannii
LIMA: Less Is More for Alignment
IndoorSim-to-OutdoorReal: Learning to navigate outdoors without any outdoor experience
To Compress or Not to Compress -- Self-Supervised Learning and Information Theory: A Review
A Guide to ICLR 2023 — 10 Topics and 50 papers you shouldn't miss
Hyena Hierarchy: Towards Larger Convolutional Language Models
Language Models can Solve Computer Tasks
TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Visualizing the Implicit Model Selection Tradeoff
DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
Reflexion: an autonomous agent with dynamic memory and self-reflection
Magnushammer: A Transformer-based Approach to Premise Selection