On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research
Test Information Space
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
HuggingChat
ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
Visual Blocks for ML: Accelerating machine learning prototyping with interactive tools
A Cookbook of Self-Supervised Learning
Hyena Hierarchy: Towards Larger Convolutional Language Models
Nvidia releases a toolkit to make text-generating AI 'safer'
16 of the best AI and ChatGPT content detectors compared
Using the Veil of Ignorance to align AI systems with principles of justice | Proceedings of the National Academy of Sciences
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Transformer Math 101
Emergent and Predictable Memorization in Large Language Models
Scaling Vision Transformers to 22 Billion Parameters
Augmented Language Models: a Survey
Learning to Compress Prompts with Gist Tokens
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Neural networks: from the perceptron to deep nets
Neuromorphic learning, working memory, and metaplasticity in nanowire networks
Responsible Artificial Intelligence -- from Principles to Practice
AI Alignment Forum
Researching Alignment Research: Unsupervised Analysis
In Conversation with Artificial Intelligence: Aligning language Models with Human Values - Philosophy & Technology
Evaluating Verifiability in Generative Search Engines
How to train your own Large Language Models
AI in Hiring and Evaluating Workers: What Americans Think
Instruction Tuning with GPT-4
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Inference with Reference: Lossless Acceleration of Large Language Models
Don't "Fake It Till You Make It"