Search Test Information Space

Found 3 bookmarks

Newest

2309

#Interpretability #Agents #Explainability #Paper #PDF

·arxiv.org·Jan 8, 2024

Language models can explain neurons in language models

#Interpretability #Large Language Models #Paper #PDF

·openaipublic.blob.core.windows.net·May 27, 2023

Language models can explain neurons in language models

Progress measures for grokking via mechanistic interpretability

#Machine Learning #Interpretability #Paper #PDF #Explainability #Deep Learning

·arxiv.org·Feb 5, 2023

Progress measures for grokking via mechanistic interpretability