Search Test Information Space

Found 219 bookmarks

Newest

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

#Large Language Models #Evaluation #Peer Review #Paper #PDF #Cohere

·arxiv.org·Apr 30, 2024

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Large language models use a surprisingly simple mechanism to retrieve some stored knowledge - MIT Schwarzman College of Computing

paper detailing these findings

#Large Language Models #Reverse Engineering #Paper

·computing.mit.edu·Apr 27, 2024

Large language models use a surprisingly simple mechanism to retrieve some stored knowledge - MIT Schwarzman College of Computing

SnapKV: LLM Knows What You are Looking for Before Generation

View PDF

#Large Language Models #Paper #PDF

·arxiv.org·Apr 24, 2024

SnapKV: LLM Knows What You are Looking for Before Generation

Simple probes can catch sleeper agents \ Anthropic

#Training #Large Language Models #Anthropic #Paper #Classification #Cybersecurity

·anthropic.com·Apr 24, 2024

Simple probes can catch sleeper agents \ Anthropic

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

#Large Language Models #Opensource #Apple #Paper #PDF

·arxiv.org·Apr 24, 2024

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

#Large Language Models #Priorities #Instruction #Paper #PDF #Training

·arxiv.org·Apr 24, 2024

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

View PDF

#Large Language Models #Microsoft #Edge Computing #Smartphone #Paper #PDF #Small Language Models

·arxiv.org·Apr 23, 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.

Long contexts represent a new front in the struggle to control LLMs. We explored a family of attacks that are newly feasible due to longer context lengths, as well as candidate mitigations. We found that the effectiveness of attacks, and of in-context learning more generally, could be characterized by simple power laws. This provides a richer source of feedback for mitigating long-context attacks than the standard approach of measuring frequency of success

#Anthropic #Prompt Engineering #Large Language Models #Paper #PDF

·www-cdn.anthropic.com·Apr 10, 2024

Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.

Does Transformer Interpretability Transfer to RNNs?

#RNN #Transformers #Large Language Models #Paper #PDF #EleutherAI

·arxiv.org·Apr 10, 2024

Does Transformer Interpretability Transfer to RNNs?

Codegemma report

#Gemma #Coding #DeepMind #Google #Large Language Models #Paper #PDF #Opensource

·storage.googleapis.com·Apr 10, 2024

Codegemma report

ReALM: Reference Resolution As Language Modeling

#Large Language Models #Paper #PDF #Apple

·arxiv.org·Apr 1, 2024

ReALM: Reference Resolution As Language Modeling

Jamba: A Hybrid Transformer-Mamba Language Model

#Large Language Models #Paper #PDF #Mixture of Experts

·arxiv.org·Apr 1, 2024

Jamba: A Hybrid Transformer-Mamba Language Model

Nay, J. J., Karamardian, D., Lawsky, S. B., Tao, W., Bhat, M., Jain, R., ... & Kasai, J. (2024). Large language models as tax attorneys: a case study in legal capabilities emergence. Philosophical Transactions of the Royal Society A, 382(2270), 20230159.

#Legal #Taxation #Large Language Models #Paper #PDF

·royalsocietypublishing.org·Mar 31, 2024

Long-form factuality in large language models

#Large Language Models #Accuracy #Fact-checking #Paper #PDF

·arxiv.org·Mar 29, 2024

Long-form factuality in large language models

Language Models Can Reduce Asymmetry in Information Markets

#Economics #Paper #PDF #Large Language Models

·arxiv.org·Mar 27, 2024

Language Models Can Reduce Asymmetry in Information Markets