Search Test Information Space

Found 747 bookmarks

Newest

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

#Large Language Models #Evaluation #Peer Review #Paper #PDF #Cohere

·arxiv.org·Apr 30, 2024

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Large language models use a surprisingly simple mechanism to retrieve some stored knowledge - MIT Schwarzman College of Computing

paper detailing these findings

#Large Language Models #Reverse Engineering #Paper

·computing.mit.edu·Apr 27, 2024

Large language models use a surprisingly simple mechanism to retrieve some stored knowledge - MIT Schwarzman College of Computing

Retrieval Head Mechanistically Explains Long-Context Factuality

View PDF

#Transformers #Machine Learning #Paper #PDF

·arxiv.org·Apr 25, 2024

Retrieval Head Mechanistically Explains Long-Context Factuality

SnapKV: LLM Knows What You are Looking for Before Generation

View PDF

#Large Language Models #Paper #PDF

·arxiv.org·Apr 24, 2024

SnapKV: LLM Knows What You are Looking for Before Generation

Simple probes can catch sleeper agents \ Anthropic

#Training #Large Language Models #Anthropic #Paper #Classification #Cybersecurity

·anthropic.com·Apr 24, 2024

Simple probes can catch sleeper agents \ Anthropic

Instructors as Innovators: a Future-focused Approach to New AI Learning Opportunities, With Prompts

#Education #AI #Personalization #Paper #PDF

·papers.ssrn.com·Apr 24, 2024

Instructors as Innovators: a Future-focused Approach to New AI Learning Opportunities, With Prompts

FlashSpeech: Efficient Zero-Shot Speech Synthesis

#Generative Speech #Machine Learning #Audio #Paper #PDF

·arxiv.org·Apr 24, 2024

FlashSpeech: Efficient Zero-Shot Speech Synthesis

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

#Large Language Models #Opensource #Apple #Paper #PDF

·arxiv.org·Apr 24, 2024

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Multi-Head Mixture-of-Experts

View PDF

#Mixture of Experts #Machine Learning #Microsoft #Paper #PDF

·arxiv.org·Apr 24, 2024

Multi-Head Mixture-of-Experts

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

#Large Language Models #Priorities #Instruction #Paper #PDF #Training

·arxiv.org·Apr 24, 2024

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

View PDF

#Large Language Models #Microsoft #Edge Computing #Smartphone #Paper #PDF #Small Language Models

·arxiv.org·Apr 23, 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences

Download PDF

#CRISPR #Opensource #Biology #Editing #Paper #PDF #Generative AI

·biorxiv.org·Apr 23, 2024

Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences

LLM Agents can Autonomously Exploit One-day Vulnerabilities

#GPT-4 #Paper #PDF #Cybersecurity

·arxiv.org·Apr 22, 2024

LLM Agents can Autonomously Exploit One-day Vulnerabilities

🥇Top ML Papers of the Week

#Machine Learning #Research #Paper

·nlp.elvissaravia.com·Apr 21, 2024

🥇Top ML Papers of the Week

The ethics of advanced ai assistants 2024 i

#AI #Digital Assistants #Ethics #DeepMind #Paper #PDF

·storage.googleapis.com·Apr 19, 2024

The ethics of advanced ai assistants 2024 i

Researchers want a ‘nutrition label’ for academic-paper facts

#Labels #Recommendations #Paper #Research #Academics #Science

·nature.com·Apr 18, 2024

Researchers want a ‘nutrition label’ for academic-paper facts

AI Index Report 2024 – Artificial Intelligence Index

#AI #Report #Stanford #Paper #PDF

·aiindex.stanford.edu·Apr 15, 2024

AI Index Report 2024 – Artificial Intelligence Index

Is artificial intelligence the great filter that makes advanced technical civilisations rare in the universe?

(Conversely, substrates are like Fermi. Where are they found?)

#AI #Fermi Paradox #Paper #PDF

·sciencedirect.com·Apr 14, 2024

Is artificial intelligence the great filter that makes advanced technical civilisations rare in the universe?

🥇Top ML Papers of the Week

#Machine Learning #Research #Paper

·nlp.elvissaravia.com·Apr 14, 2024

🥇Top ML Papers of the Week

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

#Transformers #Memory #Paper #PDF

·arxiv.org·Apr 13, 2024

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

OpenEQA: From word models to world models

OpenEQA combines challenging open-vocabulary questions with the ability to answer in natural language. This results in a straightforward benchmark that demonstrates a strong understanding of the environment—and poses a considerable challenge to current foundational models. We hope this work motivates additional research into helping AI understand and communicate about the world it sees.

#Meta #Questions and Answers #Benchmark #Blog #Paper #PDF

·ai.meta.com·Apr 12, 2024

OpenEQA: From word models to world models

(PDF) Ethics of Quantum Technologies: A Scoping Review

The majority of the research has focused on the potential impact of quantum technologies on privacy and security, the potential impact of quantum technologies on the trust of those systems, and the potential for creating new forms of inequality in access to the technology.

#Literature Review #Quantum Computing #Technology #Paper #PDF #Ethics

·researchgate.net·Apr 11, 2024

(PDF) Ethics of Quantum Technologies: A Scoping Review

Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.

Long contexts represent a new front in the struggle to control LLMs. We explored a family of attacks that are newly feasible due to longer context lengths, as well as candidate mitigations. We found that the effectiveness of attacks, and of in-context learning more generally, could be characterized by simple power laws. This provides a richer source of feedback for mitigating long-context attacks than the standard approach of measuring frequency of success

#Anthropic #Prompt Engineering #Large Language Models #Paper #PDF

·www-cdn.anthropic.com·Apr 10, 2024

Anil, C., Durmus, E., Sharma, M., Benton, J., Kundu, S., Batson, J., ... & Duvenaud, D. (2024). Many-shot Jailbreaking.

Does Transformer Interpretability Transfer to RNNs?

#RNN #Transformers #Large Language Models #Paper #PDF #EleutherAI

·arxiv.org·Apr 10, 2024

Does Transformer Interpretability Transfer to RNNs?

Codegemma report

#Gemma #Coding #DeepMind #Google #Large Language Models #Paper #PDF #Opensource

·storage.googleapis.com·Apr 10, 2024

Codegemma report

ReALM: Reference Resolution As Language Modeling

#Large Language Models #Paper #PDF #Apple

·arxiv.org·Apr 1, 2024

ReALM: Reference Resolution As Language Modeling

Jamba: A Hybrid Transformer-Mamba Language Model

#Large Language Models #Paper #PDF #Mixture of Experts

·arxiv.org·Apr 1, 2024

Jamba: A Hybrid Transformer-Mamba Language Model

Nay, J. J., Karamardian, D., Lawsky, S. B., Tao, W., Bhat, M., Jain, R., ... & Kasai, J. (2024). Large language models as tax attorneys: a case study in legal capabilities emergence. Philosophical Transactions of the Royal Society A, 382(2270), 20230159.

#Legal #Taxation #Large Language Models #Paper #PDF

·royalsocietypublishing.org·Mar 31, 2024

🥇Top ML Papers of the Week

#Machine Learning #Paper

·nlp.elvissaravia.com·Mar 31, 2024

🥇Top ML Papers of the Week

Long-form factuality in large language models

#Large Language Models #Accuracy #Fact-checking #Paper #PDF

·arxiv.org·Mar 29, 2024

Long-form factuality in large language models