Search Test Information Space

Found 10 bookmarks

Custom sorting

When Personalization Meets Reality: A Multi-Faceted Analysis of...

View PDF

·arxiv.org·today at 6:21 PM

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

·arxiv.org·Apr 30, 2024

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Download PDF

·arxiv.org·Mar 15, 2024

On The Fairness Impacts of Hardware Selection in Machine Learning

Download PDF

·arxiv.org·Dec 13, 2023

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs | OpenReview

·openreview.net·Dec 13, 2023

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

·arxiv.org·Dec 1, 2023

Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

Download PDF

·arxiv.org·Oct 27, 2023

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

Download PDF

·arxiv.org·Oct 17, 2023

Evaluating the Social Impact of Generative AI Systems in Systems and Society

·arxiv.org·Jun 19, 2023

Intriguing Properties of Quantization at Scale

PDF

·arxiv.org·Jun 1, 2023