Search Test Information Space

Found 7 bookmarks

Custom sorting

LIBMoE: A Library for comprehensive benchmarking Mixture of...

View PDF

·arxiv.org·Nov 6, 2024

What is Mixture of Experts?

·youtube.com·Aug 31, 2024

Multi-Head Mixture-of-Experts

View PDF

·arxiv.org·Apr 24, 2024

Jamba: A Hybrid Transformer-Mamba Language Model

·arxiv.org·Apr 1, 2024

Mixtral of experts

·mistral.ai·Dec 11, 2023

Hinton vs LeCun vs Ng vs Tegmark vs O

·garymarcus.substack.com·Nov 27, 2023

CS25 I Stanford Seminar - Mixture of Experts (MoE) paradigm and the Switch Transformer

·youtube.com·Jul 18, 2022