Search AI/ML

Found 115 bookmarks

Custom sorting

apple/OpenELM · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#model training

·huggingface.co·Apr 29, 2024

apple/OpenELM · Hugging Face

GitHub - apple/corenet: CoreNet: A library for training deep neural networks

CoreNet: A library for training deep neural networks - apple/corenet

#m1 #apple #model training

·github.com·Apr 25, 2024

GitHub - apple/corenet: CoreNet: A library for training deep neural networks

CoreNet: A library for training deep neural networks - apple/corenet

#model training

·github.com·Apr 25, 2024

GitHub - apple/corenet: CoreNet: A library for training deep neural networks

openelm/README-pretraining.md

Apple released something big three hours ago, and I'm still trying to get my head around exactly what it is. The parent project is called CoreNet, described as "A library …

#model training

·simonwillison.net·Apr 24, 2024

openelm/README-pretraining.md

Command R

Command R is a conversational model that excels in language tasks and supports multiple languages, making it ideal for coding use cases that require instruction models. It responds well to preambles that follow a specific structure and format, enhancing its performance.

#model training

·docs.cohere.com·Apr 7, 2024

Command R

Models All The Way Down

#model training #safety

·knowingmachines.org·Mar 30, 2024

Models All The Way Down

#model training #safety

·knowingmachines.org·Mar 30, 2024

Models All The Way Down

Introduction | Ragas

Ragas is a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. RAG denotes a class of LLM applications that use external data to augment the LLM’s context. There are existing tools and frameworks that help you build these pipelines but evaluating it and quantifying your pipeline performance can be hard. This is where Ragas (RAG Assessment) comes in.

#RAG #model training #testing

·docs.ragas.io·Mar 23, 2024

Introduction | Ragas

WeihaoTan/TWOSOME: Implementation of TWOSOME

#model training #fine tuning

·github.com·Mar 11, 2024

WeihaoTan/TWOSOME: Implementation of TWOSOME

flowersteam/Grounding_LLMs_with_online_RL: We perform functional grounding of LLMs' knowledge in BabyAI-Text

#model training #fine tuning

·github.com·Mar 11, 2024

flowersteam/Grounding_LLMs_with_online_RL: We perform functional grounding of LLMs' knowledge in BabyAI-Text

KhoomeiK/LlamaGym: Fine-tune LLM agents with online reinforcement learning

Fine-tune LLM agents with online reinforcement learning - KhoomeiK/LlamaGym

#model training #fine tuning

·github.com·Mar 11, 2024

KhoomeiK/LlamaGym: Fine-tune LLM agents with online reinforcement learning

Fine tune a 70B language model at home | Hacker News

#model training #local model

·news.ycombinator.com·Mar 10, 2024

Fine tune a 70B language model at home | Hacker News

The GPT-4 barrier has finally been broken

Four weeks ago, GPT-4 remained the undisputed champion: consistently at the top of every key benchmark, but more importantly the clear winner in terms of “vibes”. Almost everyone investing serious …

#model training

·simonwillison.net·Mar 9, 2024

The GPT-4 barrier has finally been broken

You can now train a 70b language model at home

local training

#model training

·simonwillison.net·Mar 8, 2024

You can now train a 70b language model at home

stanfordnlp/dspy at bramadams.dev

DSPy is a framework for algorithmically optimizing LM prompts and weights, especially when LMs are used one or more times within a pipeline. To use LMs to build a complex system without DSPy, you generally have to: (1) break the problem down into steps, (2) prompt your LM well until each step works well in isolation, (3) tweak the steps to work well together, (4) generate synthetic examples to tune each step, and (5) use these examples to finetune smaller LMs to cut costs. Currently, this is hard and messy: every time you change your pipeline, your LM, or your data, all prompts (or finetuning steps) may need to change.

#model training

·github.com·Feb 12, 2024

stanfordnlp/dspy at bramadams.dev

Meet ‘Smaug-72B’: The new king of open-source AI

Abacus AI has released "Smaug-72B," a new open-source AI model that outperforms GPT-3.5 and Mistral Medium on the Hugging Face Open LLM leaderboard.

#model training

·venturebeat.com·Feb 7, 2024

Meet ‘Smaug-72B’: The new king of open-source AI

Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog

A deep dive into the internals of a small transformer model to learn how it turns self-attention calculations into accurate predictions for the next token.

#model training #transformers

·shyam.blog·Feb 6, 2024

Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog

Finetuning Open-Source LLMs

This video offers a quick dive into the world of finetuning Large Language Models (LLMs). This video covers - common usage scenarios for pretrained LLMs- par...

#fine tuning #model training #embedding

·youtube.com·Jan 20, 2024

Finetuning Open-Source LLMs

Long-Context Retrieval Models with Monarch Mixer

#model training

·hazyresearch.stanford.edu·Jan 19, 2024

Long-Context Retrieval Models with Monarch Mixer

stabilityai/stable-code-3b · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#model training #code

·huggingface.co·Jan 19, 2024

stabilityai/stable-code-3b · Hugging Face

Stable Code 3B: Coding on the Edge — Stability AI

Stable Code, an upgrade from Stable Code Alpha 3B, specializes in code completion and outperforms predecessors in efficiency and multi-language support. It is compatible with standard laptops, including non-GPU models, and features capabilities like FIM and expanded context size. Trained in multiple

#model training #code

·stability.ai·Jan 19, 2024

Stable Code 3B: Coding on the Edge — Stability AI

explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable components

#transformers #model training

·github.com·Dec 21, 2023

explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable components

The Revenge of the Cataloguers

Over the past 15 years or so, libraries around the world have de-emphasized cataloguing. While budgetary concerns and technological efficien...

#data #model training

·go-to-hellman.blogspot.com·Dec 18, 2023

The Revenge of the Cataloguers

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

Large language models require huge amounts of GPU memory. Is it possible to run inference on a single GPU? If so, what is the minimum GPU…

#model training

·ai.gopubby.com·Dec 5, 2023

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models — Together AI

Releasing a new version of the RedPajama dataset, with 30 trillion filtered and deduplicated tokens (100+ trillions raw) from 84 CommonCrawl dumps covering 5 languages, along with 40+ pre-computed data quality annotations that can be used for further filtering and weighting.

#model training #training

·together.ai·Nov 1, 2023

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models — Together AI

Mistral 7B: Recipes for Fine-tuning and Quantization on Your Computer

Cheap supervised fine-tuning with an impressive LLM

#model training

·towardsdatascience.com·Oct 26, 2023

Mistral 7B: Recipes for Fine-tuning and Quantization on Your Computer

BloombergGPT: How We Built a 50 Billion Parameter Financial Language Model

We will present BloombergGPT, a 50 billion parameter language model, purpose-built for finance and trained on a uniquely balanced mix of standard general-pur...

#model training

·youtube.com·Oct 23, 2023

BloombergGPT: How We Built a 50 Billion Parameter Financial Language Model

GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization

Finetune llama2-70b and codellama on MacBook Air without quantization - GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization

#model training

·github.com·Oct 8, 2023

GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization