Search AI/ML

Found 109 bookmarks

Custom sorting

GitHub - Stability-AI/StableLM: StableLM: Stability AI Language Models

StableLM: Stability AI Language Models. Contribute to Stability-AI/StableLM development by creating an account on GitHub.

#model training

·github.com·Apr 20, 2023

GitHub - Stability-AI/StableLM: StableLM: Stability AI Language Models

RedPajama, a project to create leading open-source models, starts by reproducing LLaMA training dataset of over 1.2 trillion tokens — TOGETHER

RedPajama is a project to create a set of leading, fully open-source models. Today, we are excited to announce the completion of the first step of this project: the reproduction of the LLaMA training dataset of over 1.2 trillion tokens.

#model training

·together.xyz·Apr 17, 2023

RedPajama, a project to create leading open-source models, starts by reproducing LLaMA training dataset of over 1.2 trillion tokens — TOGETHER

StackLLaMA: A hands-on guide to train LLaMA with RLHF

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#model training

·huggingface.co·Apr 10, 2023

StackLLaMA: A hands-on guide to train LLaMA with RLHF

PAL: Program-aided Language Models

PAL generates a Python program to solve a task from its natural language description.

#model training

·reasonwithpal.com·Apr 4, 2023

PAL: Program-aided Language Models

GitHub - apple/ml-stable-diffusion: Stable Diffusion with Core ML on Apple Silicon

Stable Diffusion with Core ML on Apple Silicon. Contribute to apple/ml-stable-diffusion development by creating an account on GitHub.

#image #visualization #model training

·github.com·Apr 3, 2023

GitHub - apple/ml-stable-diffusion: Stable Diffusion with Core ML on Apple Silicon

ChatGPT for BuyItForLife | Looria

This bot is trained on over 200k comments and posts from BuyItForLife subreddits to embody the collective knowledge of the Reddit BuyItForLife community.

#model training

·looria.com·Apr 3, 2023

ChatGPT for BuyItForLife | Looria

Make loading weights 10-100x faster by jart · Pull Request #613 · ggerganov/llama.cpp

This is a breaking change that's going to give us three benefits: Your inference commands should load 100x faster You may be able to safely load models 2x larger You can run many concurrent infere...

#model training

·github.com·Apr 1, 2023

Make loading weights 10-100x faster by jart · Pull Request #613 · ggerganov/llama.cpp

hpcaitech/ColossalAI: Making large AI models cheaper, faster and more accessible

Making large AI models cheaper, faster and more accessible - hpcaitech/ColossalAI: Making large AI models cheaper, faster and more accessible

#model training

·github.com·Apr 1, 2023

hpcaitech/ColossalAI: Making large AI models cheaper, faster and more accessible

Cerebras-GPT: A Family of Open, Compute-efficient, Large Language Models - Cerebras

Cerebras open sources seven GPT-3 models from 111 million to 13 billion parameters. Trained using the Chinchilla formula, these models set new benchmarks for accuracy and compute efficiency.

#model training

·cerebras.net·Mar 30, 2023

Cerebras-GPT: A Family of Open, Compute-efficient, Large Language Models - Cerebras

Lightning-AI/lit-llama: Implementation of the LLaMA language model based on nanoGPT. Supports quantization, LoRA fine-tuning, pre-training. Apache 2.0-licensed.

Implementation of the LLaMA language model based on nanoGPT. Supports quantization, LoRA fine-tuning, pre-training. Apache 2.0-licensed. - Lightning-AI/lit-llama: Implementation of the LLaMA langua...

#model training

·github.com·Mar 30, 2023

Lightning-AI/lit-llama: Implementation of the LLaMA language model based on nanoGPT. Supports quantization, LoRA fine-tuning, pre-training. Apache 2.0-licensed.

Hello Dolly: Democratizing the magic of ChatGPT with open models

Introducing Dolly, a breakthrough in LLM from Databricks. Learn how Databricks open sourced the model and all its training code, enabling organizations to re-create Dolly at a minimal cost.

#model training

·databricks.com·Mar 25, 2023

Hello Dolly: Democratizing the magic of ChatGPT with open models

lxe/simple-llama-finetuner: Simple UI for LLaMA Model Finetuning

Simple UI for LLaMA Model Finetuning. Contribute to lxe/simple-llama-finetuner development by creating an account on GitHub.

#model training

·github.com·Mar 23, 2023

lxe/simple-llama-finetuner: Simple UI for LLaMA Model Finetuning

antimatter15/alpaca.cpp: Locally run an Instruction-Tuned Chat-Style LLM

Locally run an Instruction-Tuned Chat-Style LLM . Contribute to antimatter15/alpaca.cpp development by creating an account on GitHub.

#model training

·github.com·Mar 18, 2023

antimatter15/alpaca.cpp: Locally run an Instruction-Tuned Chat-Style LLM

tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.

Code and documentation to train Stanford's Alpaca models, and generate the data. - tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.

#AI//ML #model training

·github.com·Mar 18, 2023

tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.

A simple Word2vec tutorial

In this tutorial we are going to explain, one of the emerging and prominent word embedding technique called Word2Vec proposed by Mikolov…

#model training

·medium.com·Mar 14, 2023

A simple Word2vec tutorial

commonsense/conceptnet-numberbatch

Contribute to commonsense/conceptnet-numberbatch development by creating an account on GitHub.

#model training #data

·github.com·Mar 14, 2023

commonsense/conceptnet-numberbatch

Stanford Alpaca, and the acceleration of on-device large language model development

On Saturday 11th March I wrote about how Large language models are having their Stable Diffusion moment. Today is Monday. Let’s look at what’s happened in the past three days. …

#model training

·simonwillison.net·Mar 14, 2023

Stanford Alpaca, and the acceleration of on-device large language model development

allenai/scibert_scivocab_uncased · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#model training #science #data

·huggingface.co·Mar 14, 2023

allenai/scibert_scivocab_uncased · Hugging Face

Papers with Code - Wizard of Wikipedia Dataset

Wizard of Wikipedia is a large dataset with conversations directly grounded with knowledge retrieved from Wikipedia. It is used to train and evaluate dialogue systems for knowledgeable open dialogue with clear grounding

#data #data science #model training

·paperswithcode.com·Jan 23, 2023

Papers with Code - Wizard of Wikipedia Dataset