Search AI/ML

Found 5 bookmarks

Custom sorting

Sumandora/remove-refusals-with-transformers: Implements harmful/harmless refusal removal using pure HF Transformers

Implements harmful/harmless refusal removal using pure HF Transformers - Sumandora/remove-refusals-with-transformers

#transformers #model training #fine tuning

·github.com·Mar 5, 2025

Sumandora/remove-refusals-with-transformers: Implements harmful/harmless refusal removal using pure HF Transformers

Hello from Transformer Lab | Transformer Lab

Documentation for LLM Toolkit, Transformer Lab

#model training #fine tuning #transformers #local model

·transformerlab.ai·Feb 14, 2025

Hello from Transformer Lab | Transformer Lab

MIT spinoff Liquid debuts non-transformer AI models and they’re already state-of-the-art

The startup from MIT's CSAIL says its Liquid Foundation Models have smaller memory needs thanks to a post-transformer architecture.

#transformers #model training

·venturebeat.com·Oct 2, 2024

MIT spinoff Liquid debuts non-transformer AI models and they’re already state-of-the-art

Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog

A deep dive into the internals of a small transformer model to learn how it turns self-attention calculations into accurate predictions for the next token.

#model training #transformers

·shyam.blog·Feb 6, 2024

Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog

explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable components

#transformers #model training

·github.com·Dec 21, 2023

explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable components