Sumandora/remove-refusals-with-transformers: Implements harmful/harmless refusal removal using pure HF TransformersImplements harmful/harmless refusal removal using pure HF Transformers - Sumandora/remove-refusals-with-transformers#transformers#model training#fine tuning·github.com·Mar 5, 2025Sumandora/remove-refusals-with-transformers: Implements harmful/harmless refusal removal using pure HF Transformers
Hello from Transformer Lab | Transformer LabDocumentation for LLM Toolkit, Transformer Lab#model training#fine tuning#transformers#local model·transformerlab.ai·Feb 14, 2025Hello from Transformer Lab | Transformer Lab
MIT spinoff Liquid debuts non-transformer AI models and they’re already state-of-the-artThe startup from MIT's CSAIL says its Liquid Foundation Models have smaller memory needs thanks to a post-transformer architecture.#transformers#model training·venturebeat.com·Oct 2, 2024MIT spinoff Liquid debuts non-transformer AI models and they’re already state-of-the-art
Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's BlogA deep dive into the internals of a small transformer model to learn how it turns self-attention calculations into accurate predictions for the next token.#model training#transformers·shyam.blog·Feb 6, 2024Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog
explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable components#transformers#model training·github.com·Dec 21, 2023explosion/curated-transformers: 🤖 A PyTorch library of curated Transformer models and their composable components