AI/ML

37 bookmarks

Custom sorting

AddyOsmani.com - An Engineer's Guide to AI Code Model Evals

A deep dive into evals, goldens, and hill climbing for improving coding-capable AI models.

#AI/ML #training #guides #evals #models

·addyosmani.com·Jul 29, 2025

AddyOsmani.com - An Engineer's Guide to AI Code Model Evals

Introducing pay per crawl: Enabling content owners to charge AI crawlers for access

Pay per crawl is a new feature to allow content creators to charge AI crawlers for access to their content.

#AI/ML #cloudflare #payments #paid #content #services

·blog.cloudflare.com·Jul 28, 2025

Introducing pay per crawl: Enabling content owners to charge AI crawlers for access

Prompt Engineering | Kaggle

Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals.

#AI/ML #prompt #engineering #whitepaper #kaggle

·kaggle.com·Apr 12, 2025

Prompt Engineering | Kaggle

a-practical-guide-to-building-agents.pdf

#AI/ML #agent #openai

·cdn.openai.com·Apr 19, 2025

a-practical-guide-to-building-agents.pdf

This Chatgpt Prompt= $20k growth consultant. : r/PromptEngineering

Explore this post and more from the PromptEngineering community

#AI/ML #chatGPT #prompt #engineering #growth #consultant #reddit

·reddit.com·May 29, 2025

This Chatgpt Prompt= $20k growth consultant. : r/PromptEngineering

blog/Lora-for-sequence-classification-with-Roberta-Llama-Mistral.md at main · huggingface/blog

Public repo for HF blog posts. Contribute to huggingface/blog development by creating an account on GitHub.

#AI/ML #LLM #roberta #llama #mistral #disaster #tweets #analysis #lora

·github.com·Mar 4, 2025

blog/Lora-for-sequence-classification-with-Roberta-Llama-Mistral.md at main · huggingface/blog

Training LLM on 1000s of GPUs made simple : r/LocalLLaMA

#AI/ML #reddit #LLM #thousands #GPUs #guides

·reddit.com·Feb 23, 2025

Training LLM on 1000s of GPUs made simple : r/LocalLLaMA

Open-source DeepResearch – Freeing our search agents

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#AI/ML #DeepResearch #search #agent #openai

·huggingface.co·Feb 5, 2025

Open-source DeepResearch – Freeing our search agents

How To Scale Your Model

Training LLMs often feels like alchemy, but understanding and optimizing the performance of your models doesn't have to. This book aims to demystify the science of scaling language models on TPUs: how TPUs work and how they communicate with each other, how LLMs run on real hardware, and how to parallelize your models during training and inference so they run efficiently at massive scale. If you've ever wondered “how expensive should this LLM be to train or “how much memory do I need to serve this model myself” or “what's an AllGather”, we hope this will be useful to you.

#AI/ML #LLM #scaling #TPUs

·jax-ml.github.io·Feb 6, 2025

How To Scale Your Model

A friendly introduction to distributed training (ML Tech Talks)

Google Cloud Developer Advocate Nikita Namjoshi introduces how distributed training models can dramatically reduce machine learning training times, explains ...

#AI/ML #distributed #training #tensorflow

·youtube.com·Feb 12, 2025

A friendly introduction to distributed training (ML Tech Talks)

GPU System Requirements for Running DeepSeek-R1

Discover the GPU system requirements to run DeepSeek-R1 and its distilled models effectively, along with recommendations for choosing the right hardware for your needs.

#AI/ML #LLM #DeepSeek #GPUs

·apxml.com·Feb 3, 2025

GPU System Requirements for Running DeepSeek-R1

Embeddings are underrated

#AI/ML #embeddings #models #python

·technicalwriting.dev·Nov 1, 2024

Embeddings are underrated

Terence Tao, mathematician: ‘It’s not good for something as important as AI to be a monopoly held by one or two companies’ | Science | EL PAÍS English

The Fields Medal winner is attempting to solve one of the Millennium Problems, with a reward of $1 million, but he also applies his analysis to topical enigmas such as the Venezuelan election and the advance of artificial intelligence

#AI/ML #Terence Tao #Venezuela #math #corruption #elections

·english.elpais.com·Oct 16, 2024

Terence Tao, mathematician: ‘It’s not good for something as important as AI to be a monopoly held by one or two companies’ | Science | EL PAÍS English

What Is ChatGPT Doing … and Why Does It Work?

Stephen Wolfram explores the broader picture of what's going on inside ChatGPT and why it produces meaningful text. Discusses models, training neural nets, embeddings, tokens, transformers, language syntax.

#AI/ML #GPT #recognition #LLM #transformer

·writings.stephenwolfram.com·Oct 9, 2024

What Is ChatGPT Doing … and Why Does It Work?

The Chinese Room - 60-Second Adventures in Thought (3/6)

An argument against computers ever being truly intelligent. (Part 3 of 6) Playlist link - https://www.youtube.com/playlist?list=PL73A886F2DD959FF1 Transcript link - http://podcast.open.ac.uk/feeds/thoughtexperiments-01/transcript/60second03_01691_16695.pdf Study a free course on Introducing philosophy at the Open University https://www.open.edu/openlearn/history-the-arts/culture/philosophy/introducing-philosophy/content-section-0?active-tab=description-tab Study R14 BA (Honours) Arts and Humanities (Philosophy) http://www.open.ac.uk/courses/qualifications/r14-p Explore qualifications in Philosophy with the OU http://www.open.ac.uk/courses/find/philosophy The Open University is the world’s leading provider of flexible, high-quality online degrees and distance learning, serving students across the globe with highly respected degree qualifications, and the triple-accredited MBA. The OU teaches through its own unique method of distance learning, called ‘supported open learning’ and you do not need any formal qualifications to study with us, just commitment and a desire to find out what you are capable of. Free learning from The Open University http://www.open.edu/openlearn/ For more like this subscribe to the Open University channel https://www.youtube.com/channel/UCXsH4hSV_kEdAOsupMMm4Qw Like us on Facebook: https://www.facebook.com/ouopenlearn/ Follow us on Twitter: https://twitter.com/OUFreeLearning #OpenUniversity #paradox

#AI/ML #philosophy #turing #testings

·youtube.com·Oct 9, 2024

The Chinese Room - 60-Second Adventures in Thought (3/6)

So you want to build your own open source ChatGPT-style chatbot… – Mozilla Hacks - the Web developer blog

A small team within Mozilla’s innovation group recently undertook a hackathon to build a trustworthy internal chatbot prototype.

#AI/ML #mozilla #chatbot #guides

·hacks.mozilla.org·Oct 9, 2024

So you want to build your own open source ChatGPT-style chatbot… – Mozilla Hacks - the Web developer blog

Thread by @altryne on Thread Reader App

@altryne: Watching @karpathy presentation from today and taking twitter notes, come along for the ride: If you're like only the practical tips, skip to #32 @karpathy starts with stages: 1 - Pre-training - months x th...…

#AI/ML #GPT #training #GPUs

·threadreaderapp.com·Oct 9, 2024

Thread by @altryne on Thread Reader App

Understanding GPT tokenizers

Large language models such as GPT-3/4, LLaMA and PaLM work in terms of tokens. They take text, convert it into tokens (integers), then predict which tokens should come next. Playing …

#AI/ML #tokens #GPT

·simonwillison.net·Oct 9, 2024

Understanding GPT tokenizers

Report on Frontier Model Training — LessWrong

Understanding what drives the rising capabilities of AI is important for those who work to forecast, regulate, or ensure the safety of AI. Regulation…

#AI/ML #GPUs #frontier #training #costs

·lesswrong.com·Oct 9, 2024

Report on Frontier Model Training — LessWrong

Patterns for Building LLM-based Systems & Products

Evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.

#LLM #production #cache #localStorage #fine-tuning #guardrails #defensive #feedback #evalutations

·eugeneyan.com·Oct 8, 2024

Patterns for Building LLM-based Systems & Products

Prompt Engineering

Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empirical science and the effect of prompt engineering methods can vary a lot among models, thus requiring heavy experimentation and heuristics. This post only focuses on prompt engineering for autoregressive language models, so nothing with Cloze tests, image generation or multimodality models.

#LLM #prompt #engineering

·lilianweng.github.io·Oct 8, 2024

Prompt Engineering

Nvidia H100 GPUs: Supply and Demand

This post is an exploration of the supply and demand of GPUs, particularly Nvidia H100s.

#AI/ML #LLM #nvidia #H100 #GPUs

·gpus.llm-utils.org·Oct 8, 2024

Nvidia H100 GPUs: Supply and Demand

Hot Chips 34 – Tesla’s Dojo Microarchitecture

To say Tesla is merely interested in machine learning is an understatement.

#AI/ML #microarchitecture #tesla

·chipsandcheese.com·Oct 8, 2024

Hot Chips 34 – Tesla’s Dojo Microarchitecture

How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources

A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team

#LLM #GPT #history #timeline #story

·yaofu.notion.site·Oct 8, 2024

How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources

We’re Entering Uncharted Territory for Math - The Atlantic

Terence Tao, the world’s greatest living mathematician, has a vision for AI.

#AI/ML #math #Terence Tao #Q&A

·theatlantic.com·Oct 8, 2024

We’re Entering Uncharted Territory for Math - The Atlantic

Inside the AI Factory

How many humans does it take to make tech seem human? Millions.

ew recruits would file into an office building in Nairobi to begin their apprenticeships. There seemed to be limitless demand for the work. They would be

#AI/ML #story #training #annotation #RLHF #datasets

·theverge.com·Oct 7, 2024

Inside the AI Factory

How to make LLMs go fast

Blog about linguistics, programming, and my projects

#LLM #performance

·vgel.me·Dec 28, 2023

How to make LLMs go fast

Advanced AI Guide by The Rundown.

Thanks for signing up to The Rundown built by @therundownai and @rowancheung! Enjoy our free Advanced ChatGPT Guide as a warm welcome into the world of AI!

#AI/ML #guides

·vaulted-polonium-23c.notion.site·Dec 18, 2023

Advanced AI Guide by The Rundown.