AI/ML

AI/ML

1896 bookmarks
Custom sorting
pytudes/ipynb/CherylMind.ipynb at main · norvig/pytudes
pytudes/ipynb/CherylMind.ipynb at main · norvig/pytudes
There has been much debate on the degree to which Large Language Models (LLMs) have a theory of mind: a way of understanding what other people know and don't know. In this notebook I explore one small part of the issue by asking nine LLM chatbots to solve the Cheryl's Birthday Problem, a well-known logic puzzle in which different characters have different states of knowledge at different times.
·github.com·
pytudes/ipynb/CherylMind.ipynb at main · norvig/pytudes
mlx-vlm
mlx-vlm
The MLX ecosystem of libraries for running machine learning models on Apple Silicon continues to expand. Prince Canuma is actively developing this library for running vision models such as Qwen-2 …
·simonwillison.net·
mlx-vlm
Curiosity - AI search for everything
Curiosity - AI search for everything
The ultimate AI productivity app that protects your privacy. Bring all your apps and data into one AI-powered search and assistant. Get it for you and for your teams today.
·curiosity.ai·
Curiosity - AI search for everything
How streaming LLM APIs work | Simon Willison’s TILs
How streaming LLM APIs work | Simon Willison’s TILs
I decided to have a poke around and see if I could figure out how the HTTP streaming APIs from the various hosted LLM providers actually worked. Here are my notes so far.
·til.simonwillison.net·
How streaming LLM APIs work | Simon Willison’s TILs
Introducing Contextual Retrieval \ Anthropic
Introducing Contextual Retrieval \ Anthropic
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
·anthropic.com·
Introducing Contextual Retrieval \ Anthropic
GitHub - ictnlp/LLaMA-Omni: LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
GitHub - ictnlp/LLaMA-Omni: LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. - ictnlp/LLaMA-Omni
·github.com·
GitHub - ictnlp/LLaMA-Omni: LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Sourcetable | Your AI Data Analyst
Sourcetable | Your AI Data Analyst
Sourcetable is an AI spreadsheet that helps you analyze data and create reports. Chat with your data, create charts and graphs, build financial models, + more.
·sourcetable.com·
Sourcetable | Your AI Data Analyst
Introducing Contextual Retrieval
Introducing Contextual Retrieval
Here's an interesting new embedding/RAG technique, described by Anthropic but it should work for any embedding model against any other LLM. One of the big challenges in implementing semantic search …
·simonwillison.net·
Introducing Contextual Retrieval
Reverse engineering OpenAI’s o1
Reverse engineering OpenAI’s o1
What productionizing test-time compute shows us about the future of AI. Exploration has landed in language model training.
·interconnects.ai·
Reverse engineering OpenAI’s o1