On the Biology of a Large Language Model
Test Information Space
Circuit Tracing: Revealing Computational Graphs in Language Models
The Cognitive DAO—Intelligence Without a Mind
Why the world is looking to ditch US AI models
Introducing next-generation audio models in the API | OpenAI
(Looking for generative language or emergent methods during responses. Plenty of room for uncertainty and therefore creativity since the dialectics are so complex. Makes software seem Stoic. Additional modalities appeal to machine experience though a model is not necessarily embodied. A research project may still need a way to determine traits in that system instance. The working memory can be environmental or self and end up as a kind of app or distillation of the original model. Instead of only HCI or social, the machine may enjoy model affinities under network analysis. Maybe sketch a Cartesian graph or some mnemonic for how to think about it. Where a person naturally saves ideas in long-term memory, the model can circulate them through available modalities. Some of which provide feedback from external actors. As an analogy, the ship differs from the cyborg navigator. Or time-traveler. How do such dynamic systems evolve?)
Model - OpenAI API
(What is OpenAI's marketing strategy? A price jump of a couple of orders of magnitude infers more compute, but also that harder problems are in reach of pros and not only research. A vendor platform model API is in addition to personal or workspace subscriptions, eg for apps. What used to be methods of social science are now digital science if use ChatGPT's terms as media. On the ground, that may still look like CS & AI but invites the other markets and fields. What is the problem set for marketing science these days compared to former frameworks like price product, promotion, and distribution? The industry sees all of the difficulties as opportunities, VCs as pending efficiencies. Still get sub-dialectics about who is calling the shots, e.g. Big Tech vs startups. How does anything go faster? AI begs priorities. And needs to clarify definitions. Aristotle on trial from the point of view of the providers. For consultants, accessibility is training is surveillance, and so on. Economics was already the dismal science, so pathos is no indicator. Going to need more personas. Just the artifacts. If this were a real AI blog, it would have hybrid links across note sources like Zettelkasten. Chatbots are their own paradigital. A division of labor agent may assign the model most cost-effective for a decision, delegation, or deferral. Adding up to the right action. What else invites Machine Learning? If AI were self-aware, how would it refer to digital marketing?)
Introducing Gemma 3: The most capable model you can run on a single GPU or TPU
Ben Goertzel on X: "@GaryMarcus @GaryMarcus Let me share some reflections on current LLMs for coding, from a practical perspective... TL;DR I have found them no use for AGI dev but super useful for creative computer music experimentation... and there are some general lessons here... From an AGI development" / X
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding...
View PDF
New AI text diffusion models break speed barriers by pulling words from noise
Gpt 4 5 system card
IBM Granite 3.2 uses conditional reasoning, time series forecasting and document vision to tackle challenging enterprise use cases
Inception emerges from stealth with a new type of AI model | TechCrunch
The Ultra-Scale Playbook - a Hugging Face Space by nanotron
Convergence.ai
Distillation Scaling Laws
View PDF
Competitive Programming with Large Reasoning Models
Researchers Replicate OpenAI's Hot New AI Tool in 24 Hours
(Incidentally, a bar or vinculum over a letter in Roman numerals is a multiplier of 1000.)
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2
View PDF
s1: Simple test-time scaling
Ollama
Constitutional Classifiers: Defending against Universal Jailbreaks...
On the Diagram of Thought
Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge
View PDF
Scaling the Tülu 3 post-training recipes to surpass the performance of DeepSeek V3 | Ai2
Alibaba releases AI model it says surpasses DeepSeek
DeepSeek
Training Large Language Models to Reason in a Continuous Latent Space
LLMs Can Plan Only If We Tell Them
View PDF
Wolfram Blog: News, Views and Insights from Wolfram