LAION-5B is an open-source foundation dataset used to train AI models such as Stable Diffusion. It contains 5.8 billion image and text pairs—a size too large to make sense of. In this visual investigation, we follow the construction of the dataset to better understand its contents, implications and entanglements.
Why the Great AI Backlash Came for a Tiny Startup You’ve Probably Never Heard Of
A literary analytics project called Prosecraft has shuttered after backlash from the writing community. It's a harbinger of a bigger cultural tide shift.
Climate Justice and Labor Rights | Part I: AI Supply Chains and Workflows
Tamara Kneese, Data & Society Research Institute Download full report here. Introduction In the second half of 2023, generative AI is dominating headlines. Policymakers, technologists, and activists are all grappling with its potential implications for communities and the planet. Integrating LLMs (large language models) into search engines may multiply the carbon emissions associated with each […]
Using generative AI (like ChatGPT) in business improves users’ performance by 66%, averaged across 3 case studies. More complex tasks have bigger gains, and less-skilled workers benefit the most from AI use.
Self-healing code is the future of software development
Developers love automating solutions to their problems, and with the rise of generative AI, this concept is likely to be applied to both the creation, maintenance, and the improvement of code at an entirely new level.
AI is introducing the third user-interface paradigm in computing history, shifting to a new interaction mechanism where users tell the computer what they want, not how to do it — thus reversing the locus of control.
EU Parliament vote on AI Act: Lawmakers chose to protect people against harms of AI systems - AlgorithmWatch
After long months of intense negotiations, members of the European Parliament voted on the EU’s Artificial Intelligence Act (AI Act). AlgorithmWatch applauds the Parliament for strengthening fundamental rights protection against the negative impacts of AI, such as that of face recognition in public spaces. Yet, the Parliament missed the opportunity to enhance protection for some people who would need it most.
This post consists of three parts.
Part 1 discusses the key challenges of productionizing LLM applications and the solutions that I’ve seen.
Part 2 discusses how to compose multiple tasks with control flows (e.g. if statement, for loop) and incorporate tools (e.g. SQL executor, bash, web browsers, third-party APIs) for more complex and powerful applications.
Part 3 covers some of the promising use cases that I’ve seen companies building on top of LLMs and how to construct them from smaller tasks.
This post consists of three parts.
Part 1 discusses the key challenges of productionizing LLM applications and the solutions that I’ve seen.
Part 2 discusses how to compose multiple tasks with control flows (e.g. if statement, for loop) and incorporate tools (e.g. SQL executor, bash, web browsers, third-party APIs) for more complex and powerful applications.
Part 3 covers some of the promising use cases that I’ve seen companies building on top of LLMs and how to construct them from smaller tasks.
Stephen Wolfram explores the broader picture of what's going on inside ChatGPT and why it produces meaningful text. Discusses models, training neural nets, embeddings, tokens, transformers, language syntax.
Learn about the training pipeline of GPT assistants like ChatGPT, from tokenization to pretraining, supervised finetuning, and Reinforcement Learning from Hu...