LLMs

LLMs

318 bookmarks
Custom sorting
Will AI hamper our ability to crawl the web for useful data?
Will AI hamper our ability to crawl the web for useful data?
As websites start to block Common Crawl, and as the project leans in to its role in training LLMs, will it become harder to use data from the web for other purposes?
·blog.ldodds.com·
Will AI hamper our ability to crawl the web for useful data?
Prompt Engineering
Prompt Engineering
Prompt Engineering, also known as In-Context Prompting, refers to methods for how to communicate with LLM to steer its behavior for desired outcomes without updating the model weights. It is an empirical science and the effect of prompt engineering methods can vary a lot among models, thus requiring heavy experimentation and heuristics. This post only focuses on prompt engineering for autoregressive language models, so nothing with Cloze tests, image generation or multimodality models.
·lilianweng.github.io·
Prompt Engineering
Why gzip Just Beat a Large Language Model
Why gzip Just Beat a Large Language Model
A paper has shown that a compression algorithm – gzip – outperforms some large language models (LLMs) in some tasks. This has the NLP community …
·hendrik-erz.de·
Why gzip Just Beat a Large Language Model
What Is a Large Language Model?
What Is a Large Language Model?
A primer on what large language models are, why they are used, the different types, and what the future may hold for LLM applications.
·thenewstack.io·
What Is a Large Language Model?
Meeting the Operational Challenges of Training LLMs
Meeting the Operational Challenges of Training LLMs
To train a large language model, you must overcome three big challenges: data, hardware and legal. It helps to be a large organization, too.
·thenewstack.io·
Meeting the Operational Challenges of Training LLMs
A new series on LLM-assisted coding
A new series on LLM-assisted coding
In the 20th episode of my Mastodon series I pivoted to a new topic: LLM-assisted coding. After three posts in the new series, it got picked up by The New Stack. Here’s the full list so far, I…
·blog.jonudell.net·
A new series on LLM-assisted coding
Managing LLM Context Is a Knapsack Problem
Managing LLM Context Is a Knapsack Problem
LLMs can be more useful and less prone to hallucination when they’re able to read relevant documents, webpages, and prior conversations before responding to a new user question.
·awelm.com·
Managing LLM Context Is a Knapsack Problem
LMQL: Programming Large Language Models
LMQL: Programming Large Language Models
LMQL is a query language for large language models (LLMs). It facilitates LLM interaction by combining the benefits of natural language prompting with the expressiveness of Python.
·lmql.ai·
LMQL: Programming Large Language Models
How Containers, LLMs, and GPUs Fit with Data Apps
How Containers, LLMs, and GPUs Fit with Data Apps
Containers, large language models (LLMs), and GPUs provide a foundation for developers to build services for what Nvidia CEO Jensen Huang describes as an "AI Factory."
·thenewstack.io·
How Containers, LLMs, and GPUs Fit with Data Apps
What Large Language Models Can Do Well Now, and What They Can't
What Large Language Models Can Do Well Now, and What They Can't
At QCon New York earlier this month, two OpenAI engineers demonstrated ChatGPT's newest feature, Functions, in one session. Another talk, however, pointed to the inherent limitations of LLMs.
·thenewstack.io·
What Large Language Models Can Do Well Now, and What They Can't