AI/ML

2309 bookmarks

Custom sorting

Why and How I Created my Own LLM from Scratch - DataScienceCentral.com

XLLM: new approach to OpenAI / GPT with fast, customized search, simple architecture and better results, based on extreme LLM

·datasciencecentral.com·Jan 15, 2024

Why and How I Created my Own LLM from Scratch - DataScienceCentral.com

MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design

The growing capabilities of synthetic biology and organic chemistry demand tools to guide syntheses toward useful molecules. Here, we present Molecular AutoenCoding Auto-Workaround (MACAW), a tool that uses a novel approach to generate molecules predicted to meet a desired property specification (e.g., a binding affinity of 50 nM or an octane number of 90). MACAW describes molecules by embedding them into a smooth multidimensional numerical space, avoiding uninformative dimensions that previous methods often introduce. The coordinates in this embedding provide a natural choice of features for accurately predicting molecular properties, which we demonstrate with examples for cetane and octane numbers, flash points, and histamine H1 receptor binding affinity. The approach is computationally efficient and well-suited to the small- and medium-size datasets commonly used in biosciences. We showcase the utility of MACAW for virtual screening by identifying molecules with high predicted binding affinity to the histamine H1 receptor and limited affinity to the muscarinic M2 receptor, which are targets of medicinal relevance. Combining these predictive capabilities with a novel generative algorithm for molecules allows us to recommend molecules with a desired property value (i.e., inverse molecular design). We demonstrate this capability by recommending molecules with predicted octane numbers of 40, 80, and 120, which is an important characteristic of biofuels. Thus, MACAW augments classical retrosynthesis tools by providing recommendations for molecules on specification.

·pubs.acs.org·Jan 14, 2024

MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design

ImageBind: Holistic AI learning across six modalities

ImageBind is the first AI model capable of binding information from six modalities.

·ai.meta.com·Jan 14, 2024

ImageBind: Holistic AI learning across six modalities

Vector Databases: A Technical Primer [pdf] | Hacker News

·news.ycombinator.com·Jan 14, 2024

Vector Databases: A Technical Primer [pdf] | Hacker News

Introduction to Unstructured Data - Zilliz Vector database blog

·zilliz.com·Jan 14, 2024

Introduction to Unstructured Data - Zilliz Vector database blog

Vector%20 databases%20 %20 a%20 technical%20 primer

null

·tge-data-web.nyc3.digitaloceanspaces.com·Jan 14, 2024

Vector%20 databases%20 %20 a%20 technical%20 primer

More than an OpenAI Wrapper: Perplexity Pivots to Open Source

Perplexity CEO Aravind Srinivas is a big Larry Page fan. But he thinks he's found a way to compete not only with Google search, but with OpenAI's GPT too.

#search

·thenewstack.io·Jan 13, 2024

More than an OpenAI Wrapper: Perplexity Pivots to Open Source

I Made This

·hypercritical.co·Jan 13, 2024

I Made This

How to Build a Retrieval Augmented Generative AI Application

RAG AI is a cutting-edge application that marries a Flask backend with a Streamlit frontend, creating a dynamic and interactive user experience. At its core,...

·youtube.com·Jan 12, 2024

How to Build a Retrieval Augmented Generative AI Application

Making an intelligent Robot NPC using Hugging Face 🤗 and Unity

Making Games with AI Course, Chapter 1

·thomassimonini.substack.com·Jan 12, 2024

Making an intelligent Robot NPC using Hugging Face 🤗 and Unity

How Johnny Can Persuade LLMs to Jailbreak Them:Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs

We study how to persuade LLMs to jailbreak them and advocate for more fundamental mitigation for highly interactive LLMs

·chats-lab.github.io·Jan 12, 2024

How Johnny Can Persuade LLMs to Jailbreak Them:Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs

superwhisper

AI powered voice to text for macOS

#mac #audio #voice #language

·superwhisper.com·Jan 11, 2024

superwhisper

You Can Build an App in 60 Minutes with ChatGPT - Ep. 5 with Geoffrey Litt

This show might be a first in the history of podcasts: Researcher Geoffrey Litt and I built an app together using ChatGPTapp and Replit in under 60 minutes—while we talked. We wanted to show how AI and ChatGPT change who gets to build software and how they usher in a world where everyone can modify and remix the apps they use every day. So we did it live, and ChatGPT delivered a working prototype at the end of the episode. It was a tiny glimpse of the future—and it pushes the boundaries of what a show can be. It honestly left me speechless and it'll change the way you think about software. If it does, make sure to subscribe, share, and leave us a review! Timestamps: 00:01:03 - Intro 00:01:36 - What is malleable software? 00:08:06 - Who gets to make software on the web? 00:14:50 - Deciding what app to build 00:22:06 - Starting on our app 00:31:07 - Don’t read the code first 00:47:55 - Starting from scratch could soon be a thing of the past 00:55:50 - Getting past those final error messages 01:03:31 - Voila! An app 01:04:50 - Effortless flow Links: https://www.geoffreylitt.com/2023/03/25/llm-end-user-programming.html https://every.to/chain-of-thought/what-comes-after-saas https://chat.openai.com/g/g-qPeu5SFW6-micro-web-app-coder

·youtube.com·Jan 11, 2024

You Can Build an App in 60 Minutes with ChatGPT - Ep. 5 with Geoffrey Litt

Malleable software in the age of LLMs

All computer users may soon have the ability to author small bits of code. What structural changes does this imply for the production and distribution of software?

·geoffreylitt.com·Jan 11, 2024

Malleable software in the age of LLMs

What Comes After SaaS?

Bespoke apps for everyone—customized by AI

·every.to·Jan 11, 2024

What Comes After SaaS?

hackerllama - The Random Transformer

Understand how transformers work by demistifying all the math behind them

·osanseviero.github.io·Jan 10, 2024

hackerllama - The Random Transformer

Ten Noteworthy AI Research Papers of 2023

This year has felt distinctly different. I've been working in, on, and with machine learning and AI for over a decade, yet I can't recall a time when these fields were as popular and rapidly evolving as they have been this year. To conclude an eventful 2023 in machine learning and AI research, I'm excited to share 10 noteworthy papers I've read this year. My personal focus has been more on large language models, so you'll find a heavier emphasis on large language model (LLM) papers than computer vision papers this year.

·magazine.sebastianraschka.com·Jan 9, 2024

Ten Noteworthy AI Research Papers of 2023

AI or ain't: Eliza

Explore the intriguing history of Eliza, a pioneering chatbot, and learn how to implement a basic version in Go, unraveling the roots of conversational AI.

·zserge.com·Jan 8, 2024

AI or ain't: Eliza

Mat’s Blog - Transformers From Scratch

·blog.matdmiller.com·Jan 8, 2024

Mat’s Blog - Transformers From Scratch

mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

#learn

·github.com·Jan 8, 2024

mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Fine-tune a Mistral-7b model with Direct Preference Optimization

Boost the performance of your supervised fine-tuned models

·towardsdatascience.com·Jan 8, 2024

Fine-tune a Mistral-7b model with Direct Preference Optimization

CultriX/MistralTrix-v1 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

·huggingface.co·Jan 8, 2024

CultriX/MistralTrix-v1 · Hugging Face

Leverage KeyBERT, HDBSCAN and Zephyr-7B-Beta to Build a Knowledge Graph

LLM-enhanced natural language processing and traditional machine learning techniques are used to extract structure and to build a knowledge…

·towardsdatascience.com·Jan 7, 2024

Leverage KeyBERT, HDBSCAN and Zephyr-7B-Beta to Build a Knowledge Graph

Eyes on tokenize

I was writing a tokenizer for SMILES and came across a recent paper by the IBM Research team on reaction standardisation which contained a ...

#compchem

·baoilleach.blogspot.com·Jan 6, 2024

Eyes on tokenize

The Narrated Transformer Language Model

AI/ML has been witnessing a rapid acceleration in model improvement in the last few years. The majority of the state-of-the-art models in the field are based on the Transformer architecture. Examples include models like BERT (which when applied to Google Search, resulted in what Google calls "one of the biggest leaps forward in the history of Search") and OpenAI's GPT2 and GPT3 (which are able to generate coherent text and essays). This video by the author of the popular "Illustrated Transformer" guide will introduce the Transformer architecture and its various applications. This is a visual presentation accessible to people with various levels of ML experience. Intro (0:00) The Architecture of the Transformer (4:18) Model Training (7:11) Transformer LM Component 1: FFNN (10:01) Transformer LM Component 2: Self-Attention(12:27) Tokenization: Words to Token Ids (14:59) Embedding: Breathe meaning into tokens (19:42) Projecting the Output: Turning Computation into Language (24:11) Final Note: Visualizing Probabilities (25:51) The Illustrated Transformer: https://jalammar.github.io/illustrated-transformer/ Simple transformer language model notebook: https://github.com/jalammar/jalammar.github.io/blob/master/notebooks/Simple_Transformer_Language_Model.ipynb Philosophers On GPT-3 (updated with replies by GPT-3): https://dailynous.com/2020/07/30/philosophers-gpt-3/ ----- Twitter: https://twitter.com/JayAlammar Blog: https://jalammar.github.io/ Mailing List: https://jayalammar.substack.com/ More videos by Jay: Jay's Visual Intro to AI https://www.youtube.com/watch?v=mSTCzNgDJy4 How GPT-3 Works - Easily Explained with Animations https://www.youtube.com/watch?v=MQnJZuBGmSQ

#transformers #learn

·youtube.com·Jan 6, 2024

The Narrated Transformer Language Model

Will AI Change Our Memories?

Go to https://www.squarespace.com/nerdwriter for 10% off your first purchase.GET THE PAPERBACK OF MY BOOK: https://amzn.to/3EPDQKtSupport Nerdwriter videos: ...

#photo

·youtube.com·Jan 6, 2024

Will AI Change Our Memories?

The I in LLM stands for intelligence | daniel.haxx.se

#security #devops

·daniel.haxx.se·Jan 6, 2024

The I in LLM stands for intelligence | daniel.haxx.se

The Illustrated Transformer

Discussions: Hacker News (65 points, 4 comments), Reddit r/MachineLearning (29 points, 3 comments) Translations: Arabic, Chinese (Simplified) 1, Chinese (Simplified) 2, French 1, French 2, Italian, Japanese, Korean, Persian, Russian, Spanish 1, Spanish 2, Vietnamese Watch: MIT’s Deep Learning State of the Art lecture referencing this post Featured in courses at Stanford, Harvard, MIT, Princeton, CMU and others In the previous post, we looked at Attention – a ubiquitous method in modern deep learning models. Attention is a concept that helped improve the performance of neural machine translation applications. In this post, we will look at The Transformer – a model that uses attention to boost the speed with which these models can be trained. The Transformer outperforms the Google Neural Machine Translation model in specific tasks. The biggest benefit, however, comes from how The Transformer lends itself to parallelization. It is in fact Google Cloud’s recommendation to use The Transformer as a reference model to use their Cloud TPU offering. So let’s try to break the model apart and look at how it functions. The Transformer was proposed in the paper Attention is All You Need. A TensorFlow implementation of it is available as a part of the Tensor2Tensor package. Harvard’s NLP group created a guide annotating the paper with PyTorch implementation. In this post, we will attempt to oversimplify things a bit and introduce the concepts one by one to hopefully make it easier to understand to people without in-depth knowledge of the subject matter. 2020 Update: I’ve created a “Narrated Transformer” video which is a gentler approach to the topic: A High-Level Look Let’s begin by looking at the model as a single black box. In a machine translation application, it would take a sentence in one language, and output its translation in another.

#transformers #tutorial #learn

·jalammar.github.io·Jan 5, 2024

The Illustrated Transformer