Next Newsletter

724 bookmarks

Custom sorting

Quarto - Code Annotation

In Quarto 1.3, you can add line based annotations to code chunks to highlight or explain parts of your code.

·quarto.org·Apr 21, 2023

Quarto - Code Annotation

Quarto - Jupyter Notebook Cell Embedding

Quarto 1.3 adds support for embedding cells from a Jupyter Notebook into a Quarto document via an embed shortcode. In HTML documents, links are automatically added that point to a rendered version of the external notebook.

·quarto.org·Apr 21, 2023

Quarto - Jupyter Notebook Cell Embedding

purrr 1.0.0

purrr 1.0.0 brings a basket of updates. We deprecated a number of seldom used functions to hone in on the core purpose of purrr and implemented a swath of new features including progress bars, improved error reporting, and much much more!

·tidyverse.org·Apr 21, 2023

purrr 1.0.0

dplyr 1.1.0: Joins

In dplyr 1.1.0, joins have been greatly reworked, including a new way to specify join columns, support for inequality, rolling, and overlap joins, and two new quality control arguments.

·tidyverse.org·Apr 21, 2023

dplyr 1.1.0: Joins

Posit

The v2023.03 release of RStudio, code-named “Cherry Blossom”, brings support for R 4.3.0, improved accessibility features, and more.

·posit.co·Apr 21, 2023

Posit

WebR - R in the Browser

·docs.r-wasm.org·Apr 21, 2023

WebR - R in the Browser

Inside the secret list of websites that make AI like ChatGPT sound smart

An analysis of a chatbot data set by The Washington Post reveals the proprietary, personal, and often offensive websites that go into an AI’s training data.

·washingtonpost.com·Apr 20, 2023

Inside the secret list of websites that make AI like ChatGPT sound smart

Writing performant code with tidy tools

When performance becomes an issue for code using tidy interfaces, switching to the backend tools used by tidy developers can offer substantial speedups.

·tidyverse.org·Apr 19, 2023

Writing performant code with tidy tools

Understanding UMAP

UMAP is a new dimensionality reduction technique that offers increased speed and better preservation of global structure.

·pair-code.github.io·Apr 19, 2023

Understanding UMAP

AI-enhanced development makes me more ambitious with my projects

The thing I’m most excited about in our weird new AI-enhanced reality is the way it allows me to be more ambitious with my projects. As an experienced developer, ChatGPT …

·simonwillison.net·Apr 18, 2023

AI-enhanced development makes me more ambitious with my projects

lyft2vec — Embeddings at Lyft

Co-authors: Hakan Baba, Adriana Deneault.

·eng.lyft.com·Apr 13, 2023

lyft2vec — Embeddings at Lyft

Visualizing A Neural Machine Translation Model (Mechanics of Seq2seq Models With Attention)

Translations: Chinese (Simplified), French, Japanese, Korean, Persian, Russian, Turkish Watch: MIT’s Deep Learning State of the Art lecture referencing this post May 25th update: New graphics (RNN animation, word embedding graph), color coding, elaborated on the final attention example. Note: The animations below are videos. Touch or hover on them (if you’re using a mouse) to get play controls so you can pause if needed. Sequence-to-sequence models are deep learning models that have achieved a lot of success in tasks like machine translation, text summarization, and image captioning. Google Translate started using such a model in production in late 2016. These models are explained in the two pioneering papers (Sutskever et al., 2014, Cho et al., 2014). I found, however, that understanding the model well enough to implement it requires unraveling a series of concepts that build on top of each other. I thought that a bunch of these ideas would be more accessible if expressed visually. That’s what I aim to do in this post. You’ll need some previous understanding of deep learning to get through this post. I hope it can be a useful companion to reading the papers mentioned above (and the attention papers linked later in the post). A sequence-to-sequence model is a model that takes a sequence of items (words, letters, features of an images…etc) and outputs another sequence of items. A trained model would work like this: Your browser does not support the video tag.

·jalammar.github.io·Apr 12, 2023

Visualizing A Neural Machine Translation Model (Mechanics of Seq2seq Models With Attention)

Tony’s Blog - Tired: PCA + kmeans, Wired: UMAP + GMM

An Alternative to the Classic Approach to Dimension Reduction + Clustering

·tonyelhabr.rbind.io·Apr 11, 2023

Tony’s Blog - Tired: PCA + kmeans, Wired: UMAP + GMM

davidsjoberg/ggbump: A geom for ggplot to create bump plots

A geom for ggplot to create bump plots. Contribute to davidsjoberg/ggbump development by creating an account on GitHub.

·github.com·Apr 11, 2023

davidsjoberg/ggbump: A geom for ggplot to create bump plots

nanoGPT/model.py at master · karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs. - nanoGPT/model.py at master · karpathy/nanoGPT

·github.com·Apr 10, 2023

nanoGPT/model.py at master · karpathy/nanoGPT

Generative AI with Cohere: Part 5 - Chaining Prompts

We conclude this series by exploring practical implementations in text generation. In particular, we’ll look at prompt chaining.

·txt.cohere.ai·Apr 9, 2023

Generative AI with Cohere: Part 5 - Chaining Prompts

Significance magazine - How to lose a girl in two standard deviations

Significance is an official magazine and website of the Royal Statistical Society (RSS) and the American Statistical Association (ASA).

·significancemagazine.com·Apr 8, 2023

Significance magazine - How to lose a girl in two standard deviations

Significance magazine - Enter our 2023 writing competition for early-career statisticians and data scientists

If you read Significance, then you are definitely interested in stories about statistics and data science, and fascinated by what data can tell us about the world we live in. So, how would you like

·significancemagazine.com·Apr 8, 2023

Significance magazine - Enter our 2023 writing competition for early-career statisticians and data scientists

Significance magazine - How fake is real enough? Privacy and synthetic data

Significance is an official magazine and website of the Royal Statistical Society (RSS) and the American Statistical Association (ASA).

·significancemagazine.com·Apr 8, 2023

Significance magazine - How fake is real enough? Privacy and synthetic data

Is Your Company Giving You What You Need to Build Great R Shiny Apps?…

archived 7 Apr 2023 15:16:45 UTC

·archive.ph·Apr 7, 2023

Is Your Company Giving You What You Need to Build Great R Shiny Apps?…

How I Cried in 2022: An Analysis of 365 Days of Personal Data

An investigation into my crying patterns using data I collected on myself

·towardsdatascience.com·Apr 7, 2023

How I Cried in 2022: An Analysis of 365 Days of Personal Data

How I used XGBoost to predict on forest fires

A couple of weeks ago I entered a Kaggle community competition but was unable to post on it because the host of the competition would not…

·medium.com·Apr 7, 2023

How I used XGBoost to predict on forest fires

Unsupervised Sentiment Analysis With Real-World Data: 500,000 Tweets on Elon Musk

Guided walkthrough in a real-world Natural Language Processing project.

·pub.towardsai.net·Apr 7, 2023

Unsupervised Sentiment Analysis With Real-World Data: 500,000 Tweets on Elon Musk

Introducing Segment Anything

We're releasing the Segment Anything Model (SAM) — a step toward the first foundation model for image segmentation — and the SA-1B dataset.

·ai.facebook.com·Apr 6, 2023

Introducing Segment Anything

How maps built with Facebook AI can help with COVID-19 vaccine delivery

Our population density maps can be used to coordinate and improve the delivery of humanitarian aid around the world, including COVID-19 vaccinations.

·ai.facebook.com·Apr 6, 2023

How maps built with Facebook AI can help with COVID-19 vaccine delivery

National Geographic Society World Water Map

The World Water Map helps us understand where and why water gaps arise, how climate change might aggravate them—and even how they might be managed.

#done

·worldwatermap.nationalgeographic.org·Apr 6, 2023

National Geographic Society World Water Map

Albert Rapp - Alternative ways to visualize correlations

We explore alternative correlation matrix plots.

·albert-rapp.de·Apr 5, 2023

Albert Rapp - Alternative ways to visualize correlations

Albert Rapp - How to use Fonts and Icons in ggplot

This is a short tutorial on how to import fonts and icons in R using the showtext package.

·albert-rapp.de·Apr 5, 2023

Albert Rapp - How to use Fonts and Icons in ggplot

Albert Rapp - Storytelling in ggplot using rounded rectangles

We build rebuild a 'Storytelling with Data' plot which uses rounded rectangles. I'll show you an easy and a hard way to make rectangles round.

#done

·albert-rapp.de·Apr 5, 2023

Albert Rapp - Storytelling in ggplot using rounded rectangles

Creating beautiful tables in R with {gt}

·gt.albert-rapp.de·Apr 5, 2023

Creating beautiful tables in R with {gt}