Tiktokenizer

Development
Cursor – Working with Context
How to work with context in Cursor
Intent context defines what the user wants to get out of the model. For example, a system prompt usually serves as high-level instructions for how the user wants the model to behave. Most of the “prompting” done in Cursor is intent context. “Turn that button from blue to green” is an example of stated intent; it is prescriptive.
State context describes the state of the current world. Providing Cursor with error messages, console logs, images, and chunks of code are examples of context related to state. It is descriptive, not prescriptive.
Together, these two types of context work in harmony by describing the current state and desired future state, enabling Cursor to make useful coding suggestions.
Azure Files documentation
Mount file shares in the cloud or on-premises on Windows, Linux, and macOS. Cache Azure file shares on Windows Servers with Azure File Sync for local access.
Use storage mounts in Azure Container Apps
Learn to use temporary and permanent storage mounts in Azure Container Apps
MCP Use Cloud
Highlight AI | Master your world
Get instant answers about anything you've seen, heard or said. Download free: highlightai.com
Agora Protocol · Scalable Communication Between Agents
Scalable communication between agents
Get started - xlcharts
How to make data pipelines idempotent
Unable to find practical examples of idempotent data pipelines? Then, this post is for you. In this post, we go over a technique that you can use to make your data pipelines professional and data reprocessing a breeze.
Shell and A.I - Steven Bucher - PSConfEU 2024
In this extensive lecture, I, Steven Bucher, a product manager on the PowerShell team, discuss the integration of AI into the shell environment. Over the pas...
You Don’t Need Airflow: Orchestrate Many Data Flows in R with Maestro – data-in-flight
Shiny App Workflows
This is a book that covers the standard shiny app
workflow.
Customisable Icon Markers for leaflet
Use modern Icon libraries to construct customisable leaflet marker icons.
autodb: Automatic Database Normalisation for Data Frames
Automatic normalisation of a data frame to third normal form, with the intention of easing the process of data cleaning. (Usage to design your actual database for you is not advised.) Originally inspired by the 'AutoNormalize' library for 'Python' by 'Alteryx' (<a href="https://github.com/alteryx/autonormalize" target="_top"https://github.com/alteryx/autonormalize/a>), with various changes and improvements. Automatic discovery of functional or approximate dependencies, normalisation based on those, and plotting of the resulting "database" via 'Graphviz', with options to exclude some attributes at discovery time, or remove discovered dependencies at normalisation time.
Data Pipeline Design Patterns - #1. Data flow patterns
Data pipelines built (and added on to) without a solid foundation will suffer from poor efficiency, slow development speed, long times to triage production issues, and hard testability. What if your data pipelines are elegant and enable you to deliver features quickly? An easy-to-maintain and extendable data pipeline significantly increase developer morale, stakeholder trust, and the business bottom line! Using the correct design pattern will increase feature delivery speed and developer value (allowing devs to do more in less time), decrease toil during pipeline failures, and build trust with stakeholders. This post goes over the most commonly used data flow design patterns, what they do, when to use them, and, more importantly, when not to use them. By the end of this post, you will have an overview of the typical data flow patterns and be able to choose the right one for your use case.
Nuitka the Python Compiler — Nuitka the Python Compiler
With the Python compiler Nuitka, you create protected binaries from your Python source code.
Catalog of Patterns of Distributed Systems
A catalog of patterns to better understand, communicate, and teach the design of distributed systems
Logging - Engineering Fundamentals Playbook
ISE Engineering Fundamentals Engineering Playbook
Logs vs Metrics vs Traces - Engineering Fundamentals Playbook
ISE Engineering Fundamentals Engineering Playbook
REST API Design Guidance - Engineering Fundamentals Playbook
ISE Engineering Fundamentals Engineering Playbook
Non-Functional Requirements Capture - Engineering Fundamentals Playbook
ISE Engineering Fundamentals Engineering Playbook
Data Transfer Object
An object that carries data between processes in order to
reduce the number of method calls.
How to Split Address in Excel: A Step-by-Step Guide
If you want to split address in Excel, this comprehensive guide is your ticket to mastering it as You’ll explore through different functions.
Advanced Tidyverse
Use piped workflows for efficient data cleaning and visualization.
Technical Guidelines for R
Best practices with R around select topics.
Summarizing and Querying Data from Excel Spreadsheets Using eparse and a Large Language Model
Editor's Note: This post was written by Chris Pappalardo, a Senior Director at Alvarez & Marsal, a leading global professional services firm. The standard processes for building with LLM work well for documents that contain mostly text, but do not work as well for documents that contain tabular data (like spreadsheets). We wrote about our latest thinking on Q&A over csvs on the blog a couple weeks ago, and we loved reading Chris's exploration of working with csvs and LangChain using agents, chai
Overview - Cellm
Use LLMs in Excel formulas
Generating Structured Output with LLMs (Part 1)
LLMs are great at generating text, but how do you get them to generate structured output?
BillPetti/baseballr: A package written for R focused on baseball analysis. Currently in development.
A package written for R focused on baseball analysis. Currently in development. - BillPetti/baseballr
Ploomber AI Editor
Create custom Streamlit and Shiny R apps effortlessly with AI assistance. Design, code, and deploy data apps in minutes.