awslabs/aws-dataall: A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS. - GitHub - awslabs/...
Monitor data quality in your data lake using PyDeequ and AWS Glue | AWS Big Data Blog
In our previous post, we introduced PyDeequ, an open-source Python wrapper over Deequ, which enables you to write unit tests on your data to ensure data quality. The use case we ran through was on static, historical data, but most datasets are dynamic, so how can you quantify how your data is changing and detect […]
haciduru/kal: This is a script that you can use to encrypt data using Rscript command line tool. It would be impossibly difficult to decrypt the data if you had not seen the code in this file.
This is a script that you can use to encrypt data using Rscript command line tool. It would be impossibly difficult to decrypt the data if you had not seen the code in this file. - GitHub - hacidur...
msberends/hashing: Simple Shiny app to encrypt data for safe, anonymised data transfer
Simple Shiny app to encrypt data for safe, anonymised data transfer - GitHub - msberends/hashing: Simple Shiny app to encrypt data for safe, anonymised data transfer
phildias/cryptR: R code that allows you to encrypt and decrypt large sensitive files
R code that allows you to encrypt and decrypt large sensitive files - GitHub - phildias/cryptR: R code that allows you to encrypt and decrypt large sensitive files
Welcome to WikiOD - Wiki Online Documentation WikiOD is a collaborative writing project to build the highest quality online documentation of all programming languages, education-related subjects and concepts.
When developing a package it is essential to track the changes you make to your code. This is especially vital if they are breaking changes which have implications for any code written that depends on your package, i.e. a major version bump. Although you can always look back at your version control history in git, it is also convenient to have documentation which summarises the changes. This is where the NEWS file comes in.
Automatic Codebooks from Metadata Encoded in Dataset Attributes
Easily automate the following tasks to describe data frames: Summarise the distributions, and labelled missings of variables graphically and using descriptive statistics. For surveys, compute and summarise reliabilities (internal consistencies, retest, multilevel) for psychological scales. Combine this information with metadata (such as item labels and labelled values) that is derived from R attributes. To do so, the package relies on rmarkdown partials, so you can generate HTML, PDF, and Word documents. Codebooks are also available as tables (CSV, Excel, etc.) and in JSON-LD, so that search engines can find your data and index the metadata. The metadata are also available at your fingertips via RStudio Addins.
Install R in Ubuntu: A Complete Guide to Setting up R Development Environment - LinuxForDevices
In this guide, we'll learn to install R in Ubuntu. R is an open-source programming language used extensively for statistical and scientific computing. Due to
{rspm}: easy access to RSPM binary packages with automatic management of system requirements
There are many community projects out there that provide binary R packages for various distributions. You may know Michael Rutter’s legendary c2d4u.team/c2d4u4.0+ PPA, but this situation has …
Powerful Terminal And Command-Line (CLI) Tools For Modern Web Development — Smashing Magazine
What’s your favorite command-line tool? In this post, Louis Lazaris shares a collection of relevant command-line apps and utilities that he has personally come across in the past few years. If there’s a useful one that hasn’t been mentioned and one you use regularly, please do share it in the comments.