ALWAYS Use Option Explicit, Work with Arrays, Not With Ranges, Switch off properties during macro execution, Use VB constants when available, Avoid using SELECT or ACTIVATE, Always define and set references to all Workbooks and Sheets, Use descriptive variable naming, Document Your Work, Error Handling, Never Assume The Worksheet, Avoid using ActiveCell or ActiveSheet in Excel, WorksheetFunction object executes faster than a UDF equivalent, Avoid re-purposing the names of Properties or Methods as your variables
Without the Data Model and tools like Talend, data can completely fail to provide business value, or worse impede its success through inaccuracy, misuse, or misunderstanding.
Data Modeling - Relational Databases (SQL) vs Data Lake (File Based) - Confessions of a Data Guy
Data Modeling is a topic that never goes away. Sometimes I do reminisce about the good ol’ days of Kimball-style data models, it was so simple, straightforward, just the same thing for years. Then Big Data happened, Spark happened. Things just changed. There is a lot of new content coming out around Data Lakes and […]
Ever heard the phrase “Read the source, Luke”? It’s a play on “Use the force, Luke” from Star Wars, with no definite source 😉 that we could find^[We erroneously first linked to a rather recent blog post but Robert Link corrected us in a comment that we reproduce here in case the post gets separated from its comments: ““Use the Source, Luke” goes way back before 2012, and probably even before blogs were a thing.
Why Every Entrepreneur Should Be Doing A Weekly Review
Ever get the feeling nagging at the back of your head that there’s something you’re supposed to do, but you can't remember what it is? Or you know you're supposed to ask something specific to the person you're talking to? Or you've got so many open loops in that you aren't sure what the logical next step is? I used to. I don't very much anymore. I've always been a bit of the absent minded professor type, prone to getting lost in my head or on a walk. That means that I've always had a problem losing or forgetting things that need to get taken care of.
awslabs/aws-dataall: A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS. - GitHub - awslabs/...
Monitor data quality in your data lake using PyDeequ and AWS Glue | AWS Big Data Blog
In our previous post, we introduced PyDeequ, an open-source Python wrapper over Deequ, which enables you to write unit tests on your data to ensure data quality. The use case we ran through was on static, historical data, but most datasets are dynamic, so how can you quantify how your data is changing and detect […]
haciduru/kal: This is a script that you can use to encrypt data using Rscript command line tool. It would be impossibly difficult to decrypt the data if you had not seen the code in this file.
This is a script that you can use to encrypt data using Rscript command line tool. It would be impossibly difficult to decrypt the data if you had not seen the code in this file. - GitHub - hacidur...
msberends/hashing: Simple Shiny app to encrypt data for safe, anonymised data transfer
Simple Shiny app to encrypt data for safe, anonymised data transfer - GitHub - msberends/hashing: Simple Shiny app to encrypt data for safe, anonymised data transfer
phildias/cryptR: R code that allows you to encrypt and decrypt large sensitive files
R code that allows you to encrypt and decrypt large sensitive files - GitHub - phildias/cryptR: R code that allows you to encrypt and decrypt large sensitive files
Welcome to WikiOD - Wiki Online Documentation WikiOD is a collaborative writing project to build the highest quality online documentation of all programming languages, education-related subjects and concepts.
When developing a package it is essential to track the changes you make to your code. This is especially vital if they are breaking changes which have implications for any code written that depends on your package, i.e. a major version bump. Although you can always look back at your version control history in git, it is also convenient to have documentation which summarises the changes. This is where the NEWS file comes in.
Automatic Codebooks from Metadata Encoded in Dataset Attributes
Easily automate the following tasks to describe data frames: Summarise the distributions, and labelled missings of variables graphically and using descriptive statistics. For surveys, compute and summarise reliabilities (internal consistencies, retest, multilevel) for psychological scales. Combine this information with metadata (such as item labels and labelled values) that is derived from R attributes. To do so, the package relies on rmarkdown partials, so you can generate HTML, PDF, and Word documents. Codebooks are also available as tables (CSV, Excel, etc.) and in JSON-LD, so that search engines can find your data and index the metadata. The metadata are also available at your fingertips via RStudio Addins.