GitHub - Quartz/bad-data-guide: An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
bad-data-guide - An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
7 command-line tools for data science
In this post I would like to share seven command-line tools that I have found useful in my day-to-day work as data scientist. The tools are: jq, json2csv, csvkit, scrape, xml2json, sample, and Rio.
Being a Data Scientist: My Experience and Toolset · Jefferson Heard
Personal and professional website of Jefferson Heard