GeistHaus
log in · sign up

Welcome | Data Science at the Command Line, 2e

jeroenjanssens.com

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools—useful whether you work with Windows, macOS, or Linux.

1 page links to this URL
Text processing on the Command Line - sharing my tools

In this article I share my enthusiasm for text processing on the unix command line. First I just list some of the underlying principles and common tools; then I share the various text processing tools I myself have written over the years. Some of the larger tools are accompanied by screencast videos to quickly give you an impression of what they can do.

0 inbound links website en CC BY-NC-SA 4.0