Data Science at the Command Line

Data Science at the Command Line

This guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data.

Publication date: 07 Sep 2021

ISBN-10: 1492087912

ISBN-13: 9781492087915

Paperback: 282 pages

Views: 14,126

Data Science at the Command Line

Data Science at the Command Line This guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data.
Tag(s): Data Science Unix
Publication date: 07 Sep 2021
ISBN-10: 1492087912
ISBN-13: 9781492087915
Paperback: 282 pages
Views: 14,126
Document Type: Book
Publisher: O’Reilly Media, Inc.
License: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
Post time: 11 Mar 2022 08:00:00
Summary/Excerpts of (and not a substitute for) the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International:
You are free to:

Share — copy and redistribute the material in any medium or format

The licensor cannot revoke these freedoms as long as you follow the license terms.

Click here to read the full license.
Excerpts from the Preface:
Jeroen Janssens wrote:In this book, we’re going to obtain, scrub, explore, and model data—a lot of it. This book is not so much about how to become better at those data science tasks. There are already great resources available that discuss, for example, when to apply which statistical test or how data can best be visualized. Instead, this practical book aims to make you more efficient and productive by teaching you how to perform those data science tasks at the command line.

While this book discusses more than 90 command-line tools, it’s not the tools themselves that matter most. Some command-line tools have been around for a very long time, while others will be replaced by better ones. New command-line tools are being created even as you’re reading this. Over the years, I have discovered many amazing command-line tools. Unfortunately, some of them were discovered too late to be included in the book. In short, command-line tools come and go. But that’s OK.




About The Author(s)


Jeroen Janssens is an independent data science consultant and instructor. He enjoys visualizing data, implementing machine learning models, and building solutions using Python, R, JavaScript, and Bash. Jeroen manages Data Science Workshops, a training and coaching firm that organizes open enrollment workshops, in-company courses, inspiration sessions, hackathons, and meetups.

Jeroen Janssens

Jeroen Janssens is an independent data science consultant and instructor. He enjoys visualizing data, implementing machine learning models, and building solutions using Python, R, JavaScript, and Bash. Jeroen manages Data Science Workshops, a training and coaching firm that organizes open enrollment workshops, in-company courses, inspiration sessions, hackathons, and meetups.


Book Categories
Sponsors