KDnuggets Top Blog Winner

Linux for Data Science Cheatsheet

KDnuggets is back with another exclusive cheatsheet, this time sharing a Linux quick reference for data science.



 

Why Bother with Linux?

 
Are you a data scientist? Are you familiar with the Linux command line? If not, you should be.

Linux — its various flavors — is an incredibly modular operating system. This would descend quickly into flame war territory, so I won't try to convince you that you should be using Linux (though you should be), and will instead focus on giving a helping hand to those who want to use it, or already do. Pair this with the power of a command line, and the fact that a plethora of flavors of Linux exist that are open source, and you have the ingredients for your ideal, tailored workstation with all the OS horsepower you need.

The command line can be intimidating, but it doesn't have to be. If you just thinking of it as constructing short combinations of programming commands in a REPL. You can code, right? Then you can use the command line.

 

Linux for Data Science Cheatsheet

 
The real trick of becoming proficient with the Linux command line, beyond learning the commands that are available, is keeping handy reference materials on the subject. This is where the latest KDnuggets cheatsheet comes into play.

This cheatsheet covers 16 of the most useful Linux terminal commands, enough to get you doing much of your day to day OS business from the command line right now. With a little practice, you could find that this all makes your daily activity quicker, easier, and more easily malleable thank pointing and clicking.

You can download the cheatsheet here.

 

Linux for Data Science Cheatsheet

 

In the cheatsheet, you will learn to (and have reference for) commands such as:

  • ls
  • cd
  • wc
  • grep
  • cp
  • diff

Unsure of what some of those commands are for? Then this cheatsheet is for you!

Check it out now, and check back soon for more.