Python has rapidly became a leading language for Data Science and Machine Learning.
In the latest KDnuggets Poll
.
This page brings you the latest KDnuggets Opinions and Tutorials related to Python, as well as our most popular - gold and silver-badge winning content. Enjoy!
- Shapash: Making Machine Learning Models Understandable [Gold Blog]
Establishing an expectation for trust around AI technologies may soon become one of the most important skills provided by Data Scientists. Significant research investments are underway in this area, and new tools are being developed, such as Shapash, an open-source Python library that helps Data Scientists make machine learning models more transparent and understandable.
Tags: Explainability, Machine Learning, Python, SHAP
- Top 10 Python Libraries Data Scientists should know in 2021 [Gold Blog]
So many Python libraries exist that offer powerful and efficient foundations for supporting your data science work and machine learning model development. While the list may seem overwhelming, there are certain libraries you should focus your time on, as they are some of the most commonly used today.
Tags: Data Science, Keras, numpy, Pandas, Python, scikit-learn, Seaborn, TensorFlow
- Know your data much faster with the new Sweetviz Python library [Silver Blog]
One of the latest exploratory data analysis libraries is a new open-source Python library called Sweetviz, for just the purposes of finding out data types, missing information, distribution of values, correlations, etc. Find out more about the library and how to use it here.
Tags: Data Analysis, Data Exploration, Data Visualization, Python
- Are You Still Using Pandas to Process Big Data in 2021? Here are two better options [Silver Blog]
When its time to handle a lot of data -- so much that you are in the realm of Big Data -- what tools can you use to wrangle the data, especially in a notebook environment? Pandas doesn’t handle really Big Data very well, but two other libraries do. So, which one is better and faster?
Tags: Big Data, Dask, Data Preparation, Pandas, Python, Vaex
- Data Science Learning Roadmap for 2021 [Gold Blog]
Venturing into the world of Data Science is an exciting, interesting, and rewarding path to consider. There is a great deal to master, and this self-learning recommendation plan will guide you toward establishing a solid understanding of all that is foundational to data science as well as a solid portfolio to showcase your developed expertise.
Tags: Data Engineering, Data Preparation, Data Science, Data Science Education, Python, Roadmap, SQL
- Powerful Exploratory Data Analysis in just two lines of code [Gold Blog]
EDA is a fundamental early process for any Data Science investigation. Typical approaches for visualization and exploration are powerful, but can be cumbersome for getting to the heart of your data. Now, you can get to know your data much faster with only a few lines of code... and it might even be fun!
Tags: Data Analysis, Data Exploration, Data Visualization, Python
- Approaching (Almost) Any Machine Learning Problem [Silver Blog]
This freely-available book is a fantastic walkthrough of practical approaches to machine learning problems.
Tags: Deep Learning, Free ebook, Machine Learning, Python
- Essential Math for Data Science: Introduction to Matrices and the Matrix Product [Silver Blog]
As vectors, matrices are data structures allowing you to organize numbers. They are square or rectangular arrays containing values organized in two dimensions: as rows and columns. You can think of them as a spreadsheet. Learn more here.
Tags: Data Science, Linear Algebra, Mathematics, numpy, Python
- Build Your First Data Science Application [Silver Blog]
Check out these seven Python libraries to make your first data science MVP application.
Tags: API, Data Science, Jupyter, Keras, numpy, Pandas, Plotly, Python, PyTorch, scikit-learn
- How to create stunning visualizations using python from scratch [Platinum Blog]
Data science and data analytics can be beautiful things. Not only because of the insights and enhancements to decision-making they can provide, but because of the rich visualizations about the data that can be created. Following this step-by-step guide using the Matplotlib and Seaborn libraries will help you improve the presentation and effective communication of your work.
Tags: Data Visualization, Matplotlib, Python, Seaborn
- Getting Started with 5 Essential Natural Language Processing Libraries [Silver Blog]
This article is an overview of how to get started with 5 popular Python NLP libraries, from those for linguistic data visualization, to data preprocessing, to multi-task functionality, to state of the art language modeling, and beyond.
Tags: Data Preparation, Data Preprocessing, Data Visualization, Hugging Face, NLP, Python, spaCy, Text Analytics, Transformer
- Cleaner Data Analysis with Pandas Using Pipes [Silver Blog]
Check out this practical guide on Pandas pipes.
Tags: Data Analysis, Data Cleaning, Pandas, Pipeline, Python
- Best Python IDEs and Code Editors You Should Know [Platinum Blog]
Developing machine learning algorithms requires implementing countless libraries and integrating many supporting tools and software packages. All this magic must be written by you in yet another tool -- the IDE -- that is fundamental to all your code work and can drive your productivity. These top Python IDEs and code editors are among the best tools available for you to consider, and are reviewed with their noteworthy features.
Tags: IDE, Jupyter, PyCharm, Python, Visual Studio Code
- 10 Underappreciated Python Packages for Machine Learning Practitioners [Gold Blog]
Here are 10 underappreciated Python packages covering neural architecture design, calibration, UI creation and dissemination.
Tags: Deployment, Neural Networks, Python, UI/UX
- 15 Free Data Science, Machine Learning & Statistics eBooks for 2021 [Platinum Blog]
We present a curated list of 15 free eBooks compiled in a single location to close out the year.
Tags: Automated Machine Learning, Data Science, Deep Learning, Free ebook, Machine Learning, NLP, Python, R, Statistics
- Generating Beautiful Neural Network Visualizations [Gold Blog]
If you are looking to easily generate visualizations of neural network architectures, PlotNeuralNet is a project you should check out.
Tags: Neural Networks, Python, Visualization
- Monte Carlo integration in Python [Gold Blog]
A famous Casino-inspired trick for data science, statistics, and all of science. How to do it in Python?
Tags: Monte Carlo, Python, Simulation, Statistics
- A Rising Library Beating Pandas in Performance [Gold Blog]
This article compares the performance of the well-known pandas library with pypolars, a rising DataFrame library written in Rust. See how they compare.
Tags: Data Processing, Pandas, Performance, Python
- R or Python? Why Not Both? [Silver Blog]
Do you use both R and Python, either in different projects or in the same? Check out prython, an IDE designed to handle your needs.
Tags: Data Analysis, Data Science, IDE, Programming, Python, R
- Object-Oriented Programming Explained Simply for Data Scientists [Gold Blog]
Read this simple but effective guide to start using Classes in Python 3.
Tags: Data Science, Data Scientist, Explained, Programming, Python
- TabPy: Combining Python and Tableau [Platinum Blog]
This article demonstrates how to get started using Python in Tableau.
Tags: Data Visualization, Python, Tableau