-
Easy Synthetic Data in Python with Faker
Faker is a Python library that generates fake data to supplement or take the place of real world data. See how it can be used for data science.
-
Simple Text Scraping, Parsing, and Processing with this Python Library
Scraping, parsing, and processing text data from the web can be difficult. But it can also be easy, using Newspaper3k.
-
Introducing TensorFlow Similarity
TensorFlow Similarity is a newly-released library from Google that facilitates the training, indexing and querying of similarity models. Check out more here.
-
How to Create an AutoML Pipeline Optimization Sandbox
In this article, we will implement an automated machine learning pipeline optimization sandbox web app using Streamlit and TPOT.
-
A Python Data Processing Script Template
Here's a skeleton general purpose template for getting a Python command line script fleshed out as quickly as possible.
-
3 Data Acquisition, Annotation, and Augmentation Tools
Check out these 3 projects found around GitHub that can help with your data acquisition, annotation, and augmentation tasks.
-
Stack Overflow Survey Data Science Highlights
The results of the 2021 Stack Overflow Developer Survey were recently released, which is a fascinating snapshot of today's developers and the tools they are using. Have a look at some selections from the report, particularly those which may be of interest to data professionals.
-
Introduction to Statistical Learning Second Edition
The second edition of the classic "An Introduction to Statistical Learning, with Applications in R" was published very recently, and is now freely-available via PDF on the book's website.
-
Python Data Structures Compared
Let's take a look at 5 different Python data structures and see how they could be used to store data we might be processing in our everyday tasks, as well as the relative memory they use for storage and time they take to create and access.
-
The Best SOTA NLP Course is Free!
Hugging Face has recently released a course on using its libraries and ecosystem for practical NLP, and it appears to be very comprehensive. Have a look for yourself.
|