2020 May Tutorials, Overviews
All (84) | Events (2) | News, Education (7) | Opinions (17) | Top Stories, Tweets (9) | Tutorials, Overviews (49)
-
Model Evaluation Metrics in Machine Learning - May 28, 2020.
A detailed explanation of model evaluation metrics to evaluate a classification machine learning model. - Taming Complexity in MLOps - May 28, 2020.
A greatly expanded v2.0 of the open-source Orbyter toolkit helps data science teams continue to streamline machine learning delivery pipelines, with an emphasis on seamless deployment to production.
- 5 Machine Learning Papers on Face Recognition - May 28, 2020.
This article will highlight some of that research and introduce five machine learning papers on face recognition.
- Faster machine learning on larger graphs with NumPy and Pandas - May 27, 2020.
One of the most exciting features of StellarGraph 1.0 is a new graph data structure — built using NumPy and Pandas — that results in significantly lower memory usage and faster construction times.
- Dataset Splitting Best Practices in Python - May 26, 2020.
If you are splitting your dataset into training and testing data you need to keep some things in mind. This discussion of 3 best practices to keep in mind when doing so includes demonstration of how to implement these particular considerations in Python.
- Interactive Machine Learning Experiments - May 26, 2020.
Dive into experimenting with machine learning techniques using this open-source collection of interactive demos built on multilayer perceptrons, convolutional neural networks, and recurrent neural networks. Each package consists of ready-to-try web browser interfaces and fully-developed notebooks for you to fine tune the training for better performance.
- Machine Fairness: How to assess AI system’s fairness and mitigate any observed unfairness issues - May 26, 2020.
Microsoft is bringing the latest research in responsible AI to Azure (both Azure Machine Learning and their open source toolkits), to empower data scientists and developers to understand machine learning models, protect people and their data, and control the end-to-end machine learning process.
- LinkedIn Open Sources a Small Component to Simplify the TensorFlow-Spark Interoperability - May 25, 2020.
Spark-TFRecord enables the processing of TensorFlow’s TFRecord structures in Apache Spark.
- 10 Useful Machine Learning Practices For Python Developers - May 25, 2020.
While you may be a data scientist, you are still a developer at the core. This means your code should be skillful. Follow these 10 tips to make sure you quickly deliver bug-free machine learning solutions.
-
Python For Everybody: The Free eBook - May 25, 2020.
Get back to fundamentals with this free eBook, Python For Everybody, approaching the learning of programming from a data analysis perspective. -
The Best NLP with Deep Learning Course is Free - May 22, 2020.
Stanford's Natural Language Processing with Deep Learning is one of the most respected courses on the topic that you will find anywhere, and the course materials are freely available online. - Appropriately Handling Missing Values for Statistical Modelling and Prediction - May 22, 2020.
Many statisticians in industry agree that blindly imputing the missing values in your dataset is a dangerous move and should be avoided without first understanding why the data is missing in the first place.
- A Holistic Framework for Managing Data Analytics Projects - May 22, 2020.
Agile project management for Data Science development continues to be an effective framework that enables flexibility and productivity in a field that can experience continuous changes in data and evolving stakeholder expectations. Learn more about the leading approaches for developing Data Science models, and apply them to your next project.
-
Build and deploy your first machine learning web app - May 22, 2020.
A beginner’s guide to train and deploy machine learning pipelines in Python using PyCaret. - Dimensionality Reduction with Principal Component Analysis (PCA) - May 21, 2020.
This article focuses on design principles of the PCA algorithm for dimensionality reduction and its implementation in Python from scratch.
- Complex logic at breakneck speed: Try Julia for data science - May 20, 2020.
We show a comparative performance benchmarking of Julia with an equivalent Python code to show why Julia is great for data science and machine learning.
-
13 must-read papers from AI experts - May 20, 2020.
What research articles do top AI experts in the field recommend? Find out which ones and why, then be sure to add each to your reading to do list. - Looking Normal(ly Distributed) - May 20, 2020.
This article investigates when some probability distributions look normal "enough" for a statistical test.
- Google Unveils TAPAS, a BERT-Based Neural Network for Querying Tables Using Natural Language - May 19, 2020.
The new neural network extends BERT to interact with tabular datasets.
- Sparse Matrix Representation in Python - May 19, 2020.
Leveraging sparse matrix representations for your data when appropriate can spare you memory storage. Have a look at the reasons why, see how to create sparse matrices in Python using Scipy, and compare the memory requirements for standard and sparse representations of the same data.
- Easy Text-to-Speech with Python - May 18, 2020.
Python comes with a lot of handy and easily accessible libraries and we’re going to look at how we can deliver text-to-speech with Python in this article.
- Evidence Counterfactuals for explaining predictive models on Big Data - May 18, 2020.
Big Data generated by people -- such as, social media posts, mobile phone GPS locations, and browsing history -- provide enormous prediction value for AI systems. However, explaining how these models predict with the data remains challenging. This interesting explanation approach considers how a model would behave if it didn't have the original set of data to work with.
-
Automated Machine Learning: The Free eBook - May 18, 2020.
There is a lot to learn about automated machine learning theory and practice. This free eBook can get you started the right way. - 5 Great New Features in Scikit-learn 0.23 - May 15, 2020.
Check out 5 new features of the latest Scikit-learn release, including the ability to visualize estimators in notebooks, improvements to both k-means and gradient boosting, some new linear model implementations, and sample weight support for a pair of existing regressors.
- Facebook Open Sources Blender, the Largest-Ever Open Domain Chatbot - May 15, 2020.
The new conversational agent exhibit human-like behavior in conversations about almost any topic.
-
AI and Machine Learning for Healthcare - May 14, 2020.
Traditional business and technology sectors are not the only fields being impacted by AI. Healthcare is a field that is thought to be highly suitable for the applications of AI tools and techniques. - Satellite Image Analysis with fast.ai for Disaster Recovery - May 14, 2020.
We were asked to build ML models using the novel xBD dataset provided by the organizers to estimate damage to infrastructure with the goal of reducing the amount of human labour and time required to plan an appropriate response. This article will focus on the technical aspects of our solution and share our experiences.
- DeepMind’s Suggestions for Learning #AtHomeWithAI - May 13, 2020.
DeepMind has been sharing resources for learning AI at home on their Twitter account. Check out a few of these suggestions here, and keep your eye on the #AtHomeWithAI hashtag for more.
- I Designed My Own Machine Learning and AI Degree - May 13, 2020.
With so many pioneering online resources for open education, check out this organized collection of courses you can follow to become a well-rounded machine learning and AI engineer.
- Customer Churn Prediction: A Global Performance Study - May 13, 2020.
This article details an automated machine-learned approach to predict customer churn and its results across selected communication service providers around the globe.
- Machine Learning in Power BI using PyCaret - May 12, 2020.
Check out this step-by-step tutorial for implementing machine learning in Power BI within minutes.
- What You Need to Know About Deep Reinforcement Learning - May 12, 2020.
How does deep learning solve the challenges of scale and complexity in reinforcement learning? Learn how combining these approaches will make more progress toward the notion of Artificial General Intelligence.
- Text Mining in Python: Steps and Examples - May 12, 2020.
The majority of data exists in the textual form which is a highly unstructured format. In order to produce meaningful insights from the text data then we need to follow a method called Text Analysis.
- The Elements of Statistical Learning: The Free eBook - May 11, 2020.
Check out this free ebook covering the elements of statistical learning, appropriately titled "The Elements of Statistical Learning."
-
Start Your Machine Learning Career in Quarantine - May 11, 2020.
While this quarantine can last two months, make the most of it by starting your career in Machine Learning with this 60-day learning plan. - The Architecture Used at LinkedIn to Improve Feature Management in Machine Learning Models - May 11, 2020.
The new typed feature schema streamlined the reusability of features across thousands of machine learning models.
- Forecasting Stories 3: Each Time-series Component Sings a Different Song - May 8, 2020.
With time-series decomposition, we were able to infer that the consumers were waiting for the highest sale of the year rather than buying up-front.
- 5 Concepts You Should Know About Gradient Descent and Cost Function - May 7, 2020.
Why is Gradient Descent so important in Machine Learning? Learn more about this iterative optimization algorithm and how it is used to minimize a loss function.
- Hyperparameter Optimization for Machine Learning Models - May 7, 2020.
Check out this comprehensive guide to model optimization techniques.
- Explaining “Blackbox” Machine Learning Models: Practical Application of SHAP - May 6, 2020.
Train a "blackbox" GBM model on a real dataset and make it explainable with SHAP.
- Best Coronavirus Projections, Predictions, Dashboards and Data Resources - May 6, 2020.
Check out this curated collection of coronavirus-related projections, dashboards, visualizations, and data that we have encountered on the internet.
-
Beginners Learning Path for Machine Learning - May 5, 2020.
So, you are interested in machine learning? Here is your complete learning path to start your career in the field. - Getting Started with Spectral Clustering - May 5, 2020.
This post will unravel a practical example to illustrate and motivate the intuition behind each step of the spectral clustering algorithm.
-
Top 10 Data Visualization Tools for Every Data Scientist - May 5, 2020.
At present, the data scientist is one of the most sought after professions. That’s one of the main reasons why we decided to cover the latest data visualization tools that every data scientist can use to make their work more effective. - Microsoft Research Unveils Three Efforts to Advance Deep Generative Models - May 4, 2020.
Optimus, FQ-GAN and Prevalent bring new ideas to apply generative models at large scale.
-
Deep Learning: The Free eBook - May 4, 2020.
"Deep Learning" is the quintessential book for understanding deep learning theory, and you can still read it freely online. - Demystifying the AI Infrastructure Stack - May 1, 2020.
AI tools and services are expanding at a rapid clip, and keeping a handle on this evolving ecosystem is crucial for the success of your AI projects. This framework will help you build your technical stack to deploy AI projects faster and at scale.
- Optimize Response Time of your Machine Learning API In Production - May 1, 2020.
This article demonstrates how building a smarter API serving Deep Learning models minimizes the response time.
-
Natural Language Processing Recipes: Best Practices and Examples - May 1, 2020.
Here is an overview of another great natural language processing resource, this time from Microsoft, which demonstrates best practices and implementation guidelines for a variety of tasks and scenarios.