Search results for Natural Language Processing
-
PyCaret 101: An introduction for beginners
This article is a great overview of how to get started with PyCaret for all your machine learning projects.https://www.kdnuggets.com/2021/06/pycaret-101-introduction-beginners.html
-
How to Create and Deploy a Simple Sentiment Analysis App via API
In this article we will create a simple sentiment analysis app using the HuggingFace Transformers library, and deploy it using FastAPI.https://www.kdnuggets.com/2021/06/create-deploy-sentiment-analysis-app-api.html
-
Supercharge Your Machine Learning Experiments with PyCaret and Gradio
A step-by-step tutorial to develop and interact with machine learning pipelines rapidly.https://www.kdnuggets.com/2021/05/supercharge-machine-learning-experiments-pycaret-gradio.html
-
4 Tips for Dataset Curation for NLP Projects
You have heard it before, and you will hear it again. It's all about the data. Curating the right data is also so important than just curating any data. When dealing with text data, many hard-earned lessons have been learned by others over the years, and here are four data curation tips that you should be sure to follow during your next NLP project.https://www.kdnuggets.com/2021/05/4-tips-dataset-curation-nlp-projects.html
-
Topic Modeling with Streamlit
What does it take to create and deploy a topic modeling web application quickly? Read this post to see how the author uses Python NLP packages for topic modeling, Streamlit for the web application framework, and Streamlit Sharing for deployment.https://www.kdnuggets.com/2021/05/topic-modeling-streamlit.html
-
Write and train your own custom machine learning models using PyCaret
A step-by-step, beginner-friendly tutorial on how to write and train custom machine learning models in PyCaret.https://www.kdnuggets.com/2021/05/pycaret-write-train-custom-machine-learning-models.html
-
Data Validation in Machine Learning is Imperative, Not Optional
Before we reach model training in the pipeline, there are various components like data ingestion, data versioning, data validation, and data pre-processing that need to be executed. In this article, we will discuss data validation, why it is important, its challenges, and more.https://www.kdnuggets.com/2021/05/data-validation-machine-learning-imperative.html
-
Awesome list of datasets in 100+ categories
With an estimated 44 zettabytes of data in existence in our digital world today and approximately 2.5 quintillion bytes of new data generated daily, there is a lot of data out there you could tap into for your data science projects. It's pretty hard to curate through such a massive universe of data, but this collection is a great start. Here, you can find data from cancer genomes to UFO reports, as well as years of air quality data to 200,000 jokes. Dive into this ocean of data to explore as you learn how to apply data science techniques or leverage your expertise to discover something new.https://www.kdnuggets.com/2021/05/awesome-list-datasets.html
-
Easy MLOps with PyCaret + MLflow
A beginner-friendly, step-by-step tutorial on integrating MLOps in your Machine Learning experiments using PyCaret.https://www.kdnuggets.com/2021/05/easy-mlops-pycaret-mlflow.html
-
Machine Translation in a Nutshell
Marketing scientist Kevin Gray asks Dr. Anna Farzindar of the University of Southern California for a snapshot of machine translation. Dr. Farzindar also provided the original art for this article.https://www.kdnuggets.com/2021/05/machine-translation-nutshell.html
-
Best Python Books for Beginners and Advanced Programmers
Let's take a look at nine of the best Python books for both beginners and advanced programmers, covering topics such as data science, machine learning, deep learning, NLP, and more.https://www.kdnuggets.com/2021/05/best-python-books-beginner-advanced.html
-
What Makes AI Trustworthy?
This blog pertains to the importance of why AI needs to be trustworthy as well as what makes it trustworthy. AI predictions/suggestions should not just be taken at face value, but rather delved into at a deeper level. We need to understand how an AI system makes its predictions to put our trust in it. Trust should not be built on prediction accuracy alone.https://www.kdnuggets.com/2021/05/what-makes-ai-trustworthy.html
-
Similarity Metrics in NLP
This post covers the use of euclidean distance, dot product, and cosine similarity as NLP similarity metrics.https://www.kdnuggets.com/2021/05/similarity-metrics-nlp.html
-
What is Neural Search?
And how to get started with it with no prior experience in Machine Learning.https://www.kdnuggets.com/2021/05/what-neural-search.html
-
Introducing The NLP Index
The NLP Index is a brand new resource for NLP code discovery, combining and indexing more than 3,000 paper and code pairs at launch. If you are interested in NLP research and locating the code and papers needed to understand an implement the latest research, you should check it out.https://www.kdnuggets.com/2021/04/nlp-index.html
-
Multiple Time Series Forecasting with PyCaret
A step-by-step tutorial to forecast multiple time series with PyCaret.https://www.kdnuggets.com/2021/04/multiple-time-series-forecasting-pycaret.html
-
Time Series Forecasting with PyCaret Regression Module
PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. See how to use PyCaret's Regression Module for Time Series Forecasting.https://www.kdnuggets.com/2021/04/time-series-forecasting-pycaret-regression-module.html
-
Data Science 101: Normalization, Standardization, and Regularization
Normalization, standardization, and regularization all sound similar. However, each plays a unique role in your data preparation and model building process, so you must know when and how to use these important procedures.https://www.kdnuggets.com/2021/04/data-science-101-normalization-standardization-regularization.html
-
How to Apply Transformers to Any Length of Text
Read on to find how to restore the power of NLP for long sequences.https://www.kdnuggets.com/2021/04/apply-transformers-any-length-text.html
-
Interpretable Machine Learning: The Free eBook">Interpretable Machine Learning: The Free eBook
Interested in learning more about interpretability in machine learning? Check out this free eBook to learn about the basics, simple interpretable models, and strategies for interpreting more complex black box models.https://www.kdnuggets.com/2021/04/interpretable-machine-learning-free-ebook.html
-
Automated Text Classification with EvalML
Learn how EvalML leverages Woodwork, Featuretools and the nlp-primitives library to process text data and create a machine learning model that can detect spam text messages.https://www.kdnuggets.com/2021/04/automated-text-classification-evalml.html
-
Multilingual CLIP with Huggingface + PyTorch Lightning
An overview of training OpenAI's CLIP on Google Colab.https://www.kdnuggets.com/2021/03/multilingual-clip--huggingface-pytorch-lightning.html
-
Top YouTube Machine Learning Channels
These are the top 15 YouTube channels for machine learning as determined by our stated criteria, along with some additional data on the channels to help you decide if they may have some content useful for you.https://www.kdnuggets.com/2021/03/top-youtube-machine-learning-channels.html
-
The Best Machine Learning Frameworks & Extensions for Scikit-learn">The Best Machine Learning Frameworks & Extensions for Scikit-learn
Learn how to use a selection of packages to extend the functionality of Scikit-learn estimators.https://www.kdnuggets.com/2021/03/best-machine-learning-frameworks-extensions-scikit-learn.html
-
How to Begin Your NLP Journey
In this blog post, learn how to process text using Python.https://www.kdnuggets.com/2021/03/begin-nlp-journey.html
-
DeepMind’s AlphaFold & the Protein Folding Problem
Recently, DeepMind's AlphaFold made impressive headway in the protein structure prediction problem. Read this for an overview and explanation.https://www.kdnuggets.com/2021/03/deepmind-alphafold-protein-folding-problem.html
-
Reducing the High Cost of Training NLP Models With SRU++
The increasing computation time and costs of training natural language models (NLP) highlight the importance of inventing computationally efficient models that retain top modeling power with reduced or accelerated computation. A single experiment training a top-performing language model on the 'Billion Word' benchmark would take 384 GPU days and as much as $36,000 using AWS on-demand instances.https://www.kdnuggets.com/2021/03/reducing-high-cost-training-nlp-models-sru.html
-
Speech to Text with Wav2Vec 2.0
Facebook recently introduced and open-sourced their new framework for self-supervised learning of representations from raw audio data called Wav2Vec 2.0. Learn more about it and how to use it here.https://www.kdnuggets.com/2021/03/speech-text-wav2vec.html
-
Top YouTube Channels for Data Science">Top YouTube Channels for Data Science
Have a look at the top 15 YouTube channels for data science by number of subscribers, along with some additional data on the channels to help you decide if they may have some content useful for you.https://www.kdnuggets.com/2021/03/top-youtube-channels-data-science.html
-
How Reading Papers Helps You Be a More Effective Data Scientist"> How Reading Papers Helps You Be a More Effective Data Scientist
By reading papers, we were able to learn what others (e.g., LinkedIn) have found to work (and not work). We can then adapt their approach and not have to reinvent the rocket. This helps us deliver a working solution with lesser time and effort.https://www.kdnuggets.com/2021/02/reading-papers-effective-data-scientist.html
-
Using NLP to improve your Resume
This article discusses performing keyword matching and text analysis on job descriptions.https://www.kdnuggets.com/2021/02/nlp-improve-resume.html
-
An overview of synthetic data types and generation methods
Synthetic data can be used to test new products and services, validate models, or test performances because it mimics the statistical property of production data. Today you'll find different types of structured and unstructured synthetic data.https://www.kdnuggets.com/2021/02/overview-synthetic-data-types-generation-methods.html
-
Multidimensional multi-sensor time-series data analysis framework
This blog post provides an overview of the package “msda” useful for time-series sensor data analysis. A quick introduction about time-series data is also provided.https://www.kdnuggets.com/2021/02/multidimensional-multi-sensor-time-series-data-analysis-framework.html
-
GPT-2 vs GPT-3: The OpenAI Showdown
Thanks to the diversity of the dataset used in the training process, we can obtain adequate text generation for text from a variety of domains. GPT-2 is 10x the parameters and 10x the data of its predecessor GPT.https://www.kdnuggets.com/2021/02/gpt2-gpt3-openai-showdown.html
-
A Critical Comparison of Machine Learning Platforms in an Evolving Market
There’s a clear inclination towards the MLaaS model across industries, given the fact that companies today have an option to select from a wide range of solutions that can cater to diverse business needs. Here is a look at 3 of the top ML platforms for data excellence.https://www.kdnuggets.com/2021/02/critical-comparison-machine-learning-platforms-evolving-market.html
-
Build Your First Data Science Application">Build Your First Data Science Application
Check out these seven Python libraries to make your first data science MVP application.https://www.kdnuggets.com/2021/02/build-first-data-science-application.html
-
2011: DanNet triggers deep CNN revolution
In 2021, we are celebrating the 10-year anniversary of DanNet, which, in 2011, was the first pure deep convolutional neural network (CNN) to win computer vision contests. Read about its history here.https://www.kdnuggets.com/2021/02/dannet-triggers-deep-cnn-revolution.html
-
Past 2021 Meetings / Online Events on AI, Analytics, Big Data, Data Science, and Machine Learning
Past | Jan | Feb | Mar | Apr | May | Jun | Jul | Aug | Sep | Oct | Nov | Dec Read more »https://www.kdnuggets.com/meetings/past-meetings-2021.html
-
Popular Machine Learning Interview Questions, part 2
Get ready for your next job interview requiring domain knowledge in machine learning with answers to these thirteen common questions.https://www.kdnuggets.com/2021/01/popular-machine-learning-interview-questions-part2.html
-
Is M.Tech in Data Science Worth It?
Is M.Tech in Data Science worth it or should you learn using just online courses and projects. Let's try to find the answer to that question.https://www.kdnuggets.com/2021/01/greatlearning-mtech-data-science.html
-
Deep Learning Pioneer Geoff Hinton on his Latest Research and the Future of AI
Geoff Hinton has lived at the outer reaches of machine learning research since an aborted attempt at a carpentry career a half century ago. He spoke to Craig Smith about his work In 2020 and what he sees on the horizon for AI.https://www.kdnuggets.com/2021/01/deep-learning-pioneer-geoff-hinton-research-future-ai.html
-
5 Tools for Effortless Data Science
The sixth tool is coffee.https://www.kdnuggets.com/2021/01/5-tools-effortless-data-science.html
-
MLOps: Model Monitoring 101
Model monitoring using a model metric stack is essential to put a feedback loop from a deployed ML model back to the model building stage so that ML models can constantly improve themselves under different scenarios.https://www.kdnuggets.com/2021/01/mlops-model-monitoring-101.html
-
Where is Marketing Data Science Headed?
Marketing data science - data science related to marketing - is now a significant part of marketing. Some of it directly competes with traditional marketing research and many marketing researchers may wonder what the future holds in store for it.https://www.kdnuggets.com/2021/01/marketing-data-science-headed.html
-
15 Free Data Science, Machine Learning & Statistics eBooks for 2021">15 Free Data Science, Machine Learning & Statistics eBooks for 2021
We present a curated list of 15 free eBooks compiled in a single location to close out the year.https://www.kdnuggets.com/2020/12/15-free-data-science-machine-learning-statistics-ebooks-2021.html
-
Key Data Science Algorithms Explained: From k-means to k-medoids clustering">Key Data Science Algorithms Explained: From k-means to k-medoids clustering
As a core method in the Data Scientist's toolbox, k-means clustering is valuable but can be limited based on the structure of the data. Can expanded methods like PAM (partitioning around medoids), CLARA, and CLARANS provide better solutions, and what is the future of these algorithms?https://www.kdnuggets.com/2020/12/algorithms-explained-k-means-k-medoids-clustering.html
-
Navigate the road to Responsible AI
Deploying AI ethically and responsibly will involve cross-functional team collaboration, new tools and processes, and proper support from key stakeholders.https://www.kdnuggets.com/2020/12/navigate-road-responsible-ai.html
-
8 Places for Data Professionals to Find Datasets
Here is a curated list of sites and resources invaluable for data professionals to acquire practice datasets.https://www.kdnuggets.com/2020/12/8-places-data-professionals-find-datasets.html
-
10 Python Skills They Don’t Teach in Bootcamp
Ascend to new heights in Data Science and Machine Learning with this thrilling list of coding tips.https://www.kdnuggets.com/2020/12/10-python-skills-dont-teach-bootcamp.html
-
Main 2020 Developments and Key 2021 Trends in AI, Data Science, Machine Learning Technology">Main 2020 Developments and Key 2021 Trends in AI, Data Science, Machine Learning Technology
Our panel of leading experts reviews 2020 main developments and examines the key trends in AI, Data Science, Machine Learning, and Deep Learning Technology.https://www.kdnuggets.com/2020/12/developments-trends-ai-data-science-machine-learning-technology.html
-
Top November Stories: Top Python Libraries for Data Science, Data Visualization & Machine Learning; The Best Data Science Certification You’ve Never Heard Of
Also: TabPy: Combining Python and Tableau; How to Acquire the Most Wanted Data Science Skills.https://www.kdnuggets.com/2020/12/top-stories-2020-nov.html
-
AI, Analytics, Machine Learning, Data Science, Deep Learning Research Main Developments in 2020 and Key Trends for 2021">AI, Analytics, Machine Learning, Data Science, Deep Learning Research Main Developments in 2020 and Key Trends for 2021
2020 is finally coming to a close. While likely not to register as anyone's favorite year, 2020 did have some noteworthy advancements in our field, and 2021 promises some important key trends to look forward to. As has become a year-end tradition, our collection of experts have once again contributed their thoughts. Read on to find out more.https://www.kdnuggets.com/2020/12/predictions-ai-machine-learning-data-science-research.html
-
10 Python Skills for Beginners
Python is the fastest growing, most-beloved programming language. Get started with these Data Science tips.https://www.kdnuggets.com/2020/12/10-python-skills-beginners.html
-
Simple & Intuitive Ensemble Learning in R
Read about metaEnsembleR, an R package for heterogeneous ensemble meta-learning (classification and regression) that is fully-automated.https://www.kdnuggets.com/2020/12/simple-intuitive-meta-learning-r.html
-
A Friendly Introduction to Graph Neural Networks
Despite being what can be a confusing topic, graph neural networks can be distilled into just a handful of simple concepts. Read on to find out more.https://www.kdnuggets.com/2020/11/friendly-introduction-graph-neural-networks.html
-
How to Know if a Neural Network is Right for Your Machine Learning Initiative
It is important to remember that there must be a business reason for even considering neural nets and it should not be because the C-Suite is feeling a bad case of FOMO.https://www.kdnuggets.com/2020/11/neural-network-right-machine-learning-initiative.html
-
Simple Python Package for Comparing, Plotting & Evaluating Regression Models
This package is aimed to help users plot the evaluation metric graph with single line code for different widely used regression model metrics comparing them at a glance. With this utility package, it also significantly lowers the barrier for the practitioners to evaluate the different machine learning algorithms in an amateur fashion by applying it to their everyday predictive regression problems.https://www.kdnuggets.com/2020/11/simple-python-package-comparing-plotting-evaluating-regression-models.html
-
15 Exciting AI Project Ideas for Beginners">15 Exciting AI Project Ideas for Beginners
There are many branches to AI to learn, but a project-based approach can keep things interesting. Here is a list of 15 such projects you can get started on implementing today.https://www.kdnuggets.com/2020/11/greatlearning-ai-project-ideas-beginners.html
-
Computer Vision at Scale With Dask And PyTorch
A tutorial on conducting image classification inference using the Resnet50 deep learning model at scale with using GPU clusters on Saturn Cloud. The results were: 40x faster computer vision that made a 3+ hour PyTorch model run in just 5 minutes.https://www.kdnuggets.com/2020/11/computer-vision-scale-dask-pytorch.html
-
Compute Goes Brrr: Revisiting Sutton’s Bitter Lesson for AI
"It's just about having more compute." Wait, is that really all there is to AI? As Richard Sutton's 'bitter lesson' sinks in for more AI researchers, a debate has stirred that considers a potentially more subtle relationship between advancements in AI based on ever-more-clever algorithms and massively scaled computational power.https://www.kdnuggets.com/2020/11/revisiting-sutton-bitter-lesson-ai.html
-
5 Most Useful Machine Learning Tools every lazy full-stack data scientist should use
If you consider yourself a Data Scientist who can take any project from data curation to solution deployment, then you know there are many tools available today to help you get the job done. The trouble is that there are too many choices. Here is a review of five sets of tools that should turn you into the most efficient full-stack data scientist possible.https://www.kdnuggets.com/2020/11/5-useful-machine-learning-tools.html
-
5 Things You Are Doing Wrong in PyCaret
PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few words only. This makes experiments exponentially fast and efficient. Find out 5 ways to improve your usage of the library.https://www.kdnuggets.com/2020/11/5-things-doing-wrong-pycaret.html
-
How to Get Into Data Science Without a Degree">How to Get Into Data Science Without a Degree
Breaking into any new field or slogging through a career change is always a challenge, and requires focus and even a little grit. While transitioning to becoming a Data Scientist is no different, aspiring to this role is possible, even without a formal post-secondary degree, largely due to the vast amount of quality learning resources available today.https://www.kdnuggets.com/2020/11/data-science-without-degree.html
-
How to Acquire the Most Wanted Data Science Skills">How to Acquire the Most Wanted Data Science Skills
We recently surveyed KDnuggets readers to determine the "most wanted" data science skills. Since they seem to be those most in demand from practitioners, here is a collection of resources for getting started with this learning.https://www.kdnuggets.com/2020/11/acquire-most-wanted-data-science-skills.html
-
Moving from Data Science to Machine Learning Engineering
The world of machine learning — and software — is changing. Read this article to find out how, and what you can do to stay ahead of it.https://www.kdnuggets.com/2020/11/moving-data-science-machine-learning-engineering.html
-
Doing the impossible? Machine learning with less than one example
Machine learning algorithms are notoriously known for needing data, a lot of data -- the more data the better. But, much research has gone into developing new methods that need fewer examples to train a model, such as "few-shot" or "one-shot" learning that require only a handful or a few as one example for effective learning. Now, this lower boundary on training examples is being taken to the next extreme.https://www.kdnuggets.com/2020/11/machine-learning-less-than-one-example.html
-
Interpretability, Explainability, and Machine Learning – What Data Scientists Need to Know
The terms “interpretability,” “explainability” and “black box” are tossed about a lot in the context of machine learning, but what do they really mean, and why do they matter?https://www.kdnuggets.com/2020/11/interpretability-explainability-machine-learning.html
-
Building Neural Networks with PyTorch in Google Colab">Building Neural Networks with PyTorch in Google Colab
Combining PyTorch and Google's cloud-based Colab notebook environment can be a good solution for building neural networks with free access to GPUs. This article demonstrates how to do just that.https://www.kdnuggets.com/2020/10/building-neural-networks-pytorch-google-colab.html
-
An Introduction to AI, updated">An Introduction to AI, updated
We provide an introduction to key concepts and methods in AI, covering Machine Learning and Deep Learning, with an updated extensive list that includes Narrow AI, Super Intelligence, and Classic Artificial Intelligence, as well as recent ideas of NeuroSymbolic AI, Neuroevolution, and Federated Learning.https://www.kdnuggets.com/2020/10/introduction-ai-updated.html
-
KDnuggets™ News 20:n41, Oct 28: Difference Between Junior and Senior Data Scientists; Ain’t No Such a Thing as a Citizen Data Scientist
The unspoken difference between junior and senior data scientists; Ain't No Such a Thing as a Citizen Data Scientist; How to become a Data Scientist: a step-by-step guide; Good-bye Big Data. Hello, Massive Data!; DeepMind Relies on this Old Statistical Method to Build Fair Machine Learning Modelshttps://www.kdnuggets.com/2020/n41.html
-
Can AI Learn Human Values?
OpenAI believes that the path to safe AI requires social sciences.https://www.kdnuggets.com/2020/10/ai-learn-human-values.html
-
Roadmap to Computer Vision
Read this introduction to the main steps which compose a computer vision system, starting from how images are pre-processed, features extracted and predictions are made.https://www.kdnuggets.com/2020/10/roadmap-computer-vision.html
-
Which flavor of BERT should you use for your QA task?
Check out this guide to choosing and benchmarking BERT models for question answering.https://www.kdnuggets.com/2020/10/flavor-bert-use-qa-task.html
-
10 Underrated Python Skills
Tips for feature analysis, hyperparameter tuning, data visualization and more.https://www.kdnuggets.com/2020/10/10-underrated-python-skills.html
-
The Ethics of AI
Marketing scientist Kevin Gray asks Dr. Anna Farzindar of the University of Southern California about a very important subject - the ethics of AI.https://www.kdnuggets.com/2020/10/ethics-ai-qa-farzindar.html
-
5 Must-Read Data Science Papers (and How to Use Them)
Keeping ahead of the latest developments in a field is key to advancing your skills and your career. Five foundational ideas from recent data science papers are highlighted here with tips on how to leverage these advancements in your work, and keep you on top of the machine learning game.https://www.kdnuggets.com/2020/10/5-must-read-data-science-papers.html
-
Optimizing the Levenshtein Distance for Measuring Text Similarity
For speeding up the calculation of the Levenshtein distance, this tutorial works on calculating using a vector rather than a matrix, which saves a lot of time. We’ll be coding in Java for this implementation.https://www.kdnuggets.com/2020/10/optimizing-levenshtein-distance-measuring-text-similarity.html
-
Text Mining with R: The Free eBook">Text Mining with R: The Free eBook
This freely-available book will show you how to perform text analytics in R, using packages from the tidyverse.https://www.kdnuggets.com/2020/10/text-mining-r-free-ebook.html
-
The Future of Fake News
Let's talk about misleading communications in the digital era.https://www.kdnuggets.com/2020/10/future-fake-news.html
-
Uber Open Sources the Third Release of Ludwig, its Code-Free Machine Learning Platform
The new release makes Ludwig one of the most complete open source AutoML stacks in the market.https://www.kdnuggets.com/2020/10/uber-open-source-ludwig-code-free-machine-learning-platform.html
-
Strategies of Docker Images Optimization
Large Docker images lengthen the time it takes to build and share images between clusters and cloud providers. When creating applications, it’s therefore worth optimizing Docker Images and Dockerfiles to help teams share smaller images, improve performance, and debug problems.https://www.kdnuggets.com/2020/10/strategies-docker-images-optimization.html
-
6 Lessons Learned in 6 Months as a Data Scientist
When transitioning into a Data Science career, a new mindset toward collaboration, data, and reporting is required. Learn from these recommendations on approaches you should consider to successfully develop into your dream job.https://www.kdnuggets.com/2020/10/6-lessons-6-months-data-scientist.html
-
Free Introductory Machine Learning Course From Amazon">Free Introductory Machine Learning Course From Amazon
Amazon's Machine Learning University offers an introductory course titled Accelerated Machine Learning, which is a good starting place for those looking for a foundation in generalized practical ML.https://www.kdnuggets.com/2020/10/machine-learning-free-course-amazon.html
-
10 Best Machine Learning Courses in 2020">10 Best Machine Learning Courses in 2020
If you are ready to take your career in machine learning to the next level, then these top 10 Machine Learning Courses covering both practical and theoretical work will help you excel.https://www.kdnuggets.com/2020/10/10-best-machine-learning-courses-2020.html
-
Understanding Transformers, the Data Science Way
Read this accessible and conversational article about understanding transformers, the data science way — by asking a lot of questions that is.https://www.kdnuggets.com/2020/10/understanding-transformers-data-science-way.html
-
The Best Free Data Science eBooks: 2020 Update">The Best Free Data Science eBooks: 2020 Update
The author has updated their list of best free data science books for 2020. Read on to see what books you should grab.https://www.kdnuggets.com/2020/09/best-free-data-science-ebooks-2020-update.html
-
Artificial Intelligence for Precision Medicine and Better Healthcare
In this article, we will focus on various machine learning, deep learning models, and applications of AI which can pave the way for a new data-centric era of discovery in healthcare.https://www.kdnuggets.com/2020/09/artificial-intelligence-precision-medicine-better-healthcare.html
-
MathWorks Deep learning workflow: tips, tricks, and often forgotten steps
Getting started in deep learning – and adopting an organized, sustainable, and reproducible workflow – can be challenging. This blog post will share some tips and tricks to help you develop a systematic, effective, attainable, and scalable deep learning workflow as you experiment with different deep learning models, datasets, and applications.https://www.kdnuggets.com/2020/09/mathworks-deep-learning-workflow.html
-
The Insiders’ Guide to Generative and Discriminative Machine Learning Models
In this article, we will look at the difference between generative and discriminative models, how they contrast, and one another.https://www.kdnuggets.com/2020/09/insiders-guide-generative-discriminative-machine-learning-models.html
-
Online Certificates/Courses in AI, Data Science, Machine Learning from Top Universities">Online Certificates/Courses in AI, Data Science, Machine Learning from Top Universities
We present the online courses and certificates in AI, Data Science, Machine Learning, and related topics from the top 20 universities in the world.https://www.kdnuggets.com/2020/09/online-certificates-ai-data-science-machine-learning-top.html
-
Autograd: The Best Machine Learning Library You’re Not Using?">Autograd: The Best Machine Learning Library You’re Not Using?
If there is a Python library that is emblematic of the simplicity, flexibility, and utility of differentiable programming it has to be Autograd.https://www.kdnuggets.com/2020/09/autograd-best-machine-learning-library-not-using.html
-
Top August Stories: Know What Employers are Expecting for a Data Scientist Role in 2020; If I had to start learning Data Science again, how would I do it?
Also: Netflix's Polynote is a New Open Source Framework to Build Better Data Science Notebooks; Must-read NLP and Deep Learning articles for Data Scientists.https://www.kdnuggets.com/2020/09/top-stories-2020-aug.html
-
An Introduction to NLP and 5 Tips for Raising Your Game
This article is a collection of things the author would like to have known when they started out in NLP. Perhaps it will be useful for you.https://www.kdnuggets.com/2020/09/introduction-nlp-5-tips-raising-your-game.html
-
8 AI/Machine Learning Projects To Make Your Portfolio Stand Out">8 AI/Machine Learning Projects To Make Your Portfolio Stand Out
If you are just starting down a path toward a career in Data Science, or you are already a seasoned practitioner, then keeping active to advance your experience through side projects is invaluable to take you to the next professional level. These eight interesting project ideas with source code and reference articles will jump start you to thinking outside of the box.https://www.kdnuggets.com/2020/09/8-ml-ai-projects-stand-out.html
-
KDnuggets™ News 20:n34, Sep 9: Top Online Data Science Masters Degrees; Modern Data Science Skills: 8 Categories, Core Skills, and Hot Skills
Also: Creating Powerful Animated Visualizations in Tableau; PyCaret 2.1 is here: What's new?; How To Decide What Data Skills To Learn; How to Evaluate the Performance of Your Machine Learning Modelhttps://www.kdnuggets.com/2020/n34.html
-
Top Stories, Aug 31 – Sep 6: Top Online Masters in Analytics, Business Analytics, Data Science – Updated
Also: How to Evaluate the Performance of Your Machine Learning Model; Which methods should be used for solving linear regression?; A Curious Theory About the Consciousness Debate in AI; If I had to start learning Data Science again, how would I do it?; PyCaret 2.1 is here: Whats new?https://www.kdnuggets.com/2020/09/top-news-week-0831-0906.html
-
Computer Vision Recipes: Best Practices and Examples
This is an overview of a great computer vision resource from Microsoft, which demonstrates best practices and implementation guidelines for a variety of tasks and scenarios.https://www.kdnuggets.com/2020/09/computer-vision-recipes-best-practices-examples.html
-
Showcasing the Benefits of Software Optimizations for AI Workloads on Intel® Xeon® Scalable Platforms
The focus of this blog is to bring to light that continued software optimizations can boost performance not only for the latest platforms, but also for the current install base from prior generations. This means customers can continue to extract value from their current platform investments.https://www.kdnuggets.com/2020/09/showcasing-benefits-software-optimizations-ai-workloads-intel.html
-
Accelerated Computer Vision: A Free Course From Amazon
Amazon's Machine Learning University is making its online courses available to the public, and this time we look at its Accelerated Computer Vision offering.https://www.kdnuggets.com/2020/08/accelerated-computer-vision-free-course-amazon.html
-
A Deep Dive Into the Transformer Architecture – The Development of Transformer Models
Even though transformers for NLP were introduced only a few years ago, they have delivered major impacts to a variety of fields from reinforcement learning to chemistry. Now is the time to better understand the inner workings of transformer architectures to give you the intuition you need to effectively work with these powerful tools.https://www.kdnuggets.com/2020/08/transformer-architecture-development-transformer-models.html
-
The NLP Model Forge: Generate Model Code On Demand">The NLP Model Forge: Generate Model Code On Demand
You've seen their Big Bad NLP Database and The Super Duper NLP Repo. Now Quantum Stat is back with its most ambitious NLP product yet: The NLP Model Forge.https://www.kdnuggets.com/2020/08/nlp-model-forge.html
-
Performance Testing on Big Data Applications
You can use performance testing in any application you’re working on but it’s especially useful for big data applications. Let’s see why.https://www.kdnuggets.com/2020/08/performance-testing-big-data-applications.html
-
Must-read NLP and Deep Learning articles for Data Scientists">Must-read NLP and Deep Learning articles for Data Scientists
NLP and deep learning continue to advance, nearly on a daily basis. Check out these recent must-read guides, feature articles, and other resources to keep you on top of the latest advancements and ahead of the curve.https://www.kdnuggets.com/2020/08/must-read-nlp-deep-learning-articles.html
-
Build Your Own AutoML Using PyCaret 2.0
In this post we present a step-by-step tutorial on how PyCaret can be used to build an Automated Machine Learning Solution within Power BI, thus allowing data scientists and analysts to add a layer of machine learning to their Dashboards without any additional license or software costs.https://www.kdnuggets.com/2020/08/build-automl-pycaret.html
-
Hypothesis Test for Real Problems
Hypothesis tests are significant for evaluating answers to questions concerning samples of data.https://www.kdnuggets.com/2020/08/hypothesis-test-real-problems.html
-
Going Beyond Superficial: Data Science MOOCs with Substance">Going Beyond Superficial: Data Science MOOCs with Substance
Data science MOOCs are superficial. At least, a lot of them are. What are your options when looking for something more substantive?https://www.kdnuggets.com/2020/08/beyond-superficial-data-science-moocs-substance.html
-
GitHub is the Best AutoML You Will Ever Need
This article uses PyCaret 2.0, an open source, low-code machine learning library in Python to develop a simple AutoML solution and deploy it as a Docker container using GitHub actions.https://www.kdnuggets.com/2020/08/github-best-automl-ever-need.html
-
Data Scientist Job Market 2020
With an analysis of over a thousand Data Scientist job descriptions in the USA, check out the trends for 2020 and current expectations on new positions in the field, including credentials, experience, and programming languages.https://www.kdnuggets.com/2020/08/data-scientist-job-market-2020.html
-
Batch Normalization in Deep Neural Networks
Batch normalization is a technique for training very deep neural networks that normalizes the contributions to a layer for every mini batch.https://www.kdnuggets.com/2020/08/batch-normalization-deep-neural-networks.html
-
Metrics to Use to Evaluate Deep Learning Object Detectors
It's important to understand which metric should be used to evaluate trained object detectors and which one is more important. Is mAP alone enough to evaluate the objector models? Can the same metric be used to evaluate object detectors on validation set and test set?https://www.kdnuggets.com/2020/08/metrics-evaluate-deep-learning-object-detectors.html
-
Word Embedding Fairness Evaluation
With word embeddings being such a crucial component of NLP, the reported social biases resulting from the training corpora could limit their application. The framework introduced here intends to measure the fairness in word embeddings to better understand these potential biases.https://www.kdnuggets.com/2020/08/word-embedding-fairness-evaluation.html
-
Awesome Machine Learning and AI Courses">Awesome Machine Learning and AI Courses
Check out this list of awesome, free machine learning and artificial intelligence courses with video lectures.https://www.kdnuggets.com/2020/07/awesome-machine-learning-ai-courses.html
-
5 Big Trends in Data Analytics
Data analytics is the process by which data is deconstructed and examined for useful patterns and trends. Here we explore five trends making data analytics even more useful.https://www.kdnuggets.com/2020/07/5-big-trends-data-analytics.html
-
Is depth useful for self-attention?
Learn about recent research that is the first to explain a surprising phenomenon where in BERT/Transformer-like architectures, deepening the network does not seem to be better than widening (or, increasing the representation dimension). This empirical observation is in contrast to a fundamental premise in deep learning.https://www.kdnuggets.com/2020/07/depth-useful-self-attention.html
-
Labelling Data Using Snorkel
In this tutorial, we walk through the process of using Snorkel to generate labels for an unlabelled dataset. We will provide you examples of basic Snorkel components by guiding you through a real clinical application of Snorkel.https://www.kdnuggets.com/2020/07/labelling-data-using-snorkel.html
-
The Bitter Lesson of Machine Learning">The Bitter Lesson of Machine Learning
Since that renowned conference at Dartmouth College in 1956, AI research has experienced many crests and troughs of progress through the years. From the many lessons learned during this time, some have needed to be re-learned -- repeatedly -- and the most important of which has also been the most difficult to accept by many researchers.https://www.kdnuggets.com/2020/07/bitter-lesson-machine-learning.html
-
5 Things You Don’t Know About PyCaret
In comparison with the other open source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with a few words only.https://www.kdnuggets.com/2020/07/5-things-pycaret.html
-
A Layman’s Guide to Data Science. Part 3: Data Science Workflow">A Layman’s Guide to Data Science. Part 3: Data Science Workflow
Learn and appreciate the typical workflow for a data science project, including data preparation (extraction, cleaning, and understanding), analysis (modeling), reflection (finding new paths), and communication of the results to others.https://www.kdnuggets.com/2020/07/laymans-guide-data-science-workflow.html
-
Deploy Machine Learning Pipeline on AWS Fargate">Deploy Machine Learning Pipeline on AWS Fargate
A step-by-step beginner’s guide to containerize and deploy ML pipeline serverless on AWS Fargate.https://www.kdnuggets.com/2020/07/deploy-machine-learning-pipeline-aws-fargate.html
-
Generating cooking recipes using TensorFlow and LSTM Recurrent Neural Network: A step-by-step guide
A character-level LSTM (Long short-term memory) RNN (Recurrent Neural Network) is trained on ~100k recipes dataset using TensorFlow. The model suggested the recipes "Cream Soda with Onions", "Puff Pastry Strawberry Soup", "Zucchini flavor Tea", and "Salmon Mousse of Beef and Stilton Salad with Jalapenos". Yum!? Follow along this detailed guide with code to create your own recipe-generating chef.https://www.kdnuggets.com/2020/07/generating-cooking-recipes-using-tensorflow.html
-
Free Economics & Finance Courses for Data Scientists
Here is a selection of courses for those interested in diversifying their domain knowledge into the related realms of economics and finance, with the goal of being able to apply your data science skills to these domains.https://www.kdnuggets.com/2020/06/free-economics-finance-courses-data-scientists.html
-
Bias in AI: A Primer
Those interested in studying AI bias, but who lack a starting point, would do well to check out this introductory set of slides and the accompanying talk on the subject from Google researcher Margaret Mitchell.https://www.kdnuggets.com/2020/06/bias-ai-primer.html
-
What is emotion AI and why should you care?
What is emotion AI, why is it relevant, and what do you need to know about it?https://www.kdnuggets.com/2020/06/emotion-ai.html
-
Deploy a Machine Learning Pipeline to the Cloud Using a Docker Container
In this tutorial, we will use a previously-built machine learning pipeline and Flask app to demonstrate how to deploy a machine learning pipeline as a web app using the Microsoft Azure Web App Service.https://www.kdnuggets.com/2020/06/deploy-machine-learning-pipeline-cloud-docker.html
-
Top May Stories: The Best NLP with Deep Learning Course is Free
Also: How to Think Like a Data Scientist; Python For Everybody: The Free eBook; Automated Machine Learning: The Free eBook.https://www.kdnuggets.com/2020/06/top-stories-2020-may.html
-
5 Essential Papers on Sentiment Analysis
To highlight some of the work being done in the field, here are five essential papers on sentiment analysis and sentiment classification.https://www.kdnuggets.com/2020/06/5-essential-papers-sentiment-analysis.html
-
Deep Learning for Coders with fastai and PyTorch: The Free eBook">Deep Learning for Coders with fastai and PyTorch: The Free eBook
If you are interested in a top-down, example-driven book on deep learning, check out the draft of the upcoming Deep Learning for Coders with fastai & PyTorch from fast.ai team.https://www.kdnuggets.com/2020/06/fastai-book-free-ebook.html
-
How to Think Like a Data Scientist">How to Think Like a Data Scientist
So what does it take to become a data scientist? For some pointers on the skills for success, I interviewed Ben Chu, who is a Senior Data Scientist at Refinitiv Labs.https://www.kdnuggets.com/2020/05/think-like-data-scientist-data-analyst.html
-
Top Stories, May 18-24: The Best NLP with Deep Learning Course is Free
Also: Automated Machine Learning: The Free eBook; Sparse Matrix Representation in Python; Build and deploy your first machine learning web app; Complex logic at breakneck speed: Try Julia for data sciencehttps://www.kdnuggets.com/2020/05/top-news-week-0518-0524.html
-
Build and deploy your first machine learning web app">Build and deploy your first machine learning web app
A beginner’s guide to train and deploy machine learning pipelines in Python using PyCaret.https://www.kdnuggets.com/2020/05/build-deploy-machine-learning-web-app.html