Search results for
-
Machine learning does not produce value for my business. Why?
What is going on when machine learning can't make the jump from testing to production, and so doesn't add any business value?https://www.kdnuggets.com/2021/12/machine-learning-produce-value-business.html
-
KDnuggets™ News 21:n48, Dec 22: Write Clean Python Code Using Pipes; 5 Key Skills Needed To Become a Great Data Scientist
Write Clean Python Code Using Pipes; 5 Key Skills Needed To Become a Great Data Scientist; A Full End-to-End Deployment of a Machine Learning Algorithm into a Live Production Environment; The 5 Characteristics of a Successful Data Scientist; Top Resources for Learning Statistics for Data Sciencehttps://www.kdnuggets.com/2021/n48.html
-
The Best ETL Tools in 2021">The Best ETL Tools in 2021
If you have clear, well-defined objectives, it won’t be hard to identify the ETL technology that best meets your needs. Here are some of the best ETL tools you can use in your business.https://www.kdnuggets.com/2021/12/mozart-best-etl-tools-2021.html
-
Federated Learning: Collaborative Machine Learning with a Tutorial on How to Get Started
Read on to learn more about the intricacies of federated learning and what it can do for machine learning on sensitive data.https://www.kdnuggets.com/2021/12/federated-learning-collaborative-machine-learning-tutorial-get-started.html
-
Why we will always need humans to train AI — sometimes in real-time
Customizable, real-time data labeling pipelines that can continuously receive and process unlabeled data are necessary to train and perfect the AI that impacts our lives and daily conveniences.https://www.kdnuggets.com/2021/12/why-we-need-humans-training-ai.html
-
The Chatbot Transformation: From Failure to the Future
The all-knowing chatbots we once thought to be the future have been replaced by specialized bots, and the results are outstanding.https://www.kdnuggets.com/2021/12/chatbot-transformation-failure-future.html
-
A Faster Way to Prepare Time-Series Data with the AI & Analytics Engine
Many real-world datasets consist of records of events that occur at arbitrary and irregular intervals. These datasets then need to be processed into regular time series for further analysis. We will use the AI & Analytics Engine to illustrate how you can prepare your time-series data in just 1 step.https://www.kdnuggets.com/2021/12/piexchange-faster-way-prepare-timeseries-data-ai-analytics-engine.html
-
Three R Libraries Every Data Scientist Should Know (Even if You Use Python)">Three R Libraries Every Data Scientist Should Know (Even if You Use Python)
Check out these powerful R libraries built by the world’s biggest tech companies.https://www.kdnuggets.com/2021/12/three-r-libraries-every-data-scientist-know-even-python.html
-
How to Get Into Data Analytics If You Don’t Have the Right Degree
So, is a career in data analytics a good fit for you?https://www.kdnuggets.com/2021/12/how-to-get-into-data-analytics.html
-
How to Speed Up XGBoost Model Training
XGBoost is an open-source implementation of gradient boosting designed for speed and performance. However, even XGBoost training can sometimes be slow. This article will review the advantages and disadvantages of each approach as well as go over how to get started.https://www.kdnuggets.com/2021/12/speed-xgboost-model-training.html
-
The 5 Characteristics of a Successful Data Scientist">The 5 Characteristics of a Successful Data Scientist
I've put some thought into this, and come up with the 5 characteristics of a what I believe define a successful data scientist. Do you agree?https://www.kdnuggets.com/2021/12/5-characteristics-successful-data-scientist.html
-
10 Key AI & Data Analytics Trends for 2022 and Beyond
What AI and data analytics trends are taking the industry by storm this year? This comprehensive review highlights upcoming directions in AI to carefully watch and consider implementing in your personal work or organization.https://www.kdnuggets.com/2021/12/10-key-ai-trends-for-2022.html
-
Top 2021 Stories: We Don’t Need Data Scientists, We Need Data Engineers; A Guide On How To Become A Data Scientist (Step By Step Approach); How I Tripled My Income With Data Science in 18 Months
Most viewed KDnuggets stories in 2021 focused on Data Scientists vs Data Engineers; How to become a Data Scientist; Increase income with Data Science; Stunning visualizations using python; and more.https://www.kdnuggets.com/2021/12/top-stories-2021.html
-
Top Resources for Learning Statistics for Data Science">Top Resources for Learning Statistics for Data Science
Let’s take a look at the current state of statistics in data science, and what you can do to accelerate your learning.https://www.kdnuggets.com/2021/12/springboard-top-resources-learn-data-science-statistics.html
-
Cloud ML In Perspective: Surprises of 2021, Projections for 2022
Let’s take a closer look on Cloud ML market in 2021 in retrospective (with occasional drills into realities of 2020, too). Read this in-depth analysis.https://www.kdnuggets.com/2021/12/cloud-ml-perspective-surprises-2021-projections-2022.html
-
How I 14Xed my salary in 14 years as a data analytics/science professional
Learn how one data scientist increased their full-time job salary 14 times in 14 years of a career, with highlights on experiencing an IPO, RSUs, start-ups and working at FAANG companies.https://www.kdnuggets.com/2021/12/14x-salary-in-14-years-data-professional.html
-
5 Key Skills Needed To Become a Great Data Scientist">5 Key Skills Needed To Become a Great Data Scientist
Based on 10 years of my experience (learn to build those skills).https://www.kdnuggets.com/2021/12/5-key-skills-needed-become-great-data-scientist.html
-
Write Clean Python Code Using Pipes">Write Clean Python Code Using Pipes
A short and clean approach to processing iterables.https://www.kdnuggets.com/2021/12/write-clean-python-code-pipes.html
-
KDnuggets™ News 21:n47, Dec 15: Building a solid data team; Stop Learning Data Science to Find Purpose and Find Purpose to Learn Data Science; AI, Analytics, Machine Learning, Data Science Main Developments in 2021 and Key Trends for 2022
In this issue: Building a solid data team; Stop Learning Data Science to Find Purpose and Find Purpose to Learn Data Science; AI, Analytics, Machine Learning, Data Science, Deep Learning Main Developments in 2021 and Key Trends for 2022 - Research, Technology, and Industry perspectives.https://www.kdnuggets.com/2021/n47.html
-
Software Mistakes and Tradeoffs: New book by Tomasz Lelek and StackOverflow guru Jon Skeet
Flexibility versus maintainability—every decision you make in software engineering involves balancing tradeoffs. Software Mistakes and Tradeoffs is available in early access from its publisher Manning. Pre-order now and start reading immediately as part of the Manning Early Access Program (MEAP).https://www.kdnuggets.com/2021/12/manning-software-mistakes-tradeoffs-book.html
-
Data Science & Analytics Industry Main Developments in 2021 and Key Trends for 2022
We have solicited insights from experts at industry-leading companies, asking: "What were the main AI, Data Science, Machine Learning Developments in 2021 and what key trends do you expect in 2022?" Read their opinions here.https://www.kdnuggets.com/2021/12/developments-predictions-data-science-analytics-industry.html
-
12 Tips: From Data Analyst to Startup Co-Founder
Thinking about taking your data science expertise to a new level of creating a start-up company? These tips -- learned from experience -- can help you forge an early path toward success.https://www.kdnuggets.com/2021/12/12-tips-data-analyst-to-co-founder.html
-
Feature Selection: Where Science Meets Art
From heuristic to algorithmic feature selection techniques for data science projects.https://www.kdnuggets.com/2021/12/feature-selection-science-meets-art.html
-
What Is AI Model Governance?
How exactly does AI model governance help tackle these issues? And how can you ensure you’re using it to best fit your needs? Read on.https://www.kdnuggets.com/2021/12/ai-model-governance.html
-
Data Labeling for Machine Learning: Market Overview, Approaches, and Tools
So much of data science and machine learning is founded on having clean and well-understood data sources that it is unsurprising that the data labeling market is growing faster than ever. Here, we highlight many of the top players in this industry and the techniques they use to help you consider which might make a good partner for your needs.https://www.kdnuggets.com/2021/12/data-labeling-ml-overview-and-tools.html
-
My First Six Months as a Data Scientist
The technical and non-technical lessons I’ve learned.https://www.kdnuggets.com/2021/12/first-six-months-data-scientist.html
-
test
https://www.kdnuggets.com/test
-
Introduction to Clustering in Python with PyCaret
A step-by-step, beginner-friendly tutorial for unsupervised clustering tasks in Python using PyCaret.https://www.kdnuggets.com/2021/12/introduction-clustering-python-pycaret.html
-
Stop Learning Data Science to Find Purpose and Find Purpose to Learn Data Science">Stop Learning Data Science to Find Purpose and Find Purpose to Learn Data Science
How I flipped the educational model to become a more effective data scientist.https://www.kdnuggets.com/2021/12/stop-learning-data-science-find-purpose.html
-
Main 2021 Developments and Key 2022 Trends in AI, Data Science, Machine Learning Technology
Our panel of leading experts reviews 2021 main developments and examines the key trends in AI, Data Science, Machine Learning, and Deep Learning Technology.https://www.kdnuggets.com/2021/12/trends-ai-data-science-ml-technology.html
-
Inside DeepMind’s New Efforts to Use Deep Learning to Advance Mathematics
Using deep learning techniques can help mathematicians develop intuitions about the toughest problems in the field.https://www.kdnuggets.com/2021/12/inside-deepmind-new-efforts-deep-learning-advance-mathematics.html
-
Deep Neural Networks Don’t Lead Us Towards AGI
Machine learning techniques continue to evolve with increased efficiency for recognition problems. But, they still lack the critical element of intelligence, so we remain a long way from attaining AGI.https://www.kdnuggets.com/2021/12/deep-neural-networks-not-toward-agi.html
-
Analyzing Scientific Articles with fine-tuned SciBERT NER Model and Neo4j
In this article, we will be analyzing a dataset of scientific abstracts using the Neo4j Graph database and a fine-tuned SciBERT model.https://www.kdnuggets.com/2021/12/analyzing-scientific-articles-finetuned-scibert-ner-model-neo4j.html
-
Should You Become a Freelance Artificial Intelligence Engineer?
Take the first step towards your machine learning engineering career and explore the UC San Diego Extension Machine Learning Engineering Bootcamp today. Those with prior software engineering or data science experience are encouraged to apply.https://www.kdnuggets.com/2021/12/ucsd-become-freelance-artificial-intelligence-engineer.html
-
AI, Analytics, Machine Learning, Data Science, Deep Learning Research Main Developments in 2021 and Key Trends for 2022
2021 has almost come and gone. We saw some standout advancements in AI, Analytics, Machine Learning, Data Science, Deep Learning Research this past year, and the future, starting with 2022, looks bright. As per KDnuggets tradition, our collection of experts have contributed their insights on the matter. Read on to find out more.https://www.kdnuggets.com/2021/12/developments-predictions-ai-machine-learning-data-science-research.html
-
Building a solid data team">Building a solid data team
How do you put together a solid data science team when it comes to developing data-driven products? A variety of roles are available to consider, so which ones do you need and which are most crucial?https://www.kdnuggets.com/2021/12/build-solid-data-team.html
-
How Data Scientists Can Get the Ear of CFOs (And Why You Want It)
Hey, data scientists! Here’s how to bend your CFO’s ear, equip your company with high-quality analysis, and boost your value and career in the process.https://www.kdnuggets.com/2021/12/data-scientists-get-ear-cfos-want.html
-
Advance your data science career to the next level
SAS offers a wide range of hands-on courses for data science professionals to help you get ahead – and stay ahead – in your data science career.https://www.kdnuggets.com/2021/12/sas-advance-data-science-career-next-level.html
-
Introduction to Binary Classification with PyCaret
PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. See how to use it for binary classification.https://www.kdnuggets.com/2021/12/introduction-binary-classification-pycaret.html
-
A $9B AI Failure, Examined">A $9B AI Failure, Examined
What happened at Zillow? An important real-world lesson in... just because you have a cool AI tool, doesn't mean that alone becomes your business model.https://www.kdnuggets.com/2021/12/9b-ai-failure-examined.html
-
Using Datawig, an AWS Deep Learning Library for Missing Value Imputation
A lot of missing values in the dataset can affect the quality of prediction in the long run. Several methods can be used to fill the missing values and Datawig is one of the most efficient ones.https://www.kdnuggets.com/2021/12/datawig-aws-deep-learning-library-missing-value-imputation.html
-
10 Simple Things to Try Before Neural Networks
Below are 10 simple things you should remember to try first before throwing in the towel and jumping straight to neural networks.https://www.kdnuggets.com/2021/12/10-simple-things-try-neural-networks.html
-
What Does a Data Scientist Do?
This guide provides you with the best possible, most direct, and clear answers to "What is data science?" and "What does a data scientist do?".https://www.kdnuggets.com/2021/12/what-does-a-data-scientist-do.html
-
A Beginner’s Guide to End to End Machine Learning
Learn to train, tune, deploy and monitor machine learning models.https://www.kdnuggets.com/2021/12/beginner-guide-end-end-machine-learning.html
-
How to Get Certified as a Data Scientist">How to Get Certified as a Data Scientist
If you are early in your journey to becoming a Data Scientist, an interesting option is to earn certification by DataCamp, and this guide offers tips that will help beginners complete the challenges.https://www.kdnuggets.com/2021/12/get-certified-data-science.html
-
Using PyCaret’s New Time Series Module
PyCaret’s new time series module is now available in beta. Staying true to the simplicity of PyCaret, it is consistent with the existing API and comes with a lot of functionalities.https://www.kdnuggets.com/2021/12/pycaret-new-time-series-module.html
-
Avoid These Mistakes with Time Series Forecasting
A few checks to make before training a Machine Learning model on data that could be random.https://www.kdnuggets.com/2021/12/avoid-mistakes-time-series-forecasting.html
-
2021: A Year Full of Amazing AI papers — A Review
A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code.https://www.kdnuggets.com/2021/12/2021-year-review-amazing-ai-papers.html
-
How to Use Permutation Tests
A walkthrough of permutation tests and how they can be applied to time series data.https://www.kdnuggets.com/2021/12/use-permutation-tests.html
-
The Seven Best ELT Tools for Data Warehouses
ELT helps to streamline the process of modern data warehousing and managing a business’ data. In this post, we’ll discuss some of the best ELT tools to help you clean and transfer important data to your data warehouse.https://www.kdnuggets.com/2021/12/mozart-seven-best-elt-tools-data-warehouses.html
-
5 Practical Data Science Projects That Will Help You Solve Real Business Problems for 2022">5 Practical Data Science Projects That Will Help You Solve Real Business Problems for 2022
This curated list of data science projects offers real-life problems that will help you master skills to demonstration that you are technically sound and know how to conduct data science projects that add business value.https://www.kdnuggets.com/2021/12/5-practical-data-science-projects.html
-
Movie Recommendations with Spark Collaborative Filtering
Not sure what movie to watch? Ask your recommender system.https://www.kdnuggets.com/2021/12/movie-recommendations-spark-collaborative-filtering.html
-
KDnuggets™ News 21:n45, Dec 1: Most Common SQL Mistakes on Data Science Interviews; Why Machine Learning Engineers are Replacing Data Scientists
Most Common SQL Mistakes on Data Science Interviews; Why Machine Learning Engineers are Replacing Data Scientists; Vote in new KDnuggets Poll: What Percentage of Your Machine Learning Models Have Been Deployed? KDnuggets: Personal History and Nuggets of Experience.https://www.kdnuggets.com/2021/n45.html
-
Sentiment Analysis API vs Custom Text Classification: Which one to choose?
In this article, we are going to compare the sentiment extraction performance between Sentiment Analysis engines and Custom Text classification engines. The idea is to show pros and cons of these two types of engines on a concrete dataset.https://www.kdnuggets.com/2021/11/sentiment-analysis-api-custom-text-classification.html
-
KDnuggets: Personal History and Nuggets of Experience
After 28+ years of publishing and editing KDnuggets, I am retiring and transitioning KDnuggets to Matthew Mayo, who will become the new editor-in-chief. I want to share with you my story of KDnuggets and highlight some of the useful nuggets of experience I learned along this amazing journey.https://www.kdnuggets.com/2021/11/kdnuggets-history.html
-
Clustering in Crowdsourcing: Methodology and Applications
As a result of the efforts outlined in this article, we confirmed that clustering through crowdsourcing is indeed possible and works impressively well.https://www.kdnuggets.com/2021/11/clustering-crowdsourcing-methodology-applications.html
-
Building Massively Scalable Machine Learning Pipelines with Microsoft Synapse ML
The new platform provides a single API to abstract dozens of ML frameworks and databases.https://www.kdnuggets.com/2021/11/building-massively-scalable-machine-learning-pipelines-microsoft-synapse-ml.html
-
New Poll: What Percentage of Your Machine Learning Models Have Been Deployed?
Take a moment to participate in the latest KDnuggets poll and let the community know what percentage of your machine learning models have been deployed.https://www.kdnuggets.com/2021/11/percentage-machine-learning-models-deployed.html
-
Why Machine Learning Engineers are Replacing Data Scientists">Why Machine Learning Engineers are Replacing Data Scientists
The hiring run for data scientists continues along at a strong clip around the world. But, there are other emerging roles that are demonstrating key value to organizations that you should consider based on your existing or desired skill sets.https://www.kdnuggets.com/2021/11/why-machine-learning-engineers-are-replacing-data-scientists.html
-
Sentiment Analysis with KNIME
Check out this tutorial on how to approach sentiment classification with supervised machine learning algorithms.https://www.kdnuggets.com/2021/11/sentiment-analysis-knime.html
-
How to Build a Knowledge Graph with Neo4J and Transformers
Learn to use custom Named Entity Recognition and Relation Extraction models.https://www.kdnuggets.com/2021/11/build-knowledge-graph-neo4j-transformers.html
-
A Spreadsheet that Generates Python: The Mito JupyterLab Extension
You can call Mito into your Jupyter Environment and each edit you make will generate the equivalent Python in the code cell below.https://www.kdnuggets.com/2021/11/spreadsheet-generates-python-mito-jupyterlab-extension.html
-
Cartoon: Data Science for Thanksgiving
A classic KDnuggets Thanksgiving cartoon examines the predicament of one group of fowl Data Scientists.https://www.kdnuggets.com/2021/11/cartoon-data-science-thanksgiving.html
-
What’s the difference between a Data Scientist and a Data Analyst?
Find out the major differences between a Data Analyst and a Data Scientist, and read the author's pointers on what they would recommend you to do if you wish to make that transition from Data Analyst to Data Scientist.https://www.kdnuggets.com/2021/11/difference-data-scientist-data-analyst.html
-
Can You Become a Data Scientist Online?
Until November 29th, you can join over 1.5 million students around the globe and gain the skills of successful data science professionals with unlimited annual access to the 365 Data Science Program at 72% OFF. Read on to learn more!https://www.kdnuggets.com/2021/11/365datascience-become-data-scientist-online.html
-
Accelerating AI with MLOps
Companies are racing to use AI, but despite its vast potential, most AI projects fail. Examining and resolving operational issues upfront can help AI initiatives reach their full potential.https://www.kdnuggets.com/2021/11/accelerating-ai-mlops.html
-
Most Common SQL Mistakes on Data Science Interviews">Most Common SQL Mistakes on Data Science Interviews
Sure, we all make mistakes -- which can be a bit more painful when we are trying to get hired -- so check out these typical errors applicants make while answering SQL questions during data science interviews.https://www.kdnuggets.com/2021/11/common-sql-mistakes-data-science-interviews.html
-
5 Advanced Tips on Python Sequences
Notes from Fluent Python by Luciano Ramalho.https://www.kdnuggets.com/2021/11/5-advanced-tips-python-sequences.html
-
5 Tips to Get Your First Data Scientist Job
Read some of the key things the author has learned during the infamous job seeking stage.https://www.kdnuggets.com/2021/11/5-tips-first-data-scientist-job.html
-
On-Device Deep Learning: PyTorch Mobile and TensorFlow Lite
PyTorch and TensorFlow are the two leading AI/ML Frameworks. In this article, we take a look at their on-device counterparts PyTorch Mobile and TensorFlow Lite and examine them more deeply from the perspective of someone who wishes to develop and deploy models for use on mobile platforms.https://www.kdnuggets.com/2021/11/on-device-deep-learning-pytorch-mobile-tensorflow-lite.html
-
Dask DataFrame is not Pandas
This article is the second article of an ongoing series on using Dask in practice. Each article in this series will be simple enough for beginners, but provide useful tips for real work. The next article in the series is about parallelizing for loops, and other embarrassingly parallel operations with dask.delayed.https://www.kdnuggets.com/2021/11/dask-dataframe-not-pandas.html
-
3 Differences Between Coding in Data Science and Machine Learning
The terms ‘data science’ and ‘machine learning’ are often used interchangeably. But while they are related, there are some glaring differences, so let’s take a look at the differences between the two disciplines, specifically as it relates to programming.https://www.kdnuggets.com/2021/11/3-differences-coding-data-science-machine-learning.html
-
Stop Blaming Humans for Bias in AI
Can artificial intelligence be rid of bias? This is an important question, and it’s equally important that we look in the right place for the answer.https://www.kdnuggets.com/2021/11/stop-blaming-humans-bias-ai.html
-
Difference between distributed learning versus federated learning algorithms
Want to know the difference between distributed and federated learning? Read this article to find out.https://www.kdnuggets.com/2021/11/difference-distributed-learning-federated-learning-algorithms.html
-
eBook: 101 Ways to Use Third-Party Data to Make Smarter Decisions
To guide you in becoming a data-driven organization, AWS Data Exchange has created a new eBook, 101 Ways to Use Third-Party Data to Make Smarter Decisions. Learn how to transform the ‘currency’ of data into actionable business insights.https://www.kdnuggets.com/2021/11/roidna-ebook-101-ways-third-party-data-smarter-decisions.html
-
Build a Serverless News Data Pipeline using ML on AWS Cloud
This is the guide on how to build a serverless data pipeline on AWS with a Machine Learning model deployed as a Sagemaker endpoint.https://www.kdnuggets.com/2021/11/build-serverless-news-data-pipeline-ml-aws-cloud.html
-
Where NLP is heading">Where NLP is heading
Natural language processing research and applications are moving forward rapidly. Several trends have emerged on this progress, and point to a future of more exciting possibilities and interesting opportunities in the field.https://www.kdnuggets.com/2021/11/where-nlp-is-heading.html
-
Data Scientists: How to Sell Your Project and Yourself
Follow this formula for the perfect elevator pitch.https://www.kdnuggets.com/2021/11/data-scientists-sell-project.html
-
AI meets BI: Key capabilities to look for in a modern BI platform
With the customer at its heart, modern augmented BI platforms no longer require scripting/coding skills or the knowledge to build the back-end data models, empowering even laymen to harness the power of raw data. As a user, here are the top AI capabilities that you need to look for in BI software.https://www.kdnuggets.com/2021/11/zoho-ai-meets-bi-key-capabilities-platform.html
-
Easy Synthetic Data in Python with Faker
Faker is a Python library that generates fake data to supplement or take the place of real world data. See how it can be used for data science.https://www.kdnuggets.com/2021/11/easy-synthetic-data-python-faker.html
-
Inside recommendations: how a recommender system recommends
We describe types of recommender systems, more specifically, algorithms and methods for content-based systems, collaborative filtering, and hybrid systems.https://www.kdnuggets.com/2021/11/recommendations-recommender-system.html
-
How to fast-track machine translation projects
Data is the lifeblood of any successful machine learning model, and machine translation models are no exception. Without relevant and properly labelled data, even the most sophisticated model will be unable to achieve reliable results.https://www.kdnuggets.com/2021/11/defined-fast-track-machine-translation-projects.html
-
Virtual Presentation Tips for Data Scientists
Learn how to effectively communicate your work.https://www.kdnuggets.com/2021/11/virtual-presentation-tips-data-scientists.html
-
10 AI Project Ideas in Computer Vision
The field of computer vision has seen the development of very powerful applications leveraging machine learning. These projects will introduce you to these techniques and guide you to more advanced practice to gain a deeper appreciation for the sophistication now available.https://www.kdnuggets.com/2021/11/10-ai-project-ideas-computer-vision.html
-
Two Simple Things You Need to Steal from Agile for Data and Analytics Work
Peer Review and Definition of Done: small changes, BIG impact.https://www.kdnuggets.com/2021/11/simple-things-steal-agile-data-science-analytics.html
-
How I Redesigned over 100 ETL into ELT Data Pipelines">How I Redesigned over 100 ETL into ELT Data Pipelines
Learn how to level up your Data Pipelines!https://www.kdnuggets.com/2021/11/redesigned-over-100-etl-elt-data-pipelines.html
-
Anecdotes from 11 Role Models in Machine Learning
The skills needed to create good data are also the skills needed for good leadership.https://www.kdnuggets.com/2021/11/anecdotes-11-role-models-machine-learning.html
-
Deep Learning on your phone: PyTorch C++ API for use on Mobile Platforms
The PyTorch Deep Learning framework has a C++ API for use on mobile platforms. This article shows an end-to-end demo of how to write a simple C++ application with Deep Learning capabilities using the PyTorch C++ API such that the same code can be built for use on mobile platforms (both Android and iOS).https://www.kdnuggets.com/2021/11/deep-learning-mobile-phone-pytorch-c-api.html
-
25 Github Repositories Every Python Developer Should Know
Check out these repositories to help you improve your data science skills.https://www.kdnuggets.com/2021/11/25-github-repositories-python-developer.html
-
What’s missing from self-serve BI and what we can do about it
The notion of self-service BI tools caught an expectation that they could provide a magic formula for easily helping everyone understand all the data. But, such an end-result isn't occurring in practice. To identify a better approach, we need to take a step back and determine what problem is actually trying to be solved.https://www.kdnuggets.com/2021/11/missing-self-serve-bi.html
-
Dream Come True: Building websites by thinking about them
From the mind to the computer, make websites using your imagination!https://www.kdnuggets.com/2021/11/dream-come-true-allennlp-hacks-21.html
-
Don’t Waste Time Building Your Data Science Network">Don’t Waste Time Building Your Data Science Network
Instead, focus on what matters.https://www.kdnuggets.com/2021/11/waste-time-building-data-science-network.html
-
KDnuggets Top Blogs Rewards Program Resumes in December
After a pause, we will be resuming KDnuggets Top Blog Rewards Program, starting with blogs published on KDnuggets in December. The program will be bigger, with $3,000 (USD) divided among top 8 most viewed guest blogs. Original blogs rewarded at the rate of 3X of reposts. Submit your original blog to KDnuggets first !https://www.kdnuggets.com/2021/11/top-blogs-reward-program-resumes.html
-
SAS Analytics Pro – now available for on-site or containerized cloud-native deployment – providing your entry point into SAS Viya
Now, SAS Analytics Pro includes a new option for containerized cloud-native deployment. This makes SAS Analytics Pro a perfect entry point into SAS Viya.https://www.kdnuggets.com/2021/11/sas-analytics-pro-now-available.html
-
OpenAI’s Approach to Solve Math Word Problems
OpenAI's latest research aims to solve math word problems. Let's dive a bit deeper into the ideas behind this new research.https://www.kdnuggets.com/2021/11/open-ai-approach-solve-math-word-problems.html
-
The Common Misconceptions About Machine Learning
Beginners in the field can often have many misconceptions about machine learning that sometimes can be a make-it-or-break-it moment for the individual switching careers or starting fresh. This article clearly describes the ground truth realities about learning new ML skills and eventually working professionally as a machine learning engineer.https://www.kdnuggets.com/2021/11/common-misconception-about-machine-learning.html
-
What Comes After HDF5? Seeking a Data Storage Format for Deep Learning
In this article we are discussing that HDF5 is one of the most popular and reliable formats for non-tabular, numerical data. But this format is not optimized for deep learning work. This article suggests what kind of ML native data format should be to truly serve the needs of modern data scientists.https://www.kdnuggets.com/2021/11/after-hdf5-data-storage-format-deep-learning.html
-
7 Top Open Source Datasets to Train Natural Language Processing (NLP) & Text Models
With a lot of excitement and research around NLP, there are growing opportunities to apply these technologies to real-world scenarios. It's not trivial to become familiar with NLP and these open-source data sets can help you increase your skills.https://www.kdnuggets.com/2021/11/top-open-source-datasets-nlp.html
-
Federated Learning: Google’s Take
This blog will be focusing on the work Google has been doing in the Federated Learning space.https://www.kdnuggets.com/2021/11/federated-learning-googles-take.html
-
The Best Ways for Data Professionals to Market AWS Skills in 2022
Knowing your way around Amazon Web Services (AWS) is increasingly useful. Here are five ways to market your AWS skills in today’s job market.https://www.kdnuggets.com/2021/11/best-ways-data-professionals-market-aws-skills.html
-
Toloka 101 Live Demo: Learn how to get reliable training data for machine learning, Nov 11
Toloka is a crowdsourced data labeling platform that handles data collection and annotation projects for machine learning at any scale. In this Nov 11 Live Demo, Learn how to get reliable training data for machine learning.https://www.kdnuggets.com/2021/11/toloka-training-data-machine-learning.html
-
A First Principles Theory of Generalization
Some new research from University of California, Berkeley shades some new light into how to quantify neural networks knowledge.https://www.kdnuggets.com/2021/11/first-principles-theory-generalization.html
-
AI Infinite Training & Maintaining Loop
Productizing AI is an infrastructure orchestration problem. In planning your solution design, you should use continuous monitoring, retraining, and feedback to ensure stability and sustainability.https://www.kdnuggets.com/2021/11/ai-infinite-training-maintaining-loop.html
-
7 of The Coolest Machine Learning Topics of 2021 at ODSC West
At our upcoming event this November 16th-18th in San Francisco, ODSC West 2021 will feature a plethora of talks, workshops, and training sessions on machine learning topics, deep learning, NLP, MLOps, and so on. You can register now for 20% off all ticket types, or register for a free AI Expo Pass to see what some big names in AI are doing now.https://www.kdnuggets.com/2021/11/odsc-7-coolest-machine-learning-topics.html
-
Visual Scoring Techniques for Classification Models
Read this article assessing a model performance in a broader context.https://www.kdnuggets.com/2021/11/visual-scoring-techniques-classification-models.html
-
Data Scientist Career Path from Novice to First Job">Data Scientist Career Path from Novice to First Job
If you are beginning your data science journey, then you must be prepared to plan it out as a step-by-step process that will guide you from being a total newbie to getting your first job as a data scientist. These tips and educational resources should be useful for you and add confidence as you take that first big step.https://www.kdnuggets.com/2021/11/data-scientist-career-path-first-job.html
-
Neural Networks from a Bayesian Perspective
This article looks at neural networks from a Bayesian perspective.https://www.kdnuggets.com/2021/11/neural-networks-bayesian-perspective.html
-
Three reasons to self-host your product analytics
Want three reasons to avoid the cloud and host your own analytics platform? More data, more control, more secure.https://www.kdnuggets.com/2021/11/posthog-three-reasons-self-host-product-analysis.html
-
Design Patterns for Machine Learning Pipelines">Design Patterns for Machine Learning Pipelines
ML pipeline design has undergone several evolutions in the past decade with advances in memory and processor performance, storage systems, and the increasing scale of data sets. We describe how these design patterns changed, what processes they went through, and their future direction.https://www.kdnuggets.com/2021/11/design-patterns-machine-learning-pipelines.html
-
Salary Breakdown of the Top Data Science Jobs">Salary Breakdown of the Top Data Science Jobs
Machine Learning vs NLP vs Data Engineer vs Data Scientist, and what it means to be in each role.https://www.kdnuggets.com/2021/11/salary-breakdown-top-data-science-jobs.html
-
Advanced PyTorch Lightning with TorchMetrics and Lightning Flash
In this tutorial we will be diving deeper into two additional tools you should be using: TorchMetrics and Lightning Flash. TorchMetrics unsurprisingly provides a modular approach to define and track useful metrics across batches and devices, while Lightning Flash offers a suite of functionality facilitating more efficient transfer learning and data handling, and a recipe book of state-of-the-art approaches to typical deep learning problems.https://www.kdnuggets.com/2021/11/advanced-pytorch-lightning-torchmetrics-lightning-flash.html
-
Top 5 Time Series Methods
Data that varies in time can offer powerful applications and use cases for data scientists to analyze. This overview considers the top techniques you can learn to understand and gain insight from time-series data.https://www.kdnuggets.com/2021/11/top-5-time-series-methods.html
-
Is the Modern Data Stack Leaving You Behind?
The modern data stack narrative is largely dominated by analytics engineering. Where does that leave data engineers? Discover the difference between the MDS for data engineers & analytics engineers.https://www.kdnuggets.com/2021/11/modern-data-stack-leaving-behind.html
-
The Case for a Global Responsible AI Framework
Public and private organizations have come out with their own set of AI principles, focusing on AI-related risks from their perspective. However, it’s imperative d=to have a global consensus on Responsible AI – based on data governance, transparency and accountability – on how to utilize and benefit from AI in a way that is both consistent and ethical.https://www.kdnuggets.com/2021/10/responsible-ai-framework.html
-
Multivariate Time Series Analysis with an LSTM based RNN
Check out this codeless solution using the Keras integration.https://www.kdnuggets.com/2021/10/multivariate-time-series-analysis-lstm-based-rnn.html
-
ETL and ELT: A Guide and Market Analysis
ETL and related techniques remain a powerful and foundational tool in the data industry. We explain what ETL is and how ETL and ELT processes have evolved over the years, with a close eye toward how third-generation ETL tools are about to disrupt standard data processing practices.https://www.kdnuggets.com/2021/10/etl-elt-guide-market-analysis.html
-
Simple Text Scraping, Parsing, and Processing with this Python Library
Scraping, parsing, and processing text data from the web can be difficult. But it can also be easy, using Newspaper3k.https://www.kdnuggets.com/2021/10/simple-text-scraping-parsing-processing-python-library.html
-
Want to Join a Bank? Everything Data Scientists Need to Know About Working in Fintech
There is ample opportunity for data scientists in the financial services sector. The career experience can be very different, however, from similar roles at pure technology organizations. So, it's best to first consider if this industry is right for your interests, preferences for how you work, and long-term goals.https://www.kdnuggets.com/2021/10/bank-data-scientists-working-fintech.html
-
Analyze Python Code in Jupyter Notebooks
We present a new tool that integrates modern code analysis techniques with Jupyter notebooks and helps developers find bugs as they write code.https://www.kdnuggets.com/2021/10/analyze-python-code-jupyter-notebooks.html
-
How to Build Data Frameworks with Open Source Tools to Enhance Agility and Security
Let’s take a look at how to harness open source tools to build your data frameworks.https://www.kdnuggets.com/2021/10/build-data-frameworks-open-source-tools-agility-security.html
-
A Guide to 14 Different Data Science Jobs">A Guide to 14 Different Data Science Jobs
The field of data science is growing into one that features a variety of job titles This guide reviews different positions available for you to consider if you have a data science background.https://www.kdnuggets.com/2021/10/guide-14-different-data-science-jobs.html
-
Machine Learning Model Development and Model Operations: Principles and Practices">Machine Learning Model Development and Model Operations: Principles and Practices
The ML model management and the delivery of highly performing model is as important as the initial build of the model by choosing right dataset. The concepts around model retraining, model versioning, model deployment and model monitoring are the basis for machine learning operations (MLOps) that helps the data science teams deliver highly performing models.https://www.kdnuggets.com/2021/10/machine-learning-model-development-operations-principles-practice.html
-
Export Data from the Web Scraping Tool through Zapier Integration
Octoparse makes it easy to collect data from websites and automate workflows on the web. Zapier is an online platform that allows you to automate workflows by connecting the apps and services you use. Zapier connection, the new feature in Octoparse, makes it possible to connect the product with apps including Google Drive, Google Sheets, Dropbox, Trello, Slack, and load more apps in a second with NO CODE.https://www.kdnuggets.com/2021/10/octoparse-web-scraping-zapier-integration.html
-
Getting Started with PyTorch Lightning
As a library designed for production research, PyTorch Lightning streamlines hardware support and distributed training as well, and we’ll show how easy it is to move training to a GPU toward the end.https://www.kdnuggets.com/2021/10/getting-started-pytorch-lightning.html
-
How To Defeat The Machine Learning Engineer Impostor Syndrome
How many times have you taken yet another online course on machine learning or read yet another paper on a new emerging topic, to be up-to-date in this crazy fast-paced AI/ML world -- only to keep feeling like an ML engineer impostor? These three personal tips can help you overcome the classic (and common) impostor syndrome behind every emerging ML engineer who wants to be better at what you do.https://www.kdnuggets.com/2021/10/defeat-machine-learning-engineer-impostor-syndrome.html
-
Four Basic Steps in Data Preparation">Four Basic Steps in Data Preparation
What we would like to do here is introduce four very basic and very general steps in data preparation for machine learning algorithms. We will describe how and why to apply such transformations within a specific example.https://www.kdnuggets.com/2021/10/four-basic-steps-data-preparation.html
-
365 Data Science courses free until 18 November">365 Data Science courses free until 18 November
365 Data Science, an online educational platform providing beginner-to-advanced courses for data science and business analytics professionals, will unlock the entire library of courses, hands-on exercises, certificate exams, and resume builder for a full 30-day period from Oct. 18 to Nov. 18.https://www.kdnuggets.com/2021/10/365datascience-courses-free.html
-
Guide To Finding The Right Predictive Maintenance Machine Learning Techniques
What happens to a life so dependent on machines, when that particular machine breaks down? This is precisely why there’s a dire need for predictive maintenance with machine learning.https://www.kdnuggets.com/2021/10/guide-right-predictive-maintenance-machine-learning-techniques.html
-
Save Sarah Connor with Data Science
Data science and data privacy are deeply interwoven, and must be carefully considered by practitioners. In comparing the Safe Harbour and Expert Determination data obfuscation approaches, Safe Harbour has been very popular among data engineers but has fundamental limitations, where Expert Determination offers important advantages.https://www.kdnuggets.com/2021/10/save-sarah-connor-data-science.html
-
Learn To Reproduce Papers: Beginner’s Guide">Learn To Reproduce Papers: Beginner’s Guide
Step-by-step instructions on how to understand Deep Learning papers and implement the described approaches.https://www.kdnuggets.com/2021/10/learn-reproduce-papers-beginners-guide.html