Search results for
-
Sentiment Analysis API vs Custom Text Classification: Which one to choose?
In this article, we are going to compare the sentiment extraction performance between Sentiment Analysis engines and Custom Text classification engines. The idea is to show pros and cons of these two types of engines on a concrete dataset.https://www.kdnuggets.com/2021/11/sentiment-analysis-api-custom-text-classification.html
-
KDnuggets: Personal History and Nuggets of Experience
After 28+ years of publishing and editing KDnuggets, I am retiring and transitioning KDnuggets to Matthew Mayo, who will become the new editor-in-chief. I want to share with you my story of KDnuggets and highlight some of the useful nuggets of experience I learned along this amazing journey.https://www.kdnuggets.com/2021/11/kdnuggets-history.html
-
Clustering in Crowdsourcing: Methodology and Applications
As a result of the efforts outlined in this article, we confirmed that clustering through crowdsourcing is indeed possible and works impressively well.https://www.kdnuggets.com/2021/11/clustering-crowdsourcing-methodology-applications.html
-
Building Massively Scalable Machine Learning Pipelines with Microsoft Synapse ML
The new platform provides a single API to abstract dozens of ML frameworks and databases.https://www.kdnuggets.com/2021/11/building-massively-scalable-machine-learning-pipelines-microsoft-synapse-ml.html
-
New Poll: What Percentage of Your Machine Learning Models Have Been Deployed?
Take a moment to participate in the latest KDnuggets poll and let the community know what percentage of your machine learning models have been deployed.https://www.kdnuggets.com/2021/11/percentage-machine-learning-models-deployed.html
-
Why Machine Learning Engineers are Replacing Data Scientists">
The hiring run for data scientists continues along at a strong clip around the world. But, there are other emerging roles that are demonstrating key value to organizations that you should consider based on your existing or desired skill sets.Why Machine Learning Engineers are Replacing Data Scientists
https://www.kdnuggets.com/2021/11/why-machine-learning-engineers-are-replacing-data-scientists.html
-
Top Stories, Nov 22-28: Most Common SQL Mistakes on Data Science Interviews
Also: 19 Data Science Project Ideas for Beginners; How to Build a Knowledge Graph with Neo4J and Transformers; Data Scientists: How to Sell Your Project and Yourself; Where NLP is headinghttps://www.kdnuggets.com/2021/11/top-news-week-1122-1128.html
-
Sentiment Analysis with KNIME
Check out this tutorial on how to approach sentiment classification with supervised machine learning algorithms.https://www.kdnuggets.com/2021/11/sentiment-analysis-knime.html
-
How to Build a Knowledge Graph with Neo4J and Transformers
Learn to use custom Named Entity Recognition and Relation Extraction models.https://www.kdnuggets.com/2021/11/build-knowledge-graph-neo4j-transformers.html
-
PyCaret 2.3.5 Is Here! Learn What’s New
Read about the new functionalities added in PyCaret’s recent release.https://www.kdnuggets.com/2021/11/pycaret-here-learn-new.html
-
A Spreadsheet that Generates Python: The Mito JupyterLab Extension
You can call Mito into your Jupyter Environment and each edit you make will generate the equivalent Python in the code cell below.https://www.kdnuggets.com/2021/11/spreadsheet-generates-python-mito-jupyterlab-extension.html
-
Cartoon: Data Science for Thanksgiving
A classic KDnuggets Thanksgiving cartoon examines the predicament of one group of fowl Data Scientists.https://www.kdnuggets.com/2021/11/cartoon-data-science-thanksgiving.html
-
What’s the difference between a Data Scientist and a Data Analyst?
Find out the major differences between a Data Analyst and a Data Scientist, and read the author's pointers on what they would recommend you to do if you wish to make that transition from Data Analyst to Data Scientist.https://www.kdnuggets.com/2021/11/difference-data-scientist-data-analyst.html
-
Can You Become a Data Scientist Online?
Until November 29th, you can join over 1.5 million students around the globe and gain the skills of successful data science professionals with unlimited annual access to the 365 Data Science Program at 72% OFF. Read on to learn more!https://www.kdnuggets.com/2021/11/365datascience-become-data-scientist-online.html
-
Top 4 Data Integration Tools for Modern Enterprises
Maintaining a centralized data repository can simplify your business intelligence initiatives. Here are four data integration tools that can make data more valuable for modern enterprises.https://www.kdnuggets.com/2021/11/top-4-data-integration-tools-modern-enterprises.html
-
Accelerating AI with MLOps
Companies are racing to use AI, but despite its vast potential, most AI projects fail. Examining and resolving operational issues upfront can help AI initiatives reach their full potential.https://www.kdnuggets.com/2021/11/accelerating-ai-mlops.html
-
Common Misconceptions About Differential Privacy
This article will clarify some common misconceptions about differential privacy and what it guarantees.https://www.kdnuggets.com/2021/11/common-misconceptions-differential-privacy.html
-
Top Stories, Nov 15-21: 19 Data Science Project Ideas for Beginners
Also: How I Redesigned over 100 ETL into ELT Data Pipelines; Where NLP is heading; Don’t Waste Time Building Your Data Science Network; Data Scientists: How to Sell Your Project and Yourselfhttps://www.kdnuggets.com/2021/11/top-news-week-1115-1121.html
-
Most Common SQL Mistakes on Data Science Interviews">
Sure, we all make mistakes -- which can be a bit more painful when we are trying to get hired -- so check out these typical errors applicants make while answering SQL questions during data science interviews.Most Common SQL Mistakes on Data Science Interviews
https://www.kdnuggets.com/2021/11/common-sql-mistakes-data-science-interviews.html
-
5 Advanced Tips on Python Sequences
Notes from Fluent Python by Luciano Ramalho.https://www.kdnuggets.com/2021/11/5-advanced-tips-python-sequences.html
-
5 Tips to Get Your First Data Scientist Job
Read some of the key things the author has learned during the infamous job seeking stage.https://www.kdnuggets.com/2021/11/5-tips-first-data-scientist-job.html
-
On-Device Deep Learning: PyTorch Mobile and TensorFlow Lite
PyTorch and TensorFlow are the two leading AI/ML Frameworks. In this article, we take a look at their on-device counterparts PyTorch Mobile and TensorFlow Lite and examine them more deeply from the perspective of someone who wishes to develop and deploy models for use on mobile platforms.https://www.kdnuggets.com/2021/11/on-device-deep-learning-pytorch-mobile-tensorflow-lite.html
-
Dask DataFrame is not Pandas
This article is the second article of an ongoing series on using Dask in practice. Each article in this series will be simple enough for beginners, but provide useful tips for real work. The next article in the series is about parallelizing for loops, and other embarrassingly parallel operations with dask.delayed.https://www.kdnuggets.com/2021/11/dask-dataframe-not-pandas.html
-
3 Differences Between Coding in Data Science and Machine Learning
The terms ‘data science’ and ‘machine learning’ are often used interchangeably. But while they are related, there are some glaring differences, so let’s take a look at the differences between the two disciplines, specifically as it relates to programming.https://www.kdnuggets.com/2021/11/3-differences-coding-data-science-machine-learning.html
-
Stop Blaming Humans for Bias in AI
Can artificial intelligence be rid of bias? This is an important question, and it’s equally important that we look in the right place for the answer.https://www.kdnuggets.com/2021/11/stop-blaming-humans-bias-ai.html
-
Difference between distributed learning versus federated learning algorithms
Want to know the difference between distributed and federated learning? Read this article to find out.https://www.kdnuggets.com/2021/11/difference-distributed-learning-federated-learning-algorithms.html
-
eBook: 101 Ways to Use Third-Party Data to Make Smarter Decisions
To guide you in becoming a data-driven organization, AWS Data Exchange has created a new eBook, 101 Ways to Use Third-Party Data to Make Smarter Decisions. Learn how to transform the ‘currency’ of data into actionable business insights.https://www.kdnuggets.com/2021/11/roidna-ebook-101-ways-third-party-data-smarter-decisions.html
-
Build a Serverless News Data Pipeline using ML on AWS Cloud
This is the guide on how to build a serverless data pipeline on AWS with a Machine Learning model deployed as a Sagemaker endpoint.https://www.kdnuggets.com/2021/11/build-serverless-news-data-pipeline-ml-aws-cloud.html
-
Where NLP is heading">
Natural language processing research and applications are moving forward rapidly. Several trends have emerged on this progress, and point to a future of more exciting possibilities and interesting opportunities in the field.Where NLP is heading
https://www.kdnuggets.com/2021/11/where-nlp-is-heading.html
-
Data Scientists: How to Sell Your Project and Yourself
Follow this formula for the perfect elevator pitch.https://www.kdnuggets.com/2021/11/data-scientists-sell-project.html
-
AI meets BI: Key capabilities to look for in a modern BI platform
With the customer at its heart, modern augmented BI platforms no longer require scripting/coding skills or the knowledge to build the back-end data models, empowering even laymen to harness the power of raw data. As a user, here are the top AI capabilities that you need to look for in BI software.https://www.kdnuggets.com/2021/11/zoho-ai-meets-bi-key-capabilities-platform.html
-
Easy Synthetic Data in Python with Faker
Faker is a Python library that generates fake data to supplement or take the place of real world data. See how it can be used for data science.https://www.kdnuggets.com/2021/11/easy-synthetic-data-python-faker.html
-
Inside recommendations: how a recommender system recommends
We describe types of recommender systems, more specifically, algorithms and methods for content-based systems, collaborative filtering, and hybrid systems.https://www.kdnuggets.com/2021/11/recommendations-recommender-system.html
-
Book Metadata and Cover Retrieval Using OCR and Google Books API
With KNIME extracting critical pieces of information from images becomes as easy as ABC.https://www.kdnuggets.com/2021/11/book-metadata-cover-retrieval-ocr-google-books-api.html
-
KDnuggets™ News 21:n44, Nov 17: Don’t Waste Time Building Your Data Science Network; 19 Data Science Project Ideas for Beginners
Don’t Waste Time Building Your Data Science Network; 19 Data Science Project Ideas for Beginners; How I Redesigned over 100 ETL into ELT Data Pipelines; Anecdotes from 11 Role Models in Machine Learning; The Ultimate Guide To Different Word Embedding Techniques In NLPhttps://www.kdnuggets.com/2021/n44.html
-
How to fast-track machine translation projects
Data is the lifeblood of any successful machine learning model, and machine translation models are no exception. Without relevant and properly labelled data, even the most sophisticated model will be unable to achieve reliable results.https://www.kdnuggets.com/2021/11/defined-fast-track-machine-translation-projects.html
-
Virtual Presentation Tips for Data Scientists
Learn how to effectively communicate your work.https://www.kdnuggets.com/2021/11/virtual-presentation-tips-data-scientists.html
-
10 AI Project Ideas in Computer Vision
The field of computer vision has seen the development of very powerful applications leveraging machine learning. These projects will introduce you to these techniques and guide you to more advanced practice to gain a deeper appreciation for the sophistication now available.https://www.kdnuggets.com/2021/11/10-ai-project-ideas-computer-vision.html
-
Two Simple Things You Need to Steal from Agile for Data and Analytics Work
Peer Review and Definition of Done: small changes, BIG impact.https://www.kdnuggets.com/2021/11/simple-things-steal-agile-data-science-analytics.html
-
KDnuggets Top Blogs Rewards for October 2021
The October blogs that won KDnuggets Rewards include: How I Tripled My Income With Data Science in 18 Months; What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; How to Build Strong Data Science Portfolio as a Beginner; Data Scientist vs Data Engineer Salary.https://www.kdnuggets.com/2021/11/top-blogs-rewards-oct.html
-
What Are NVIDIA NGC Containers & How to Get Started Using Them
NVIDIA, the pioneer in the GPU technologies and deep learning revolution, has come up with an excellent catalog of specialized containers that they call NGC Collections. In this article, we explore their basic usage and some variations.https://www.kdnuggets.com/2021/11/nvidia-ngc-containers-get-started.html
-
Top Stories, Nov 8-14: Don’t Waste Time Building Your Data Science Network
Also: Data Scientist Career Path from Novice to First Job; Design Patterns for Machine Learning Pipelines; What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; Salary Breakdown of the Top Data Science Jobshttps://www.kdnuggets.com/2021/11/top-news-week-1108-1114.html
-
How I Redesigned over 100 ETL into ELT Data Pipelines">
Learn how to level up your Data Pipelines!How I Redesigned over 100 ETL into ELT Data Pipelines
https://www.kdnuggets.com/2021/11/redesigned-over-100-etl-elt-data-pipelines.html
-
Anecdotes from 11 Role Models in Machine Learning
The skills needed to create good data are also the skills needed for good leadership.https://www.kdnuggets.com/2021/11/anecdotes-11-role-models-machine-learning.html
-
Deep Learning on your phone: PyTorch C++ API for use on Mobile Platforms
The PyTorch Deep Learning framework has a C++ API for use on mobile platforms. This article shows an end-to-end demo of how to write a simple C++ application with Deep Learning capabilities using the PyTorch C++ API such that the same code can be built for use on mobile platforms (both Android and iOS).https://www.kdnuggets.com/2021/11/deep-learning-mobile-phone-pytorch-c-api.html
-
25 Github Repositories Every Python Developer Should Know
Check out these repositories to help you improve your data science skills.https://www.kdnuggets.com/2021/11/25-github-repositories-python-developer.html
-
Top October Stories: How I Tripled My Income With Data Science in 18 Months; What Google Recommends You do Before Taking Their ML or DS Course
Also: How to Build Strong Data Science Portfolio as a Beginner; Data Science Portfolio Project Ideas That Can Get You Hired (Or Not); Exclusive: OpenAI summarizes KDnuggetshttps://www.kdnuggets.com/2021/11/top-stories-2021-oct.html
-
Attend the Data Intelligence Summit to Learn from Data Thought Leaders
Join Caserta and fellow data and analytics leaders, Nov 17, as they help guide you on how, what and why you need to transform your data ecosystem to cloud-based modern analytics.https://www.kdnuggets.com/2021/11/caserta-data-intelligence-summit-data-thought-leaders.html
-
What’s missing from self-serve BI and what we can do about it
The notion of self-service BI tools caught an expectation that they could provide a magic formula for easily helping everyone understand all the data. But, such an end-result isn't occurring in practice. To identify a better approach, we need to take a step back and determine what problem is actually trying to be solved.https://www.kdnuggets.com/2021/11/missing-self-serve-bi.html
-
Dream Come True: Building websites by thinking about them
From the mind to the computer, make websites using your imagination!https://www.kdnuggets.com/2021/11/dream-come-true-allennlp-hacks-21.html
-
AWS Data Exchange Webinar: Maintain competitive edge with third-party financial services data
Join this webinar, Nov 11, to learn how leveraging third-party financial services data can facilitate faster, intelligence-based decision-making that propels your company's business outcomes and digital transformation.https://www.kdnuggets.com/2021/11/roidna-aws-data-exchange-webinar-third-party-financial-data.html
-
The Ultimate Guide To Different Word Embedding Techniques In NLP
A machine can only understand numbers. As a result, converting text to numbers, called embedding text, is an actively researched topic. In this article, we review different word embedding techniques for converting text into vectors.https://www.kdnuggets.com/2021/11/guide-word-embedding-techniques-nlp.html
-
Don’t Waste Time Building Your Data Science Network">
Instead, focus on what matters.Don’t Waste Time Building Your Data Science Network
https://www.kdnuggets.com/2021/11/waste-time-building-data-science-network.html
-
KDnuggets™ News 21:n43, Nov 10: Data Scientist Career Path from Novice to First Job; Neural Networks from a Bayesian Perspective
Data Scientist Career Path: from Novice to First Job; Understand Neural Networks from a Bayesian Perspective; The Best Ways for Data Professionals to Market AWS Skills; Build Your Own Automated Machine Learning App.https://www.kdnuggets.com/2021/n43.html
-
KDnuggets Top Blogs Rewards Program Resumes in December
After a pause, we will be resuming KDnuggets Top Blog Rewards Program, starting with blogs published on KDnuggets in December. The program will be bigger, with $3,000 (USD) divided among top 8 most viewed guest blogs. Original blogs rewarded at the rate of 3X of reposts. Submit your original blog to KDnuggets first !https://www.kdnuggets.com/2021/11/top-blogs-reward-program-resumes.html
-
SAS Analytics Pro – now available for on-site or containerized cloud-native deployment – providing your entry point into SAS Viya
Now, SAS Analytics Pro includes a new option for containerized cloud-native deployment. This makes SAS Analytics Pro a perfect entry point into SAS Viya.https://www.kdnuggets.com/2021/11/sas-analytics-pro-now-available.html
-
OpenAI’s Approach to Solve Math Word Problems
OpenAI's latest research aims to solve math word problems. Let's dive a bit deeper into the ideas behind this new research.https://www.kdnuggets.com/2021/11/open-ai-approach-solve-math-word-problems.html
-
The Common Misconceptions About Machine Learning
Beginners in the field can often have many misconceptions about machine learning that sometimes can be a make-it-or-break-it moment for the individual switching careers or starting fresh. This article clearly describes the ground truth realities about learning new ML skills and eventually working professionally as a machine learning engineer.https://www.kdnuggets.com/2021/11/common-misconception-about-machine-learning.html
-
What Comes After HDF5? Seeking a Data Storage Format for Deep Learning
In this article we are discussing that HDF5 is one of the most popular and reliable formats for non-tabular, numerical data. But this format is not optimized for deep learning work. This article suggests what kind of ML native data format should be to truly serve the needs of modern data scientists.https://www.kdnuggets.com/2021/11/after-hdf5-data-storage-format-deep-learning.html
-
SigOpt AI & HPC Summit, Nov 16 – Virtual and Free
Learn how PayPal, AWS, Intel, Accenture, MIT and Stanford apply experimentation to build better AI at the free SigOpt AI & HPC Summit.https://www.kdnuggets.com/2021/11/sigopt-ai-hpc-summit-virtual-free.html
-
POS Tagging, Explained
Learn about the strengths of part-of-speech tagging, and about how a strong POS tagger can contribute to natural language understanding.https://www.kdnuggets.com/2021/11/pos-tagging-explained.html
-
Top Stories, Nov 1-7: What Google Recommends You do Before Taking Their Machine Learning or Data Science Course
Also: Design Patterns for Machine Learning Pipelines; Data Scientist Career Path from Novice to First Job; Salary Breakdown of the Top Data Science Jobs; ORDAINED: The Python Project Templatehttps://www.kdnuggets.com/2021/11/top-news-week-1101-1107.html
-
7 Top Open Source Datasets to Train Natural Language Processing (NLP) & Text Models
With a lot of excitement and research around NLP, there are growing opportunities to apply these technologies to real-world scenarios. It's not trivial to become familiar with NLP and these open-source data sets can help you increase your skills.https://www.kdnuggets.com/2021/11/top-open-source-datasets-nlp.html
-
Federated Learning: Google’s Take
This blog will be focusing on the work Google has been doing in the Federated Learning space.https://www.kdnuggets.com/2021/11/federated-learning-googles-take.html
-
Machine Learning Safety: Unsolved Problems
There remain critical challenges in machine learning that, if left resolved, could lead to unintended consequences and unsafe use of AI in the future. As an important and active area of research, roadmaps are being developed to help guide continued ML research and use toward meaningful and robust applications.https://www.kdnuggets.com/2021/11/machine-learning-safety-unsolved-problems.html
-
The Best Ways for Data Professionals to Market AWS Skills in 2022
Knowing your way around Amazon Web Services (AWS) is increasingly useful. Here are five ways to market your AWS skills in today’s job market.https://www.kdnuggets.com/2021/11/best-ways-data-professionals-market-aws-skills.html
-
Toloka 101 Live Demo: Learn how to get reliable training data for machine learning, Nov 11
Toloka is a crowdsourced data labeling platform that handles data collection and annotation projects for machine learning at any scale. In this Nov 11 Live Demo, Learn how to get reliable training data for machine learning.https://www.kdnuggets.com/2021/11/toloka-training-data-machine-learning.html
-
A First Principles Theory of Generalization
Some new research from University of California, Berkeley shades some new light into how to quantify neural networks knowledge.https://www.kdnuggets.com/2021/11/first-principles-theory-generalization.html
-
AI Infinite Training & Maintaining Loop
Productizing AI is an infrastructure orchestration problem. In planning your solution design, you should use continuous monitoring, retraining, and feedback to ensure stability and sustainability.https://www.kdnuggets.com/2021/11/ai-infinite-training-maintaining-loop.html
-
NLP for Business in the Time of BERTera: Seven Misplaced Passions
This article is a brief summary of our observations on some common client misperceptions with respect to recent developments in NLP, especially the use of large-scale models and datasets.https://www.kdnuggets.com/2021/11/nlp-business-bertera-seven-misplaced-passions.html
-
7 of The Coolest Machine Learning Topics of 2021 at ODSC West
At our upcoming event this November 16th-18th in San Francisco, ODSC West 2021 will feature a plethora of talks, workshops, and training sessions on machine learning topics, deep learning, NLP, MLOps, and so on. You can register now for 20% off all ticket types, or register for a free AI Expo Pass to see what some big names in AI are doing now.https://www.kdnuggets.com/2021/11/odsc-7-coolest-machine-learning-topics.html
-
Visual Scoring Techniques for Classification Models
Read this article assessing a model performance in a broader context.https://www.kdnuggets.com/2021/11/visual-scoring-techniques-classification-models.html
-
Data Scientist Career Path from Novice to First Job">
If you are beginning your data science journey, then you must be prepared to plan it out as a step-by-step process that will guide you from being a total newbie to getting your first job as a data scientist. These tips and educational resources should be useful for you and add confidence as you take that first big step.Data Scientist Career Path from Novice to First Job
https://www.kdnuggets.com/2021/11/data-scientist-career-path-first-job.html
-
Neural Networks from a Bayesian Perspective
This article looks at neural networks from a Bayesian perspective.https://www.kdnuggets.com/2021/11/neural-networks-bayesian-perspective.html
-
KDnuggets™ News 21:n42, Nov 3: Google Recommendations Before Taking Their Machine Learning Course; Guide to Data Science Jobs
What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; A Guide to 14 Different Data Science Jobs; Analyze Python Code in Jupyter Notebooks; Machine Learning Model Development and Model Operations: Principles and Practices; Want to Join a Bank? Everything Data Scientists Need to Know About Working in Fintechhttps://www.kdnuggets.com/2021/n42.html
-
Three reasons to self-host your product analytics
Want three reasons to avoid the cloud and host your own analytics platform? More data, more control, more secure.https://www.kdnuggets.com/2021/11/posthog-three-reasons-self-host-product-analysis.html
-
ORDAINED: The Python Project Template">
Recently I decided to take the time to better understand the Python packaging ecosystem and create a project boilerplate template as an improvement over copying a directory tree and doing find and replace.ORDAINED: The Python Project Template
https://www.kdnuggets.com/2021/11/ordained-python-project-template.html
-
Design Patterns for Machine Learning Pipelines">
ML pipeline design has undergone several evolutions in the past decade with advances in memory and processor performance, storage systems, and the increasing scale of data sets. We describe how these design patterns changed, what processes they went through, and their future direction.Design Patterns for Machine Learning Pipelines
https://www.kdnuggets.com/2021/11/design-patterns-machine-learning-pipelines.html
-
Salary Breakdown of the Top Data Science Jobs">
Machine Learning vs NLP vs Data Engineer vs Data Scientist, and what it means to be in each role.Salary Breakdown of the Top Data Science Jobs
https://www.kdnuggets.com/2021/11/salary-breakdown-top-data-science-jobs.html
-
Top Stories, Oct 25-31: How I Tripled My Income With Data Science in 18 Months; Machine Learning Model Development and Model Operations: Principles and Practices
Also: What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; Learn To Reproduce Papers: Beginner’s Guide; 365 Data Science courses free until 18 November; A Guide to 14 Different Data Science Jobshttps://www.kdnuggets.com/2021/11/top-news-week-1025-1031.html
-
Advanced PyTorch Lightning with TorchMetrics and Lightning Flash
In this tutorial we will be diving deeper into two additional tools you should be using: TorchMetrics and Lightning Flash. TorchMetrics unsurprisingly provides a modular approach to define and track useful metrics across batches and devices, while Lightning Flash offers a suite of functionality facilitating more efficient transfer learning and data handling, and a recipe book of state-of-the-art approaches to typical deep learning problems.https://www.kdnuggets.com/2021/11/advanced-pytorch-lightning-torchmetrics-lightning-flash.html
-
Top 5 Time Series Methods
Data that varies in time can offer powerful applications and use cases for data scientists to analyze. This overview considers the top techniques you can learn to understand and gain insight from time-series data.https://www.kdnuggets.com/2021/11/top-5-time-series-methods.html
-
Is the Modern Data Stack Leaving You Behind?
The modern data stack narrative is largely dominated by analytics engineering. Where does that leave data engineers? Discover the difference between the MDS for data engineers & analytics engineers.https://www.kdnuggets.com/2021/11/modern-data-stack-leaving-behind.html
-
The Case for a Global Responsible AI Framework
Public and private organizations have come out with their own set of AI principles, focusing on AI-related risks from their perspective. However, it’s imperative d=to have a global consensus on Responsible AI – based on data governance, transparency and accountability – on how to utilize and benefit from AI in a way that is both consistent and ethical.https://www.kdnuggets.com/2021/10/responsible-ai-framework.html
-
Multivariate Time Series Analysis with an LSTM based RNN
Check out this codeless solution using the Keras integration.https://www.kdnuggets.com/2021/10/multivariate-time-series-analysis-lstm-based-rnn.html
-
ETL and ELT: A Guide and Market Analysis
ETL and related techniques remain a powerful and foundational tool in the data industry. We explain what ETL is and how ETL and ELT processes have evolved over the years, with a close eye toward how third-generation ETL tools are about to disrupt standard data processing practices.https://www.kdnuggets.com/2021/10/etl-elt-guide-market-analysis.html
-
Simple Text Scraping, Parsing, and Processing with this Python Library
Scraping, parsing, and processing text data from the web can be difficult. But it can also be easy, using Newspaper3k.https://www.kdnuggets.com/2021/10/simple-text-scraping-parsing-processing-python-library.html
-
First steps to learning data science & machine learning are the foundations.What Google Recommends You do Before Taking Their Machine Learning or Data Science Course">
What Google Recommends You do Before Taking Their Machine Learning or Data Science Course
https://www.kdnuggets.com/2021/10/google-recommends-before-machine-learning-data-science-course.html
-
Want to Join a Bank? Everything Data Scientists Need to Know About Working in Fintech
There is ample opportunity for data scientists in the financial services sector. The career experience can be very different, however, from similar roles at pure technology organizations. So, it's best to first consider if this industry is right for your interests, preferences for how you work, and long-term goals.https://www.kdnuggets.com/2021/10/bank-data-scientists-working-fintech.html
-
Analyze Python Code in Jupyter Notebooks
We present a new tool that integrates modern code analysis techniques with Jupyter notebooks and helps developers find bugs as they write code.https://www.kdnuggets.com/2021/10/analyze-python-code-jupyter-notebooks.html
-
How to Build Data Frameworks with Open Source Tools to Enhance Agility and Security
Let’s take a look at how to harness open source tools to build your data frameworks.https://www.kdnuggets.com/2021/10/build-data-frameworks-open-source-tools-agility-security.html
-
A Guide to 14 Different Data Science Jobs">
The field of data science is growing into one that features a variety of job titles This guide reviews different positions available for you to consider if you have a data science background.A Guide to 14 Different Data Science Jobs
https://www.kdnuggets.com/2021/10/guide-14-different-data-science-jobs.html
-
Machine Learning Model Development and Model Operations: Principles and Practices">
The ML model management and the delivery of highly performing model is as important as the initial build of the model by choosing right dataset. The concepts around model retraining, model versioning, model deployment and model monitoring are the basis for machine learning operations (MLOps) that helps the data science teams deliver highly performing models.Machine Learning Model Development and Model Operations: Principles and Practices
https://www.kdnuggets.com/2021/10/machine-learning-model-development-operations-principles-practice.html
-
KDnuggets™ News 21:n41, Oct 27: How I Tripled My Income With Data Science in 18 Months; Data Scientist vs Data Engineer Salary
Read "How I Tripled My Income With Data Science in 18 Months"; Compare Data Scientist vs Data Engineer Salary; Learn To Reproduce Research Papers; Exclusive: OpenAI summarizes KDnuggets; Data Science Portfolio Project Ideas That Can Get You Hired (Or Not); and more.https://www.kdnuggets.com/2021/n41.html
-
Export Data from the Web Scraping Tool through Zapier Integration
Octoparse makes it easy to collect data from websites and automate workflows on the web. Zapier is an online platform that allows you to automate workflows by connecting the apps and services you use. Zapier connection, the new feature in Octoparse, makes it possible to connect the product with apps including Google Drive, Google Sheets, Dropbox, Trello, Slack, and load more apps in a second with NO CODE.https://www.kdnuggets.com/2021/10/octoparse-web-scraping-zapier-integration.html
-
Getting Started with PyTorch Lightning
As a library designed for production research, PyTorch Lightning streamlines hardware support and distributed training as well, and we’ll show how easy it is to move training to a GPU toward the end.https://www.kdnuggets.com/2021/10/getting-started-pytorch-lightning.html
-
How To Defeat The Machine Learning Engineer Impostor Syndrome
How many times have you taken yet another online course on machine learning or read yet another paper on a new emerging topic, to be up-to-date in this crazy fast-paced AI/ML world -- only to keep feeling like an ML engineer impostor? These three personal tips can help you overcome the classic (and common) impostor syndrome behind every emerging ML engineer who wants to be better at what you do.https://www.kdnuggets.com/2021/10/defeat-machine-learning-engineer-impostor-syndrome.html
-
Four Basic Steps in Data Preparation">
What we would like to do here is introduce four very basic and very general steps in data preparation for machine learning algorithms. We will describe how and why to apply such transformations within a specific example.Four Basic Steps in Data Preparation
https://www.kdnuggets.com/2021/10/four-basic-steps-data-preparation.html
-
Top Stories, Oct 18-24: How I Tripled My Income With Data Science in 18 Months; Data Science Portfolio Project Ideas That Can Get You Hired (Or Not)
Also: Data Scientist vs Data Engineer Salary; The 20 Python Packages You Need For Machine Learning and Data Science; Exclusive: OpenAI summarizes KDnuggets; Real Time Image Segmentation Using 5 Lines of Codehttps://www.kdnuggets.com/2021/10/top-news-week-1018-1024.html
-
365 Data Science courses free until 18 November">
365 Data Science, an online educational platform providing beginner-to-advanced courses for data science and business analytics professionals, will unlock the entire library of courses, hands-on exercises, certificate exams, and resume builder for a full 30-day period from Oct. 18 to Nov. 18.365 Data Science courses free until 18 November
https://www.kdnuggets.com/2021/10/365datascience-courses-free.html
-
Guide To Finding The Right Predictive Maintenance Machine Learning Techniques
What happens to a life so dependent on machines, when that particular machine breaks down? This is precisely why there’s a dire need for predictive maintenance with machine learning.https://www.kdnuggets.com/2021/10/guide-right-predictive-maintenance-machine-learning-techniques.html
-
Save Sarah Connor with Data Science
Data science and data privacy are deeply interwoven, and must be carefully considered by practitioners. In comparing the Safe Harbour and Expert Determination data obfuscation approaches, Safe Harbour has been very popular among data engineers but has fundamental limitations, where Expert Determination offers important advantages.https://www.kdnuggets.com/2021/10/save-sarah-connor-data-science.html
-
Step-by-step instructions on how to understand Deep Learning papers and implement the described approaches.Learn To Reproduce Papers: Beginner’s Guide">
Learn To Reproduce Papers: Beginner’s Guide
https://www.kdnuggets.com/2021/10/learn-reproduce-papers-beginners-guide.html
-
Exclusive: OpenAI summarizes KDnuggets">
OpenAI has recently done amazing work summarizing full-length books. We have asked OpenAI to summarize two recent KDnuggets posts, and the results have a very human-like quality. Only the last line betrays the inhuman intelligence at work.Exclusive: OpenAI summarizes KDnuggets
https://www.kdnuggets.com/2021/10/exclusive-openai-summarizes-kdnuggets.html
-
How to Transform Your Data in Snowflake
Data transformation is the biggest bottleneck in the analytics workflow. The modern approach to data pipelines is ELT, or extract, transform, and load, with data transformation performed in your Snowflake data warehouse. A new breed of “no-/low-code” data transformation tools, such as Datameer, are emerging to allow the wider analytics community to transform data on their own, eliminating analytics bottlenecks.https://www.kdnuggets.com/2021/10/datameer-transform-data-snowflake.html
-
Deploying Serverless spaCy Transformer Model with AWS Lambda
A step-by-step guide on how to deploy NER transformer model serverless.https://www.kdnuggets.com/2021/10/deploying-serverless-spacy-transformer-model-aws-lambda.html
-
Introduction to AutoEncoder and Variational AutoEncoder (VAE)">
Autoencoders and their variants are interesting and powerful artificial neural networks used in unsupervised learning scenarios. Learn how autoencoders perform in their different approaches and how to implement with Keras on the instructional data set of the MNIST digits.Introduction to AutoEncoder and Variational AutoEncoder (VAE)
https://www.kdnuggets.com/2021/10/introduction-autoencoder-variational-autoencoder-vae.html
-
Find the Best-Matching Distribution for Your Data Effortlessly
How to find the best-matching statistical distributions for your data points — in an automated and easy way. And, then how to extend the utility further.https://www.kdnuggets.com/2021/10/best-matching-distribution-data-effortlessly.html
-
DATAnalyze 2021 Analytics Hackathon Sponsored by Microsoft and WorldData.AI, $125,000 in prizes!
Tech Tree Root is excited to introduce you to our DATAnalyze 2021 sponsors Microsoft, WorldData.AI, and HBCU Connect! Our online analytics hackathon is offering up to $125,000 USD in prizes!https://www.kdnuggets.com/2021/10/datanalyze-2021-analytics-hackathon.html
-
Training BPE, WordPiece, and Unigram Tokenizers from Scratch using Hugging Face
Comparing the tokens generated by SOTA tokenization algorithms using Hugging Face's tokenizers package.https://www.kdnuggets.com/2021/10/bpe-wordpiece-unigram-tokenizers-using-hugging-face.html
-
Level-Up This November with the ODSC West 2021 Keynotes and Training Sessions
At ODSC West 2021 this November 16th-18th, we’ll have 80+ training sessions and workshops on essential tools and languages led by some of the best and brightest minds in data science and AI.https://www.kdnuggets.com/2021/10/odsc-west-2021-keynotes-training-sessions.html
-
Data Preparation in R using dplyr, with Cheat Sheet!
Leverage the powerful data wrangling tools in R’s dplyr to clean and prepare your data.https://www.kdnuggets.com/2021/10/data-preparation-r-dplyr-cheat-sheet.html
-
Data Science Portfolio Project Ideas That Can Get You Hired (Or Not)">
Choosing what to include in your data science portfolio during the job search is the most important part of the process. Each project should be well-structured so that a hiring manager can assess your skills quickly. To help you get started, we highlight a few data science project ideas that you should consider for your portfolio.Data Science Portfolio Project Ideas That Can Get You Hired (Or Not)
https://www.kdnuggets.com/2021/10/data-science-portfolio-project-ideas.html
-
What are the differences between these two popular tech roles?Data Scientist vs Data Engineer Salary">
Data Scientist vs Data Engineer Salary
https://www.kdnuggets.com/2021/10/data-scientist-data-engineer-salary.html
-
KDnuggets™ News 21:n40, Oct 20: The 20 Python Packages You Need For Machine Learning and Data Science; Ace Data Science Interviews with Portfolio Projects
The 20 Python Packages You Need For Machine Learning and Data Science; How to Ace Data Science Interview by Working on Portfolio Projects; Deploying Your First Machine Learning API; Real Time Image Segmentation Using 5 Lines of Code; What is Clustering and How Does it Work?https://www.kdnuggets.com/2021/n40.html
-
2021 Data Engineer Salary Report Shares Insights on a Swiftly Evolving Market
Over the past few years, the data engineering market has seen tremendous growth. The acceleration of the data engineering market prompted us to create a new report specifically for data engineering professionals. You can download both the 2021 Data Engineering and 2021 Data Science & Analytics salary reports from our website for free.https://www.kdnuggets.com/2021/10/burtchworks-data-engineer-salary-report.html
-
How Data Professionals Can Impress Even When Busy
While there may be plenty of room for advancement even when busy, how to achieve that isn’t always clear. In that spirit, here are five ways you can impress your company leadership.https://www.kdnuggets.com/2021/10/data-professionals-impress-busy.html
-
11 Most Practical Data Science Skills for 2022
While the field of data science continues to evolve with exciting new progress in analytical approaches and machine learning, there remain a core set of skills that are foundational for all general practitioners and specialists, especially those who want to be employable with full-stack capabilities.https://www.kdnuggets.com/2021/10/11-most-practical-data-science-skills-2022.html
-
How to Create an Interactive Dashboard in Three Steps with KNIME Analytics Platform
In this blog post I will show you how to build a simple, but useful and good-looking dashboard to present your data - in three simple steps!https://www.kdnuggets.com/2021/10/interactive-dashboard-three-steps-knime-analytics-platform.html
-
Top Stories, Oct 11-17: Query Your Pandas DataFrames with SQL
Also: How to Ace Data Science Interview by Working on Portfolio Projects; AutoML: An Introduction Using Auto-Sklearn and Auto-PyTorch; How to Build Strong Data Science Portfolio as a Beginner; 8 Must-Have Git Commands for Data Scientistshttps://www.kdnuggets.com/2021/10/top-news-week-1011-1017.html
-
Knowledge Graph Forum: Technology Ecosystem and Business Applications
Ontotext is thrilled to invite you to the Ontotext & partners virtual Knowledge Graph Forum, Oct 26 & 27, 2021. This event is shaped by Ontotext’s vision that knowledge graphs serve as a hub for data, metadata and content. 35+ speakers from around the globe will share their experiences through real-life cases and platforms demonstrations. Save your spot now.https://www.kdnuggets.com/2021/10/ontotext-knowledge-graph-forum.html
-
Real Time Image Segmentation Using 5 Lines of Code
PixelLib Library is a library created to allow easy integration of object segmentation in images and videos using few lines of python code. PixelLib now provides support for PyTorch backend to perform faster, more accurate segmentation and extraction of objects in images and videos using PointRend segmentation architecture.https://www.kdnuggets.com/2021/10/real-time-image-segmentation-5-lines-code.html
-
Avoid These Five Behaviors That Make You Look Like A Data Novice">
If you are new to the Data Science industry or a well-versed veteran in all things data and analytics, there are always key pitfalls that each of us can easily slide into if we are not careful. These behaviors not only make us appear like novices, but they can risk our position as a trustworthy, likable data partner with stakeholder.Avoid These Five Behaviors That Make You Look Like A Data Novice
https://www.kdnuggets.com/2021/10/avoid-five-behaviors-data-novice.html
-
Serving ML Models in Production: Common Patterns
Over the past couple years, we've seen 4 common patterns of machine learning in production: pipeline, ensemble, business logic, and online learning. In the ML serving space, implementing these patterns typically involves a tradeoff between ease of development and production readiness. Ray Serve was built to support these patterns by being both easy to develop and production ready.https://www.kdnuggets.com/2021/10/serving-ml-models-production-common-patterns.html
-
KDnuggets Top Blogs Rewards for September 2021
The September blogs that earned KDnuggets Rewards include: Do You Read Excel Files with Python? There is a 1000x Faster Way; Data Scientists Without Data Engineering Skills Will Face the Harsh Truth; Path to Full Stack Data Science; Nine Tools I Wish I Mastered Before My PhD in Machine Learninghttps://www.kdnuggets.com/2021/10/top-blogs-rewards-sep.html
-
Learn from Northwestern Data Science experts
Build the essential technical, analytical, and leadership skills needed for careers in today's data-driven world in Northwestern’s Master of Science in Data Science program. Apply now.https://www.kdnuggets.com/2021/10/northwestern-learn-from-data-science-experts.html
-
How our Obsession with Algorithms Broke Computer Vision: And how Synthetic Computer Vision can fix it
Deep Learning radically improved Machine Learning as a whole. The Data-Centric revolution is about to do the same. In this post, we’ll take a look at the pitfalls of mainstream Computer Vision (CV) and discuss why Synthetic Computer Vision (SCV) is the future.https://www.kdnuggets.com/2021/10/obsession-algorithms-broke-computer-vision.html
-
New Computing Paradigm for AI: Processing-in-Memory (PIM) Architecture
As larger deep neural networks are trained on the latest and fastest chip technologies, an important challenge remains that bottlenecks performance -- and it is not compute power. You can try to calculate a DNN as fast as possible, but there is data -- and it has to move. Data pipelines on the chip are expensive and new solutions must be developed to advance capabilities.https://www.kdnuggets.com/2021/10/samsung-computing-paradigm-ai-in-memory.html
-
How to calculate confidence intervals for performance metrics in Machine Learning using an automatic bootstrap method
Are your model performance measurements very precise due to a “large” test set, or very uncertain due to a “small” or imbalanced test set?https://www.kdnuggets.com/2021/10/calculate-confidence-intervals-performance-metrics-machine-learning.html
-
Amazon Web Services Webinar: Leverage data sets to create a customer-centric strategy and improve business outcomes
Register now for this webinar, Oct 28, to learn how using third-party data enhances applications to better prioritize your target customer - helping you build a more customer-centric business.https://www.kdnuggets.com/2021/10/roidna-aws-webinar-customer-centric-strategy.html