Topics: AI | Data Science | Data Visualization | Deep Learning | Machine Learning | NLP | Python | R | Statistics

Search results for

    Found 12757 documents, 12483 searched:

  • Put Responsible AI into Practice—
    attend the digital event on December 7

    Learn best practice guidelines for building AI solutions responsibly. Join AI experts from Microsoft and BCG at Put Responsible AI into Practice—a free Azure digital event on December 7.

    https://www.kdnuggets.com/2021/11/microsoft-responsible-ai-practice-attend-digital-event.html

  • Sentiment Analysis API vs Custom Text Classification: Which one to choose?

    In this article, we are going to compare the sentiment extraction performance between Sentiment Analysis engines and Custom Text classification engines. The idea is to show pros and cons of these two types of engines on a concrete dataset.

    https://www.kdnuggets.com/2021/11/sentiment-analysis-api-custom-text-classification.html

  • KDnuggets: Personal History and Nuggets of Experience

    After 28+ years of publishing and editing KDnuggets, I am retiring and transitioning KDnuggets to Matthew Mayo, who will become the new editor-in-chief. I want to share with you my story of KDnuggets and highlight some of the useful nuggets of experience I learned along this amazing journey.

    https://www.kdnuggets.com/2021/11/kdnuggets-history.html

  • Clustering in Crowdsourcing: Methodology and Applications

    As a result of the efforts outlined in this article, we confirmed that clustering through crowdsourcing is indeed possible and works impressively well.

    https://www.kdnuggets.com/2021/11/clustering-crowdsourcing-methodology-applications.html

  • Building Massively Scalable Machine Learning Pipelines with Microsoft Synapse ML

    The new platform provides a single API to abstract dozens of ML frameworks and databases.

    https://www.kdnuggets.com/2021/11/building-massively-scalable-machine-learning-pipelines-microsoft-synapse-ml.html

  • New Poll: What Percentage of Your Machine Learning Models Have Been Deployed?

    Take a moment to participate in the latest KDnuggets poll and let the community know what percentage of your machine learning models have been deployed.

    https://www.kdnuggets.com/2021/11/percentage-machine-learning-models-deployed.html

  • Why Machine Learning Engineers are Replacing Data Scientists

    The hiring run for data scientists continues along at a strong clip around the world. But, there are other emerging roles that are demonstrating key value to organizations that you should consider based on your existing or desired skill sets.

    https://www.kdnuggets.com/2021/11/why-machine-learning-engineers-are-replacing-data-scientists.html

  • Top Stories, Nov 22-28: Most Common SQL Mistakes on Data Science Interviews

    Also: 19 Data Science Project Ideas for Beginners; How to Build a Knowledge Graph with Neo4J and Transformers; Data Scientists: How to Sell Your Project and Yourself; Where NLP is heading

    https://www.kdnuggets.com/2021/11/top-news-week-1122-1128.html

  • Sentiment Analysis with KNIME

    Check out this tutorial on how to approach sentiment classification with supervised machine learning algorithms.

    https://www.kdnuggets.com/2021/11/sentiment-analysis-knime.html

  • How to Build a Knowledge Graph with Neo4J and Transformers

    Learn to use custom Named Entity Recognition and Relation Extraction models.

    https://www.kdnuggets.com/2021/11/build-knowledge-graph-neo4j-transformers.html

  • PyCaret 2.3.5 Is Here! Learn What’s New

    Read about the new functionalities added in PyCaret’s recent release.

    https://www.kdnuggets.com/2021/11/pycaret-here-learn-new.html

  • A Spreadsheet that Generates Python: The Mito JupyterLab Extension

    You can call Mito into your Jupyter Environment and each edit you make will generate the equivalent Python in the code cell below.

    https://www.kdnuggets.com/2021/11/spreadsheet-generates-python-mito-jupyterlab-extension.html

  • Cartoon: Data Science for Thanksgiving

    A classic KDnuggets Thanksgiving cartoon examines the predicament of one group of fowl Data Scientists.

    https://www.kdnuggets.com/2021/11/cartoon-data-science-thanksgiving.html

  • What’s the difference between a Data Scientist and a Data Analyst?

    Find out the major differences between a Data Analyst and a Data Scientist, and read the author's pointers on what they would recommend you to do if you wish to make that transition from Data Analyst to Data Scientist.

    https://www.kdnuggets.com/2021/11/difference-data-scientist-data-analyst.html

  • Can You Become a Data Scientist Online?

    Until November 29th, you can join over 1.5 million students around the globe and gain the skills of successful data science professionals with unlimited annual access to the 365 Data Science Program at 72% OFF. Read on to learn more!

    https://www.kdnuggets.com/2021/11/365datascience-become-data-scientist-online.html

  • Top 4 Data Integration Tools for Modern Enterprises

    Maintaining a centralized data repository can simplify your business intelligence initiatives. Here are four data integration tools that can make data more valuable for modern enterprises.

    https://www.kdnuggets.com/2021/11/top-4-data-integration-tools-modern-enterprises.html

  • Accelerating AI with MLOps

    Companies are racing to use AI, but despite its vast potential, most AI projects fail. Examining and resolving operational issues upfront can help AI initiatives reach their full potential.

    https://www.kdnuggets.com/2021/11/accelerating-ai-mlops.html

  • Common Misconceptions About Differential Privacy

    This article will clarify some common misconceptions about differential privacy and what it guarantees.

    https://www.kdnuggets.com/2021/11/common-misconceptions-differential-privacy.html

  • Top Stories, Nov 15-21: 19 Data Science Project Ideas for Beginners

    Also: How I Redesigned over 100 ETL into ELT Data Pipelines; Where NLP is heading; Don’t Waste Time Building Your Data Science Network; Data Scientists: How to Sell Your Project and Yourself

    https://www.kdnuggets.com/2021/11/top-news-week-1115-1121.html

  • Most Common SQL Mistakes on Data Science Interviews">Gold BlogMost Common SQL Mistakes on Data Science Interviews

    Sure, we all make mistakes -- which can be a bit more painful when we are trying to get hired -- so check out these typical errors applicants make while answering SQL questions during data science interviews.

    https://www.kdnuggets.com/2021/11/common-sql-mistakes-data-science-interviews.html

  • 5 Advanced Tips on Python Sequences

    Notes from Fluent Python by Luciano Ramalho.

    https://www.kdnuggets.com/2021/11/5-advanced-tips-python-sequences.html

  • 5 Tips to Get Your First Data Scientist Job

    Read some of the key things the author has learned during the infamous job seeking stage.

    https://www.kdnuggets.com/2021/11/5-tips-first-data-scientist-job.html

  • On-Device Deep Learning: PyTorch Mobile and TensorFlow Lite

    PyTorch and TensorFlow are the two leading AI/ML Frameworks. In this article, we take a look at their on-device counterparts PyTorch Mobile and TensorFlow Lite and examine them more deeply from the perspective of someone who wishes to develop and deploy models for use on mobile platforms.

    https://www.kdnuggets.com/2021/11/on-device-deep-learning-pytorch-mobile-tensorflow-lite.html

  • Dask DataFrame is not Pandas

    This article is the second article of an ongoing series on using Dask in practice. Each article in this series will be simple enough for beginners, but provide useful tips for real work. The next article in the series is about parallelizing for loops, and other embarrassingly parallel operations with dask.delayed.

    https://www.kdnuggets.com/2021/11/dask-dataframe-not-pandas.html

  • 3 Differences Between Coding in Data Science and Machine Learning

    The terms ‘data science’ and ‘machine learning’ are often used interchangeably. But while they are related, there are some glaring differences, so let’s take a look at the differences between the two disciplines, specifically as it relates to programming.

    https://www.kdnuggets.com/2021/11/3-differences-coding-data-science-machine-learning.html

  • Stop Blaming Humans for Bias in AI

    Can artificial intelligence be rid of bias? This is an important question, and it’s equally important that we look in the right place for the answer.

    https://www.kdnuggets.com/2021/11/stop-blaming-humans-bias-ai.html

  • Difference between distributed learning versus federated learning algorithms

    Want to know the difference between distributed and federated learning? Read this article to find out.

    https://www.kdnuggets.com/2021/11/difference-distributed-learning-federated-learning-algorithms.html

  • eBook: 101 Ways to Use Third-Party Data to Make Smarter Decisions

    To guide you in becoming a data-driven organization, AWS Data Exchange has created a new eBook, 101 Ways to Use Third-Party Data to Make Smarter Decisions. Learn how to transform the ‘currency’ of data into actionable business insights.

    https://www.kdnuggets.com/2021/11/roidna-ebook-101-ways-third-party-data-smarter-decisions.html

  • Build a Serverless News Data Pipeline using ML on AWS Cloud

    This is the guide on how to build a serverless data pipeline on AWS with a Machine Learning model deployed as a Sagemaker endpoint.

    https://www.kdnuggets.com/2021/11/build-serverless-news-data-pipeline-ml-aws-cloud.html

  • Where NLP is heading

    Natural language processing research and applications are moving forward rapidly. Several trends have emerged on this progress, and point to a future of more exciting possibilities and interesting opportunities in the field.

    https://www.kdnuggets.com/2021/11/where-nlp-is-heading.html

  • Data Scientists: How to Sell Your Project and Yourself

    Follow this formula for the perfect elevator pitch.

    https://www.kdnuggets.com/2021/11/data-scientists-sell-project.html

  • AI meets BI: Key capabilities to look for in a modern BI platform

    With the customer at its heart, modern augmented BI platforms no longer require scripting/coding skills or the knowledge to build the back-end data models, empowering even laymen to harness the power of raw data. As a user, here are the top AI capabilities that you need to look for in BI software.

    https://www.kdnuggets.com/2021/11/zoho-ai-meets-bi-key-capabilities-platform.html

  • Easy Synthetic Data in Python with Faker

    Faker is a Python library that generates fake data to supplement or take the place of real world data. See how it can be used for data science.

    https://www.kdnuggets.com/2021/11/easy-synthetic-data-python-faker.html

  • Inside recommendations: how a recommender system recommends

    We describe types of recommender systems, more specifically, algorithms and methods for content-based systems, collaborative filtering, and hybrid systems.

    https://www.kdnuggets.com/2021/11/recommendations-recommender-system.html

  • Book Metadata and Cover Retrieval Using OCR and Google Books API

    With KNIME extracting critical pieces of information from images becomes as easy as ABC.

    https://www.kdnuggets.com/2021/11/book-metadata-cover-retrieval-ocr-google-books-api.html

  • KDnuggets™ News 21:n44, Nov 17: Don’t Waste Time Building Your Data Science Network; 19 Data Science Project Ideas for Beginners

    Don’t Waste Time Building Your Data Science Network; 19 Data Science Project Ideas for Beginners; How I Redesigned over 100 ETL into ELT Data Pipelines; Anecdotes from 11 Role Models in Machine Learning; The Ultimate Guide To Different Word Embedding Techniques In NLP

    https://www.kdnuggets.com/2021/n44.html

  • How to fast-track machine translation projects

    Data is the lifeblood of any successful machine learning model, and machine translation models are no exception. Without relevant and properly labelled data, even the most sophisticated model will be unable to achieve reliable results.

    https://www.kdnuggets.com/2021/11/defined-fast-track-machine-translation-projects.html

  • Virtual Presentation Tips for Data Scientists

    Learn how to effectively communicate your work.

    https://www.kdnuggets.com/2021/11/virtual-presentation-tips-data-scientists.html

  • 10 AI Project Ideas in Computer Vision

    The field of computer vision has seen the development of very powerful applications leveraging machine learning. These projects will introduce you to these techniques and guide you to more advanced practice to gain a deeper appreciation for the sophistication now available.

    https://www.kdnuggets.com/2021/11/10-ai-project-ideas-computer-vision.html

  • Two Simple Things You Need to Steal from Agile for Data and Analytics Work

    Peer Review and Definition of Done: small changes, BIG impact.

    https://www.kdnuggets.com/2021/11/simple-things-steal-agile-data-science-analytics.html

  • KDnuggets Top Blogs Rewards for October 2021

    The October blogs that won KDnuggets Rewards include: How I Tripled My Income With Data Science in 18 Months; What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; How to Build Strong Data Science Portfolio as a Beginner; Data Scientist vs Data Engineer Salary.

    https://www.kdnuggets.com/2021/11/top-blogs-rewards-oct.html

  • What Are NVIDIA NGC Containers & How to Get Started Using Them

    NVIDIA, the pioneer in the GPU technologies and deep learning revolution, has come up with an excellent catalog of specialized containers that they call NGC Collections. In this article, we explore their basic usage and some variations.

    https://www.kdnuggets.com/2021/11/nvidia-ngc-containers-get-started.html

  • 19 Data Science Project Ideas for Beginners">Gold Blog19 Data Science Project Ideas for Beginners

    This article features 19 data science projects for beginners, categorized into 7 full project tutorials, 5 places to come up with your own data science projects using data, and 7 skills-based data science projects.

    https://www.kdnuggets.com/2021/11/19-data-science-project-ideas-beginners.html

  • Top Stories, Nov 8-14: Don’t Waste Time Building Your Data Science Network

    Also: Data Scientist Career Path from Novice to First Job; Design Patterns for Machine Learning Pipelines; What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; Salary Breakdown of the Top Data Science Jobs

    https://www.kdnuggets.com/2021/11/top-news-week-1108-1114.html

  • How I Redesigned over 100 ETL into ELT Data Pipelines">Silver BlogHow I Redesigned over 100 ETL into ELT Data Pipelines

    Learn how to level up your Data Pipelines!

    https://www.kdnuggets.com/2021/11/redesigned-over-100-etl-elt-data-pipelines.html

  • Anecdotes from 11 Role Models in Machine Learning

    The skills needed to create good data are also the skills needed for good leadership.

    https://www.kdnuggets.com/2021/11/anecdotes-11-role-models-machine-learning.html

  • Deep Learning on your phone: PyTorch C++ API for use on Mobile Platforms

    The PyTorch Deep Learning framework has a C++ API for use on mobile platforms. This article shows an end-to-end demo of how to write a simple C++ application with Deep Learning capabilities using the PyTorch C++ API such that the same code can be built for use on mobile platforms (both Android and iOS).

    https://www.kdnuggets.com/2021/11/deep-learning-mobile-phone-pytorch-c-api.html

  • 25 Github Repositories Every Python Developer Should Know

    Check out these repositories to help you improve your data science skills.

    https://www.kdnuggets.com/2021/11/25-github-repositories-python-developer.html

  • Top October Stories: How I Tripled My Income With Data Science in 18 Months; What Google Recommends You do Before Taking Their ML or DS Course

    Also: How to Build Strong Data Science Portfolio as a Beginner; Data Science Portfolio Project Ideas That Can Get You Hired (Or Not); Exclusive: OpenAI summarizes KDnuggets

    https://www.kdnuggets.com/2021/11/top-stories-2021-oct.html

  • Attend the Data Intelligence Summit to Learn from Data Thought Leaders

    Join Caserta and fellow data and analytics leaders, Nov 17, as they help guide you on how, what and why you need to transform your data ecosystem to cloud-based modern analytics.

    https://www.kdnuggets.com/2021/11/caserta-data-intelligence-summit-data-thought-leaders.html

  • What’s missing from self-serve BI and what we can do about it

    The notion of self-service BI tools caught an expectation that they could provide a magic formula for easily helping everyone understand all the data. But, such an end-result isn't occurring in practice. To identify a better approach, we need to take a step back and determine what problem is actually trying to be solved.

    https://www.kdnuggets.com/2021/11/missing-self-serve-bi.html

  • Dream Come True: Building websites by thinking about them

    From the mind to the computer, make websites using your imagination!

    https://www.kdnuggets.com/2021/11/dream-come-true-allennlp-hacks-21.html

  • AWS Data Exchange Webinar: Maintain competitive edge with third-party financial services data

    Join this webinar, Nov 11, to learn how leveraging third-party financial services data can facilitate faster, intelligence-based decision-making that propels your company's business outcomes and digital transformation.

    https://www.kdnuggets.com/2021/11/roidna-aws-data-exchange-webinar-third-party-financial-data.html

  • 5 Things That Set a Data Scientist Apart From Other Professions

    Here are five things that help set the data scientist apart from other professions.

    https://www.kdnuggets.com/2021/11/5-things-set-data-scientist-apart-other-professions.html

  • The Ultimate Guide To Different Word Embedding Techniques In NLP

    A machine can only understand numbers. As a result, converting text to numbers, called embedding text, is an actively researched topic. In this article, we review different word embedding techniques for converting text into vectors.

    https://www.kdnuggets.com/2021/11/guide-word-embedding-techniques-nlp.html

  • Don’t Waste Time Building Your Data Science Network">Gold BlogDon’t Waste Time Building Your Data Science Network

    Instead, focus on what matters.

    https://www.kdnuggets.com/2021/11/waste-time-building-data-science-network.html

  • KDnuggets™ News 21:n43, Nov 10: Data Scientist Career Path from Novice to First Job; Neural Networks from a Bayesian Perspective

    Data Scientist Career Path: from Novice to First Job; Understand Neural Networks from a Bayesian Perspective; The Best Ways for Data Professionals to Market AWS Skills; Build Your Own Automated Machine Learning App.

    https://www.kdnuggets.com/2021/n43.html

  • KDnuggets Top Blogs Rewards Program Resumes in December

    After a pause, we will be resuming KDnuggets Top Blog Rewards Program, starting with blogs published on KDnuggets in December. The program will be bigger, with $3,000 (USD) divided among top 8 most viewed guest blogs. Original blogs rewarded at the rate of 3X of reposts. Submit your original blog to KDnuggets first !

    https://www.kdnuggets.com/2021/11/top-blogs-reward-program-resumes.html

  • SAS Analytics Pro – now available for on-site or containerized cloud-native deployment – providing your entry point into SAS Viya

    Now, SAS Analytics Pro includes a new option for containerized cloud-native deployment. This makes SAS Analytics Pro a perfect entry point into SAS Viya.

    https://www.kdnuggets.com/2021/11/sas-analytics-pro-now-available.html

  • OpenAI’s Approach to Solve Math Word Problems

    OpenAI's latest research aims to solve math word problems. Let's dive a bit deeper into the ideas behind this new research.

    https://www.kdnuggets.com/2021/11/open-ai-approach-solve-math-word-problems.html

  • The Common Misconceptions About Machine Learning

    Beginners in the field can often have many misconceptions about machine learning that sometimes can be a make-it-or-break-it moment for the individual switching careers or starting fresh. This article clearly describes the ground truth realities about learning new ML skills and eventually working professionally as a machine learning engineer.

    https://www.kdnuggets.com/2021/11/common-misconception-about-machine-learning.html

  • What Comes After HDF5? Seeking a Data Storage Format for Deep Learning

    In this article we are discussing that HDF5 is one of the most popular and reliable formats for non-tabular, numerical data. But this format is not optimized for deep learning work. This article suggests what kind of ML native data format should be to truly serve the needs of modern data scientists.

    https://www.kdnuggets.com/2021/11/after-hdf5-data-storage-format-deep-learning.html

  • SigOpt AI & HPC Summit, Nov 16 – Virtual and Free

    Learn how PayPal, AWS, Intel, Accenture, MIT and Stanford apply experimentation to build better AI at the free SigOpt AI & HPC Summit.

    https://www.kdnuggets.com/2021/11/sigopt-ai-hpc-summit-virtual-free.html

  • POS Tagging, Explained

    Learn about the strengths of part-of-speech tagging, and about how a strong POS tagger can contribute to natural language understanding.

    https://www.kdnuggets.com/2021/11/pos-tagging-explained.html

  • Top Stories, Nov 1-7: What Google Recommends You do Before Taking Their Machine Learning or Data Science Course

    Also: Design Patterns for Machine Learning Pipelines; Data Scientist Career Path from Novice to First Job; Salary Breakdown of the Top Data Science Jobs; ORDAINED: The Python Project Template

    https://www.kdnuggets.com/2021/11/top-news-week-1101-1107.html

  • 7 Top Open Source Datasets to Train Natural Language Processing (NLP) & Text Models

    With a lot of excitement and research around NLP, there are growing opportunities to apply these technologies to real-world scenarios. It's not trivial to become familiar with NLP and these open-source data sets can help you increase your skills.

    https://www.kdnuggets.com/2021/11/top-open-source-datasets-nlp.html

  • Federated Learning: Google’s Take

    This blog will be focusing on the work Google has been doing in the Federated Learning space.

    https://www.kdnuggets.com/2021/11/federated-learning-googles-take.html

  • Build Your Own Automated Machine Learning App

    In this article, we will create an automated machine learning web app you can actually use.

    https://www.kdnuggets.com/2021/11/diy-automated-machine-learning-app.html

  • Machine Learning Safety: Unsolved Problems

    There remain critical challenges in machine learning that, if left resolved, could lead to unintended consequences and unsafe use of AI in the future. As an important and active area of research, roadmaps are being developed to help guide continued ML research and use toward meaningful and robust applications.

    https://www.kdnuggets.com/2021/11/machine-learning-safety-unsolved-problems.html

  • The Best Ways for Data Professionals to Market AWS Skills in 2022

    Knowing your way around Amazon Web Services (AWS) is increasingly useful. Here are five ways to market your AWS skills in today’s job market.

    https://www.kdnuggets.com/2021/11/best-ways-data-professionals-market-aws-skills.html

  • Toloka 101 Live Demo: Learn how to get reliable training data for machine learning, Nov 11

    Toloka is a crowdsourced data labeling platform that handles data collection and annotation projects for machine learning at any scale. In this Nov 11 Live Demo, Learn how to get reliable training data for machine learning.

    https://www.kdnuggets.com/2021/11/toloka-training-data-machine-learning.html

  • A First Principles Theory of Generalization

    Some new research from University of California, Berkeley shades some new light into how to quantify neural networks knowledge.

    https://www.kdnuggets.com/2021/11/first-principles-theory-generalization.html

  • AI Infinite Training & Maintaining Loop

    Productizing AI is an infrastructure orchestration problem. In planning your solution design, you should use continuous monitoring, retraining, and feedback to ensure stability and sustainability.

    https://www.kdnuggets.com/2021/11/ai-infinite-training-maintaining-loop.html

  • NLP for Business in the Time of BERTera: Seven Misplaced Passions

    This article is a brief summary of our observations on some common client misperceptions with respect to recent developments in NLP, especially the use of large-scale models and datasets.

    https://www.kdnuggets.com/2021/11/nlp-business-bertera-seven-misplaced-passions.html

  • 7 of The Coolest Machine Learning Topics of 2021 at ODSC West

    At our upcoming event this November 16th-18th in San Francisco, ODSC West 2021 will feature a plethora of talks, workshops, and training sessions on machine learning topics, deep learning, NLP, MLOps, and so on. You can register now for 20% off all ticket types, or register for a free AI Expo Pass to see what some big names in AI are doing now.

    https://www.kdnuggets.com/2021/11/odsc-7-coolest-machine-learning-topics.html

  • Visual Scoring Techniques for Classification Models

    Read this article assessing a model performance in a broader context.

    https://www.kdnuggets.com/2021/11/visual-scoring-techniques-classification-models.html

  • Data Scientist Career Path from Novice to First Job">Silver BlogData Scientist Career Path from Novice to First Job

    If you are beginning your data science journey, then you must be prepared to plan it out as a step-by-step process that will guide you from being a total newbie to getting your first job as a data scientist. These tips and educational resources should be useful for you and add confidence as you take that first big step.

    https://www.kdnuggets.com/2021/11/data-scientist-career-path-first-job.html

  • Neural Networks from a Bayesian Perspective

    This article looks at neural networks from a Bayesian perspective.

    https://www.kdnuggets.com/2021/11/neural-networks-bayesian-perspective.html

  • KDnuggets™ News 21:n42, Nov 3: Google Recommendations Before Taking Their Machine Learning Course; Guide to Data Science Jobs

    What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; A Guide to 14 Different Data Science Jobs; Analyze Python Code in Jupyter Notebooks; Machine Learning Model Development and Model Operations: Principles and Practices; Want to Join a Bank? Everything Data Scientists Need to Know About Working in Fintech

    https://www.kdnuggets.com/2021/n42.html

  • Three reasons to self-host your product analytics

    Want three reasons to avoid the cloud and host your own analytics platform? More data, more control, more secure.

    https://www.kdnuggets.com/2021/11/posthog-three-reasons-self-host-product-analysis.html

  • ORDAINED: The Python Project Template

    Recently I decided to take the time to better understand the Python packaging ecosystem and create a project boilerplate template as an improvement over copying a directory tree and doing find and replace.

    https://www.kdnuggets.com/2021/11/ordained-python-project-template.html

  • Design Patterns for Machine Learning Pipelines">Silver BlogDesign Patterns for Machine Learning Pipelines

    ML pipeline design has undergone several evolutions in the past decade with advances in memory and processor performance, storage systems, and the increasing scale of data sets. We describe how these design patterns changed, what processes they went through, and their future direction.

    https://www.kdnuggets.com/2021/11/design-patterns-machine-learning-pipelines.html

  • Salary Breakdown of the Top Data Science Jobs

    Machine Learning vs NLP vs Data Engineer vs Data Scientist, and what it means to be in each role.

    https://www.kdnuggets.com/2021/11/salary-breakdown-top-data-science-jobs.html

  • Top Stories, Oct 25-31: How I Tripled My Income With Data Science in 18 Months; Machine Learning Model Development and Model Operations: Principles and Practices

    Also: What Google Recommends You do Before Taking Their Machine Learning or Data Science Course; Learn To Reproduce Papers: Beginner’s Guide; 365 Data Science courses free until 18 November; A Guide to 14 Different Data Science Jobs

    https://www.kdnuggets.com/2021/11/top-news-week-1025-1031.html

  • Advanced PyTorch Lightning with TorchMetrics and Lightning Flash

    In this tutorial we will be diving deeper into two additional tools you should be using: TorchMetrics and Lightning Flash. TorchMetrics unsurprisingly provides a modular approach to define and track useful metrics across batches and devices, while Lightning Flash offers a suite of functionality facilitating more efficient transfer learning and data handling, and a recipe book of state-of-the-art approaches to typical deep learning problems.

    https://www.kdnuggets.com/2021/11/advanced-pytorch-lightning-torchmetrics-lightning-flash.html

  • Top 5 Time Series Methods

    Data that varies in time can offer powerful applications and use cases for data scientists to analyze. This overview considers the top techniques you can learn to understand and gain insight from time-series data.

    https://www.kdnuggets.com/2021/11/top-5-time-series-methods.html

  • Is the Modern Data Stack Leaving You Behind?

    The modern data stack narrative is largely dominated by analytics engineering. Where does that leave data engineers? Discover the difference between the MDS for data engineers & analytics engineers.

    https://www.kdnuggets.com/2021/11/modern-data-stack-leaving-behind.html

  • The Case for a Global Responsible AI Framework

    Public and private organizations have come out with their own set of AI principles, focusing on AI-related risks from their perspective. However, it’s imperative d=to have a global consensus on Responsible AI – based on data governance, transparency and accountability – on how to utilize and benefit from AI in a way that is both consistent and ethical.

    https://www.kdnuggets.com/2021/10/responsible-ai-framework.html

  • Multivariate Time Series Analysis with an LSTM based RNN

    Check out this codeless solution using the Keras integration.

    https://www.kdnuggets.com/2021/10/multivariate-time-series-analysis-lstm-based-rnn.html

  • ETL and ELT: A Guide and Market Analysis

    ETL and related techniques remain a powerful and foundational tool in the data industry. We explain what ETL is and how ETL and ELT processes have evolved over the years, with a close eye toward how third-generation ETL tools are about to disrupt standard data processing practices.

    https://www.kdnuggets.com/2021/10/etl-elt-guide-market-analysis.html

  • Simple Text Scraping, Parsing, and Processing with this Python Library

    Scraping, parsing, and processing text data from the web can be difficult. But it can also be easy, using Newspaper3k.

    https://www.kdnuggets.com/2021/10/simple-text-scraping-parsing-processing-python-library.html

  • Platinum BlogWhat Google Recommends You do Before Taking Their Machine Learning or Data Science Course">Rewards BlogPlatinum BlogWhat Google Recommends You do Before Taking Their Machine Learning or Data Science Course

    First steps to learning data science & machine learning are the foundations.

    https://www.kdnuggets.com/2021/10/google-recommends-before-machine-learning-data-science-course.html

  • Want to Join a Bank? Everything Data Scientists Need to Know About Working in Fintech

    There is ample opportunity for data scientists in the financial services sector. The career experience can be very different, however, from similar roles at pure technology organizations. So, it's best to first consider if this industry is right for your interests, preferences for how you work, and long-term goals.

    https://www.kdnuggets.com/2021/10/bank-data-scientists-working-fintech.html

  • Analyze Python Code in Jupyter Notebooks

    We present a new tool that integrates modern code analysis techniques with Jupyter notebooks and helps developers find bugs as they write code.

    https://www.kdnuggets.com/2021/10/analyze-python-code-jupyter-notebooks.html

  • How to Build Data Frameworks with Open Source Tools to Enhance Agility and Security

    Let’s take a look at how to harness open source tools to build your data frameworks.

    https://www.kdnuggets.com/2021/10/build-data-frameworks-open-source-tools-agility-security.html

  • A Guide to 14 Different Data Science Jobs">Silver BlogA Guide to 14 Different Data Science Jobs

    The field of data science is growing into one that features a variety of job titles This guide reviews different positions available for you to consider if you have a data science background.

    https://www.kdnuggets.com/2021/10/guide-14-different-data-science-jobs.html

  • Machine Learning Model Development and Model Operations: Principles and Practices">Gold BlogMachine Learning Model Development and Model Operations: Principles and Practices

    The ML model management and the delivery of highly performing model is as important as the initial build of the model by choosing right dataset. The concepts around model retraining, model versioning, model deployment and model monitoring are the basis for machine learning operations (MLOps) that helps the data science teams deliver highly performing models.

    https://www.kdnuggets.com/2021/10/machine-learning-model-development-operations-principles-practice.html

  • KDnuggets™ News 21:n41, Oct 27: How I Tripled My Income With Data Science in 18 Months; Data Scientist vs Data Engineer Salary

    Read "How I Tripled My Income With Data Science in 18 Months"; Compare Data Scientist vs Data Engineer Salary; Learn To Reproduce Research Papers; Exclusive: OpenAI summarizes KDnuggets; Data Science Portfolio Project Ideas That Can Get You Hired (Or Not); and more.

    https://www.kdnuggets.com/2021/n41.html

  • Export Data from the Web Scraping Tool through Zapier Integration

    Octoparse makes it easy to collect data from websites and automate workflows on the web. Zapier is an online platform that allows you to automate workflows by connecting the apps and services you use. Zapier connection, the new feature in Octoparse, makes it possible to connect the product with apps including Google Drive, Google Sheets, Dropbox, Trello, Slack, and load more apps in a second with NO CODE.

    https://www.kdnuggets.com/2021/10/octoparse-web-scraping-zapier-integration.html

  • Getting Started with PyTorch Lightning

    As a library designed for production research, PyTorch Lightning streamlines hardware support and distributed training as well, and we’ll show how easy it is to move training to a GPU toward the end.

    https://www.kdnuggets.com/2021/10/getting-started-pytorch-lightning.html

  • How To Defeat The Machine Learning Engineer Impostor Syndrome

    How many times have you taken yet another online course on machine learning or read yet another paper on a new emerging topic, to be up-to-date in this crazy fast-paced AI/ML world -- only to keep feeling like an ML engineer impostor? These three personal tips can help you overcome the classic (and common) impostor syndrome behind every emerging ML engineer who wants to be better at what you do.

    https://www.kdnuggets.com/2021/10/defeat-machine-learning-engineer-impostor-syndrome.html

  • Four Basic Steps in Data Preparation">Silver BlogFour Basic Steps in Data Preparation

    What we would like to do here is introduce four very basic and very general steps in data preparation for machine learning algorithms. We will describe how and why to apply such transformations within a specific example.

    https://www.kdnuggets.com/2021/10/four-basic-steps-data-preparation.html

  • Top Stories, Oct 18-24: How I Tripled My Income With Data Science in 18 Months; Data Science Portfolio Project Ideas That Can Get You Hired (Or Not)

    Also: Data Scientist vs Data Engineer Salary; The 20 Python Packages You Need For Machine Learning and Data Science; Exclusive: OpenAI summarizes KDnuggets; Real Time Image Segmentation Using 5 Lines of Code

    https://www.kdnuggets.com/2021/10/top-news-week-1018-1024.html

  • 365 Data Science courses free until 18 November">Gold Blog365 Data Science courses free until 18 November

    365 Data Science, an online educational platform providing beginner-to-advanced courses for data science and business analytics professionals, will unlock the entire library of courses, hands-on exercises, certificate exams, and resume builder for a full 30-day period from Oct. 18 to Nov. 18.

    https://www.kdnuggets.com/2021/10/365datascience-courses-free.html

  • Guide To Finding The Right Predictive Maintenance Machine Learning Techniques

    What happens to a life so dependent on machines, when that particular machine breaks down? This is precisely why there’s a dire need for predictive maintenance with machine learning.

    https://www.kdnuggets.com/2021/10/guide-right-predictive-maintenance-machine-learning-techniques.html

  • Save Sarah Connor with Data Science

    Data science and data privacy are deeply interwoven, and must be carefully considered by practitioners. In comparing the Safe Harbour and Expert Determination data obfuscation approaches, Safe Harbour has been very popular among data engineers but has fundamental limitations, where Expert Determination offers important advantages.

    https://www.kdnuggets.com/2021/10/save-sarah-connor-data-science.html

  • Gold BlogLearn To Reproduce Papers: Beginner’s Guide">Rewards BlogGold BlogLearn To Reproduce Papers: Beginner’s Guide

    Step-by-step instructions on how to understand Deep Learning papers and implement the described approaches.

    https://www.kdnuggets.com/2021/10/learn-reproduce-papers-beginners-guide.html

  • Exclusive: OpenAI summarizes KDnuggets">Gold BlogExclusive: OpenAI summarizes KDnuggets

    OpenAI has recently done amazing work summarizing full-length books. We have asked OpenAI to summarize two recent KDnuggets posts, and the results have a very human-like quality. Only the last line betrays the inhuman intelligence at work.

    https://www.kdnuggets.com/2021/10/exclusive-openai-summarizes-kdnuggets.html

  • How to Transform Your Data in Snowflake

    Data transformation is the biggest bottleneck in the analytics workflow. The modern approach to data pipelines is ELT, or extract, transform, and load, with data transformation performed in your Snowflake data warehouse. A new breed of “no-/low-code” data transformation tools, such as Datameer, are emerging to allow the wider analytics community to transform data on their own, eliminating analytics bottlenecks.

    https://www.kdnuggets.com/2021/10/datameer-transform-data-snowflake.html

  • Deploying Serverless spaCy Transformer Model with AWS Lambda

    A step-by-step guide on how to deploy NER transformer model serverless.

    https://www.kdnuggets.com/2021/10/deploying-serverless-spacy-transformer-model-aws-lambda.html

  • Introduction to AutoEncoder and Variational AutoEncoder (VAE)">Silver BlogIntroduction to AutoEncoder and Variational AutoEncoder (VAE)

    Autoencoders and their variants are interesting and powerful artificial neural networks used in unsupervised learning scenarios. Learn how autoencoders perform in their different approaches and how to implement with Keras on the instructional data set of the MNIST digits.

    https://www.kdnuggets.com/2021/10/introduction-autoencoder-variational-autoencoder-vae.html

  • Find the Best-Matching Distribution for Your Data Effortlessly

    How to find the best-matching statistical distributions for your data points — in an automated and easy way. And, then how to extend the utility further.

    https://www.kdnuggets.com/2021/10/best-matching-distribution-data-effortlessly.html

  • DATAnalyze 2021 Analytics Hackathon Sponsored by Microsoft and WorldData.AI, $125,000 in prizes!

    Tech Tree Root is excited to introduce you to our DATAnalyze 2021 sponsors Microsoft, WorldData.AI, and HBCU Connect! Our online analytics hackathon is offering up to $125,000 USD in prizes!

    https://www.kdnuggets.com/2021/10/datanalyze-2021-analytics-hackathon.html

  • Training BPE, WordPiece, and Unigram Tokenizers from Scratch using Hugging Face

    Comparing the tokens generated by SOTA tokenization algorithms using Hugging Face's tokenizers package.

    https://www.kdnuggets.com/2021/10/bpe-wordpiece-unigram-tokenizers-using-hugging-face.html

  • Platinum BlogHow I Tripled My Income With Data Science in 18 Months">Rewards BlogPlatinum BlogHow I Tripled My Income With Data Science in 18 Months

    Over a year ago, I lost my job due to the COVID-19 pandemic. During this this, I taught myself data science and tripled my income.

    https://www.kdnuggets.com/2021/10/tripled-my-income-data-science-18-months.html

  • Simple Question Answering Web App with HuggingFace Pipelines

    See how easy it can be to build a simple web app for question answering from text using Streamlit and HuggingFace pipelines.

    https://www.kdnuggets.com/2021/10/simple-question-answering-web-app-hugging-face-pipelines.html

  • Level-Up This November with the ODSC West 2021 Keynotes and Training Sessions

    At ODSC West 2021 this November 16th-18th, we’ll have 80+ training sessions and workshops on essential tools and languages led by some of the best and brightest minds in data science and AI.

    https://www.kdnuggets.com/2021/10/odsc-west-2021-keynotes-training-sessions.html

  • Data Preparation in R using dplyr, with Cheat Sheet!

    Leverage the powerful data wrangling tools in R’s dplyr to clean and prepare your data.

    https://www.kdnuggets.com/2021/10/data-preparation-r-dplyr-cheat-sheet.html

  • Data Science Portfolio Project Ideas That Can Get You Hired (Or Not)">Gold BlogData Science Portfolio Project Ideas That Can Get You Hired (Or Not)

    Choosing what to include in your data science portfolio during the job search is the most important part of the process. Each project should be well-structured so that a hiring manager can assess your skills quickly. To help you get started, we highlight a few data science project ideas that you should consider for your portfolio.

    https://www.kdnuggets.com/2021/10/data-science-portfolio-project-ideas.html

  • Gold BlogData Scientist vs Data Engineer Salary">Rewards BlogGold BlogData Scientist vs Data Engineer Salary

    What are the differences between these two popular tech roles?

    https://www.kdnuggets.com/2021/10/data-scientist-data-engineer-salary.html

  • KDnuggets™ News 21:n40, Oct 20: The 20 Python Packages You Need For Machine Learning and Data Science; Ace Data Science Interviews with Portfolio Projects

    The 20 Python Packages You Need For Machine Learning and Data Science; How to Ace Data Science Interview by Working on Portfolio Projects; Deploying Your First Machine Learning API; Real Time Image Segmentation Using 5 Lines of Code; What is Clustering and How Does it Work?

    https://www.kdnuggets.com/2021/n40.html

  • 2021 Data Engineer Salary Report Shares Insights on a Swiftly Evolving Market

    Over the past few years, the data engineering market has seen tremendous growth. The acceleration of the data engineering market prompted us to create a new report specifically for data engineering professionals. You can download both the 2021 Data Engineering and 2021 Data Science & Analytics salary reports from our website for free.

    https://www.kdnuggets.com/2021/10/burtchworks-data-engineer-salary-report.html

  • How Data Professionals Can Impress Even When Busy

    While there may be plenty of room for advancement even when busy, how to achieve that isn’t always clear. In that spirit, here are five ways you can impress your company leadership.

    https://www.kdnuggets.com/2021/10/data-professionals-impress-busy.html

  • 11 Most Practical Data Science Skills for 2022

    While the field of data science continues to evolve with exciting new progress in analytical approaches and machine learning, there remain a core set of skills that are foundational for all general practitioners and specialists, especially those who want to be employable with full-stack capabilities.

    https://www.kdnuggets.com/2021/10/11-most-practical-data-science-skills-2022.html

  • How to Create an Interactive Dashboard in Three Steps with KNIME Analytics Platform

    In this blog post I will show you how to build a simple, but useful and good-looking dashboard to present your data - in three simple steps!

    https://www.kdnuggets.com/2021/10/interactive-dashboard-three-steps-knime-analytics-platform.html

  • Top Stories, Oct 11-17: Query Your Pandas DataFrames with SQL

    Also: How to Ace Data Science Interview by Working on Portfolio Projects; AutoML: An Introduction Using Auto-Sklearn and Auto-PyTorch; How to Build Strong Data Science Portfolio as a Beginner; 8 Must-Have Git Commands for Data Scientists

    https://www.kdnuggets.com/2021/10/top-news-week-1011-1017.html

  • Knowledge Graph Forum: Technology Ecosystem and Business Applications

    Ontotext is thrilled to invite you to the Ontotext & partners virtual Knowledge Graph Forum, Oct 26 & 27, 2021. This event is shaped by Ontotext’s vision that knowledge graphs serve as a hub for data, metadata and content. 35+ speakers from around the globe will share their experiences through real-life cases and platforms demonstrations. Save your spot now.

    https://www.kdnuggets.com/2021/10/ontotext-knowledge-graph-forum.html

  • Real Time Image Segmentation Using 5 Lines of Code

    PixelLib Library is a library created to allow easy integration of object segmentation in images and videos using few lines of python code. PixelLib now provides support for PyTorch backend to perform faster, more accurate segmentation and extraction of objects in images and videos using PointRend segmentation architecture.

    https://www.kdnuggets.com/2021/10/real-time-image-segmentation-5-lines-code.html

  • Avoid These Five Behaviors That Make You Look Like A Data Novice">Silver BlogAvoid These Five Behaviors That Make You Look Like A Data Novice

    If you are new to the Data Science industry or a well-versed veteran in all things data and analytics, there are always key pitfalls that each of us can easily slide into if we are not careful. These behaviors not only make us appear like novices, but they can risk our position as a trustworthy, likable data partner with stakeholder.

    https://www.kdnuggets.com/2021/10/avoid-five-behaviors-data-novice.html

  • Serving ML Models in Production: Common Patterns

    Over the past couple years, we've seen 4 common patterns of machine learning in production: pipeline, ensemble, business logic, and online learning. In the ML serving space, implementing these patterns typically involves a tradeoff between ease of development and production readiness. Ray Serve was built to support these patterns by being both easy to develop and production ready.

    https://www.kdnuggets.com/2021/10/serving-ml-models-production-common-patterns.html

Refine your search here:

Sign Up

By subscribing you accept KDnuggets Privacy Policy