Search results for

    Found 6241 documents, 5970 searched:

  • Converting Text Documents to Token Counts with CountVectorizer

    The post explains the significance of CountVectorizer and demonstrates its implementation with Python code.

    https://www.kdnuggets.com/2022/10/converting-text-documents-token-counts-countvectorizer.html

  • KDnuggets News, October 19: 3 Valuable Skills That Have Doubled My Income as a Data Scientist • The Complete Free PyTorch Course for Deep Learning

    3 Valuable Skills That Have Doubled My Income as a Data Scientist • The Complete Free PyTorch Course for Deep Learning • 7 Free Platforms for Building a Strong Data Science Portfolio • Mathematics for Machine Learning: The Free eBook • 25 Advanced SQL Interview Questions for Data Scientists

    https://www.kdnuggets.com/2022/n41.html

  • Become Data-Driven Faster with DataCamp’s Analyst Takeover

    Looking for a clear (and fast) data upskilling path? Discover DataCamp’s hands-on analyst tracks and certification program—no prior analyst experience required.

    https://www.kdnuggets.com/2022/10/datacamp-data-driven-faster-analyst-takeover.html

  • 25 Advanced SQL Interview Questions for Data Scientists

    KDnuggets Top Blog Check out this collection of advanced SQL interview questions with answers.

    https://www.kdnuggets.com/2022/10/25-advanced-sql-interview-questions-data-scientists.html

  • 5 Free Courses to Master Calculus

    Calculus is one of the foundational pillars of understanding the mathematics behind machine learning algorithms. The post shares five free courses to help you master calculus and learn its real-world applications.

    https://www.kdnuggets.com/2022/10/5-free-courses-master-calculus.html

  • Essential Books You Need to Become a Data Engineer

    KDnuggets Top Blog In this article, I will go through the roadmap of books you need to become a Data Engineer.

    https://www.kdnuggets.com/2022/10/essential-books-need-become-data-engineer.html

  • Working With Sparse Features In Machine Learning Models

    Sparse features can cause problems like overfitting and suboptimal results in learning models, and understanding why this happens is crucial when developing models. Multiple methods, including dimensionality reduction, are available to overcome issues due to sparse features.

    https://www.kdnuggets.com/2021/01/sparse-features-machine-learning-models.html

  • Implementing Adaboost in Scikit-learn

    It is called Adaptive Boosting due to the fact that the weights are re-assigned to each instance, with higher weights being assigned to instances that are not correctly classified - therefore it ‘adapts’.

    https://www.kdnuggets.com/2022/10/implementing-adaboost-scikitlearn.html

  • 7 Free Platforms for Building a Strong Data Science Portfolio

    KDnuggets Top Blog Outshine others and increase your odds of getting hired by maintaining a data science portfolio with projects, resumes, blogs, and reports.

    https://www.kdnuggets.com/2022/10/7-free-platforms-building-strong-data-science-portfolio.html

  • Mathematics for Machine Learning: The Free eBook

    Check out this free ebook covering the fundamentals of mathematics for machine learning, as well as its companion website of exercises and Jupyter notebooks.

    https://www.kdnuggets.com/2020/04/mathematics-machine-learning-book.html

  • Explaining Explainable AI for Conversations

    Something is missing in artificial intelligence – trust.

    https://www.kdnuggets.com/2022/10/explaining-explainable-ai-conversations.html

  • Sparse Matrix Representation in Python

    Leveraging sparse matrix representations for your data when appropriate can spare you memory storage. Have a look at the reasons why, see how to create sparse matrices in with Python, and compare the memory requirements for standard and sparse representations of the same data.

    https://www.kdnuggets.com/2020/05/sparse-matrix-representation-python.html

  • Classification Metrics Walkthrough: Logistic Regression with Accuracy, Precision, Recall, and ROC

    In this article, I will be going through 4 common classification metrics: Accuracy, Precision, Recall, and ROC in relation to Logistic Regression.

    https://www.kdnuggets.com/2022/10/classification-metrics-walkthrough-logistic-regression-accuracy-precision-recall-roc.html

  • 5 Free Courses to Master Linear Algebra

    Linear Algebra is an important subfield of mathematics and forms a core foundation of machine learning algorithms. The post shares five free courses to master the concepts of linear algebra.

    https://www.kdnuggets.com/2022/10/5-free-courses-master-linear-algebra.html

  • Data Representation for Natural Language Processing Tasks

    In NLP we must find a way to represent our data (a series of texts) to our systems (e.g. a text classifier). As Yoav Goldberg asks, "How can we encode such categorical data in a way which is amenable for us by a statistical classifier?" Enter the word vector.

    https://www.kdnuggets.com/2018/11/data-representation-natural-language-processing.html

  • KDnuggets News, October 12: 10 Cheat Sheets You Need To Ace Data Science Interview • NLP Interview Questions

    10 Cheat Sheets You Need To Ace Data Science Interview • NLP Interview Questions • A Day in the Life of a Machine Learning Engineer • 11 Questions About Data Engineers: What's the profession about, and where's it heading? • The ABCs of NLP, From A to Z

    https://www.kdnuggets.com/2022/n40.html

  • What it takes to crack Machine Learning Engineer interviews

    Interview Kickstart’s Machine Learning Interview Course is the first-of-its-kind, ML-specific tech interview prep program designed and taught by FAANG+ instructors. Learn more about the program.

    https://www.kdnuggets.com/2022/10/interview-kickstart-crack-machine-learning-engineer-interviews.html

  • Statistical Functions in Python

    In this tutorial, we would be covering some useful statistical functions which can be applied to pandas and series objects.

    https://www.kdnuggets.com/2022/10/statistical-functions-python.html

  • 3 Valuable Skills That Have Doubled My Income as a Data Scientist

    KDnuggets Top Blog In a year, I have learned three essential skills that have opened a new world of possibilities.

    https://www.kdnuggets.com/2022/10/3-valuable-skills-doubled-income-data-scientist.html

  • The Complete Free PyTorch Course for Deep Learning

    KDnuggets Top Blog Do you want to learn PyTorch for machine learning and deep learning? Check out this 24 hour long video course with accompanying notes and courseware for free. Did I mention it's free?

    https://www.kdnuggets.com/2022/10/complete-free-pytorch-course-deep-learning.html

  • Top Posts October 3-9: How to Select Rows and Columns in Pandas

    How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • Top Free Git GUI Clients for Beginners • Decision Tree Algorithm, Explained • 7 Techniques to Handle Imbalanced Data • Free Algorithms in Python Course

    https://www.kdnuggets.com/2022/10/top-posts-week-1003-1009.html

  • How to Build a Data Science Enablement Team: A Complete Guide

    A Data Science Enablement Team consists of people from various departments like marketing, sales, product development, etc. They are responsible for providing the necessary tools and resources to help the data scientists do their job more efficiently.

    https://www.kdnuggets.com/2022/10/build-data-science-enablement-team-complete-guide.html

  • 3 Simple Ways to Speed Up Your Python Code

    The post explains three popular frameworks, PySpark, Dask, and Ray, and discusses various factors to select the most appropriate one for your project.

    https://www.kdnuggets.com/2022/10/3-simple-ways-speed-python-code.html

  • A Beginner’s Guide to Web Scraping Using Python

    This article serves as a beginner’s guide to web scraping using Python and looks at the different frameworks and methods you can use, outlined in simple terms.

    https://www.kdnuggets.com/2022/10/beginner-guide-web-scraping-python.html

  • A Day in the Life of a Machine Learning Engineer

    What does a day in the life as a machine learning engineer look like for you?

    https://www.kdnuggets.com/2022/10/day-life-machine-learning-engineer.html

  • 10 Cheat Sheets You Need To Ace Data Science Interview

    KDnuggets Top Blog The only cheat you need for a job interview and data professional life. It includes SQL, web scraping, statistics, data wrangling and visualization, business intelligence, machine learning, deep learning, NLP, and super cheat sheets.

    https://www.kdnuggets.com/2022/10/10-cheat-sheets-need-ace-data-science-interview.html

  • How to Get Up and Running with SQL – A List of Free Learning Resources

    We have compiled a list of the top free resources to help new data practitioners learn SQL. These include free online courses and resources to get the most out of your SQL skills.

    https://www.kdnuggets.com/2022/10/get-running-sql-list-free-learning-resources.html

  • Debunking the Myth of the Citizen Data Scientist

    While there are some benefits to having citizen data scientists, they are no silver bullet – and they certainly aren’t a replacement for true data scientists.

    https://www.kdnuggets.com/2022/10/debunking-myth-citizen-data-scientist.html

  • Feature Store Summit 2022: A free conference on Feature Engineering

    Next week, the Feature Store Summit 2022 will bring together leading innovators in cutting-edge technologies and discuss all things on data and AI!

    https://www.kdnuggets.com/2022/10/hopsworks-feature-store-summit-2022-free-conference-feature-engineering.html

  • 11 Questions About Data Engineers: What’s the profession about, and where’s it heading?

    I hope my answers will be useful to novice data engineers and anyone interested in data engineering.

    https://www.kdnuggets.com/2022/10/11-questions-data-engineers-profession-heading.html

  • 3 Ways to Process CSV Files in Python

    This article is about 3 ways you can process a CSV file using Python.

    https://www.kdnuggets.com/2022/10/3-ways-process-csv-files-python.html

  • What makes a visualization good?

    Jeff Heer, co-collaborator on data visualization tools like D3.js, Vega, and Vega-Lite, recently addressed the question, "What makes a visualization good?"

    https://www.kdnuggets.com/2022/10/sphere-makes-visualization-good.html

  • AI in FinTech: Managing the Finance of the Future

    Digital transformation is evolving, and so is the fintech industry by implementing AI trends and leveraging several benefits, such as optimizing productivity, increasing ROI, and enhancing security.

    https://www.kdnuggets.com/2022/10/ai-fintech-managing-finance-future.html

  • Hyperparameter Tuning Using Grid Search and Random Search in Python

    A comprehensive guide on optimizing model hyperparameters with Scikit-Learn.

    https://www.kdnuggets.com/2022/10/hyperparameter-tuning-grid-search-random-search-python.html

  • KDnuggets News, October 5: Top Free Git GUI Clients for Beginners • A Day in the Life of a Data Scientist

    Top Free Git GUI Clients for Beginners • A Day in the Life of a Data Scientist: Expert vs. Beginner • Getting Started with Pandas Cheatsheet • Top 5 Machine Learning Practices Recommended by Experts • 7 Steps to Mastering Machine Learning with Python in 2022

    https://www.kdnuggets.com/2022/n39.html

  • Interview Kickstart Data Science Interview Course — What Makes It Different?

    Interview Kickstart’s Data Science Interview Course is built by Data Scientists from MAANG and other big tech companies, the course promises to get you interview-ready in 15 weeks.

    https://www.kdnuggets.com/2022/10/interview-kickstart-data-science-interview-course-makes-different.html

  • 6 Best Free Online Courses to Jumpstart Your Learning of SQL

    We scoured the internet for the best free courses for anyone looking to learn SQL. We’re excited to share the top 6 resources we found.

    https://www.kdnuggets.com/2022/10/corise-6-best-free-online-courses-jumpstart-learning-sql.html

  • Key-Value Databases, Explained

    Among the four big NoSQL database types, key-value stores are probably the most popular ones due to their simplicity and fast performance. Let’s further explore how key-value stores work and what are their practical uses.

    https://www.kdnuggets.com/2021/04/nosql-explained-understanding-key-value-databases.html

  • Key Takeaways from BigData London Conference and Exhibition

    Read some of the key takeaways from BigData LDN, one of the UK's free data & analytics conferences, which took place recently.

    https://www.kdnuggets.com/2022/10/key-takeaways-bigdata-london-conference-exhibition.html

  • Machine Learning for Everybody!

    Who is machine learning for? Everybody!

    https://www.kdnuggets.com/2022/10/machine-learning-everybody.html

  • Which Metric Should I Use? Accuracy vs. AUC

    Depending on the problem you’re trying to solve, one metric may be more insightful than another.

    https://www.kdnuggets.com/2022/10/metric-accuracy-auc.html

  • Top Posts September 26 – October 2: Free Algorithms in Python Course

    Free Algorithms in Python Course • How to Select Rows and Columns in Pandas • Lessons from a Senior Data Scientist • A Day in the Life of a Data Scientist: Expert vs. Beginner • 7 Machine Learning Portfolio Projects to Boost the Resume

    https://www.kdnuggets.com/2022/10/top-posts-week-0926-1002.html

  • Beginner Friendly Python Projects That Are Fun!

    Projects like this are not only beginner friendly, but they add a little bit of fun to your studies or career.

    https://www.kdnuggets.com/2022/10/beginner-friendly-python-projects-fun.html

  • Top Free Git GUI Clients for Beginners

    KDnuggets Top Blog Learn about beginner-friendly Git GUI clients and perform Git-based tasks using an interactive user interface.

    https://www.kdnuggets.com/2022/10/top-free-git-gui-clients-beginners.html

  • Handling Missing Values in Time-series with SQL

    This article is about a specific use-case that comes up often when dealing with time-series data.

    https://www.kdnuggets.com/2022/09/handling-missing-values-timeseries-sql.html

  • Are the Efforts of People Analytics Worth the Outcome?

    Learn about the connection between people analytics and creating diversity, equity, and inclusion (DEI) accountability.

    https://www.kdnuggets.com/2022/09/efforts-people-analytics-worth-outcome.html

  • 7 Steps to Mastering Machine Learning with Python in 2022

    Are you trying to teach yourself machine learning from scratch, but aren’t sure where to start? I will attempt to condense all the resources I’ve used over the years into 7 steps that you can follow to teach yourself machine learning.

    https://www.kdnuggets.com/2022/02/7-steps-mastering-machine-learning-python.html

  • Master Transformers with This Free Stanford Course!

    If you want a deep dive on transformers, this Stanford course has made its courseware freely available, including lecture videos, readings, assignments, and more.

    https://www.kdnuggets.com/2022/09/master-transformers-free-stanford-course.html

  • 8 Ways to Improve Your Search Application this Week

    There are many places to start improving and optimizing and it’s easy to get bogged down. The good news is that there are several easy ways to improve your search application’s quality and performance.

    https://www.kdnuggets.com/2022/09/corise-8-ways-improve-search-application-week.html

  • Beyond Pipelines: Graphs as Scikit-Learn Metaestimators

    Create manageable and scalable machine learning workflows with skdag.

    https://www.kdnuggets.com/2022/09/graphs-scikitlearn-metaestimators.html

  • SQL LIKE Operator Examples

    Search for the specific patterns in the SQL table using LIKE and wildcards.

    https://www.kdnuggets.com/2022/09/sql-like-operator-examples.html

  • Getting Started with Pandas Cheatsheet

    The latest KDnuggets cheatsheet aims to get you up to speed with introductory Pandas operations, and provide a handy reference as you work with the library. Check it out if you're interested in a quick start.

    https://www.kdnuggets.com/2022/09/getting-started-pandas-cheatsheet.html

  • A Day in the Life of a Data Scientist: Expert vs. Beginner

    Let’s learn more about what a data scientist gets up to.

    https://www.kdnuggets.com/2022/09/day-life-data-scientist-expert-beginner.html

  • Top 5 Machine Learning Practices Recommended by Experts

    This article is intended to help beginners improve their model structure by listing the best practices recommended by machine learning experts.

    https://www.kdnuggets.com/2022/09/top-5-machine-learning-practices-recommended-experts.html

  • KDnuggets News, September 28: Free Algorithms in Python Course • Lessons from a Senior Data Scientist

    Free Algorithms in Python Course • Lessons from a Senior Data Scientist • The Absolute Basics of MLOps • Data Analyst Skills You Need for Your Next Promotion • Dimensionality Reduction Techniques in Data Science

    https://www.kdnuggets.com/2022/n38.html

  • Top Posts September 19-25: 7 Machine Learning Portfolio Projects to Boost the Resume

    7 Machine Learning Portfolio Projects to Boost the Resume • How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • Decision Tree Algorithm, Explained • Free SQL and Database Course • 5 Tricky SQL Queries Solved

    https://www.kdnuggets.com/2022/09/top-posts-week-0919-0925.html

  • How to Correctly Select a Sample From a Huge Dataset in Machine Learning

    We explain how choosing a small, representative dataset from a large population can improve model training reliability.

    https://www.kdnuggets.com/2019/05/sample-huge-dataset-machine-learning.html

  • Become an AI Artist Using Phraser and Stable Diffusion

    Generate the prompt using Phraser and create realistic art using the Diffusion model.

    https://www.kdnuggets.com/2022/09/become-ai-artist-phraser-stable-diffusion.html

  • Which is Best: Data Science Bootcamp vs Degree vs Online Course

    Let’s break down each of the three options: the pros, the cons, the cost, and what you can expect to get out of them in the end.

    https://www.kdnuggets.com/2022/09/best-data-science-bootcamp-degree-online-course.html

  • Lessons from a Senior Data Scientist

    The aim of this article was for me to gain a deeper insight into the life of a senior data scientist and how their experience can be used as lessons for up-and-coming data scientists.

    https://www.kdnuggets.com/2022/09/lessons-senior-data-scientist.html

  • 5 Python Interview Questions & Answers

    The Python coding questions challenge your problem-solving and programming skills.

    https://www.kdnuggets.com/2022/09/5-python-interview-questions-answers.html

  • Free Algorithms in Python Course

    KDnuggets Top Blog Algorithms are an often misunderstood concept. Leverage Python to learn what algorithms really are, and how to implement an array of basic computational algorithms in the language.

    https://www.kdnuggets.com/2022/09/free-algorithms-python-course.html

  • Data Analyst Skills You Need for Your Next Promotion

    Get some advice from the “older” generation.

    https://www.kdnuggets.com/2022/09/data-analyst-skills-need-next-promotion.html

  • The Machine Learning Lifecycle

    Learn about the standard process for building sustainable machine learning applications.

    https://www.kdnuggets.com/2022/06/making-sense-crispmlq-machine-learning-lifecycle-process.html

  • Dimensionality Reduction Techniques in Data Science

    Dimensionality reduction techniques are basically a part of the data pre-processing step, performed before training the model.

    https://www.kdnuggets.com/2022/09/dimensionality-reduction-techniques-data-science.html

  • The Absolute Basics of MLOps

    This article is for people who don’t know a thing about MLOps or want to refresh their memory.

    https://www.kdnuggets.com/2022/09/absolute-basics-mlops.html

  • IMPACT 2022: The Data Observability Summit, on Oct. 25-26

    In just one month, IMPACT 2022 will bring together the most insightful minds in data. Register for The Data Observability Summit now!

    https://www.kdnuggets.com/2022/09/monte-carlo-impact-2022-data-observability-summit.html

  • The Mistake Every Data Scientist Has Made at Least Once

    How to increase your chances of avoiding the mistake.

    https://www.kdnuggets.com/2022/09/mistake-every-data-scientist-made-least.html

  • Free Microsoft Excel for Beginners Course

    Are you ready to learn Excel from the beginning? In this course, you will learn data entry, essential formulas, data visualization, pivot tables, and much more.

    https://www.kdnuggets.com/2022/09/free-microsoft-excel-beginners-course.html

  • Build a Text-to-Speech Converter with Python in 5 Minutes

    I have chosen to go through how to build a text-to-speech converter in Python, not only is it simple, but it is also fun and interactive. I will show you two ways you can do it with Python.

    https://www.kdnuggets.com/2022/09/build-texttospeech-converter-python-5-minutes.html

  • KDnuggets News, September 21: 7 Machine Learning Portfolio Projects to Boost the Resume • Free SQL and Database Course

    7 Machine Learning Portfolio Projects to Boost the Resume • Free SQL and Database Course • Top 5 Bookmarks Every Data Analyst Should Have • 7 Steps to Mastering Python for Data Science • 5 Concepts You Should Know About Gradient Descent and Cost Function

    https://www.kdnuggets.com/2022/n37.html

  • Learn How Different Data Visualizations Work

    Data Literacy Month at DataCamp is in full swing. DataCamp’s three-part series on demystifying data visualizations explores how to capture trends, demonstrate relationships, and explore distributions. Start learning today.

    https://www.kdnuggets.com/2022/09/datacamp-learn-different-data-visualizations-work.html

  • Data-centric AI and Tabular Data

    DALL-E, LaMDA, and GPT-3 all had celebrity moments recently. So, where’s the glamorous, high-performance model that’s mastered tabular data?

    https://www.kdnuggets.com/2022/09/datacentric-ai-tabular-data.html

  • More Performance Evaluation Metrics for Classification Problems You Should Know

    When building and optimizing your classification model, measuring how accurately it predicts your expected outcome is crucial. However, this metric alone is never the entire story, as it can still offer misleading results. That's where these additional performance evaluations come into play to help tease out more meaning from your model.

    https://www.kdnuggets.com/2020/04/performance-evaluation-metrics-classification.html

  • How To Calculate Algorithm Efficiency

    In this article, we will discuss how to calculate algorithm efficiency, focusing on two main ways to measure it and providing an overview of the calculation process.

    https://www.kdnuggets.com/2022/09/calculate-algorithm-efficiency.html

  • Top Posts September 12-18: How to Select Rows and Columns in Pandas

    How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • Free Python for Data Science Course • 5 Data Science Skills That Pay & 5 That Don't • 7 Data Analytics Interview Questions & Answers • 5 Tricky SQL Queries Solved

    https://www.kdnuggets.com/2022/09/top-posts-week-0912-0918.html

  • AWS AI & ML Scholarship Program Overview

    This scholarship program aims to help people who are underserved and that were underrepresented during high school and college - to then help them learn the foundations and concepts of Machine Learning and build a careers in AI and ML.

    https://www.kdnuggets.com/2022/09/aws-ai-ml-scholarship-program-overview.html

  • 7 Machine Learning Portfolio Projects to Boost the Resume

    KDnuggets Top Blog Work on machine learning and deep learning portfolio projects to learn new skills and improve your chance of getting hired.

    https://www.kdnuggets.com/2022/09/7-machine-learning-portfolio-projects-boost-resume.html

  • 5 Concepts You Should Know About Gradient Descent and Cost Function

    Why is Gradient Descent so important in Machine Learning? Learn more about this iterative optimization algorithm and how it is used to minimize a loss function.

    https://www.kdnuggets.com/2020/05/5-concepts-gradient-descent-cost-function.html

  • Top 5 Bookmarks Every Data Analyst Should Have

    Check out these online tools to save you time & effort.

    https://www.kdnuggets.com/2022/09/top-5-bookmarks-every-data-analyst.html

  • Simplifying Decision Tree Interpretability with Python & Scikit-learn

    This post will look at a few different ways of attempting to simplify decision tree representation and, ultimately, interpretability. All code is in Python, with Scikit-learn being used for the decision tree modeling.

    https://www.kdnuggets.com/2017/05/simplifying-decision-tree-interpretation-decision-rules-python.html

  • How Data Science Fuels Fraud Prevention

    By themselves, these data points will probably not provide much insight into a single customer. However, a company that has some or all of this information is well-positioned to have a strong idea of how legitimate its visitors are.

    https://www.kdnuggets.com/2022/09/data-science-fuels-fraud-prevention.html

  • An Intuitive Explanation of Collaborative Filtering

    The post introduces one of the most popular recommendation algorithms, i.e., collaborative filtering. It focuses on building an intuitive understanding of the algorithm illustrated with the help of an example.

    https://www.kdnuggets.com/2022/09/intuitive-explanation-collaborative-filtering.html

  • Free SQL and Database Course

    KDnuggets Top Blog Get up to speed on SQL and relational databases with this free video course.

    https://www.kdnuggets.com/2022/09/free-sql-database-course.html

  • Top Open Source Large Language Models

    In this article, we will discuss the importance of large language models and suggest some of the top open source models and the NLP tasks they can be used for.

    https://www.kdnuggets.com/2022/09/john-snow-top-open-source-large-language-models.html

  • 7 Steps to Mastering Python for Data Science

    Here’s how you can learn to code in Python from scratch in 7 easy steps.

    https://www.kdnuggets.com/2022/06/7-steps-mastering-python-data-science.html

  • Why Organizations Need Data Warehouses

    So where can you store, harness and collect findings in your data - in one place? What is the right tool for this? Data Warehouses

    https://www.kdnuggets.com/2022/09/organizations-need-data-warehouses.html

  • KDnuggets News, September 14: Free Python for Data Science Course • Everything You’ve Ever Wanted to Know About Machine Learning

    Free Python for Data Science Course • Everything You’ve Ever Wanted to Know About Machine Learning • Progress Bars in Python with tqdm for Fun and Profit • 7 Tips for Python Beginners • 7 Data Analytics Interview Questions & Answers

    https://www.kdnuggets.com/2022/n36.html

  • KDnuggets Applied Data Science Survey

    We want to know where you applied data science in the past 12 months.

    https://www.kdnuggets.com/2022/09/applied-data-science-survey.html

  • ModelOps: What you need to know to get certified

    Find out why ModelOps is in-demand and how SAS can help you propel in this growing area. 

    https://www.kdnuggets.com/2022/09/sas-modelops-need-know-get-certified.html

  • Find a Picture in an Image Without Marking it Up

    Let's take a closer look at our algorithm so that you can test it with a notebook in Google Colaboratory and even implement it in your project.

    https://www.kdnuggets.com/2022/09/find-picture-image-without-marking.html

  • Top Posts August 29 – September 11: Free Python for Data Science Course

    Free Python for Data Science Course • How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • Everything You've Ever Wanted to Know About Machine Learning • 7 Tips for Python Beginners • 5 Tricky SQL Queries Solved

    https://www.kdnuggets.com/2022/09/top-posts-week-0829-0911.html

  • Removing Outliers Using Standard Deviation in Python

    Standard Deviation is one of the most underrated statistical tools out there. It’s an extremely useful metric that most people know how to calculate but very few know how to use effectively.

    https://www.kdnuggets.com/2017/02/removing-outliers-standard-deviation-python.html

  • 7 Data Analytics Interview Questions & Answers

    KDnuggets Top Blog Most asked non-technical, operational, and SQL interview questions for data analytics jobs.

    https://www.kdnuggets.com/2022/09/7-data-analytics-interview-questions-answers.html

  • Everything You’ve Ever Wanted to Know About Machine Learning

    KDnuggets Top Blog Putting the fun in fundamentals! A collection of short videos to amuse beginners and experts alike.

    https://www.kdnuggets.com/2022/09/everything-youve-ever-wanted-to-know-about-machine-learning.html

  • All About Collections in Python

    In this tutorial, we would be exploring different types of containers implemented by the collections module.

    https://www.kdnuggets.com/2022/09/collections-python.html

  • How To Tackle 3 Common Machine Learning Challenges

    As you begin developing your ML models, here are the common challenges you might encounter during your project.

    https://www.kdnuggets.com/2022/09/comet-tackle-3-common-machine-learning-challenges.html

  • Everything You Need to Know About Data Lakehouses

    Learn everything you need to know about data lakehouses.

    https://www.kdnuggets.com/2022/09/everything-need-know-data-lakehouses.html

  • 7 Things You Didn’t Know You Could do with a Low Code Tool

    Surprisingly easy solutions for complex data problems.

    https://www.kdnuggets.com/2022/09/7-things-didnt-know-could-low-code-tool.html

  • Join Data Literacy Month 2022

    This September, DataCamp are dedicating an entire month to supporting individuals and organizations to drive data literacy as part of their mission to democratize data skills for everyone.

    https://www.kdnuggets.com/2022/09/datacamp-join-data-literacy-month-2022.html

  • Machine Learning Algorithms – What, Why, and How?

    This post explains why and when you need machine learning and concludes by listing the key considerations for choosing the correct machine learning algorithm.

    https://www.kdnuggets.com/2022/09/machine-learning-algorithms.html

  • Convert Text Documents to a TF-IDF Matrix with tfidfvectorizer

    Convert text documents to vectors using TF-IDF vectorizer for topic extraction, clustering, and classification.

    https://www.kdnuggets.com/2022/09/convert-text-documents-tfidf-matrix-tfidfvectorizer.html

  • Choosing the Right Clustering Algorithm for Your Dataset

    Applying a clustering algorithm is much easier than selecting the best one. Each type offers pros and cons that must be considered if you’re striving for a tidy cluster structure.

    https://www.kdnuggets.com/2019/10/right-clustering-algorithm.html

  • How to build a model to find the most impactful paths in user journeys

    In this how-to, we’ll build a model to uncover which paths in user journeys have the biggest impact on product goals (e.g. conversion). You can use it to improve products or optimize marketing campaigns, or as a base for deeper user behavior analyses.

    https://www.kdnuggets.com/2022/09/objectiv-build-model-impactful-paths-user-journeys.html

  • 8 Innovative BERT Knowledge Distillation Papers That Have Changed The Landscape of NLP

    All of the papers present a particular point of view of findings in the BERT utilization.

    https://www.kdnuggets.com/2022/09/eight-innovative-bert-knowledge-distillation-papers-changed-nlp-landscape.html

  • 24 A/B Testing Interview Questions in Data Science Interviews and How to Crack Them

    Here’s everything you need to know about A/B testing interview questions in data science interviews.

    https://www.kdnuggets.com/2022/09/24-ab-testing-interview-questions-data-science-interviews-crack.html

  • Visualizing Your Confusion Matrix in Scikit-learn

    Defining model evaluation metrics is crucial in ensuring that the model performs precisely for the purpose it is built. Confusion Matrix is one of the most popular and effective tools to evaluate the performance of the trained ML model. In this post, you will learn how to visualize the confusion matrix and interpret its output.

    https://www.kdnuggets.com/2022/09/visualizing-confusion-matrix-scikitlearn.html

  • SQL vs NoSQL: 7 Key Takeaways

    People assume that NoSQL is a counterpart to SQL. Instead, it’s a different type of database designed for use-cases where SQL is not ideal. The differences between the two are many, although some are so crucial that they define both databases at their cores.

    https://www.kdnuggets.com/2020/12/sql-vs-nosql-7-key-takeaways.html

  • 7 Tips for Python Beginners

    Learn useful tips to start your career as a Python developer.

    https://www.kdnuggets.com/2022/09/7-tips-python-beginners.html

  • Free Python for Data Science Course

    KDnuggets Top Blog Ready to learn how to use Python for data science? This free course has got you covered!

    https://www.kdnuggets.com/2022/09/free-python-data-science-course.html

  • What’s New On KDnuggets?

    KDnuggets has been up to some things over the past several months. Check in quick to make sure you haven't missed anything.

    https://www.kdnuggets.com/2022/09/whats-new-kdnuggets.html

  • Combining Pandas DataFrames Made Simple

    For this tutorial, we will work through examples to understand how different mehtods for combining Pandas DataFrames work.

    https://www.kdnuggets.com/2022/09/combining-pandas-dataframes-made-simple.html

  • Decision Tree Pruning: The Hows and Whys

    Decision trees are a machine learning algorithm that is susceptible to overfitting. One of the techniques you can use to reduce overfitting in decision trees is pruning.

    https://www.kdnuggets.com/2022/09/decision-tree-pruning-hows-whys.html

  • How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat

    KDnuggets Top Blog Subset selection is one of the most frequently performed tasks while manipulating data. Pandas provides different ways to efficiently select subsets of data from your DataFrame.

    https://www.kdnuggets.com/2019/06/select-rows-columns-pandas.html

  • Progress Bars in Python with tqdm for Fun and Profit

    Add progress bar to the Python functions, Jupyter Notebook, and pandas dataframe.

    https://www.kdnuggets.com/2022/09/progress-bars-python-tqdm-fun-profit.html

  • KDnuggets News, August 31: The Complete Data Science Study Roadmap • 7 Techniques to Handle Imbalanced Data

    The Complete Data Science Study Roadmap • 7 Techniques to Handle Imbalanced Data • 3 Ways to Append Rows to Pandas DataFrames • The Bias-Variance Trade-off • How to Package and Distribute Machine Learning Models with MLFlow

    https://www.kdnuggets.com/2022/n35.html

  • The Difference Between Training and Testing Data in Machine Learning

    When building a predictive model, the quality of the results depends on the data you use. In order to do so, you need to understand the difference between training and testing data in machine learning.

    https://www.kdnuggets.com/2022/08/difference-training-testing-data-machine-learning.html

  • Machine Learning Metadata Store

    In this article, we will learn about metadata stores, the need for them, their components, and metadata store management.

    https://www.kdnuggets.com/2022/08/machine-learning-metadata-store.html

  • Machine Learning in the Enterprise: Use Cases & Challenges

    This article provides insights into how leading data scientists are embracing machine learning in their organizations and covers some of the major ML challenges and trends in the enterprise.

    https://www.kdnuggets.com/2022/08/dss-machine-learning-enterprise-cases-challenges.html

  • Data Governance and Observability, Explained

    Let’s dive in and understand the ins and outs of data observability and data governance - the two keys to a more robust data foundation.

    https://www.kdnuggets.com/2022/08/data-governance-observability-explained.html

  • The Benefits of Natural Language AI for Content Creators

    In this article, we will discuss the benefits of natural language AI for content creators, highlighting the key reasons why you should consider using it to improve your content output.

    https://www.kdnuggets.com/2022/08/benefits-natural-language-ai-content-creators.html

  • Top Posts August 22-28: Free Python Project Coding Course

    Free Python Project Coding Course • 5 Tricky SQL Queries Solved • Decision Tree Algorithm, Explained • Free AI for Beginners Course • The Complete Collection of Data Science Projects & Part 2

    https://www.kdnuggets.com/2022/08/top-posts-week-0822-0828.html

  • 3 Ways to Append Rows to Pandas DataFrames

    Learn a simple way to append rows in the form of arrays, dictionaries, series, and dataframes to another dataframe.

    https://www.kdnuggets.com/2022/08/3-ways-append-rows-pandas-dataframes.html

  • Build a Reproducible and Maintainable Data Science Project: A Free Online Book

    This free online book is a fantastic resource on how to structure, manage, and maintain your real-world data science projects.

    https://www.kdnuggets.com/2022/08/free-book-build-reproducible-maintainable-data-science-project.html

  • A Complete Guide To Decision Tree Software

    Decision tree models are used to classify information into meaningful sequential results. Find out everything else you need to know here.

    https://www.kdnuggets.com/2022/08/complete-guide-decision-tree-software.html

  • Machine Learning is Not Like Your Brain Part Seven: What Neurons are Good At

    Thus far, this series has focused on things that Machine Learning does or needs which biological neurons simply can’t do. This article turns the tables and discusses a few things that neurons are particularly good at.

    https://www.kdnuggets.com/2022/08/machine-learning-like-brain-part-seven-neurons-good.html

  • Put your deep learning skills with R into action!

    Sponsored Post     Deep learning has become essential knowledge for data scientists, researchers, and software developers. The R language APIs for Keras and TensorFlow Read more »

    https://www.kdnuggets.com/2022/08/manning-deep-learning-skills-r-action.html

  • How to Better Leverage Data Science for Business Growth

    Is data science for you? And if it is, how can you use it to grow your business?

    https://www.kdnuggets.com/2022/08/better-leverage-data-science-business-growth.html

  • How to Package and Distribute Machine Learning Models with MLFlow

    MLFlow is a tool to manage the end-to-end lifecycle of a Machine Learning model. Likewise, the installation and configuration of an MLFlow service is addressed and examples are added on how to generate and share projects with MLFlow.

    https://www.kdnuggets.com/2022/08/package-distribute-machine-learning-models-mlflow.html

  • 7 Techniques to Handle Imbalanced Data

    This blog post introduces seven techniques that are commonly applied in domains like intrusion detection or real-time bidding, because the datasets are often extremely imbalanced.

    https://www.kdnuggets.com/2017/06/7-techniques-handle-imbalanced-data.html

Refine your search here:

No, thanks!