Search results for
-
10 Most Common Data Quality Issues and How to Fix Them
Ensuring data quality guarantees more data-informed decisions. Hence, this article highlights the common data quality issues and ways to overcome them.https://www.kdnuggets.com/2022/11/10-common-data-quality-issues-fix.html
-
SHAP: Explain Any Machine Learning Model in Python
A Comprehensive Guide to SHAP and Shapley Valueshttps://www.kdnuggets.com/2022/11/shap-explain-machine-learning-model-python.html
-
Picking Examples to Understand Machine Learning Model
Understanding ML by combining explainability and sample picking.https://www.kdnuggets.com/2022/11/picking-examples-understand-machine-learning-model.html
-
Top Posts November 14-20: Git for Data Science Cheatsheet
Git for Data Science Cheatsheet • 6 Best Free Online Courses to Learn Python and Boost Your Career • How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • How LinkedIn Uses Machine Learning To Rank Your Feed • 7 SQL Concepts You Should Know For Data Sciencehttps://www.kdnuggets.com/2022/11/top-posts-week-1114-1120.html
-
Announcing a Blog Writing Contest, Winner Gets an NVIDIA GPU!
KDnuggets and NVIDIA are announcing a blog-writing contest with a GPU focus, with the winner receiving an RTX 3080 Ti GPU!https://www.kdnuggets.com/2022/11/blog-writing-contest-nvidia-gpu.html
-
How to Use Graph Theory to Scout Soccer
Take Soccer Analytics to the Next Level with Graph Theory: Here’s What to Know and How to Do It.https://www.kdnuggets.com/2022/11/graph-theory-scout-soccer.html
-
New From Anaconda! Data Science Training and Cloud Hosted Notebooks
Anaconda is incredibly excited to announce the release of a brand-new suite of products on the Anaconda Nucleus platform: Anaconda Notebooks and Anaconda Learning.https://www.kdnuggets.com/2022/11/anaconda-new-anaconda-data-science-training-cloud-hosted-notebooks.html
-
Introduction to Pandas for Data Science
The Pandas library is core to any Data Science work in Python. This introduction will walk you through the basics of data manipulating, and features many of Pandas important features.https://www.kdnuggets.com/2020/06/introduction-pandas-data-science.html
-
7 SQL Concepts You Should Know For Data Science
The post explains all the key elements of SQL that you must know as a data science practitioner.https://www.kdnuggets.com/2022/11/7-sql-concepts-needed-data-science.html
-
Research Papers for NLP Beginners
Read research papers on neural models, word embedding, language modeling, and attention & transformers.https://www.kdnuggets.com/2022/11/research-papers-nlp-beginners.html
-
Geocoding in Python: A Complete Guide
A step-by-step tutorial on geocoding with Pythonhttps://www.kdnuggets.com/2022/11/geocoding-python-complete-guide.html
-
9 Free Resources to Master Python
Python is the most popular general-purpose language and you can learn it for free.https://www.kdnuggets.com/2022/11/9-free-resources-master-python.html
-
How to Setup Julia on Jupyter Notebook
Learn three simple steps to install Julia for Jupyter Notebook and write your first data visualization code.https://www.kdnuggets.com/2022/11/setup-julia-jupyter-notebook.html
-
What To Expect for AI Quality Trends In 2023
Based on the recent discussions with dozens of Fortune 500 data science teams, we can expect to see a continued spotlight on AI model quality in 2023.https://www.kdnuggets.com/2022/11/expect-ai-quality-trends-2023.html
-
KDnuggets Top Posts for October 2022: 10 Cheat Sheets You Need To Ace Data Science Interview
10 Cheat Sheets You Need To Ace Data Science Interview • 7 Free Platforms for Building a Strong Data Science Portfolio • The Complete Free PyTorch Course for Deep Learning • 3 Valuable Skills That Have Doubled My Income as a Data Scientist • 25 Advanced SQL Interview Questions for Data Scientists • A Data Science Portfolio That Will Land You The Job in 2022 • Top Free Git GUI Clients for Beginners • Essential Books You Need to Become a Data Engineerhttps://www.kdnuggets.com/2022/11/top-posts-october-2022.html
-
If I Had To Start Learning Data Science Again, How Would I Do It?
While different ways to learn Data Science for the first time exist, the approach that works for you should be based on how you learn best. One powerful method is to evolve your learning from simple practice into complex foundations, as outlined in this learning path recommended by a physicist who turned into a Data Scientist.https://www.kdnuggets.com/2020/08/start-learning-data-science-again.html
-
KDnuggets News, November 16: How LinkedIn Uses Machine Learning • Confusion Matrix, Precision, and Recall Explained
How LinkedIn Uses Machine Learning To Rank Your Feed • Confusion Matrix, Precision, and Recall Explained • Matrix Multiplication for Data Science (or Machine Learning) • Machine Learning from scratch: Decision Trees • 7 Python Projects for Beginnershttps://www.kdnuggets.com/2022/n45.html
-
Git for Data Science Cheatsheet
Knowing git is no longer an option for data professionals. Grab this handy reference sheet now and make sure you know how to git the job done.https://www.kdnuggets.com/2022/11/git-data-science-cheatsheet.html
-
6 Best Free Online Courses to Learn Python and Boost Your Career
The demand for Data Scientists who are proficient in Python is at an all time high. Python has helped people boost their careers in finance, consulting, research, software tech, and robotics. Explore 6 courses designed to help you learn Python.https://www.kdnuggets.com/2022/11/corise-6-best-free-online-courses-python-boost-career.html
-
Efficiency Spells the Difference Between Biological Neurons and Their Artificial Counterparts
Part 8 of the series explores a single facet of biological neurons which, so far, have kept them way ahead of their artificial counterparts: their efficiency.https://www.kdnuggets.com/2022/11/efficiency-spells-difference-biological-neurons-artificial-counterparts.html
-
Top Data Analyst Certification Courses for 2022
Top certification courses by IBM, Edureka, DataCamp, Udacity, and Google.https://www.kdnuggets.com/2022/11/top-data-analyst-certification-courses-2022.html
-
5 Linguistics Courses for NLP Practitioners
This collection of 5 courses is intended to help NLP practitioners or hopefuls acquire some of their lacking linguistics knowledge.https://www.kdnuggets.com/2022/11/5-linguistics-courses-nlp-practitioners.html
-
How LinkedIn Uses Machine Learning To Rank Your Feed
In this post, you will learn to clarify business problems & constraints, understand problem statements, select evaluation metrics, overcome technical challenges, and design high-level systems.https://www.kdnuggets.com/2022/11/linkedin-uses-machine-learning-rank-feed.html
-
A Quick Overview of Voronoi Diagrams
If you've heard of Voronoi diagrams but don't klnow what they are, have a look at this quick and informative overview.https://www.kdnuggets.com/2022/11/quick-overview-voronoi-diagrams.html
-
5 Ways of Filtering Python Lists
Filter the list elements using for loop, list comprehension, regex, filter(), and lambda functions.https://www.kdnuggets.com/2022/11/5-ways-filtering-python-lists.html
-
Machine Learning from Scratch: Decision Trees
A simple explanation and implementation of DTs ID3 algorithm in Pythonhttps://www.kdnuggets.com/2022/11/machine-learning-scratch-decision-trees.html
-
Matrix Multiplication for Data Science (or Machine Learning)
Learn the math behind matrix multiplication for data science and machine learning with code examples.https://www.kdnuggets.com/2022/11/matrix-multiplication-data-science-machine-learning.html
-
How To Create An Effective AI Strategy
This post elaborates on various factors that go into consideration while prioritizing various AI initiatives.https://www.kdnuggets.com/2022/11/create-effective-ai-strategy.html
-
Understanding Bias-Variance Trade-Off in 3 Minutes
This article is the write-up of a Machine Learning Lighting Talk, intuitively explaining an important data science concept in 3 minutes.https://www.kdnuggets.com/2020/09/understanding-bias-variance-trade-off-3-minutes.html
-
Getting Started with PyCaret
An open-source low-code machine learning library for training and deploying the models in production.https://www.kdnuggets.com/2022/11/getting-started-pycaret.html
-
7 Python Projects for Beginners
Simple and fun Python projects to get experience and build a strong portfolio.https://www.kdnuggets.com/2022/11/7-python-projects-beginners.html
-
Analyzing Diversity & Inclusion with SQL
The most underrated SQL function for analyzing diversity.https://www.kdnuggets.com/2022/11/analyzing-diversity-inclusion-sql.html
-
Fake It Till You Make It: Generating Realistic Synthetic Customer Datasets
Finding the data you need is hard. So why not fake it?https://www.kdnuggets.com/2022/01/fake-realistic-synthetic-customer-datasets-projects.html
-
Finally a Book on Attention!
Finally a book on Attention. Learn how to build your own transformer model with Machine Learning Mastery's new book.https://www.kdnuggets.com/2022/11/mlm-finally-book-attention.html
-
Confusion Matrix, Precision, and Recall Explained
Learn these key machine learning performance metrics to ace data science interviews.https://www.kdnuggets.com/2022/11/confusion-matrix-precision-recall-explained.html
-
Map out your journey towards SAS Certification
Nearly 50% of certification holders said it was easier to find new jobs, enter new career fields and land job interviews. Read on to learn about every resource you’ll need from start to finish to receive your SAS certification.https://www.kdnuggets.com/2022/11/sas-map-journey-towards-sas-certification.html
-
The Most Comprehensive List of Kaggle Solutions and Ideas
Learn from top-performing teams in the competition to get better at understanding machine learning techniques.https://www.kdnuggets.com/2022/11/comprehensive-list-kaggle-solutions-ideas.html
-
3 Useful Python Automation Scripts
The post highlights three useful applications of using python to automate simple desktop tasks. Stay tuned till the end of the post to find the reference for a bonus resource.https://www.kdnuggets.com/2022/11/3-useful-python-automation-scripts.html
-
Approaches to Text Summarization: An Overview
This article will present the main approaches to text summarization currently employed, as well as discuss some of their characteristics.https://www.kdnuggets.com/2019/01/approaches-text-summarization-overview.html
-
15 More Free Machine Learning and Deep Learning Books
Check out this second list of 15 FREE ebooks for learning machine learning and deep learning.https://www.kdnuggets.com/2022/11/15-free-machine-learning-deep-learning-books.html
-
4 Ways to Rename Pandas Columns
A simple pandas tutorial for beginners with code examples.https://www.kdnuggets.com/2022/11/4-ways-rename-pandas-columns.html
-
How to Create a Sampling Plan for Your Data Project
When simple random sampling is not that simple.https://www.kdnuggets.com/2022/11/create-sampling-plan-data-project.html
-
The Ultimate Guide To Different Word Embedding Techniques In NLP
A machine can only understand numbers. As a result, converting text to numbers, called embedding text, is an actively researched topic. In this article, we review different word embedding techniques for converting text into vectors.https://www.kdnuggets.com/2021/11/guide-word-embedding-techniques-nlp.html
-
The AI Education Gap and How to Close It
AI education is broken, how do we solve it? Individuals end up learning a specific tool or tactic in a vacuum. They are missing the real-world applicability and collaboration that is critical to building impactful AI solutions in line with the organization’s strategy.https://www.kdnuggets.com/2022/11/ai-education-gap-close.html
-
What is Statistical Skew?
Read this overview of what is skewness, and how to calculate it.https://www.kdnuggets.com/2022/11/statistical-skew.html
-
Simple and Fast Data Streaming for Machine Learning Projects
Learn about the cutting-edge DagsHub's Direct Data Access for simple and faster data loading and model training.https://www.kdnuggets.com/2022/11/simple-fast-data-streaming-machine-learning-projects.html
-
Getting Deep Learning working in the wild: A Data-Centric Course
Data-centric learning resources are somewhat scattered today, and that’s why we developed a new Data Centric Deep Learning course on the co:rise education platform. It is an introduction to a set of approaches and best practices, for people who are trying to do deep learning in the wild.https://www.kdnuggets.com/2022/11/corise-deep-learning-wild-data-centric-course.html
-
9 Skills You Need to Become a Data Engineer
A data engineer is a fast-growing profession with amazing challenges and rewards. Which skills do you need to become a data engineer? In this post, we’ll take a look at both hard and soft skills.https://www.kdnuggets.com/2021/03/9-skills-become-data-engineer.html
-
KDnuggets News, November 2: The Current State of Data Science Careers • 15 Free Machine Learning and Deep Learning Books
The Current State of Data Science Careers • 15 Free Machine Learning and Deep Learning Books • How to Make Python Code Run Incredibly Fast • Machine Learning on the Edge • Don't Become a Commoditized Data Scientisthttps://www.kdnuggets.com/2022/n43.html
-
30 Resources for Mastering Data Visualization
Want to master data visualization? This list of 30 resources and tools will help you get started on your path toward mastering data visualization.https://www.kdnuggets.com/2022/11/30-resources-mastering-data-visualization.html
-
7 Tips To Produce Readable Data Science Code
In this article, we will go over a few steps that you can take to produce readable, high-quality code.https://www.kdnuggets.com/2022/11/7-tips-produce-readable-data-science-code.html
-
365 Data Science courses free until November 21
The unlimited access initiative provides a risk-free way to break into data science.https://www.kdnuggets.com/2022/11/365-data-science-365-data-science-courses-free-november-21.html
-
Random Forest vs Decision Tree: Key Differences
Check out this reasoned comparison of 2 critical machine learning algorithms to help you better make an informed decision.https://www.kdnuggets.com/2022/02/random-forest-decision-tree-key-differences.html
-
Getting Started with spaCy for NLP
In this blog, we will explore how to get started with spaCy right from the installation to explore the various functionalities it provides.https://www.kdnuggets.com/2022/11/getting-started-spacy-nlp.html
-
Should I Learn Julia?
Do you think learning Julia is better for your data science career? Let’s find out.https://www.kdnuggets.com/2022/11/learn-julia.html
-
Top Posts October 24-30: How to Select Rows and Columns in Pandas
How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • Decision Tree Algorithm, Explained • Graphs: The natural way to understand data • 7 Techniques to Handle Imbalanced Data • A Data Science Portfolio That Will Land You The Job in 2022https://www.kdnuggets.com/2022/10/top-posts-week-1024-1030.html
-
The Gap Between Deep Learning and Human Cognitive Abilities
How do we bridge this gap between deep learning and human cognitive ability?https://www.kdnuggets.com/2022/10/gap-deep-learning-human-cognitive-abilities.html
-
15 Free Machine Learning and Deep Learning Books
Check out this list of 15 FREE ebooks for learning machine learning and deep learning.https://www.kdnuggets.com/2022/10/15-free-machine-learning-deep-learning-books.html
-
Don’t Become a Commoditized Data Scientist
Unicorns don't exist. Aim instead to be an endangered species.https://www.kdnuggets.com/2022/10/commoditized-data-scientist.html
-
How to Make Python Code Run Incredibly Fast
In this article, I have explained some tips and tricks to optimize and speed up Python code.https://www.kdnuggets.com/2021/06/make-python-code-run-incredibly-fast.html
-
The Current State of Data Science Careers
If you’re someone in data science or aiming to get into a data science career, this article will give you a comprehensive analysis of the state of the field.https://www.kdnuggets.com/2022/10/current-state-data-science-careers.html
-
Graphs: The natural way to understand data
Graph Algorithms for Data Science is a hands-on guide to working with graph-based data in applications like machine learning, fraud detection, and business data analysis. Filled with fascinating and fun projects, demonstrating the ins-and-outs of graphs.https://www.kdnuggets.com/2022/10/manning-graphs-natural-way-understand-data.html
-
Machine Learning on the Edge
Edge ML involves putting ML models on consumer devices where they can independently run inferences without an internet connection, in real-time, and at no cost.https://www.kdnuggets.com/2022/10/machine-learning-edge.html
-
In Data We Trust: Data Centric AI
Learn how data-centric AI can improve your model's overall performance.https://www.kdnuggets.com/2022/10/data-trust-data-centric-ai.html
-
KDnuggets News, October 26: A Data Science Portfolio That Will Land You The Job in 2022 • Is OLAP Dead?
A Data Science Portfolio That Will Land You The Job in 2022 • Is OLAP Dead? • 10 Essential SQL Commands for Data Science • Why TinyML Cases Are Becoming More Popular • Ensemble Learning with Exampleshttps://www.kdnuggets.com/2022/n42.html
-
Codeless Time Series Analysis with KNIME
Bringing powerful Python functionality to the masses!https://www.kdnuggets.com/2022/10/packt-codeless-time-series-analysis-knime.html
-
TF-IDF Defined
Check out this breakdown of TF-IDF by defining its constituent parts.https://www.kdnuggets.com/2022/10/tfidf-defined.html
-
Getting Started with Automated Text Summarization
This article will walk through an extractive text summarization process, using a simple word frequency approach, implemented in Python.https://www.kdnuggets.com/2019/11/getting-started-automated-text-summarization.html
-
Top 7 Diffusion-Based Applications with Demos
Learn about various Diffusion-based applications to get inspiration for a final-year project, research, and product.https://www.kdnuggets.com/2022/10/top-7-diffusionbased-applications-demos.html
-
Join UC’s Information Session for the Master’s in Business Analytics Program
Join UC's Online Master's in Business Analytics Program information session, November 8, 2022, at 6:30 pm est, to learn more about the program and what it can do for your career.https://www.kdnuggets.com/2022/10/ucincinnati-join-ucs-information-session-masters-business-analytics-program.html
-
The ABCs of NLP, From A to Z
There is no shortage of tools today that can help you through the steps of natural language processing, but if you want to get a handle on the basics this is a good place to start. Read about the ABCs of NLP, all the way from A to Z.https://www.kdnuggets.com/2022/10/abcs-nlp-a-to-z.html
-
Top 10 MLOps Tools to Optimize & Manage Machine Learning Lifecycle
As more businesses experiment with data, they realize that developing a machine learning (ML) model is only one of many steps in the ML lifecycle.https://www.kdnuggets.com/2022/10/top-10-mlops-tools-optimize-manage-machine-learning-lifecycle.html
-
Easy Guide To Data Preprocessing In Python
Preprocessing data for machine learning models is a core general skill for any Data Scientist or Machine Learning Engineer. Follow this guide using Pandas and Scikit-learn to improve your techniques and make sure your data leads to the best possible outcome.https://www.kdnuggets.com/2020/07/easy-guide-data-preprocessing-python.html
-
The First ML Value Chain Landscape
TheSequence recently released the first ever ML Chain Landscape shaped by data scientists, a new landscape that would be able to address the entire ML value chain.https://www.kdnuggets.com/2022/10/first-ml-value-chain-landscape-sequence.html
-
Ensemble Learning with Examples
Learn various algorithms to improve the robustness and performance of machine learning applications. Furthermore, it will help you build a more generalized and stable model.https://www.kdnuggets.com/2022/10/ensemble-learning-examples.html
-
Is OLAP Dead?
OLAP enables citizen analysts to quickly, efficiently, and cost-effectively uncover new business insights at a reduced time-to-value.https://www.kdnuggets.com/2022/10/olap-dead.html
-
A Data Science Portfolio That Will Land You The Job in 2022
Check out this article on crafting a data science portfolio that will get you that job. And learn 4 resume mistakes to avoid at any cost.https://www.kdnuggets.com/2022/10/data-science-portfolio-land-job-2022.html
-
KDnuggets Top Posts for September 2022: Free Python for Data Science Course
Free Python for Data Science Course • 7 Machine Learning Portfolio Projects to Boost the Resume • Free Algorithms in Python Course • How to Select Rows and Columns in Pandas • 5 Data Science Skills That Pay & 5 That Don't • Everything You’ve Ever Wanted to Know About Machine Learning • Free SQL and Database Course • 7 Data Analytics Interview Questions & Answershttps://www.kdnuggets.com/2022/09/top-posts-september-2022.html
-
The Evolution of Speech Recognition Metrics
As Speech Recognition has become more accurate than ever, scenarios like dictation and meeting transcription are gaining popularity. Metrics need to evolve with the times and guide the research focus.https://www.kdnuggets.com/2022/10/evolution-speech-recognition-metrics.html
-
Why TinyML Cases Are Becoming Popular?
This article will provide an overview of what TinyML is, its use cases, and why it is becoming more popular.https://www.kdnuggets.com/2022/10/tinyml-cases-becoming-popular.html
-
10 Essential SQL Commands for Data Science
Learn SQL commands for filtering, string operations, alias, joining tables, if-else statements, and grouping.https://www.kdnuggets.com/2022/10/10-essential-sql-commands-data-science.html
-
Frameworks for Approaching the Machine Learning Process
This post is a summary of 2 distinct frameworks for approaching machine learning tasks, followed by a distilled third. Do they differ considerably (or at all) from each other, or from other such processes available?https://www.kdnuggets.com/2018/05/general-approaches-machine-learning-process.html
-
Converting Text Documents to Token Counts with CountVectorizer
The post explains the significance of CountVectorizer and demonstrates its implementation with Python code.https://www.kdnuggets.com/2022/10/converting-text-documents-token-counts-countvectorizer.html
-
KDnuggets News, October 19: 3 Valuable Skills That Have Doubled My Income as a Data Scientist • The Complete Free PyTorch Course for Deep Learning
3 Valuable Skills That Have Doubled My Income as a Data Scientist • The Complete Free PyTorch Course for Deep Learning • 7 Free Platforms for Building a Strong Data Science Portfolio • Mathematics for Machine Learning: The Free eBook • 25 Advanced SQL Interview Questions for Data Scientistshttps://www.kdnuggets.com/2022/n41.html
-
Become Data-Driven Faster with DataCamp’s Analyst Takeover
Looking for a clear (and fast) data upskilling path? Discover DataCamp’s hands-on analyst tracks and certification program—no prior analyst experience required.https://www.kdnuggets.com/2022/10/datacamp-data-driven-faster-analyst-takeover.html
-
25 Advanced SQL Interview Questions for Data Scientists
Check out this collection of advanced SQL interview questions with answers.https://www.kdnuggets.com/2022/10/25-advanced-sql-interview-questions-data-scientists.html
-
5 Free Courses to Master Calculus
Calculus is one of the foundational pillars of understanding the mathematics behind machine learning algorithms. The post shares five free courses to help you master calculus and learn its real-world applications.https://www.kdnuggets.com/2022/10/5-free-courses-master-calculus.html
-
Essential Books You Need to Become a Data Engineer
In this article, I will go through the roadmap of books you need to become a Data Engineer.https://www.kdnuggets.com/2022/10/essential-books-need-become-data-engineer.html
-
Working With Sparse Features In Machine Learning Models
Sparse features can cause problems like overfitting and suboptimal results in learning models, and understanding why this happens is crucial when developing models. Multiple methods, including dimensionality reduction, are available to overcome issues due to sparse features.https://www.kdnuggets.com/2021/01/sparse-features-machine-learning-models.html
-
Implementing Adaboost in Scikit-learn
It is called Adaptive Boosting due to the fact that the weights are re-assigned to each instance, with higher weights being assigned to instances that are not correctly classified - therefore it ‘adapts’.https://www.kdnuggets.com/2022/10/implementing-adaboost-scikitlearn.html
-
7 Free Platforms for Building a Strong Data Science Portfolio
Outshine others and increase your odds of getting hired by maintaining a data science portfolio with projects, resumes, blogs, and reports.https://www.kdnuggets.com/2022/10/7-free-platforms-building-strong-data-science-portfolio.html
-
Mathematics for Machine Learning: The Free eBook
Check out this free ebook covering the fundamentals of mathematics for machine learning, as well as its companion website of exercises and Jupyter notebooks.https://www.kdnuggets.com/2020/04/mathematics-machine-learning-book.html
-
Explaining Explainable AI for Conversations
Something is missing in artificial intelligence – trust.https://www.kdnuggets.com/2022/10/explaining-explainable-ai-conversations.html
-
Sparse Matrix Representation in Python
Leveraging sparse matrix representations for your data when appropriate can spare you memory storage. Have a look at the reasons why, see how to create sparse matrices in with Python, and compare the memory requirements for standard and sparse representations of the same data.https://www.kdnuggets.com/2020/05/sparse-matrix-representation-python.html
-
Classification Metrics Walkthrough: Logistic Regression with Accuracy, Precision, Recall, and ROC
In this article, I will be going through 4 common classification metrics: Accuracy, Precision, Recall, and ROC in relation to Logistic Regression.https://www.kdnuggets.com/2022/10/classification-metrics-walkthrough-logistic-regression-accuracy-precision-recall-roc.html
-
5 Free Courses to Master Linear Algebra
Linear Algebra is an important subfield of mathematics and forms a core foundation of machine learning algorithms. The post shares five free courses to master the concepts of linear algebra.https://www.kdnuggets.com/2022/10/5-free-courses-master-linear-algebra.html
-
Data Representation for Natural Language Processing Tasks
In NLP we must find a way to represent our data (a series of texts) to our systems (e.g. a text classifier). As Yoav Goldberg asks, "How can we encode such categorical data in a way which is amenable for us by a statistical classifier?" Enter the word vector.https://www.kdnuggets.com/2018/11/data-representation-natural-language-processing.html
-
KDnuggets News, October 12: 10 Cheat Sheets You Need To Ace Data Science Interview • NLP Interview Questions
10 Cheat Sheets You Need To Ace Data Science Interview • NLP Interview Questions • A Day in the Life of a Machine Learning Engineer • 11 Questions About Data Engineers: What's the profession about, and where's it heading? • The ABCs of NLP, From A to Zhttps://www.kdnuggets.com/2022/n40.html
-
What it takes to crack Machine Learning Engineer interviews
Interview Kickstart’s Machine Learning Interview Course is the first-of-its-kind, ML-specific tech interview prep program designed and taught by FAANG+ instructors. Learn more about the program.https://www.kdnuggets.com/2022/10/interview-kickstart-crack-machine-learning-engineer-interviews.html
-
Statistical Functions in Python
In this tutorial, we would be covering some useful statistical functions which can be applied to pandas and series objects.https://www.kdnuggets.com/2022/10/statistical-functions-python.html
-
3 Valuable Skills That Have Doubled My Income as a Data Scientist
In a year, I have learned three essential skills that have opened a new world of possibilities.https://www.kdnuggets.com/2022/10/3-valuable-skills-doubled-income-data-scientist.html
-
The Complete Free PyTorch Course for Deep Learning
Do you want to learn PyTorch for machine learning and deep learning? Check out this 24 hour long video course with accompanying notes and courseware for free. Did I mention it's free?https://www.kdnuggets.com/2022/10/complete-free-pytorch-course-deep-learning.html
-
Top Posts October 3-9: How to Select Rows and Columns in Pandas
How to Select Rows and Columns in Pandas Using [ ], .loc, iloc, .at and .iat • Top Free Git GUI Clients for Beginners • Decision Tree Algorithm, Explained • 7 Techniques to Handle Imbalanced Data • Free Algorithms in Python Coursehttps://www.kdnuggets.com/2022/10/top-posts-week-1003-1009.html
-
How to Build a Data Science Enablement Team: A Complete Guide
A Data Science Enablement Team consists of people from various departments like marketing, sales, product development, etc. They are responsible for providing the necessary tools and resources to help the data scientists do their job more efficiently.https://www.kdnuggets.com/2022/10/build-data-science-enablement-team-complete-guide.html
-
3 Simple Ways to Speed Up Your Python Code
The post explains three popular frameworks, PySpark, Dask, and Ray, and discusses various factors to select the most appropriate one for your project.https://www.kdnuggets.com/2022/10/3-simple-ways-speed-python-code.html
-
A Beginner’s Guide to Web Scraping Using Python
This article serves as a beginner’s guide to web scraping using Python and looks at the different frameworks and methods you can use, outlined in simple terms.https://www.kdnuggets.com/2022/10/beginner-guide-web-scraping-python.html
-
A Day in the Life of a Machine Learning Engineer
What does a day in the life as a machine learning engineer look like for you?https://www.kdnuggets.com/2022/10/day-life-machine-learning-engineer.html
-
10 Cheat Sheets You Need To Ace Data Science Interview
The only cheat you need for a job interview and data professional life. It includes SQL, web scraping, statistics, data wrangling and visualization, business intelligence, machine learning, deep learning, NLP, and super cheat sheets.https://www.kdnuggets.com/2022/10/10-cheat-sheets-need-ace-data-science-interview.html
-
How to Get Up and Running with SQL – A List of Free Learning Resources
We have compiled a list of the top free resources to help new data practitioners learn SQL. These include free online courses and resources to get the most out of your SQL skills.https://www.kdnuggets.com/2022/10/get-running-sql-list-free-learning-resources.html
-
Debunking the Myth of the Citizen Data Scientist
While there are some benefits to having citizen data scientists, they are no silver bullet – and they certainly aren’t a replacement for true data scientists.https://www.kdnuggets.com/2022/10/debunking-myth-citizen-data-scientist.html
-
Feature Store Summit 2022: A free conference on Feature Engineering
Next week, the Feature Store Summit 2022 will bring together leading innovators in cutting-edge technologies and discuss all things on data and AI!https://www.kdnuggets.com/2022/10/hopsworks-feature-store-summit-2022-free-conference-feature-engineering.html
-
11 Questions About Data Engineers: What’s the profession about, and where’s it heading?
I hope my answers will be useful to novice data engineers and anyone interested in data engineering.https://www.kdnuggets.com/2022/10/11-questions-data-engineers-profession-heading.html
-
3 Ways to Process CSV Files in Python
This article is about 3 ways you can process a CSV file using Python.https://www.kdnuggets.com/2022/10/3-ways-process-csv-files-python.html
-
What makes a visualization good?
Jeff Heer, co-collaborator on data visualization tools like D3.js, Vega, and Vega-Lite, recently addressed the question, "What makes a visualization good?"https://www.kdnuggets.com/2022/10/sphere-makes-visualization-good.html
-
AI in FinTech: Managing the Finance of the Future
Digital transformation is evolving, and so is the fintech industry by implementing AI trends and leveraging several benefits, such as optimizing productivity, increasing ROI, and enhancing security.https://www.kdnuggets.com/2022/10/ai-fintech-managing-finance-future.html
-
Hyperparameter Tuning Using Grid Search and Random Search in Python
A comprehensive guide on optimizing model hyperparameters with Scikit-Learn.https://www.kdnuggets.com/2022/10/hyperparameter-tuning-grid-search-random-search-python.html
-
KDnuggets News, October 5: Top Free Git GUI Clients for Beginners • A Day in the Life of a Data Scientist
Top Free Git GUI Clients for Beginners • A Day in the Life of a Data Scientist: Expert vs. Beginner • Getting Started with Pandas Cheatsheet • Top 5 Machine Learning Practices Recommended by Experts • 7 Steps to Mastering Machine Learning with Python in 2022https://www.kdnuggets.com/2022/n39.html
-
Interview Kickstart Data Science Interview Course — What Makes It Different?
Interview Kickstart’s Data Science Interview Course is built by Data Scientists from MAANG and other big tech companies, the course promises to get you interview-ready in 15 weeks.https://www.kdnuggets.com/2022/10/interview-kickstart-data-science-interview-course-makes-different.html
-
6 Best Free Online Courses to Jumpstart Your Learning of SQL
We scoured the internet for the best free courses for anyone looking to learn SQL. We’re excited to share the top 6 resources we found.https://www.kdnuggets.com/2022/10/corise-6-best-free-online-courses-jumpstart-learning-sql.html
-
Key-Value Databases, Explained
Among the four big NoSQL database types, key-value stores are probably the most popular ones due to their simplicity and fast performance. Let’s further explore how key-value stores work and what are their practical uses.https://www.kdnuggets.com/2021/04/nosql-explained-understanding-key-value-databases.html
-
Key Takeaways from BigData London Conference and Exhibition
Read some of the key takeaways from BigData LDN, one of the UK's free data & analytics conferences, which took place recently.https://www.kdnuggets.com/2022/10/key-takeaways-bigdata-london-conference-exhibition.html
-
Machine Learning for Everybody!
Who is machine learning for? Everybody!https://www.kdnuggets.com/2022/10/machine-learning-everybody.html
-
Which Metric Should I Use? Accuracy vs. AUC
Depending on the problem you’re trying to solve, one metric may be more insightful than another.https://www.kdnuggets.com/2022/10/metric-accuracy-auc.html
-
Top Posts September 26 – October 2: Free Algorithms in Python Course
Free Algorithms in Python Course • How to Select Rows and Columns in Pandas • Lessons from a Senior Data Scientist • A Day in the Life of a Data Scientist: Expert vs. Beginner • 7 Machine Learning Portfolio Projects to Boost the Resumehttps://www.kdnuggets.com/2022/10/top-posts-week-0926-1002.html
-
Beginner Friendly Python Projects That Are Fun!
Projects like this are not only beginner friendly, but they add a little bit of fun to your studies or career.https://www.kdnuggets.com/2022/10/beginner-friendly-python-projects-fun.html
-
Top Free Git GUI Clients for Beginners
Learn about beginner-friendly Git GUI clients and perform Git-based tasks using an interactive user interface.https://www.kdnuggets.com/2022/10/top-free-git-gui-clients-beginners.html
-
Handling Missing Values in Time-series with SQL
This article is about a specific use-case that comes up often when dealing with time-series data.https://www.kdnuggets.com/2022/09/handling-missing-values-timeseries-sql.html
-
Are the Efforts of People Analytics Worth the Outcome?
Learn about the connection between people analytics and creating diversity, equity, and inclusion (DEI) accountability.https://www.kdnuggets.com/2022/09/efforts-people-analytics-worth-outcome.html
-
7 Steps to Mastering Machine Learning with Python in 2022
Are you trying to teach yourself machine learning from scratch, but aren’t sure where to start? I will attempt to condense all the resources I’ve used over the years into 7 steps that you can follow to teach yourself machine learning.https://www.kdnuggets.com/2022/02/7-steps-mastering-machine-learning-python.html
-
Master Transformers with This Free Stanford Course!
If you want a deep dive on transformers, this Stanford course has made its courseware freely available, including lecture videos, readings, assignments, and more.https://www.kdnuggets.com/2022/09/master-transformers-free-stanford-course.html