Search results for
-
Deploying Your First Machine Learning API">
Effortless way to develop and deploy your machine learning API using FastAPI and Deta.Deploying Your First Machine Learning API
https://www.kdnuggets.com/2021/10/deploying-first-machine-learning-api.html
-
Do you do Python? Do you do data science and machine learning? Then, you need to do these crucial Python libraries that enable nearly all you will want to do.The 20 Python Packages You Need For Machine Learning and Data Science">
The 20 Python Packages You Need For Machine Learning and Data Science
https://www.kdnuggets.com/2021/10/20-python-packages.html
-
What is Clustering and How Does it Work?
Let us examine how clusters with different properties are produced by different clustering algorithms. In particular, we give an overview of three clustering methods: k-Means clustering, hierarchical clustering, and DBSCAN.https://www.kdnuggets.com/2021/10/clustering-what-is-how-works.html
-
How Hasura Improved Conversion Rates By 20% With PostHog
Find out how Hasura increased conversion rates by 10-20% by using PostHog for self-hosted product analytics!https://www.kdnuggets.com/2021/10/posthog-hasura-improved-conversion-rates.html
-
Will Your Job be Replaced by a Machine?
Yes! It will happen. However, you can pivot and thrive in this disruptive time by becoming a Citizen Developer!https://www.kdnuggets.com/2021/10/job-replaced-machine.html
-
How to Ace Data Science Interview by Working on Portfolio Projects">
Recruiters of Data Science professionals around the world focus on portfolio projects rather than resumes and LinkedIn profiles. So, learning early how to contribute and share your work on GitHub, Deepnote, and Kaggle can help you perform your best during data science interviews.How to Ace Data Science Interview by Working on Portfolio Projects
https://www.kdnuggets.com/2021/10/ace-data-science-interview-portfolio-projects.html
-
Building Multimodal Models: Using the widedeep Pytorch package
This article gets you started on the open-source widedeep PyTorch framework developed by Javier Rodriguez Zaurin.https://www.kdnuggets.com/2021/10/building-multimodal-models-widedeep-pytorch-package.html
-
KDnuggets™ News 21:n39, Oct 13: 8 Must-Have Git Commands for Data Scientists; 38 Free Courses on Coursera for Data Science
The 8 Git commands Data Scientists should know; 38 free courses on Coursera for Data Science; How to query your Pandas DataFrames with SQL; Why You Need Python Skills as a Machine Learning Engineer; and more.https://www.kdnuggets.com/2021/n39.html
-
Top September Stories: Do You Read Excel Files with Python? There is a 1000x Faster Way
Also: Data Scientists Without Data Engineering Skills Will Face the Harsh Truth; Nine Tools I Wish I Mastered Before My PhD in ML; A Data Science Portfolio That Will Land You The Jobhttps://www.kdnuggets.com/2021/10/top-stories-2021-sep.html
-
Transforming your business with SAS® Viya® on Microsoft Azure
Faster, trusted decisions are in the cloud. See how you can use the flexibility, scalability and agility of modern technologies to advance your organization’s goals. Read our blog with 3-part video demo.https://www.kdnuggets.com/2021/10/sas-viya-microsoft-azure.html
-
Create Synthetic Time-series with Anomaly Signatures in Python
A simple and intuitive way to create synthetic (artificial) time-series data with customized anomalies — particularly suited to industrial applications.https://www.kdnuggets.com/2021/10/synthetic-time-series-anomaly-signatures-python.html
-
How I Built A Perfect Model And Got Into Trouble
Data-driven decisions, actionable insights, business impact—you've seen these buzzwords in data science jobs descriptions. But, just focusing on these terms doesn't automatically lead to the best results. Learn from this real-world scenario that followed data-driven indecisiveness, found misleading insights, and initially created a negative business impact.https://www.kdnuggets.com/2021/10/perfect-model-trouble.html
-
Step by Step Building a Vacancy Tracker Using Tableau
Step-by-step explanations of vacancies valued in tens of millions of dollars in the small town of Fitchburg, Massachusetts.https://www.kdnuggets.com/2021/10/building-vacancy-tracker-using-tableau.html
-
PASS Data Community Summit – Free Online Conference for Data Professionals
PASS Data Community Summit 2021 is the year’s largest gathering of Microsoft data platform professionals. This FREE online conference (taking place November 8 – 12, 2021) features 200+ world-class speakers and sessions, and gives you the opportunity to connect, share, and learn with thousands of your peers from the global data platform community.https://www.kdnuggets.com/2021/10/pass-data-community-summit.html
-
Top Stories, Oct 4-10: How to Build Strong Data Science Portfolio as a Beginner; 38 Free Courses on Coursera for Data Science
Also: Data science SQL interview questions from top tech firms; Here’s Why You Need Python Skills as a Machine Learning Engineer; 8 Must-Have Git Commands for Data Scientists; Introduction to PyTorch Lightninghttps://www.kdnuggets.com/2021/10/top-news-week-1004-1010.html
-
AutoML: An Introduction Using Auto-Sklearn and Auto-PyTorch
AutoML is a broad category of techniques and tools for applying automated search to your automated search and learning to your learning. In addition to Auto-Sklearn, the Freiburg-Hannover AutoML group has also developed an Auto-PyTorch library. We’ll use both of these as our entry point into AutoML in the following simple tutorial.https://www.kdnuggets.com/2021/10/automl-introduction-auto-sklearn-auto-pytorch.html
-
Scaling human oversight of AI systems for difficult tasks – OpenAI approach
The foundational idea of Artificial Intelligence is that it should demonstrate human-level intelligence. So, unless a model can perform as a human might do, its intended purpose is missed. Here, recent OpenAI research into full-length book summarization focuses on generating results that make sense to humans with state-of-the-art results that leverage scalable AI-enhanced human-in-the-loop feedback.https://www.kdnuggets.com/2021/10/openai-scaling-human-oversight-ai-systems.html
-
Choose The Right Job in Data: 5 Signs To Look For In An Engineering Culture
Software engineers seeking jobs at data companies face a new problem: choosing the right job out of all the options. Learn the 5 signs that signal an agile and innovative engineering culture.https://www.kdnuggets.com/2021/10/choose-right-job-data-signs-engineering-culture.html
-
Are you familiar with data labeling?
Are you familiar with common data labeling approaches and tools? Take a simple 2-minute survey.https://www.kdnuggets.com/2021/10/toloka-survey-data-labeling.html
-
8 Must-Have Git Commands for Data Scientists">
Git is a must-have skill for data scientists. Maintaining your development work within a version control system is absolutely necessary to have a collaborative and productive working environment with your colleagues. This guide will quickly start you off in the right direction for contributing to an existing project at your organization.8 Must-Have Git Commands for Data Scientists
https://www.kdnuggets.com/2021/10/8-git-commands-data-scientists.html
-
Dealing with Data Leakage
Target leakage and data leakage represent challenging problems in machine learning. Be prepared to recognize and avoid these potentially messy problems.https://www.kdnuggets.com/2021/10/dealing-data-leakage.html
-
Transforming the Shop Floor: A No-BS Look at Data Science in Manufacturing
Join RapidMiner live on LinkedIn, Oct 28, to learn how you can lead a digital transformation—not by starting from scratch, but by getting more from what you already have. We’ll walk through a series of real-world examples to demonstrate how your data, when paired with machine learning, can be used to make smarter process decisions.https://www.kdnuggets.com/2021/10/rapidminer-data-science-manufacturing.html
-
The Evolution of Tokenization – Byte Pair Encoding in NLP
Though we have SOTA algorithms for tokenization, it's always a good practice to understand the evolution trail and learning how have we reached here. Read this introduction to Byte Pair Encoding.https://www.kdnuggets.com/2021/10/evolution-tokenization-byte-pair-encoding-nlp.html
-
Building and Operationalizing Machine Learning Models: Three tips for success
With more enterprises implementing machine learning to improve revenue and operations, properly operationalizing the ML lifecycle in a holistic way is crucial for data teams to make their projects efficient and effective.https://www.kdnuggets.com/2021/10/building-operationalizing-machine-learning-models.html
-
How to do “Limitless” Math in Python
How to perform arbitrary-precision computation and much more math (and fast too) than what is possible with the built-in math library in Python.https://www.kdnuggets.com/2021/10/limitless-math-python.html
-
Here’s Why You Need Python Skills as a Machine Learning Engineer">
If you want to learn how to apply Python programming skills in the context of AI applications, the UC San Diego Extension Machine Learning Engineering Bootcamp can help. Read on to find out more about how machine learning engineers use Python, and why the language dominates today’s machine learning landscape.Here’s Why You Need Python Skills as a Machine Learning Engineer
https://www.kdnuggets.com/2021/10/bootcamp-python-skills-machine-learning-engineer.html
-
Four Different Pipes for R with magrittr
The magrittr package supplies the pipe operator (%>%), but it turns out that the package actually contains four pipe operators in total. Let's go into them a bit.https://www.kdnuggets.com/2021/10/four-different-pipes-r-magrittr.html
-
38 Free Courses on Coursera for Data Science">
There are so many online resources for learning data science, and a great deal of it can be used at no cost. This collection of free courses hosted by Coursera will help you enhance your data science and machine learning skills, no matter your current level of expertise.38 Free Courses on Coursera for Data Science
https://www.kdnuggets.com/2021/10/38-free-courses-coursera-datascience.html
-
My AI Plays Piano for Me
Training an RNN with a Combined Loss Function.https://www.kdnuggets.com/2021/10/ai-plays-piano.html
-
KDnuggets™ News 21:n38, Oct 6: Build a Strong Data Science Portfolio; Surpassing Trillion Parameters with Switch Transformers — a path to AGI?
How to Build Strong Data Science Portfolio as a Beginner; Surpassing Trillion Parameters and GPT-3 with Switch Transformers — a path to AGI?; How Deep Is That Data Lake?; Data Science Process Lifecycle; Use These Unique Data Sets to Sharpen Your Data Science Skills; How to Auto-Detect the Date/Datetime Columns and Set Their Datatype When Reading a CSV File in Pandashttps://www.kdnuggets.com/2021/n38.html
-
Eight Data Science Specializations, and Why You Should Pick One
With so many Data Science specializations, where should you focus? The Pace University online Master of Science in Data Science features elective courses which allow you to focus on topics that suit your career path so that you can begin to develop a unique specialization.https://www.kdnuggets.com/2021/10/pace-eight-data-science-specializations.html
-
Will Data Analysts be Replaced by AI?
It's the question so many are asking: will data analysts be replaced by AI? Read this well-reasoned and concise opinion by someone with insight into the matter.https://www.kdnuggets.com/2021/10/data-analysts-replaced-ai.html
-
Data science SQL interview questions from top tech firms">
As a data scientist, there is one thing you really need to understand and know how to handle: data. With SQL being a foundational technical approach for working with data, it should not be surprising that the top tech companies will ask about your SQL skills during an interview. Here, we cover the key concepts tested so you can best prepare for your next data science interview.Data science SQL interview questions from top tech firms
https://www.kdnuggets.com/2021/10/data-science-sql-interview-questions.html
-
The Architecture Behind DeepMind’s Model for Near Real Time Weather Forecasts
Deep Generative Model of Rain (DGMR) is the newest creation from DeepMind which can predict precipitation in short term intervals.https://www.kdnuggets.com/2021/10/architecture-deepmind-model-near-real-time-weather-forecasts.html
-
Top Stories, Sep 27 – Oct 3: Path to Full Stack Data Science
Also: How To Build A Database Using Python; Surpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI?; Nine Tools I Wish I Mastered Before My PhD in Machine Learning; 20 Machine Learning Projects That Will Get You Hiredhttps://www.kdnuggets.com/2021/10/top-news-week-0927-1003.html
-
Parallelizing Python Code
This article reviews some common options for parallelizing Python code, including process-based parallelism, specialized libraries, ipython parallel, and Ray.https://www.kdnuggets.com/2021/10/parallelizing-python-code.html
-
Introduction to PyTorch Lightning">
PyTorch Lightning is a high-level programming layer built on top of PyTorch. It makes building and training models faster, easier, and more reliable.Introduction to PyTorch Lightning
https://www.kdnuggets.com/2021/10/introduction-pytorch-lightning.html
-
Cartoon: How Deep Is That Data Lake?
New KDnuggets Cartoon looks at some of the problems data engineers may encounter when trying to measure data lakes.https://www.kdnuggets.com/2021/10/cartoon-data-lake.html
-
Teaching AI to Classify Time-series Patterns with Synthetic Data">
How to build and train an AI model to identify various common anomaly patterns in time-series data.Teaching AI to Classify Time-series Patterns with Synthetic Data
https://www.kdnuggets.com/2021/10/teaching-ai-classify-time-series-patterns-synthetic-data.html
-
Surpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI?">
Ever larger models churning on increasingly faster machines suggest a potential path toward smarter AI, such as with the massive GPT-3 language model. However, new, more lean, approaches are being conceived and explored that may rival these super-models, which could lead to a future with more efficient implementations of advanced AI-driven systems.Surpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI?
https://www.kdnuggets.com/2021/10/trillion-parameters-gpt-3-switch-transformers-path-agi.html
-
How to Auto-Detect the Date/Datetime Columns and Set Their Datatype When Reading a CSV File in Pandas
When read_csv( ) reads e.g. “2021-03-04” and “2021-03-04 21:37:01.123” as mere “object” datatypes, often you can simply auto-convert them all at once to true datetime datatypes.https://www.kdnuggets.com/2021/10/auto-detect-date-datetime-columns-and-set-their-datatype-when-reading-a-csv-file-in-pandas.html
-
Scale and Govern AI Initiatives with ModelOps
AI/ML model life cycle automation and orchestration ensures reliable model operations and governance at scale. The path to production for each enterprise model can vary, along with different monitoring, continuous improvement, retirement needs. Organizations must now consider ModelOps as a fundamental capability towards operational excellence and immediate ROIs.https://www.kdnuggets.com/2021/09/scale-govern-ai-modelops.html
-
Advanced Statistical Concepts in Data Science
The article contains some of the most commonly used advanced statistical concepts along with their Python implementation.https://www.kdnuggets.com/2021/09/advanced-statistical-concepts-data-science.html
-
Use These Unique Data Sets to Sharpen Your Data Science Skills
Want to get your hands on some real-world data sets right now? Kick off your bootcamp prep with this list of hot-button data sets curated to help you hone different data science skills.https://www.kdnuggets.com/2021/09/springboard-unique-data-sets-data-science-skills.html
-
GitHub Desktop for Data Scientists
Less scary than version control in the command line.https://www.kdnuggets.com/2021/09/github-desktop-data-scientists.html
-
Important Statistics Data Scientists Need to Know
Several fundamental statistical concepts must be well appreciated by every data scientist -- from the enthusiast to the professional. Here, we provide code snippets in Python to increase understanding to bring you key tools that bring early insight into your data.https://www.kdnuggets.com/2021/09/important-statistics-data-scientists.html
-
Data Science Process Lifecycle
How would it feel to know that without a doubt, the data projects you were working on would create TRUE ROI for your organization? Stick around until the end to get my data science process lifecycle framework so that each data project you run is a smashing success.https://www.kdnuggets.com/2021/09/data-science-process-lifecycle.html
-
KDnuggets™ News 21:n37, Sep 29: Nine Tools I Wish I Mastered Before My PhD in Machine Learning; Path to Full Stack Data Science
Whether you have a PhD or not, learn these very useful 9 tools to increase your mastery of Machine Learning; Check this detailed path to becoming a full stack Data Scientist; Then do one of these 20 Machine Learning Projects that will help you get a job; See a Breakdown of Deep Learning Frameworks; and more.https://www.kdnuggets.com/2021/n37.html
-
Transform speech into knowledge with Huggingface/Facebook AI and expert.ai
Speech2Data is a blend of open source and free-to-use AI models and technologies powered by Huggingface, Facebook AI and expert.ai. Learn more here.https://www.kdnuggets.com/2021/09/expert-ai-speech-huggingface-facebook.html
-
How To Build A Database Using Python">
Implement your database without handling the SQL using the Flask-SQLAlchemy library.How To Build A Database Using Python
https://www.kdnuggets.com/2021/09/build-database-using-python.html
-
MLOps and ModelOps: What’s the Difference and Why it Matters
These two terms are often used interchangeably. However, there are key distinctions between the functionality and features each provide, and the AI value and scalability at your organization depend on them.https://www.kdnuggets.com/2021/09/mlops-modelops-difference.html
-
Building a Structured Financial Newsfeed Using Python, SpaCy and Streamlit
Getting started with NLP by building a Named Entity Recognition(NER) application.https://www.kdnuggets.com/2021/09/-structured-financial-newsfeed-using-python-spacy-and-streamlit.html
-
Top Stories, Sep 20-26: Nine Tools I Wish I Mastered Before My PhD in Machine Learning; How to Find Weaknesses in your Machine Learning Models
Also: How to be a Data Scientist without a STEM degree; Data Scientists Without Data Engineering Skills Will Face the Harsh Truth; 20 Machine Learning Projects That Will Get You Hired; How to Find Weaknesses in your Machine Learning Modelshttps://www.kdnuggets.com/2021/09/top-news-week-0920-0926.html
-
Computer Vision in Agriculture
Deep learning isn’t just for placing ads or identifying cats anymore. Instead, a slew of young startups have started to incorporate the advances in computer vision made possible through larger and larger neural networks to real working robots in the fields.https://www.kdnuggets.com/2021/09/computer-vision-agriculture.html
-
Start your journey toward mastering all aspects of the field of Data Science with this focused list of in-depth self-learning resources. Curated with the beginner in mind, these recommendations will help you learn efficiently, and can also offer existing professionals useful highlights for review or help filling in any gaps in skills.Path to Full Stack Data Science">
Path to Full Stack Data Science
https://www.kdnuggets.com/2021/09/path-full-stack-data-science.html
-
Zero to RAPIDS in Minutes with NVIDIA GPUs + Saturn Cloud
Managing large-scale data science infrastructure presents significant challenges. With Saturn Cloud, managing GPU-based infrastructure is made easier, allowing practitioners and enterprises to focus on solving their business challenges.https://www.kdnuggets.com/2021/09/zero-rapids-minutes-nvidia-gpus-saturn-cloud.html
-
Data Analysis Using Scala
It is very important to choose the right tool for data analysis. On the Kaggle forums, where international Data Science competitions are held, people often ask which tool is better. R and Python are at the top of the list. In this article we will tell you about an alternative stack of data analysis technologies, based on Scala.https://www.kdnuggets.com/2021/09/data-analysis-scala.html
-
Real-Time Histogram Plots on Unbounded Data
Using histograms on real-time data is not possible in most of the popular data science libraries. In this article you will learn how dynamically compute and display a histogram within a Python notebook.https://www.kdnuggets.com/2021/09/real-time-histogram-plots-unbounded-data.html
-
How Data Scientists Can Compete in the Global Job Market
Data scientists wanting to stay competitive or break into the field will need the right approach. These techniques will help them search for and secure a new position.https://www.kdnuggets.com/2021/09/data-scientists-compete-global-job-market.html
-
Introducing PostHog: An open-source product analytics platform
PostHog is an open-source product analytics platform that helps you and your product team capture, analyze, and make informed decisions based on user behaviour.https://www.kdnuggets.com/2021/09/posthog-open-source-product-analytics-platform.html
-
How To Deal With Imbalanced Classification, Without Re-balancing the Data
Before considering oversampling your skewed data, try adjusting your classification decision threshold, in Python.https://www.kdnuggets.com/2021/09/imbalanced-classification-without-re-balancing-data.html
-
A Breakdown of Deep Learning Frameworks
Deep Learning continues to evolve as one of the most powerful techniques in the AI toolbox. Many software packages exist today to support the development of models, and we highlight important options available with key qualities and differentiators to help you select the most appropriate for your needs.https://www.kdnuggets.com/2021/09/a-breakdown-deep-learning-frameworks.html
-
9 Outstanding Reasons to Learn Python for Finance
Is Python good for learning finance and working in the financial world? The answer is not only a resounding YES, but yes for nine very good reasons. This article gets into the details behind why Python is a must-know programming language for anyone who wants to work in the financial sector.https://www.kdnuggets.com/2021/09/9-outstanding-reasons-learn-python-finance.html
-
Messy Data is Beautiful
Once these types of data have been cleaned, they do more than show organized data sets. They reveal unlimited possibilities, and AI analytics can reveal these possibilities faster and more efficiently than ever before.https://www.kdnuggets.com/2021/09/sparkbeyond-messy-data-is-beautiful.html
-
GitHub Copilot and the Rise of AI Language Models in Programming Automation
Read on to learn more about what makes Copilot different from previous autocomplete tools (including TabNine), and why this particular tool has been generating so much controversy.https://www.kdnuggets.com/2021/09/github-copilot-rise-ai-language-models-programming-automation.html
-
20 Machine Learning Projects That Will Get You Hired">
If you want to break into the machine learning and data science job market, then you will need to demonstrate the proficiency of your skills, especially if you are self-taught through online courses and bootcamps. A project portfolio is a great way to practice your new craft and offer convincing evidence that an employee should hire you over the competition.20 Machine Learning Projects That Will Get You Hired
https://www.kdnuggets.com/2021/09/20-machine-learning-projects-hired.html
-
Whether you are building a start up or making scientific breakthroughs these tools will bring your ML pipeline to the next level.Nine Tools I Wish I Mastered Before My PhD in Machine Learning">
Nine Tools I Wish I Mastered Before My PhD in Machine Learning
https://www.kdnuggets.com/2021/09/nine-tools-mastered-before-phd-machine-learning.html
-
KDnuggets™ News 21:n36, Sep 22: The Machine & Deep Learning Compendium Open Book; Easy SQL in Native Python
The Machine & Deep Learning Compendium Open Book; Easy SQL in Native Python; Introduction to Automated Machine Learning; How to be a Data Scientist without a STEM degree; What Is The Real Difference Between Data Engineers and Data Scientists?https://www.kdnuggets.com/2021/n36.html
-
Free virtual event: Big Data and AI Toronto
This year’s Big Data and AI Toronto conference and expo, held virtually Oct 13-14, will provide attendees with a 360° view of the industry through a unique 4-in-1 experience: Artificial intelligence, big data, cloud, and cybersecurity.https://www.kdnuggets.com/2021/09/corp-agency-virtual-event-big-data-ai-toronto.html
-
15 Must-Know Python String Methods
It is not always about numbers.https://www.kdnuggets.com/2021/09/15-must-know-python-string-methods.html
-
Data Engineering Technologies 2021
Emerging technologies supporting the field of data engineering are growing at a rapid clip. This curated list includes the most important offerings available in 2021.https://www.kdnuggets.com/2021/09/data-engineering-technologies-2021.html
-
If You Can Write Functions, You Can Use Dask
This article is the second article of an ongoing series on using Dask in practice. Each article in this series will be simple enough for beginners, but provide useful tips for real work. The first article in the series is about using LocalCluster.https://www.kdnuggets.com/2021/09/write-functions-use-dask.html
-
Top Stories, Sep 13-19: Data Scientists Without Data Engineering Skills Will Face the Harsh Truth; The Machine & Deep Learning Compendium Open Book
Also: The Machine & Deep Learning Compendium Open Book; Easy SQL in Native Python; The Prefect Way to Automate & Orchestrate Data Pipelines; A Data Science Portfolio That Will Land You The Jobhttps://www.kdnuggets.com/2021/09/top-news-week-0913-0919.html
-
How to label time series efficiently – and boost your AI
Data labeling is a critical step in building high-quality AI models. This blog explains how to speed up the labeling process of time series data from sensors and IoT devices.https://www.kdnuggets.com/2021/09/visplore-label-time-series-efficiently.html
-
Don’t Touch a Dataset Without Asking These 10 Questions
Selecting the right dataset is critical for the success of your AI project.https://www.kdnuggets.com/2021/09/dataset-asking-10-questions.html
-
How to be a Data Scientist without a STEM degree">
Breaking into data science as a professional does require technical skills, a well-honed knack for problem-solving, and a willingness to swim in oceans of data. Maybe you are coming in as a career change or ready to take a new learning path in life--without having previously earned an advanced degree in a STEM field. Follow these tips to find your way into this high-demand and interesting field.How to be a Data Scientist without a STEM degree
https://www.kdnuggets.com/2021/09/data-scientist-without-stem-degree.html
-
How to Find Weaknesses in your Machine Learning Models">
FreaAI: a new method from researchers at IBM.How to Find Weaknesses in your Machine Learning Models
https://www.kdnuggets.com/2021/09/weaknesses-machine-learning-models.html
-
Paradoxes in Data Science
Have a look into some of the main paradoxes associate with Data Science and it’s statistical foundations.https://www.kdnuggets.com/2021/09/paradoxes-data-science.html
-
What 2 years of self-teaching data science taught me
Many of us self-learn data science from the very beginning. While continuing to self-learn on demand is crucial, especially after you become a professional, there can be many pitfalls early on for learning the wrong way or missing out on key ideas that are important for the real-world application of data science.https://www.kdnuggets.com/2021/09/2-years-self-teaching-data-science.html
-
Introducing TensorFlow Similarity
TensorFlow Similarity is a newly-released library from Google that facilitates the training, indexing and querying of similarity models. Check out more here.https://www.kdnuggets.com/2021/09/introducing-tensorflow-similarity.html
-
What Is The Real Difference Between Data Engineers and Data Scientists?
To launch your data career, you’ll need both theoretical knowledge and applied skills. Bootcamp programs like Springboard’s Data Science Career Track and Data Engineering Career Track can help make you job-ready through hands-on, project-based learning and one-on-one mentorship. Wondering which data career path is right for you? Read on to find out.https://www.kdnuggets.com/2021/09/springboard-difference-data-engineers-data-scientists.html
-
Adventures in MLOps with Github Actions, Iterative.ai, Label Studio and NBDEV
This article documents the authors' experience building their custom MLOps approach.https://www.kdnuggets.com/2021/09/adventures-mlops-github-actions-iterative-ai-label-studio-and-nbdev.html
-
The Machine & Deep Learning Compendium Open Book">
After years in the making, this extensive and comprehensive ebook resource is now available and open for data scientists and ML engineers. Learn from and contribute to this tome of valuable information to support all your work in data science from engineering to strategy to management.The Machine & Deep Learning Compendium Open Book
https://www.kdnuggets.com/2021/09/machine-deep-learning-open-book.html
-
KDnuggets Top Blogs Rewards for August 2021
These top blogs were winners of KDnuggets Top Blog Rewards Program for August: Automate Microsoft Excel and Word Using Python,https://www.kdnuggets.com/2021/09/top-blogs-rewards-aug.html
-
DATAcated Expo, Oct 5, Live-streamed,
The DATAcated Expo, hosted by DATAcated founder Kate Strachnyi, is coming up on October 5, 2021 from 11am - 6pm ET. Live-streamed on LinkedIn, the free event provides the community with an opportunity to explore and discover innovative technologies in data science & analytics.
Explore new AI / Data Science Techhttps://www.kdnuggets.com/2021/09/datacated-expo-oct-5.html
-
Introduction to Automated Machine Learning
AutoML enables developers with limited ML expertise (and coding experience) to train high-quality models specific to their business needs. For this article, we will focus on AutoML systems which cater to everyday business and technology applications.https://www.kdnuggets.com/2021/09/introduction-automated-machine-learning.html
-
How to get Python PCAP Certification: Roadmap, Resources, Tips For Success, Based On My Experience
Follow this journey of personal experience -- with useful tips and learning resources -- to help you achieve the PCAP Certification, one of the most reputed Python Certifications, to validate your knowledge against International Standards.https://www.kdnuggets.com/2021/09/python-pcap-certification-roadmap-resources.html
-
5 Must Try Awesome Python Data Visualization Libraries
The goal of data visualization is to communicate data or information clearly and effectively to readers. Here are 5 must try awesome Python libraries for helping you do so, with overviews and links to quick start guides for each.https://www.kdnuggets.com/2021/09/5-awesome-data-visualization-libraries-python.html
-
KDnuggets™ News 21:n35, Sep 15: A Data Science Portfolio That Will Land You The Job; Top 18 Low-Code and No-Code Machine Learning Platforms
Here is a Data Science Portfolio that will land you the job; Review the top 18 Low-Code and No-Code Machine Learning platforms; Try these 8 Deep Learning Project Ideas for Beginners; Very useful - working with Python APIs for data science project.https://www.kdnuggets.com/2021/n35.html
-
Top August Stories: Automate Microsoft Excel and Word Using Python; The Difference Between Data Scientists and ML Engineers
Automate Microsoft Excel and Word Using Python; The Difference Between Data Scientists and ML Engineers; Most Common Data Science Interview Questions and Answers; 3 Reasons Why You Should Use Linear Regression Models Instead of Neural Networkshttps://www.kdnuggets.com/2021/09/top-stories-2021-aug.html
-
Amazon Web Services Webinar: Boost customer satisfaction and sales with consumer insights data
Join this webinar, Sep 27, to learn how to leverage external data to understand market needs and consumer behavior – helping you build a more customer-centric business.https://www.kdnuggets.com/2021/09/roidna-aws-webinar-consumer-insights-data.html
-
Speeding up Neural Network Training With Multiple GPUs and Dask
A common moment when training a neural network is when you realize the model isn’t training quickly enough on a CPU and you need to switch to using a GPU. It turns out multi-GPU model training across multiple machines is pretty easy with Dask. This blog post is about my first experiment in using multiple GPUs with Dask and the results.https://www.kdnuggets.com/2021/09/speeding-neural-network-training-multiple-gpus-dask.html
-
Although the role of the data scientist is still evolving, data remains at its core. Setting the right expectations for what you will do as a data scientist is important, and, to be sure, knowing the tools of data engineering will get yourself ready for the real world.Data Scientists Without Data Engineering Skills Will Face the Harsh Truth">
Data Scientists Without Data Engineering Skills Will Face the Harsh Truth
https://www.kdnuggets.com/2021/09/data-scientists-data-engineering-skills.html
-
An Introduction to Reinforcement Learning with OpenAI Gym, RLlib, and Google Colab
Get an Introduction to Reinforcement Learning by attempting to balance a virtual CartPole with OpenAI Gym, RLlib, and Google Colab.https://www.kdnuggets.com/2021/09/intro-reinforcement-learning-openai-gym-rllib-colab.html
-
Top Stories, Sep 6-12: Do You Read Excel Files with Python? There is a 1000x Faster Way; 8 Deep Learning Project Ideas for Beginners
Also: How to Create Stunning Web Apps for your Data Science Projects; A Data Science Portfolio That Will Land You The Job; Top 18 Low-Code and No-Code Machine Learning Platforms; 8 Deep Learning Project Ideas for Beginnershttps://www.kdnuggets.com/2021/09/top-news-week-0906-0912.html
-
85% of data science projects fail – here’s how to avoid it
Here are a few common traps that data scientists can avoid to NOT be one of the 85% of data science projects that fail.https://www.kdnuggets.com/2021/09/sparkbeyond-avoid-data-science-projects-fail.html
-
The Prefect Way to Automate & Orchestrate Data Pipelines
I am migrating all my ETL work from Airflow to this super-cool framework.https://www.kdnuggets.com/2021/09/prefect-way-automate-orchestrate-data-pipelines.html
-
3 Most Important Lessons I’ve Learned 3 Years Into My Data Science Career
After only 3 years of working as a data professional, many tried-and-true lessons can be learned. Here are 3 of the most important lessons learned with key takeaways and reflections shared.https://www.kdnuggets.com/2021/09/3-important-lessons-data-science-career.html
-
How Many AI Neurons Does It Take to Simulate a Brain Neuron?
A new research shows some shocking answers to that question.https://www.kdnuggets.com/2021/09/ai-neurons-simulate-brain-neuron.html
-
Working with Python APIs For Data Science Project
In this article, we will work with YouTube Python API to collect video statistics from our channel using the requests python library to make an API call and save it as a Pandas DataFrame.https://www.kdnuggets.com/2021/09/python-apis-data-science-project.html
-
Landing a data science job is no easy feat, especially during the COVID-19 pandemic. This article provides aspiring data scientists with advice on building a data science portfolio that stands out.A Data Science Portfolio That Will Land You The Job">
A Data Science Portfolio That Will Land You The Job
https://www.kdnuggets.com/2021/09/data-science-portfolio-job.html
-
Text Preprocessing Methods for Deep Learning
While the preprocessing pipeline we are focusing on in this post is mainly centered around Deep Learning, most of it will also be applicable to conventional machine learning models too.https://www.kdnuggets.com/2021/09/text-preprocessing-methods-deep-learning.html
-
How to Create an AutoML Pipeline Optimization Sandbox
In this article, we will implement an automated machine learning pipeline optimization sandbox web app using Streamlit and TPOT.https://www.kdnuggets.com/2021/09/automl-pipeline-optimization-sandbox.html
-
8 Deep Learning Project Ideas for Beginners">
Have you studied Deep Learning techniques, but never worked on a useful project? Here, we highlight eight deep learning project ideas for beginners that will help you sharpen your skills and boost your resume.8 Deep Learning Project Ideas for Beginners
https://www.kdnuggets.com/2021/09/8-deep-learning-project-ideas-beginners.html
-
7 Differences Between a Data Analyst and a Data Scientist">
This article discusses the 7 key differences between data analysts and data scientists with an aim to help potential data analysts/scientists determine which is the right one for them. I touch on day-to-day tasks, skill requirements, typical career progression, and salary and career prospects for both.7 Differences Between a Data Analyst and a Data Scientist
https://www.kdnuggets.com/2021/09/7-differences-between-data-analyst-data-scientist.html
-
300 Data Science Leaders Share What’s Holding Their Teams Back
Flawed investments in people, processes, and tools are crushing potential business impact.https://www.kdnuggets.com/2021/09/domino-300-data-science-leaders.html
-
Smart Ingestion: Using ontology-driven AI
Imagine data that organizes itself to power your decision-making.https://www.kdnuggets.com/2021/09/smart-ingestion-ontology-driven-ai.html
-
Top 18 Low-Code and No-Code Machine Learning Platforms">
Machine learning becomes more accessible to companies and individuals when there is less coding involved. Especially if you are just starting your path in ML, then check out these low-code and no-code platforms to help expedite your capabilities in learning and applying AI.Top 18 Low-Code and No-Code Machine Learning Platforms
https://www.kdnuggets.com/2021/09/top-18-low-code-no-code-machine-learning-platforms.html
-
Math 2.0: The Fundamental Importance of Machine Learning
Machine learning is not just another way to program computers; it represents a fundamental shift in the way we understand the world. It is Math 2.0.https://www.kdnuggets.com/2021/09/math-fundamental-importance-machine-learning.html
-
KDnuggets™ News 21:n34, Sep 8: Do You Read Excel Files with Python? There is a 1000x Faster Way; Hypothesis Testing Explained
Do You Read Excel Files with Python? There is a 1000x Faster Way; Hypothesis Testing Explained; Data Science Cheat Sheet 2.0; 6 Cool Python Libraries That I Came Across Recently; Best Resources to Learn Natural Language Processing in 2021https://www.kdnuggets.com/2021/n34.html
-
Popular Certifications to validate your data and analytics skills
Check out the most popular certifications from SAS to see what certification you want to pursue next. Now through the end of 2021, you can save 55% on your exam!https://www.kdnuggets.com/2021/09/sas-popular-certifications-data-analytics-skills.html
-
How Machine Learning Leverages Linear Algebra to Solve Data Problems
Why you should learn the fundamentals of linear algebra.https://www.kdnuggets.com/2021/09/machine-learning-leverages-linear-algebra-solve-data-problems.html
-
ebook: Learn Data Science with R – free download
Check out this new book for data science beginners with many practical examples that covers statistics, R, graphing, and machine learning. As a source to learn the full breadth of data science foundations, "Learn Data Science with R" starts at the beginner level and gradually progresses into expert content.https://www.kdnuggets.com/2021/09/ebook-learn-data-science-r.html
-
Data scientists do not have to learn HTML, CSS, and JavaScript to build web pages.How to Create Stunning Web Apps for your Data Science Projects">
How to Create Stunning Web Apps for your Data Science Projects
https://www.kdnuggets.com/2021/09/create-stunning-web-apps-data-science-projects.html
-
Top Stories, Aug 30 – Sep 5: Do You Read Excel Files with Python? There is a 1000x Faster Way; Hypothesis Testing Explained
Also: The Top Industries Hiring Data Scientists in 2021; Hypothesis Testing Explained; Automate Microsoft Excel and Word Using Python; Data Science Cheat Sheet 2.0https://www.kdnuggets.com/2021/09/top-news-week-0830-0905.html
-
Fast AutoML with FLAML + Ray Tune
Microsoft Researchers have developed FLAML (Fast Lightweight AutoML) which can now utilize Ray Tune for distributed hyperparameter tuning to scale up FLAML’s resource-efficient & easily parallelizable algorithms across a cluster.https://www.kdnuggets.com/2021/09/fast-automl-flaml-ray-tune.html
-
Antifragility and Machine Learning
Our intuition for most products, processes, and even some models might be that they either will get worse over time, or if they fail, they will experience an cascade of more failure. But, what if we could intentionally design systems and models to only get better, even as the world around them gets worse?https://www.kdnuggets.com/2021/09/antifragility-machine-learning.html
-
Five Key Facts About Wu Dao 2.0: The Largest Transformer Model Ever Built
The record-setting model combines some clever research and engineering methods.https://www.kdnuggets.com/2021/09/five-key-facts-wu-dao-largest-transformer-model.html
-
Behind OpenAI Codex: 5 Fascinating Challenges About Building Codex You Didn’t Know About
Some ML engineering and modeling challenges encountering during the construction of Codex.https://www.kdnuggets.com/2021/09/openai-codex-challenges.html
-
6 Cool Python Libraries That I Came Across Recently
Check out these awesome Python libraries for Machine Learning.https://www.kdnuggets.com/2021/09/6-cool-python-libraries-recently.html
-
eBook: A Practical Guide to Using Third-Party Data in the Cloud
Download this eBook to learn how innovative teams are shifting their focus from data-driven business intelligence to accelerating insight-driven decision-making and now are turning to third-party datasets as a differentiator.https://www.kdnuggets.com/2021/09/roidna-ebook-guide-third-party-data-cloud.html
-
Build a synthetic data pipeline using Gretel and Apache Airflow
In this blog post, we build an ETL pipeline that generates synthetic data from a PostgreSQL database using Gretel’s Synthetic Data APIs and Apache Airflow.https://www.kdnuggets.com/2021/09/build-synthetic-data-pipeline-gretel-apache-airflow.html
-
How to solve machine learning problems in the real world
Becoming a machine learning engineer pro is your goal? Sure, online ML courses and Kaggle-style competitions are great resources to learn the basics. However, the daily job of a ML engineer requires an additional layer of skills that you won’t master through these approaches.https://www.kdnuggets.com/2021/09/solve-machine-learning-problems-real-world.html
-
Best Resources to Learn Natural Language Processing in 2021
In this article, the author has listed listed all the best resources to learn natural language processing including Online Courses, Tutorials, Books, and YouTube Videos.https://www.kdnuggets.com/2021/09/best-resources-learn-natural-language-processing-2021.html
-
Future Says Series | Discover the Future of AI
This innovative project brings together industry thought leaders from top tech companies such as Google, PwC, King, DNB, Piab, Scania, Telefonica, and more to discuss what the future holds for data and AI. Watch Future Says Series as industry experts discuss real-life examples how they are scaling AI successfully within their organizations.https://www.kdnuggets.com/2021/09/altair-future-says-series.html
-
In this article, I’ll show you five ways to load data in Python. Achieving a speedup of 3 orders of magnitude.Do You Read Excel Files with Python? There is a 1000x Faster Way">
Do You Read Excel Files with Python? There is a 1000x Faster Way
https://www.kdnuggets.com/2021/09/excel-files-python-1000x-faster-way.html
-
Data Science Cheat Sheet 2.0">
Check out this helpful, 5-page data science cheat sheet to assist with your exam reviews, interview prep, and anything in-between.Data Science Cheat Sheet 2.0
https://www.kdnuggets.com/2021/09/data-science-cheat-sheet.html
-
How is Machine Learning Beneficial in Mobile App Development?
Mobile app developers have a lot to gain by implementing AI & Machine Learning from the revolutionary changes that these disruptive technologies can offer. This is due to AI and ML's potential to strengthen mobile applications, providing for smoother user experiences capable of leveraging powerful features.https://www.kdnuggets.com/2021/09/machine-learning-beneficial-mobile-app-development.html
-
KDnuggets™ News 21:n33, Sep 1: Top Industries Hiring Data Scientists; The Most Important Tool for Data Engineers
The top industries hiring Data Scientists; The most important tool for data engineers (hint - it is not technical); How to Engineer Date Features in Python; 15 Python Snippets to Optimize your Data Science Pipelinehttps://www.kdnuggets.com/2021/n33.html
-
NLP Insights for the Penguin Café Orchestra
We give an example of how to use Expert.ai and Python to investigate favorite music albums.https://www.kdnuggets.com/2021/08/expert-nlp-insights-music.html