Search results for "NLP"

1027 documents found out of 7080 total.

  • Practical Deep Learning from fast.ai is Back!

    Looking for a great course to go from machine learning zero to hero quickly? fast.ai has released the latest version of Practical Deep Learning For Coders. And it won't cost you a thing.

    https://www.kdnuggets.com/2022/07/practical-deep-learning-fastai-2022.html

  • The Difficulty of Estimating the Carbon Footprint of Machine Learning

    Is machine learning killing the planet? Probably not, but let's make sure it doesn't.

    https://www.kdnuggets.com/2022/07/difficulty-estimating-carbon-footprint-machine-learning.html

  • 12 Most Challenging Data Science Interview Questions

    The simple but tricky data science questions that most people struggle to answer.

    https://www.kdnuggets.com/2022/07/12-challenging-data-science-interview-questions.html

  • N-gram Language Modeling in Natural Language Processing

    N-gram is a sequence of n words in the modeling of NLP. How can this technique be useful in language modeling?

    https://www.kdnuggets.com/2022/06/ngram-language-modeling-natural-language-processing.html

  • Top Posts June 27 – July 3: Statistics and Probability for Data Science

    Also: Decision Tree Algorithm, Explained; 20 Basic Linux Commands for Data Science Beginners; 15 Python Coding Interview Questions You Must Know For Data Science; Naïve Bayes Algorithm: Everything You Need to Know

    https://www.kdnuggets.com/2022/07/top-posts-week-0627-0703.html

  • The Complete Collection of Data Science Interviews – Part 2

    The second part covers the list of Data Management, Data Engineering, Machine Learning, Deep Learning, Natural Language Processing, MLOps, Cloud Computing, and AI Manager interview questions.

    https://www.kdnuggets.com/2022/06/complete-collection-data-science-interviews-part-2.html

  • Market Data and News: A Time Series Analysis

    In this article we introduce a few tools and techniques for studying relationships between the stock market and the news. We explore time series processing, anomaly detection, and an event-based view of the news. We also generate intuitive charts to demonstrate some of these concepts, and share the code behind all of this in a notebook.

    https://www.kdnuggets.com/2022/06/market-data-news-time-series-analysis.html

  • The Complete Collection of Data Science Interviews – Part 1

    The first part covers the list of Behavioral, Situational, Statistics, Python, R, SQL, Data Analytics, and Business Intelligence interview questions.

    https://www.kdnuggets.com/2022/06/complete-collection-data-science-interviews-part-1.html

  • A Gentle Introduction to Natural Language Processing

    This gentle introduction to NLP covers the basics, and will help you move along to more advanced topics ASAP.

    https://www.kdnuggets.com/2022/06/gentle-introduction-natural-language-processing.html

  • KDnuggets News, June 15: 14 Essential Git Commands for Data Scientists; A Structured Approach To Building a Machine Learning Model

    14 Essential Git Commands for Data Scientists; A Structured Approach To Building a Machine Learning Model; How is Data Mining Different from Machine Learning?; Understanding Functions for Data Science; Top 18 Data Science Facebook Groups

    https://www.kdnuggets.com/2022/n24.html

  • Deep Learning Key Terms, Explained

    Gain a beginner's perspective on artificial neural networks and deep learning with this set of 14 straight-to-the-point related key concept definitions.

    https://www.kdnuggets.com/2016/10/deep-learning-key-terms-explained.html

  • The Complete Collection of Data Science Books – Part 2

    KDnuggets Top Blog Read the best books on Machine Learning, Deep Learning, Computer Vision, Natural Language Processing, MLOps, Robotics, IoT, AI Products Management, and Data Science for Executives.

    https://www.kdnuggets.com/2022/05/complete-collection-data-science-books-part-2.html

  • The Complete Collection of Data Science Books – Part 1

    KDnuggets Top Blog Read the best books on Programming, Statistics, Data Engineering, Web Scraping, Data Analytics, Business Intelligence, Data Applications, Data Management, Big Data, and Cloud Architecture.

    https://www.kdnuggets.com/2022/05/complete-collection-data-science-books-part-1.html

  • KDnuggets News, May 18: 5 Free Hosting Platform For Machine Learning Applications; Data Mesh Architecture: Reimagining Data Management

    5 Free Hosting Platform For Machine Learning Applications; Data Mesh Architecture: Reimagining Data Management; Popular Machine Learning Algorithms; Reinforcement Learning for Newbies ; Deep Learning For Compliance Checks: What's New?

    https://www.kdnuggets.com/2022/n20.html

  • Natural Language Processing Key Terms, Explained

    This post provides a concise overview of 18 natural language processing terms, intended as an entry point for the beginner looking for some orientation on the topic.

    https://www.kdnuggets.com/2017/02/natural-language-processing-key-terms-explained.html

  • Reinforcement Learning for Newbies

    A simple guide to reinforcement learning for a complete beginner. The blog includes definitions with examples, real-life applications, key concepts, and various types of learning resources.

    https://www.kdnuggets.com/2022/05/reinforcement-learning-newbies.html

  • Deep Learning For Compliance Checks: What’s New?

    By implementing the different NLP techniques into the production processes, compliance departments can maintain detailed checks and keep up with regulator demands.

    https://www.kdnuggets.com/2022/05/deep-learning-compliance-checks-new.html

  • Can We Query a Table with T5?

    Learn how to tune a large language model.

    https://www.kdnuggets.com/2022/05/query-table-t5.html

  • 5 Free Hosting Platform For Machine Learning Applications

    Learn about the free and easy-to-deploy hosting platform for your machine learning projects.

    https://www.kdnuggets.com/2022/05/5-free-hosting-platform-machine-learning-applications.html

  • Top 4 tricks for competing on Kaggle and why you should start

    If you aren't familiar with Kaggle, you should be. Hear why from two expert Kagglers in this article.

    https://www.kdnuggets.com/2022/05/packt-top-4-tricks-competing-kaggle-start.html

  • KDnuggets News, May 11: SQL Notes for Professionals; How To Structure a Data Science Project

    SQL Notes for Professionals: The Free eBook Review; How To Structure a Data Science Project: A Step-by-Step Guide; Everything You Need to Know About Tensors; Free University Data Science Resources; Image Classification with Convolutional Neural Networks (CNNs)

    https://www.kdnuggets.com/2022/n19.html

  • 6 Highest Paying Companies for Data Scientists

    These are the six top paying companies for data scientists. I’ve looked at absolute salary, but I’ll fill you in on other factors you should consider as well when it comes to picking a data science job for money.

    https://www.kdnuggets.com/2022/05/6-highest-paying-companies-data-scientists.html

  • Best Data Science Career Tracks of 2022

    Top-rated data science tracks consist of multiple project-based courses covering all aspects of data. It includes an introduction to Python/R, data ingestion & manipulation, data visualization, machine learning, and reporting.

    https://www.kdnuggets.com/2022/04/best-data-science-career-tracks-2022.html

  • How Fast Can BERT Go With Sparsity?

    How much impact does sparsity have on model performance?

    https://www.kdnuggets.com/2022/04/fast-bert-go-sparsity.html

  • How Metadata Improves Security, Quality, and Transparency

    Metadata is the data providing context about the data, more than what you see in the rows and columns. By managing your metadata, you're effectively creating an encyclopedia of your data assets.

    https://www.kdnuggets.com/2022/04/metadata-improves-security-quality-transparency.html

  • Answering Questions with HuggingFace Pipelines and Streamlit

    See how easy it can be to build a simple web app for question answering from text using Streamlit and HuggingFace pipelines.

    https://www.kdnuggets.com/2021/10/simple-question-answering-web-app-hugging-face-pipelines.html

  • How to Start Using Natural Language Processing With PyTorch

    In this guide, we will address some of the obvious questions that may arise when starting to dive into natural language processing, but we will also engage with deeper questions and give you the right steps to get started working on your own NLP programs.

    https://www.kdnuggets.com/2022/04/start-natural-language-processing-pytorch.html

  • Summarization with GPT-3

    GPT-3 models are quite convincing and represent the rising power of Cloud AI. Read this excerpt from the book Transformers for Natural Language Processing, Second Edition to see how easy getting started with summarization with GPT-3 can be.

    https://www.kdnuggets.com/2022/04/packt-summarization-gpt3.html

  • The Complete Collection Of Data Repositories – Part 2

    Check out the collection of the best data repositories on healthcare, natural language, neuroscience, physics, social network, sports, time series, transportation, miscellaneous, and super data repositories.

    https://www.kdnuggets.com/2022/04/complete-collection-data-repositories-part-2.html

  • Naïve Bayes Algorithm: Everything You Need to Know

    Naïve Bayes is a probabilistic machine learning algorithm based on the Bayes Theorem, used in a wide variety of classification tasks. In this article, we will understand the Naïve Bayes algorithm and all essential concepts so that there is no room for doubts in understanding.

    https://www.kdnuggets.com/2020/06/naive-bayes-algorithm-everything.html

  • 15 Trending MLOps Talks You can Access for Free at ODSC East 2022

    Covering topics like workflows and full-stack machine learning, these are 15 free MLOps talks coming to #ODSCEast 2022 that you can see with a free Bronze Pass.

    https://www.kdnuggets.com/2022/04/odsc-15-trending-mlops-talks-access-free-odsc-east-2022.html

  • KDnuggets News March 30: The Most Popular Intro to Programming Course From Harvard is Free!; Top 13 Skills That Every Data Scientist Should Have

    The Most Popular Intro to Programming Course From Harvard is Free!; Top 13 Skills That Every Data Scientist Should Have; Junior vs Senior Data Scientist Salary: What’s the Difference?; MLOps Is a Mess But That's to be Expected; Data Science at the Command Line: The Free eBook

    https://www.kdnuggets.com/2022/n13.html

  • Junior vs Senior Data Scientist Salary: What’s the Difference?

    Check out this US salary deep dive for 2022 career decisions, work, & interests.

    https://www.kdnuggets.com/2022/03/junior-senior-data-scientist-salary-difference.html

  • Data-Centric AI: Is it Real? For Everyone? Are We Ready?

    Check out this deep dive into Data-Centric AI.

    https://www.kdnuggets.com/2022/03/data-centric-ai-real-everyone-ready.html

  • From Google Colab to a Ploomber Pipeline: ML at Scale with GPUs

    In this short blog, we’ll review the process of taking a POC data science pipeline (ML/Deep learning/NLP) that was conducted on Google Colab, and transforming it into a pipeline that can run parallel at scale and works with Git so the team can collaborate on.

    https://www.kdnuggets.com/2022/03/google-colab-ploomber-pipeline-ml-scale-gpus.html

  • What is Adversarial Machine Learning?

    In the Cybersecurity sector Adversarial machine learning attempts to deceive and trick models by creating unique deceptive inputs, to confuse the model resulting in a malfunction in the model. 

    https://www.kdnuggets.com/2022/03/adversarial-machine-learning.html

  • Hybrid AI Will Go Mainstream in 2022

    Analysts predict an AI boom, driven by possibilities and record funding. While challenges remain, a hybrid approach combining the best of the realm may finally send it sailing into the mainstream.

    https://www.kdnuggets.com/2022/03/hybrid-ai-go-mainstream-2022.html

  • How to Create a Dataset for Machine Learning

    Datasets - properly curated and labeled - remain a scarce resource. What can be done about this?

    https://www.kdnuggets.com/2022/02/create-dataset-machine-learning.html

  • Data-Centric AI: The Latest Research You Need to Know

    While a vast majority of research efforts today are preoccupied solely with ML models and algorithms, the data itself tends to be secondary and is treated as fixed. This claim is potentially detrimental.

    https://www.kdnuggets.com/2022/02/datacentric-ai-latest-research-need-know.html

  • The Complete Collection of Data Science Cheat Sheets – Part 2

    KDnuggets Top Blog A collection of cheat sheets that will help you prepare for a technical interview on Data Structures & Algorithms, Machine learning, Deep Learning, Natural Language Processing, Data Engineering, Web Frameworks.

    https://www.kdnuggets.com/2022/02/complete-collection-data-science-cheat-sheets-part-2.html

  • 4 Ways Hackers Are Using Data Science to Steal Billions

    The best way to stop your enemy is to know your enemy. Here are four ways hackers are using data science - and how they can be stopped.

    https://www.kdnuggets.com/2022/02/4-ways-hackers-data-science-steal-billions.html

  • How You Can Use Machine Learning to Automatically Label Data

    AI and machine learning can provide us with these tools. This guide will explore how we can use machine learning to label data.

    https://www.kdnuggets.com/2022/02/machine-learning-automatically-label-data.html

  • Managing Your Reusable Python Code as a Data Scientist

    Here are a few approaches that I have settled on for managing my own reusable Python code as a data scientist, presented from most to least general code use, and aimed at beginners.

    https://www.kdnuggets.com/2021/06/managing-reusable-python-code-data-scientist.html

  • 19 Data Science Project Ideas for Beginners

    This article features 19 data science projects for beginners, categorized into 7 full project tutorials, 5 places to come up with your own data science projects using data, and 7 skills-based data science projects.

    https://www.kdnuggets.com/2021/11/19-data-science-project-ideas-beginners.html

  • Classifying Long Text Documents Using BERT

    Transformer based language models such as BERT are really good at understanding the semantic context because they were designed specifically for that purpose. BERT outperforms all NLP baselines, but as we say in the scientific community, “no free lunch”. How can we use BERT to classify long text documents?

    https://www.kdnuggets.com/2022/02/classifying-long-text-documents-bert.html

  • Fine-Tuning BERT for Tweets Classification with HuggingFace

    In this blog, we used the Hugging Face library to fine-tune BERT on the classification task. We classified tweets related to COVID.

    https://www.kdnuggets.com/2022/01/finetuning-bert-tweets-classification-ft-hugging-face.html

  • KDnuggets™ News 22:n04, Jan 26: The High Paying Side Hustles for Data Scientists; Top Programming Languages and Their Uses

    The High Paying Side Hustles for Data Scientists; Top Programming Languages and Their Uses; Artificial Intelligence Project Ideas for 2022; The Best Python Courses: An Analysis Summary; Top Stories, Jan 17-23: The High Paying Side Hustles for Data Scientists

    https://www.kdnuggets.com/2022/n04.html

  • Learn Machine Learning 4X Faster by Participating in Competitions

    Participating in competitions has taught me everything about machine learning and how It can help you learn multiple domains faster than online courses.

    https://www.kdnuggets.com/2022/01/learn-machine-learning-4x-faster-participating-competitions.html

  • Transfer Learning for Image Recognition and Natural Language Processing

    Read the second article in this series on Transfer Learning, and learn how to apply it to Image Recognition and Natural Language Processing.

    https://www.kdnuggets.com/2022/01/transfer-learning-image-recognition-natural-language-processing.html

  • Learn Deep Learning by Building 15 Neural Network Projects in 2022

    Here are 15 neural network projects you can take on in 2022 to build your skills, your know-how, and your portfolio.

    https://www.kdnuggets.com/2022/01/15-neural-network-projects-build-2022.html

  • Explainable Forecasting and Nowcasting with State-of-the-art Deep Neural Networks and Dynamic Factor Model

    Review this detailed tutorial with code and revisit the decades-long old problem using a democratized and interpretable AI framework of how precisely can we anticipate the future and understand its causal factors?

    https://www.kdnuggets.com/2021/12/sota-explainable-forecasting-and-nowcasting.html

  • The Chatbot Transformation: From Failure to the Future

    The all-knowing chatbots we once thought to be the future have been replaced by specialized bots, and the results are outstanding.

    https://www.kdnuggets.com/2021/12/chatbot-transformation-failure-future.html

  • Data Labeling for Machine Learning: Market Overview, Approaches, and Tools

    So much of data science and machine learning is founded on having clean and well-understood data sources that it is unsurprising that the data labeling market is growing faster than ever. Here, we highlight many of the top players in this industry and the techniques they use to help you consider which might make a good partner for your needs.

    https://www.kdnuggets.com/2021/12/data-labeling-ml-overview-and-tools.html

  • My First Six Months as a Data Scientist

    The technical and non-technical lessons I’ve learned.

    https://www.kdnuggets.com/2021/12/first-six-months-data-scientist.html

  • Analyzing Scientific Articles with fine-tuned SciBERT NER Model and Neo4j

    In this article, we will be analyzing a dataset of scientific abstracts using the Neo4j Graph database and a fine-tuned SciBERT model.

    https://www.kdnuggets.com/2021/12/analyzing-scientific-articles-finetuned-scibert-ner-model-neo4j.html

  • Should You Become a Freelance Artificial Intelligence Engineer?

    Take the first step towards your machine learning engineering career and explore the UC San Diego Extension Machine Learning Engineering Bootcamp today. Those with prior software engineering or data science experience are encouraged to apply.

    https://www.kdnuggets.com/2021/12/ucsd-become-freelance-artificial-intelligence-engineer.html

  • AI, Analytics, Machine Learning, Data Science, Deep Learning Research Main Developments in 2021 and Key Trends for 2022

    2021 has almost come and gone. We saw some standout advancements in AI, Analytics, Machine Learning, Data Science, Deep Learning Research this past year, and the future, starting with 2022, looks bright. As per KDnuggets tradition, our collection of experts have contributed their insights on the matter. Read on to find out more.

    https://www.kdnuggets.com/2021/12/developments-predictions-ai-machine-learning-data-science-research.html

  • KDnuggets™ News 21:n45, Dec 1: Most Common SQL Mistakes on Data Science Interviews; Why Machine Learning Engineers are Replacing Data Scientists

    Most Common SQL Mistakes on Data Science Interviews; Why Machine Learning Engineers are Replacing Data Scientists; Vote in new KDnuggets Poll: What Percentage of Your Machine Learning Models Have Been Deployed? KDnuggets: Personal History and Nuggets of Experience.

    https://www.kdnuggets.com/2021/n45.html

  • Sentiment Analysis API vs Custom Text Classification: Which one to choose?

    In this article, we are going to compare the sentiment extraction performance between Sentiment Analysis engines and Custom Text classification engines. The idea is to show pros and cons of these two types of engines on a concrete dataset.

    https://www.kdnuggets.com/2021/11/sentiment-analysis-api-custom-text-classification.html

  • How to Build a Knowledge Graph with Neo4J and Transformers

    Learn to use custom Named Entity Recognition and Relation Extraction models.

    https://www.kdnuggets.com/2021/11/build-knowledge-graph-neo4j-transformers.html

  • 3 Differences Between Coding in Data Science and Machine Learning

    The terms ‘data science’ and ‘machine learning’ are often used interchangeably. But while they are related, there are some glaring differences, so let’s take a look at the differences between the two disciplines, specifically as it relates to programming.

    https://www.kdnuggets.com/2021/11/3-differences-coding-data-science-machine-learning.html

  • Anecdotes from 11 Role Models in Machine Learning

    The skills needed to create good data are also the skills needed for good leadership.

    https://www.kdnuggets.com/2021/11/anecdotes-11-role-models-machine-learning.html

  • Dream Come True: Building websites by thinking about them

    From the mind to the computer, make websites using your imagination!

    https://www.kdnuggets.com/2021/11/dream-come-true-allennlp-hacks-21.html

  • OpenAI’s Approach to Solve Math Word Problems

    OpenAI's latest research aims to solve math word problems. Let's dive a bit deeper into the ideas behind this new research.

    https://www.kdnuggets.com/2021/11/open-ai-approach-solve-math-word-problems.html

  • What Comes After HDF5? Seeking a Data Storage Format for Deep Learning

    In this article we are discussing that HDF5 is one of the most popular and reliable formats for non-tabular, numerical data. But this format is not optimized for deep learning work. This article suggests what kind of ML native data format should be to truly serve the needs of modern data scientists.

    https://www.kdnuggets.com/2021/11/after-hdf5-data-storage-format-deep-learning.html

  • 7 of The Coolest Machine Learning Topics of 2021 at ODSC West

    At our upcoming event this November 16th-18th in San Francisco, ODSC West 2021 will feature a plethora of talks, workshops, and training sessions on machine learning topics, deep learning, NLP, MLOps, and so on. You can register now for 20% off all ticket types, or register for a free AI Expo Pass to see what some big names in AI are doing now.

    https://www.kdnuggets.com/2021/11/odsc-7-coolest-machine-learning-topics.html

  • Salary Breakdown of the Top Data Science Jobs">Gold BlogSalary Breakdown of the Top Data Science Jobs

    Machine Learning vs NLP vs Data Engineer vs Data Scientist, and what it means to be in each role.

    https://www.kdnuggets.com/2021/11/salary-breakdown-top-data-science-jobs.html

  • Simple Text Scraping, Parsing, and Processing with this Python Library

    Scraping, parsing, and processing text data from the web can be difficult. But it can also be easy, using Newspaper3k.

    https://www.kdnuggets.com/2021/10/simple-text-scraping-parsing-processing-python-library.html

  • Machine Learning Model Development and Model Operations: Principles and Practices">Gold BlogMachine Learning Model Development and Model Operations: Principles and Practices

    The ML model management and the delivery of highly performing model is as important as the initial build of the model by choosing right dataset. The concepts around model retraining, model versioning, model deployment and model monitoring are the basis for machine learning operations (MLOps) that helps the data science teams deliver highly performing models.

    https://www.kdnuggets.com/2021/10/machine-learning-model-development-operations-principles-practice.html

  • Deploying Serverless spaCy Transformer Model with AWS Lambda

    A step-by-step guide on how to deploy NER transformer model serverless.

    https://www.kdnuggets.com/2021/10/deploying-serverless-spacy-transformer-model-aws-lambda.html

  • Training BPE, WordPiece, and Unigram Tokenizers from Scratch using Hugging Face

    Comparing the tokens generated by SOTA tokenization algorithms using Hugging Face's tokenizers package.

    https://www.kdnuggets.com/2021/10/bpe-wordpiece-unigram-tokenizers-using-hugging-face.html

  • Gold BlogData Scientist vs Data Engineer Salary">Rewards BlogGold BlogData Scientist vs Data Engineer Salary

    What are the differences between these two popular tech roles?

    https://www.kdnuggets.com/2021/10/data-scientist-data-engineer-salary.html

  • 11 Most Practical Data Science Skills for 2022

    While the field of data science continues to evolve with exciting new progress in analytical approaches and machine learning, there remain a core set of skills that are foundational for all general practitioners and specialists, especially those who want to be employable with full-stack capabilities.

    https://www.kdnuggets.com/2021/10/11-most-practical-data-science-skills-2022.html

  • Serving ML Models in Production: Common Patterns

    Over the past couple years, we've seen 4 common patterns of machine learning in production: pipeline, ensemble, business logic, and online learning. In the ML serving space, implementing these patterns typically involves a tradeoff between ease of development and production readiness. Ray Serve was built to support these patterns by being both easy to develop and production ready.

    https://www.kdnuggets.com/2021/10/serving-ml-models-production-common-patterns.html

  • New Computing Paradigm for AI: Processing-in-Memory (PIM) Architecture

    As larger deep neural networks are trained on the latest and fastest chip technologies, an important challenge remains that bottlenecks performance -- and it is not compute power. You can try to calculate a DNN as fast as possible, but there is data -- and it has to move. Data pipelines on the chip are expensive and new solutions must be developed to advance capabilities.

    https://www.kdnuggets.com/2021/10/samsung-computing-paradigm-ai-in-memory.html

  • Deploying Your First Machine Learning API">Silver BlogDeploying Your First Machine Learning API

    Effortless way to develop and deploy your machine learning API using FastAPI and Deta.

    https://www.kdnuggets.com/2021/10/deploying-first-machine-learning-api.html

  • How to Ace Data Science Interview by Working on Portfolio Projects">Silver BlogHow to Ace Data Science Interview by Working on Portfolio Projects

    Recruiters of Data Science professionals around the world focus on portfolio projects rather than resumes and LinkedIn profiles. So, learning early how to contribute and share your work on GitHub, Deepnote, and Kaggle can help you perform your best during data science interviews.

    https://www.kdnuggets.com/2021/10/ace-data-science-interview-portfolio-projects.html

  • Introduction to PyTorch Lightning">Silver BlogIntroduction to PyTorch Lightning

    PyTorch Lightning is a high-level programming layer built on top of PyTorch. It makes building and training models faster, easier, and more reliable.

    https://www.kdnuggets.com/2021/10/introduction-pytorch-lightning.html

  • Surpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI?">Silver BlogSurpassing Trillion Parameters and GPT-3 with Switch Transformers – a path to AGI?

    Ever larger models churning on increasingly faster machines suggest a potential path toward smarter AI, such as with the massive GPT-3 language model. However, new, more lean, approaches are being conceived and explored that may rival these super-models, which could lead to a future with more efficient implementations of advanced AI-driven systems.

    https://www.kdnuggets.com/2021/10/trillion-parameters-gpt-3-switch-transformers-path-agi.html

  • Transform speech into knowledge with Huggingface/Facebook AI and expert.ai

    Speech2Data is a blend of open source and free-to-use AI models and technologies powered by Huggingface, Facebook AI and expert.ai. Learn more here.

    https://www.kdnuggets.com/2021/09/expert-ai-speech-huggingface-facebook.html

  • Building a Structured Financial Newsfeed Using Python, SpaCy and Streamlit

    Getting started with NLP by building a Named Entity Recognition(NER) application.

    https://www.kdnuggets.com/2021/09/-structured-financial-newsfeed-using-python-spacy-and-streamlit.html

  • Gold BlogPath to Full Stack Data Science">Rewards BlogGold BlogPath to Full Stack Data Science

    Start your journey toward mastering all aspects of the field of Data Science with this focused list of in-depth self-learning resources. Curated with the beginner in mind, these recommendations will help you learn efficiently, and can also offer existing professionals useful highlights for review or help filling in any gaps in skills.

    https://www.kdnuggets.com/2021/09/path-full-stack-data-science.html

  • A Breakdown of Deep Learning Frameworks

    Deep Learning continues to evolve as one of the most powerful techniques in the AI toolbox. Many software packages exist today to support the development of models, and we highlight important options available with key qualities and differentiators to help you select the most appropriate for your needs.

    https://www.kdnuggets.com/2021/09/a-breakdown-deep-learning-frameworks.html

  • Messy Data is Beautiful

    Once these types of data have been cleaned, they do more than show organized data sets. They reveal unlimited possibilities, and AI analytics can reveal these possibilities faster and more efficiently than ever before.

    https://www.kdnuggets.com/2021/09/sparkbeyond-messy-data-is-beautiful.html

  • 20 Machine Learning Projects That Will Get You Hired">Silver Blog20 Machine Learning Projects That Will Get You Hired

    If you want to break into the machine learning and data science job market, then you will need to demonstrate the proficiency of your skills, especially if you are self-taught through online courses and bootcamps. A project portfolio is a great way to practice your new craft and offer convincing evidence that an employee should hire you over the competition.

    https://www.kdnuggets.com/2021/09/20-machine-learning-projects-hired.html

  • The Machine & Deep Learning Compendium Open Book">Gold BlogThe Machine & Deep Learning Compendium Open Book

    After years in the making, this extensive and comprehensive ebook resource is now available and open for data scientists and ML engineers. Learn from and contribute to this tome of valuable information to support all your work in data science from engineering to strategy to management.

    https://www.kdnuggets.com/2021/09/machine-deep-learning-open-book.html

  • Working with Python APIs For Data Science Project

    In this article, we will work with YouTube Python API to collect video statistics from our channel using the requests python library to make an API call and save it as a Pandas DataFrame.

    https://www.kdnuggets.com/2021/09/python-apis-data-science-project.html

  • Text Preprocessing Methods for Deep Learning

    While the preprocessing pipeline we are focusing on in this post is mainly centered around Deep Learning, most of it will also be applicable to conventional machine learning models too.

    https://www.kdnuggets.com/2021/09/text-preprocessing-methods-deep-learning.html

  • 8 Deep Learning Project Ideas for Beginners">Gold Blog8 Deep Learning Project Ideas for Beginners

    Have you studied Deep Learning techniques, but never worked on a useful project? Here, we highlight eight deep learning project ideas for beginners that will help you sharpen your skills and boost your resume.

    https://www.kdnuggets.com/2021/09/8-deep-learning-project-ideas-beginners.html

  • How Machine Learning Leverages Linear Algebra to Solve Data Problems

    Why you should learn the fundamentals of linear algebra.

    https://www.kdnuggets.com/2021/09/machine-learning-leverages-linear-algebra-solve-data-problems.html

  • Fast AutoML with FLAML + Ray Tune

    Microsoft Researchers have developed FLAML (Fast Lightweight AutoML) which can now utilize Ray Tune for distributed hyperparameter tuning to scale up FLAML’s resource-efficient & easily parallelizable algorithms across a cluster.

    https://www.kdnuggets.com/2021/09/fast-automl-flaml-ray-tune.html

  • 6 Cool Python Libraries That I Came Across Recently

    Check out these awesome Python libraries for Machine Learning.

    https://www.kdnuggets.com/2021/09/6-cool-python-libraries-recently.html

  • Best Resources to Learn Natural Language Processing in 2021

    In this article, the author has listed listed all the best resources to learn natural language processing including Online Courses, Tutorials, Books, and YouTube Videos.

    https://www.kdnuggets.com/2021/09/best-resources-learn-natural-language-processing-2021.html

  • Multilabel Document Categorization, step by step example

    This detailed guide explores an unsupervised and supervised learning two-stage approach with LDA and BERT to develop a domain-specific document categorizer on unlabeled documents.

    https://www.kdnuggets.com/2021/08/multilabel-document-categorization.html

  • 3 Data Acquisition, Annotation, and Augmentation Tools

    Check out these 3 projects found around GitHub that can help with your data acquisition, annotation, and augmentation tasks.

    https://www.kdnuggets.com/2021/08/3-data-labeling-synthesizing-augmentation-tools.html

  • Learning Data Science and Machine Learning: First Steps After The Roadmap">Silver BlogLearning Data Science and Machine Learning: First Steps After The Roadmap

    Just getting into learning data science may seem as daunting as (if not more than) trying to land your first job in the field. With so many options and resources online and in traditional academia to consider, these pre-requisites and pre-work are recommended before diving deep into data science and AI/ML.

    https://www.kdnuggets.com/2021/08/learn-data-science-machine-learning.html

  • Amazon Web Services Webinar: Accelerating clinical trial and biomedical development processes with healthcare data

    Join this webinar on August 27 to learn how to leverage external healthcare datasets to make faster decisions with greater accuracy – accelerating biomedical development and improving patient welfare.

    https://www.kdnuggets.com/2021/08/aws-webinar-clinical-trial-biomedical-development-healthcare.html

  • Open Source Datasets for Computer Vision">Silver BlogOpen Source Datasets for Computer Vision

    Access to high-quality, noise-free, large-scale datasets is crucial for training complex deep neural network models for computer vision applications. Many open-source datasets are developed for use in image classification, pose estimation, image captioning, autonomous driving, and object segmentation. These datasets must be paired with the appropriate hardware and benchmarking strategies to optimize performance.

    https://www.kdnuggets.com/2021/08/open-source-datasets-computer-vision.html

  • Linear Algebra for Natural Language Processing

    Learn about representing word semantics in vector space.

    https://www.kdnuggets.com/2021/08/linear-algebra-natural-language-processing.html

  • How to Train a BERT Model From Scratch

    Meet BERT’s Italian cousin, FiliBERTo.

    https://www.kdnuggets.com/2021/08/train-bert-model-scratch.html

  • MLOps And Machine Learning Roadmap

    A 16–20 week roadmap to review machine learning and learn MLOps.

    https://www.kdnuggets.com/2021/08/mlops-machine-learning-roadmap.html

  • Using Twitter to Understand Pizza Delivery Apprehension During COVID

    Analyzing customer sentiments and capturing any specific difference in emotion to order Dominos pizza in India during lockdown.

    https://www.kdnuggets.com/2021/08/twitter-understand-pizza-delivery-covid.html

  • 30 Most Asked Machine Learning Questions Answered

    There is always a lot to learn in machine learning. Whether you are new to the field or a seasoned practitioner and ready for a refresher, understanding these key concepts will keep your skills honed in the right direction.

    https://www.kdnuggets.com/2021/08/30-machine-learning-questions-answered.html

  • An AI-Based Framework Solution to Address Email Management Challenges

    Expert.ai’s Edge NL API is an on-premise API that can perform NLU tasks with no required training or extra work, offering advanced, out-of-the-box capabilities that address common use cases and can be easily customized to your specific needs.

    https://www.kdnuggets.com/2021/07/expertai-ai-based-framework-solution-email-management.html

  • Machine Learning Skills – Update Yours This Summer

    The process of mastering new knowledge often requires multiple passes to ensure the information is deeply understood. If you already began your journey into machine learning and data science, then you are likely ready for a refresher on topics you previously covered. This eight-week self-learning path will help you recapture the foundations and prepare you for future success in applying these skills.

    https://www.kdnuggets.com/2021/07/update-your-machine-learning-skills.html

  • Facebook Open Sources a Chatbot That Can Discuss Any Topic

    The new version expands the capabilities of its predecessor building a much more natural conversational experience.

    https://www.kdnuggets.com/2021/07/facebook-open-sources-chatbot-discuss-any-topic.html

  • Understanding BERT with Hugging Face

    We don’t really understand something before we implement it ourselves. So in this post, we will implement a Question Answering Neural Network using BERT and a Hugging Face Library.

    https://www.kdnuggets.com/2021/07/understanding-bert-hugging-face.html

  • How to Create Unbiased Machine Learning Models

    In this post we discuss the concepts of bias and fairness in the Machine Learning world, and show how ML biases often reflect existing biases in society. Additionally, We discuss various methods for testing and enforcing fairness in ML models.

    https://www.kdnuggets.com/2021/07/create-unbiased-machine-learning-models.html

  • 7 Open Source Libraries for Deep Learning Graphs

    In this article we’ll go through 7 up-and-coming open source libraries for graph deep learning, ranked in order of increasing popularity.

    https://www.kdnuggets.com/2021/07/7-open-source-libraries-deep-learning-graphs.html

  • Gold BlogA Learning Path To Becoming a Data Scientist">Rewards BlogGold BlogA Learning Path To Becoming a Data Scientist

    Becoming a professional data scientist may not be as easy as "1... 2... 3...", but these 10 steps can be your self-learning roadmap to kickstarting your future in the exciting and ever-expanding field of data science.

    https://www.kdnuggets.com/2021/07/learning-path-data-scientist.html

  • Semantic Search: Measuring Meaning From Jaccard to Bert

    In this article, we’ll cover a few of the most interesting — and powerful — of these techniques — focusing specifically on semantic search. We’ll learn how they work, what they’re good at, and how we can implement them ourselves.

    https://www.kdnuggets.com/2021/07/semantic-search-measuring-meaning-jaccard-bert.html

  • High-Performance Deep Learning: How to train smaller, faster, and better models – Part 3

    Now that you are ready to efficiently build advanced deep learning models with the right software and hardware tools, the techniques involved in implementing such efforts must be explored to improve model quality and obtain the performance that your organization desires.

    https://www.kdnuggets.com/2021/07/high-performance-deep-learning-part3.html

  • Computational Complexity of Deep Learning: Solution Approaches

    Why has deep learning been so successful? What is the fundamental reason that deep learning can learn from big data? Why cannot traditional ML learn from the large data sets that are now available for different tasks as efficiently as deep learning can?

    https://www.kdnuggets.com/2021/06/computational-complexity-deep-learning-solution-approaches.html

  • How to Train a Joint Entities and Relation Extraction Classifier using BERT Transformer with spaCy 3

    A step-by-step guide on how to train a relation extraction classifier using Transformer and spaCy3.

    https://www.kdnuggets.com/2021/06/train-joint-entities-relation-extraction-classifier-bert-spacy.html

  • Create and Deploy Dashboards using Voila and Saturn Cloud

    Working with and training large datasets, maintaining them all in one place, and deploying them to production is a challenging job. In this article, we covered what Saturn Cloud is and how it can speed up your end-to-end pipeline, how to create dashboards using Voila and Python and publish them to production in just a few easy steps.

    https://www.kdnuggets.com/2021/06/create-deploy-dashboards-voila-saturn-cloud.html

  • Fine-Tuning Transformer Model for Invoice Recognition

    The author presents a step-by-step guide from annotation to training.

    https://www.kdnuggets.com/2021/06/fine-tuning-transformer-model-invoice-recognition.html

  • The Word “WORD” Has 13 Meanings

    Thoughts around Knowledge Graphs, the semantic nature of language, and the two main types of word ambiguity.

    https://www.kdnuggets.com/2021/06/expert-word-has-13-meanings.html

  • High Performance Deep Learning, Part 1

    Advancing deep learning techniques continue to demonstrate incredible potential to deliver exciting new AI-enhanced software and systems. But, training the most powerful models is expensive--financially, computationally, and environmentally. Increasing the efficiency of such models will have profound impacts in many ways, so developing future models with this intension in mind will only help to further expand the reach, applicability, and value of what deep learning has to offer.

    https://www.kdnuggets.com/2021/06/efficiency-deep-learning-part1.html

  • Get Interactive Plots Directly With Pandas">Silver BlogGet Interactive Plots Directly With Pandas

    Telling a story with data is a core function for any Data Scientist, and creating data visualizations that are simultaneously illuminating and appealing can be challenging. This tutorial reviews how to create Plotly and Bokeh plots directly through Pandas plotting syntax, which will help you convert static visualizations into interactive counterparts -- and take your analysis to the next level.

    https://www.kdnuggets.com/2021/06/interactive-plots-directly-pandas.html

  • Building a Knowledge Graph for Job Search Using BERT

    A guide on how to create knowledge graphs using NER and Relation Extraction.

    https://www.kdnuggets.com/2021/06/knowledge-graph-job-search-bert.html

  • The Essential Guide to Transformers, the Key to Modern SOTA AI

    You likely know Transformers from their recent spate of success stories in natural language processing, computer vision, and other areas of artificial intelligence, but are familiar with all of the X-formers? More importantly, do you know the differences, and why you might use one over another?

    https://www.kdnuggets.com/2021/06/essential-guide-transformers-key-modern-sota-ai.html

  • How to speed up a Deep Learning Language model by almost 50X at half the cost

    In this blog post, we show how to accelerate fine-tuning the ALBERT language model while also reducing costs by using Determined’s built-in support for distributed training with AWS spot instances.

    https://www.kdnuggets.com/2021/06/determined-ai-speed-up-deep-learning-language-model.html

  • How to Fine-Tune BERT Transformer with spaCy 3

    A step-by-step guide on how to create a knowledge graph using NER and Relation Extraction.

    https://www.kdnuggets.com/2021/06/fine-tune-bert-transformer-spacy.html

  • PyCaret 101: An introduction for beginners

    This article is a great overview of how to get started with PyCaret for all your machine learning projects.

    https://www.kdnuggets.com/2021/06/pycaret-101-introduction-beginners.html

  • How to Create and Deploy a Simple Sentiment Analysis App via API

    In this article we will create a simple sentiment analysis app using the HuggingFace Transformers library, and deploy it using FastAPI.

    https://www.kdnuggets.com/2021/06/create-deploy-sentiment-analysis-app-api.html

  • Supercharge Your Machine Learning Experiments with PyCaret and Gradio

    A step-by-step tutorial to develop and interact with machine learning pipelines rapidly.

    https://www.kdnuggets.com/2021/05/supercharge-machine-learning-experiments-pycaret-gradio.html

  • Great New Resource for Natural Language Processing Research and Applications

    The NLP Index is a brand new resource for NLP code discovery, combining and indexing more than 3,000 paper and code pairs at launch. If you are interested in NLP research and locating the code and papers needed to understand an implement the latest research, you should check it out.

    https://www.kdnuggets.com/2021/05/great-new-resource-natural-language-processing-research-applications.html

  • Topic Modeling with Streamlit

    What does it take to create and deploy a topic modeling web application quickly? Read this post to see how the author uses Python NLP packages for topic modeling, Streamlit for the web application framework, and Streamlit Sharing for deployment.

    https://www.kdnuggets.com/2021/05/topic-modeling-streamlit.html

  • Write and train your own custom machine learning models using PyCaret

    A step-by-step, beginner-friendly tutorial on how to write and train custom machine learning models in PyCaret.

    https://www.kdnuggets.com/2021/05/pycaret-write-train-custom-machine-learning-models.html

  • Awesome list of datasets in 100+ categories

    With an estimated 44 zettabytes of data in existence in our digital world today and approximately 2.5 quintillion bytes of new data generated daily, there is a lot of data out there you could tap into for your data science projects. It's pretty hard to curate through such a massive universe of data, but this collection is a great start. Here, you can find data from cancer genomes to UFO reports, as well as years of air quality data to 200,000 jokes. Dive into this ocean of data to explore as you learn how to apply data science techniques or leverage your expertise to discover something new.

    https://www.kdnuggets.com/2021/05/awesome-list-datasets.html

4
Refine your search here:

Get the FREE ebook 'KDnuggets Artificial Intelligence Pocket Dictionary' along with the leading newsletter on Data Science, Machine Learning, AI & Analytics straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

Get the FREE ebook 'KDnuggets Artificial Intelligence Pocket Dictionary' along with the leading newsletter on Data Science, Machine Learning, AI & Analytics straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

No, thanks!