Search results for aws

    Found 719 documents, 5920 searched:

  • 11 Best Practices of Cloud and Data Migration to AWS Cloud

    list of Best Practices compiled from our learnings during our migration journey to the AWS cloud.

    https://www.kdnuggets.com/2023/04/11-best-practices-cloud-data-migration-aws-cloud.html

  • Setup and use JupyterHub (TLJH) on AWS EC2

    JupyterHub is a multi-user, container-friendly version of the Jupyter Notebook. However, it can be difficult to setup. This blog post will make you less likely to run into issues in this 15+ step process.

    https://www.kdnuggets.com/2023/01/setup-jupyterhub-tljh-aws-ec2.html

  • AWS AI & ML Scholarship Program Overview

    This scholarship program aims to help people who are underserved and that were underrepresented during high school and college - to then help them learn the foundations and concepts of Machine Learning and build a careers in AI and ML.

    https://www.kdnuggets.com/2022/09/aws-ai-ml-scholarship-program-overview.html

  • Using Datawig, an AWS Deep Learning Library for Missing Value Imputation

    A lot of missing values in the dataset can affect the quality of prediction in the long run. Several methods can be used to fill the missing values and Datawig is one of the most efficient ones.

    https://www.kdnuggets.com/2021/12/datawig-aws-deep-learning-library-missing-value-imputation.html

  • Build a Serverless News Data Pipeline using ML on AWS Cloud

    This is the guide on how to build a serverless data pipeline on AWS with a Machine Learning model deployed as a Sagemaker endpoint.

    https://www.kdnuggets.com/2021/11/build-serverless-news-data-pipeline-ml-aws-cloud.html

  • The Best Ways for Data Professionals to Market AWS Skills in 2022

    Knowing your way around Amazon Web Services (AWS) is increasingly useful. Here are five ways to market your AWS skills in today’s job market.

    https://www.kdnuggets.com/2021/11/best-ways-data-professionals-market-aws-skills.html

  • Deploying Serverless spaCy Transformer Model with AWS Lambda

    A step-by-step guide on how to deploy NER transformer model serverless.

    https://www.kdnuggets.com/2021/10/deploying-serverless-spacy-transformer-model-aws-lambda.html

  • Development & Testing of ETL Pipelines for AWS Locally

    Typically, development and testing ETL pipelines is done on real environment/clusters which is time consuming to setup & requires maintenance. This article focuses on the development and testing of ETL pipelines locally with the help of Docker & LocalStack. The solution gives flexibility to test in a local environment without setting up any services on the cloud.

    https://www.kdnuggets.com/2021/08/development-testing-etl-pipelines-aws-locally.html

  • AWS Webinar: How are data-driven companies using ESG and sustainability data to make actionable decisions?

    In this virtual session, on Jul 29 @ 11AM PT, 2PM ET, our panel of experts will uncover how companies across several verticals use ESG data to move beyond the reporting benchmark, deepen business insights, and create competitive differentiation.

    https://www.kdnuggets.com/2021/07/roidna-aws-webinar-data-driven-esg-sustainability-decisions.html

  • 3 Mathematical Laws Data Scientists Need To Know">Gold Blog3 Mathematical Laws Data Scientists Need To Know

    Machine learning and data science are founded on important mathematics in statistics and probability. A few interesting mathematical laws you should understand will especially help you perform better as a Data Scientist, including Benford's Law, the Law of Large Numbers, and Zipf's Law.

    https://www.kdnuggets.com/2021/03/3-mathematical-laws.html

  • Deploying Secure and Scalable Streamlit Apps on AWS with Docker Swarm, Traefik and Keycloak

    If you are a data scientist who just wants to get the work done but doesn’t necessarily want to go down the DevOps rabbit hole, this tutorial offers a relatively straightforward deployment solution leveraging Docker Swarm and Traefik, with an option of adding user authentication with Keycloak.

    https://www.kdnuggets.com/2020/10/deploying-secure-scalable-streamlit-apps-aws-docker-swarm-traefik-keycloak.html

  • 10 Steps for Tackling Data Privacy and Security Laws in 2020

    Data privacy laws, such as the CCPA, GDPR, and HIPAA, are here to stay and significantly impact everyone in the digital era. These steps will guide organizations to prepare for compliance and ensure they support the fundamental privacy rights of their customers and users.

    https://www.kdnuggets.com/2020/07/10-steps-data-privacy-security-laws.html

  • Deploy Machine Learning Pipeline on AWS Fargate">Gold BlogDeploy Machine Learning Pipeline on AWS Fargate

    A step-by-step beginner’s guide to containerize and deploy ML pipeline serverless on AWS Fargate.

    https://www.kdnuggets.com/2020/07/deploy-machine-learning-pipeline-aws-fargate.html

  • Build Dog Breeds Classifier Step By Step with AWS Sagemaker

    This post takes you through the basic steps for creating a cloud-based deep learning dog classifier, with everything accomplished from the AWS Management Console.

    https://www.kdnuggets.com/2020/06/build-dog-breeds-classifier-aws-sagemaker.html

  • Deploying a pretrained GPT-2 model on AWS

    This post attempts to summarize my recent detour into NLP, describing how I exposed a Huggingface pre-trained Language Model (LM) on an AWS-based web application.

    https://www.kdnuggets.com/2019/12/deploying-pretrained-gpt-2-model-aws.html

  • Power Laws in Deep Learning 2: Universality

    It is amazing that Deep Neural Networks display this Universality in their weight matrices, and this suggests some deeper reason for Why Deep Learning Works.

    https://www.kdnuggets.com/2018/09/power-laws-deep-learning-2-universality.html

  • Power Laws in Deep Learning

    In pretrained, production quality DNNs,  the weight matrices for the Fully Connected (FC ) layers display Fat Tailed Power Law behavior.

    https://www.kdnuggets.com/2018/09/power-laws-deep-learning.html

  • IoT on AWS: Machine Learning Models and Dashboards from Sensor Data

    I developed my first IoT project using my notebook as an IoT device and AWS IoT as infrastructure, with this "simple" idea: collect CPU Temperature from my Notebook running on Ubuntu, send to Amazon AWS IoT, save data, make it available for Machine Learning models and dashboards.

    https://www.kdnuggets.com/2018/06/zimbres-iot-aws-machine-learning-dashboard.html

  • Rethinking 3 Laws of Machine Learning

    We rethink Asimov’s 3 law of robotics to help companies moving to unsupervised machine learning and realize 100% automated predictive information governance (PIG).

    https://www.kdnuggets.com/2017/10/3-laws-machine-learning.html

  • Data Science Basics: Power Laws and Distributions

    Power laws and other relationships between observable phenomena may not seem like they are of any interest to data science, at least not to newcomers to the field, but this post provides an overview and suggests how they may be.

    https://www.kdnuggets.com/2016/12/data-science-basics-power-laws-distributions.html

  • Top stories for Jun 28 – Jul 4: Top 20 R packages by popularity; Nine Laws of Data Mining

    Top 20 R packages by popularity; Top 20 R Machine Learning and Data Science packages; Nine Laws of Data Mining; The missing D in Data Science.

    https://www.kdnuggets.com/2015/07/top-news-week-jun-28.html

  • (Deep Learning’s Deep Flaws)’s Deep Flaws

    Recent press has challenged the hype surrounding deep learning, trumpeting several findings which expose shortcomings of current algorithms. However, many of deep learning's reported flaws are universal, affecting nearly all machine learning algorithms.

    https://www.kdnuggets.com/2015/01/deep-learning-flaws-universal-machine-learning.html

  • Does Deep Learning Have Deep Flaws?

    A recent study of neural networks found that for every correctly classified image, one can generate an "adversarial", visually indistinguishable image that will be misclassified. This suggests potential deep flaws in all neural networks, including possibly a human brain.

    https://www.kdnuggets.com/2014/06/deep-learning-deep-flaws.html

  • 7 Steps to Mastering MLOPs

    Join us on a journey of becoming a professional MLOps engineer by mastering essential tools, frameworks, key concepts, and processes in the field.

    https://www.kdnuggets.com/7-steps-to-mastering-mlops

  • Exploring the OpenAI API with Python

    Let’s learn all the useful services from the OpenAI.

    https://www.kdnuggets.com/exploring-the-openai-api-with-python

  • Mistral 7B-V0.2: Fine-Tuning Mistral’s New Open-Source LLM with Hugging Face

    Access Mistral’s latest open-source model and fine-tune it on a custom dataset.

    https://www.kdnuggets.com/mistral-7b-v02-fine-tuning-mistral-new-open-source-llm-with-hugging-face

  • Top 7 Model Deployment and Serving Tools

    Learn about the top tools and frameworks that can simplify deploying large machine learning models in production and generate business value.

    https://www.kdnuggets.com/top-7-model-deployment-and-serving-tools

  • A Beginner’s Guide to the Top 10 Machine Learning Algorithms

    Data science’s essence lies in machine learning algorithms. Here are ten algorithms that are a great introduction to machine learning for any beginner!

    https://www.kdnuggets.com/a-beginner-guide-to-the-top-10-machine-learning-algorithms

  • 10 GitHub Repositories to Master MLOps

    Begin your MLOps journey with these comprehensive free resources available on GitHub.

    https://www.kdnuggets.com/10-github-repositories-to-master-mlops

  • Build An AI Application with Python in 10 Easy Steps

    Explore the fundamental steps for creating a successful AI Application with Python and other tools.

    https://www.kdnuggets.com/build-an-ai-application-with-python-in-10-easy-steps

  • 5 Essential Skills Every Data Scientist Needs in 2024

    Want to move into the data science field? Or advance your career in the data? Don’t miss these must-have skills.

    https://www.kdnuggets.com/5-essential-skills-every-data-scientist-needs-in-2024

  • Best Free Resources to Learn Data Analysis and Data Science

    This article introduces six top-notch, free data science resources ideal for aspiring data analysts, data scientists, or anyone aiming to enhance their analytical skills.

    https://www.kdnuggets.com/2024/03/365datascience-best-free-resources-learn-data-analysis-data-science

  • Data Science and the Go Programming Language

    Northwestern’s School of Professional Studies uses Go in Its Master of Science in Data Science Program.

    https://www.kdnuggets.com/2024/03/nwu-data-science-go-programming-language

  • How to Learn Python Basics With ChatGPT

    Your Ultimate Learning Companion.

    https://www.kdnuggets.com/how-to-learn-python-basics-with-chatgpt

  • Free Amazon Courses to Learn Generative AI: For All Levels

    Upskill with these free courses to master generative AI, regardless of your job title.

    https://www.kdnuggets.com/free-amazon-courses-to-learn-generative-ai-for-all-levels

  • Books, Courses, and Live Events to Learn Generative AI with O’Reilly

    If you are new to generative AI or an expert who wants to learn more, O’Reilly offers a range of resources to kickstart your generative AI journey.

    https://www.kdnuggets.com/books-courses-and-live-events-to-learn-generative-ai-with-oreilly

  • Data Maturity: The Cornerstone of AI-Enabled Innovation

    This article outlines strategies for overcoming data maturity challenges and accelerating AI adoption.

    https://www.kdnuggets.com/data-maturity-the-cornerstone-of-ai-enabled-innovation

  • What I Learned From Using ChatGPT for Data Science

    ChatGPT can be a great tool for data scientists. Here’s what I learned about where it excels and where it is less so.

    https://www.kdnuggets.com/what-i-learned-from-using-chatgpt-for-data-science

  • Top 16 Technical Data Sources for Advanced Data Science Projects

    Here are data repositories that will up your data science game and improve your data projects.

    https://www.kdnuggets.com/top-16-technical-data-sources-for-advanced-data-science-projects

  • 5 Coding Tasks ChatGPT Can’t Do

    This is a pretty good list of what ChatGPT can't do. But it's not exhaustive. ChatGPT can generate pretty good code from scratch, but it can't do anything that would take your job.

    https://www.kdnuggets.com/5-coding-tasks-chatgpt-cant-do

  • Level 50 Data Scientist: Python Libraries to Know

    This article will help you understand the different tools of Data Science used by experts for Data Visualization, Model Building, and Data Manipulation.

    https://www.kdnuggets.com/level-50-data-scientist-python-libraries-to-know

  • 2023: The Crazy AI Year

    The year of Generative AI - let’s go through what happened in the past 12 months.

    https://www.kdnuggets.com/2023-the-crazy-ai-year

  • 25 Free Courses to Master Data Science, Data Engineering, Machine Learning, MLOps, and Generative AI

    Discover a collection of top courses to launch your dream career or master a new skill, all for free!

    https://www.kdnuggets.com/25-free-courses-to-master-data-science-data-engineering-machine-learning-mlops-and-generative-ai

  • Back to Basics Pathway

    Kickstart your 2024 with KDnuggets Back to Basics Data Science pathway!

    https://www.kdnuggets.com/back-to-basics-pathway

  • Strategies for Optimizing Performance and Costs When Using Large Language Models in the Cloud

    There are many cases where your LLM underperforms and costs you too much in the cloud platform. Simple strategies help you avoid that.

    https://www.kdnuggets.com/strategies-for-optimizing-performance-and-costs-when-using-large-language-models-in-the-cloud

  • Back to Basics Bonus Week: Deploying to the Cloud

    Welcome back to the KDnuggets’ "Back to Basics" series. This is the BONUS week and we will dive into learning about deploying to the cloud.

    https://www.kdnuggets.com/back-to-basics-bonus-week-deploying-to-the-cloud

  • 5 Free Courses to Master MLOps

    Have you finished learning the basics of machine learning and now wondering what's next? You're in the right place!

    https://www.kdnuggets.com/5-free-courses-to-master-mlops

  • Building a GPU Machine vs. Using the GPU Cloud

    The article examines the pros and cons of building an on-premise GPU machine versus using a GPU cloud service for projects involving deep learning and artificial intelligence, analyzing factors like cost, performance, operations, and scalability.

    https://www.kdnuggets.com/building-a-gpu-machine-vs-using-the-gpu-cloud

  • Beyond Human Boundaries: The Rise of SuperIntelligence

    From ANI to AGI and Beyond: Deciphering AI's Evolutionary Path.

    https://www.kdnuggets.com/beyond-human-boundaries-the-rise-of-superintelligence

  • Remote Work in Data Science: Pros and Cons

    In this post we explored the potential challenges and pitfalls of remote work in data science.

    https://www.kdnuggets.com/remote-work-in-data-science-pros-and-cons

  • Job Trends in Data Analytics: Part 2

    Check out these skillsets in demand for the data analytics job market.

    https://www.kdnuggets.com/job-trends-in-data-analytics-part-2

  • Enhance Your Python Coding Style with Ruff

    Ruff's 700+ built-in lint rules, reimplemented in Rust for speed, provide comprehensive linting and formatting to enforce clean and consistent Python code.

    https://www.kdnuggets.com/enhance-your-python-coding-style-with-ruff

  • 5 Free Courses to Master Generative AI

    Generative AI is an exciting and fast-moving area of research and application. Check out these 5 courses to get up to speed and stay ahead of the curve.

    https://www.kdnuggets.com/5-free-courses-to-master-generative-ai

  • A Microsoft Engineer’s Guide to AI Innovation and Leadership

    Dive into the insights of AI innovation with Microsoft's Senior Software Engineer, Manas Joshi: A journey of technology, triumph, and teachings for the next generation.

    https://www.kdnuggets.com/a-microsoft-engineer-guide-to-ai-innovation-and-leadership

  • The New Ethical Implications of Generative Artificial Intelligence

    Generative AI's rapid progress necessitates urgent ethical safeguards against data, scale, accountability, copyright, and misinformation risks.

    https://www.kdnuggets.com/the-new-ethical-implications-of-generative-artificial-intelligence

  • Introduction to Giskard: Open-Source Quality Management for AI Models

    To solve the conundrum of ensuring the quality of AI models in production — especially given the emergence of LLMs — we are thrilled to announce the official launch of Giskard, the premier open-source AI quality management system.

    https://www.kdnuggets.com/2023/11/giskard-introduction-giskard-opensource-quality-management-ai-models

  • Generative AI: The First Draft, Not Final

    This article gives a high-level overview of how LLMs work and their attendant limitations with accessible explanations and anecdotes throughout the piece. We also present advice on how people can introduce them into their workflows.

    https://www.kdnuggets.com/generative-ai-the-first-draft-not-final

  • The Top 5 Cloud Machine Learning Platforms & Tools

    What are the top 5 cloud machine learning platforms in the market today. Our list will help provide some vital insights into which platform might best cater to your specific machine learning needs. See what KDnuggets recommends.

    https://www.kdnuggets.com/the-top-5-cloud-machine-learning-platforms-tools

  • 7 Platforms for Getting High Paying Data Science Jobs

    Job hunting in data science got you down? Check out these 7 awesome platforms to score your next high-paying data science gig!

    https://www.kdnuggets.com/7-platforms-for-getting-high-paying-data-science-jobs

  • Greening AI: 7 Strategies to Make Applications More Sustainable

    The article delves into a comprehensive methodology that sheds light on how to accurately estimate the carbon footprint associated with AI applications. It explains the environmental impact of AI, a crucial consideration in today's world.

    https://www.kdnuggets.com/greening-ai-7-strategies-to-make-applications-more-sustainable

  • A Brief History of the Neural Networks

    From the biological neuron to LLMs: How AI became smart.

    https://www.kdnuggets.com/a-brief-history-of-the-neural-networks

  • Top Companies in India to Consider for Employment

    If you’re looking for a job, want to shift careers, or start a new chapter and currently reside in India. Check out these top 7 companies to consider for employment in India for 2023/24.

    https://www.kdnuggets.com/top-companies-in-india-to-consider-for-employment

  • 7 Best Cloud Database Platforms

    Cloud databases have made it easier and cheaper to develop enterprise-level applications, offering flexibility, convenience, and standard database functionality. See what KDnuggets recommends.

    https://www.kdnuggets.com/7-best-cloud-database-platforms

  • 7 Steps to Mastering Large Language Models (LLMs)

    Large Language Models (LLMs) have unlocked a new era in natural language processing. So why not learn more about them? Go from learning what large language models are to building and deploying LLM apps in 7 easy steps with this guide.

    https://www.kdnuggets.com/7-steps-to-mastering-large-language-models-llms

  • Best Practices for Building ETLs for ML

    This article talks about several best practices for writing ETLs for building training datasets. It delves into several software engineering techniques and patterns applied to ML.

    https://www.kdnuggets.com/best-practices-for-building-etls-for-ml

  • AI and Open Source Software: Separated at Birth?

    In this article, Luis shares with readers his thoughts on the intersection of open source software and machine learning and what the future might bring. Many articles cover how open source software is used by the machine learning community but this post focuses on the similarities between the two areas of practice and what machine learning can and can’t learn from open source software.

    https://www.kdnuggets.com/ai-and-open-source-software-separated-at-birth

  • The Top 5 Data Management Tools For Your Projects

    See what KDnuggets is recommending for the top 5 cutting-edge tools for cloud, ETL, transformation, master data management, and visualization.

    https://www.kdnuggets.com/top-5-data-management-tools-for-your-projects

  • Getting Started with Google Cloud Platform in 5 Steps

    Explore the essentials of Google Cloud Platform for data science and ML, from account setup to model deployment, with hands-on project examples.

    https://www.kdnuggets.com/5-steps-google-cloud-platform

  • Deploying Your Machine Learning Model to Production in the Cloud

    Learn a simple way to have a live model hosted on AWS.

    https://www.kdnuggets.com/deploying-your-ml-model-to-production-in-the-cloud

  • Effective Small Language Models: Microsoft’s 1.3 Billion Parameter phi-1.5

    Learn about Microsoft’s 1.3 billion parameter model that has outperformed Llama 2’s 7-billion parameters model on several benchmarks.

    https://www.kdnuggets.com/effective-small-language-models-microsoft-phi-15

  • Fine Tuning LLAMAv2 with QLora on Google Colab for Free

    Learn how to fine-tune one of the most influential open-source models for free on Google Colab.

    https://www.kdnuggets.com/fine-tuning-llamav2-with-qlora-on-google-colab-for-free

  • 10 Math Concepts for Programmers

    The not so secret behind becoming a proficient programmer - Math & it’s top 10 concepts.

    https://www.kdnuggets.com/10-math-concepts-for-programmers

  • Working with Big Data: Tools and Techniques

    Where do you start in a field as vast as big data? Which tools and techniques to use? We explore this and talk about the most common tools in big data.

    https://www.kdnuggets.com/working-with-big-data-tools-and-techniques

  • Building Microservice for Multi-Chat Backends Using Llama and ChatGPT

    As LLMs continue to evolve, integrating multiple models or switching between them has become increasingly challenging. This article suggests a Microservice approach to separate model integration from business applications and simplify the process.

    https://www.kdnuggets.com/building-microservice-for-multichat-backends-using-llama-and-chatgpt

  • Building a Formula 1 Streaming Data Pipeline With Kafka and Risingwave

    Build a streaming data pipeline using Formula 1 data, Python, Kafka, RisingWave as the streaming database, and visualize all the real-time data in Grafana.

    https://www.kdnuggets.com/building-a-formula-1-streaming-data-pipeline-with-kafka-and-risingwave

  • Want to Become a Data Scientist? Part 1: 10 Hard Skills You Need

    A quick 10-step hard skill guide on what you need to become a Data Scientist.

    https://www.kdnuggets.com/want-to-become-a-data-scientist-part-1-10-hard-skills-you-need

  • 5 Skills All Marketing Analytics and Data Science Pros Need Today

    Join us at the MADS conference in Washington, D.C., from Sept. 26 to 28, 2023. Learn more below and register with code KDN100 for $100 off your conference pass.

    https://www.kdnuggets.com/2023/08/mads-5-skills-marketing-analytics-data-science-pros-need-today.html

  • How to Ace Data Scientist Professional Certificate Exam

    Gain insights into the certification process and expert tips for passing the certificate exam.

    https://www.kdnuggets.com/2023/08/ace-data-scientist-professional-certificate.html

  • Things You Should Know When Scaling Your Web Data-Driven Product

    Scaling your data-driven product helps grow your business, but it requires certain expertise. In this article, you will learn how scaling works and what to keep in mind while doing it.

    https://www.kdnuggets.com/2023/08/things-know-scaling-web-datadriven-product.html

  • A Comprehensive Guide to MLOps

    Machine Learning Operations (MLOps) is a relatively new discipline that provides the structure and support necessary for machine learning (ML) models to thrive in production environments.

    https://www.kdnuggets.com/2023/08/comprehensive-guide-mlops.html

  • This Week in AI, August 7: Generative AI Comes to Jupyter & Stack Overflow • ChatGPT Updates

    "This Week in AI" on KDnuggets provides a weekly roundup of the latest happenings in the world of Artificial Intelligence. Covering a wide range of topics from recent headlines, scholarly articles, educational resources, to spotlight research, the post is designed to keep readers up-to-date and informed about the ever-evolving field of AI.

    https://www.kdnuggets.com/2023/mm/this-week-ai-2023-08-07.html

  • Fundamentals Of Statistics For Data Scientists and Analysts

    Key statistical concepts for your data science or data analysis journey.

    https://www.kdnuggets.com/2023/08/fundamentals-statistics-data-scientists-analysts.html

  • CDC Data Replication: Techniques, Tradeoffs, Insights

    The author discusses common use cases for CDC data replication, implementation techniques and their tradeoffs, and firsthand insights.

    https://www.kdnuggets.com/2023/08/cdc-data-replication-techniques-tradeoffs-insights.html

  • The Importance of Data Cleaning in Data Science

    This article provides an overview of the importance of data cleaning in data science. It explains what data cleaning is, the benefits of using it, and the commonly used tools.

    https://www.kdnuggets.com/2023/08/importance-data-cleaning-data-science.html

  • Introduction to Data Science: A Beginner’s Guide

    This article is a guide for new data scientists, and it's designed to help you get started quickly. It's meant to be a starting point, but if you're already in the market for a new job, you may want to read this article more.

    https://www.kdnuggets.com/2023/07/introduction-data-science-beginner-guide.html

  • Mastering GPUs: A Beginner’s Guide to GPU-Accelerated DataFrames in Python

    RAPIDS cuDF, with its pandas-like API, enables data scientists and engineers to quickly tap into the immense potential of parallel computing on GPUs–with just a few code line changes. Read on for more.

    https://www.kdnuggets.com/2023/07/mastering-gpus-beginners-guide-gpu-accelerated-dataframes-python.html

  • Everything You Need About the LLM University by Cohere

    Want to kickstart a new career with LLMs? Or want to transfer to the next big thing in tech? You can do so now with the LLM University by Cohere.

    https://www.kdnuggets.com/2023/07/everything-need-llm-university-cohere.html

  • A Beginner’s Guide to Data Engineering

    So you want to break into data engineering? Start today by learning more about data engineering and the fundamental concepts.

    https://www.kdnuggets.com/2023/07/beginner-guide-data-engineering.html

  • Generative AI with Large Language Models: Hands-On Training

    This 2-hour training covers LLMs, their capabilities, and how to develop and deploy them. It uses hands-on code demos in Hugging Face and PyTorch Lightning.

    https://www.kdnuggets.com/2023/07/generative-ai-large-language-models-handson-training.html

  • GPT-4 Details Have Been Leaked!

    What has OpenAI been keeping in the woodwork about GPT-4?

    https://www.kdnuggets.com/2023/07/gpt4-details-leaked.html

  • Unveiling the Power of Meta’s Llama 2: A Leap Forward in Generative AI?

    This article explores the technical details and implications of Meta's newly released Llama 2, a large language model that promises to revolutionize the field of generative AI. We delve into its capabilities, performance, and potential applications, while also discussing its open-source nature and the company's commitment to safety and transparency.

    https://www.kdnuggets.com/2023/07/unveiling-power-metas-llama-2-leap-forward-generative-ai.html

  • The First Half of 2023: Data Science and AI Developments

    6 months of 2023 has gone by like that. Here’s a recap of what the major data science and AI advancements have been in the first half of 2023.

    https://www.kdnuggets.com/2023/07/first-half-2023-data-science-ai-developments.html

  • Synthetic Data Platforms: Unlocking the Power of Generative AI for Structured Data

    The article highlights various use cases of synthetic data, including generating confidential data, rebalancing imbalanced data, and imputing missing data points. It also provides information on popular synthetic data generation tools such as MOSTLY AI, SDV, and YData.

    https://www.kdnuggets.com/2023/07/synthetic-data-platforms-unlocking-power-generative-ai-structured-data.html

  • How to Build a Streaming Semi-structured Analytics Platform on Snowflake

    Building a datalake for semi-structured data or json has always been challenging. Imagine if the json documents are streaming or continuously flowing from healthcare vendors then we need a robust modern architecture that can deal with such a high volume. At the same time analytics layer also needs to be created so as to generate value from it.

    https://www.kdnuggets.com/2023/07/build-streaming-semistructured-analytics-platform-snowflake.html

  • ChatGPT Plugins: Everything You Need To Know

    Learn more about the third-party plugins that OpenAI have rolled out to understand ChatGPTs in real-world use.

    https://www.kdnuggets.com/2023/06/chatgpt-plugins-everything-need-know.html

  • Closing the Gap Between Human Understanding and Machine Learning: Explainable AI as a Solution

    This article elaborates on the importance of Explainable AI (XAI), what the challenges in building interpretable AI models are, and some practical guidelines for companies to build XAI models.

    https://www.kdnuggets.com/2023/06/closing-gap-human-understanding-machine-learning-explainable-ai-solution.html

  • Using RAPIDS cuDF to Leverage GPU in Feature Engineering

    Improving Performance by Replacing Pandas with cuDF in Creating Data Frames and Engineering Features and Integrating with Google Colab.

    https://www.kdnuggets.com/2023/06/rapids-cudf-leverage-gpu-feature-engineering.html

  • Falcon LLM: The New King of Open-Source LLMs

    Falcon LLM, is the new large language model that has taken the crown from LLaMA.

    https://www.kdnuggets.com/2023/06/falcon-llm-new-king-llms.html

  • The Top AutoML Frameworks You Should Consider in 2023

    AutoML frameworks are powerful tool for data analysts and machine learning specialists that can automate data preprocessing, model selection, hyperparameter tuning, and even perform complex tasks like feature engineering.

    https://www.kdnuggets.com/2023/05/best-automl-frameworks-2023.html

  • How to Efficiently Scale Data Science Projects with Cloud Computing

    This article discusses the key components that contribute to the successful scaling of data science projects. It covers how to collect data using APIs, how to store data in the cloud, how to clean and process data, how to visualize data, and how to harness the power of data visualization through interactive dashboards.

    https://www.kdnuggets.com/2023/05/efficiently-scale-data-science-projects-cloud-computing.html

  • Data Masking: The Core of Ensuring GDPR and other Regulatory Compliance Strategies

    This article has provided an overview of data masking and its importance in ensuring compliance with GDPR and other global regulations.

    https://www.kdnuggets.com/2023/05/data-masking-core-ensuring-gdpr-regulatory-compliance-strategies.html

  • Stop Doing this on ChatGPT and Get Ahead of the 99% of its Users

    KDnuggets Top Blog Unleash the power of AI writing with effective prompts.

    https://www.kdnuggets.com/2023/05/stop-chatgpt-get-ahead-99-users.html

  • Schedule & Run ETLs with Jupysql and GitHub Actions

    This blog provided you with a comprehensive overview of ETL and JupySQL, including a brief introduction to ETLs and JupySQL. We also demonstrated how to schedule an example ETL notebook via GitHub actions, which allows you to automate the process of executing ETLs and JupySQL from Jupyter.

    https://www.kdnuggets.com/2023/05/schedule-run-etls-jupysql-github-actions.html

  • Fine-Tuning OpenAI Language Models with Noisily Labeled Data

    Reduce LLM prediction error by 37% via data-centric AI.

    https://www.kdnuggets.com/2023/04/finetuning-openai-language-models-noisily-labeled-data.html

  • MLOps Best Practices You Should Know

    Implement these tips to improve your MLOps skills and workflows.

    https://www.kdnuggets.com/2023/04/mlops-best-practices-know.html

  • KDnuggets News, April 19: AutoGPT: Everything You Need To Know • 10 Websites to Get Amazing Data for Data Science Projects

    AutoGPT: Everything You Need To Know • 10 Websites to Get Amazing Data for Data Science Projects • 6 ChatGPT mind-blowing extensions to use it anywhere • Mastering Generative AI and Prompt Engineering: A Free eBook • Baby AGI: The Birth of a Fully Autonomous AI

    https://www.kdnuggets.com/2023/n14.html

  • How to Get Hired as Data Scientist in the GPT-4 Era

    We will be focusing on statistics, core data science concepts, NLP, prompt engineering, data science portfolio, interview preparation, and AIOps.

    https://www.kdnuggets.com/2023/04/get-hired-data-scientist-gpt4-era.html

  • What Is ChatGPT Doing and Why Does It Work?

    In this article, we will explain how ChatGPT works and why it is able to produce coherent and diverse conversations.

    https://www.kdnuggets.com/2023/04/chatgpt-work.html

  • The Role of the MLOps Engineer in an Organization

    Interested in becoming an MLOps engineer? Start today by learning more about the MLOps engineer role.

    https://www.kdnuggets.com/2023/04/role-mlops-engineer-organization.html

  • Chatting with the Future: Predictions for AI in the Next Decade

    What else should we expect from the rest of 2023 and the next decade in the world of Artificial Intelligence? Let's talk it out.

    https://www.kdnuggets.com/2023/04/chatting-future-predictions-ai-next-decade.html

  • Post GPT-4: Answering Most Asked Questions About AI

    Is AI overhyped, or is there a valid reason to be afraid?

    https://www.kdnuggets.com/2023/04/post-gpt4-answering-asked-questions-ai.html

  • Introducing TPU v4: Googles Cutting Edge Supercomputer for Large Language Models

    TPU v4: Google's fifth domain-specific architecture and third supercomputer for machine learning models.

    https://www.kdnuggets.com/2023/04/introducing-tpu-v4-googles-cutting-edge-supercomputer-large-language-models.html

  • 8 Open-Source Alternative to ChatGPT and Bard

    Discover the widely-used open-source frameworks and models for creating your ChatGPT like chatbots, integrating LLMs, or launching your AI product.

    https://www.kdnuggets.com/2023/04/8-opensource-alternative-chatgpt-bard.html

  • The Future of Work: How AI is Changing the Job Landscape

    With more and more companies integrating artificial intelligence into the workplace, what does this mean for employees' futures and careers?

    https://www.kdnuggets.com/2023/04/future-work-ai-changing-job-landscape.html

  • Top 19 Skills You Need to Know in 2023 to Be a Data Scientist

    Skills like the ability to clean, transform, statistically analyze, visualize, communicate, and predict data.

    https://www.kdnuggets.com/2023/04/top-19-skills-need-know-2023-data-scientist.html

  • 5 Data Management Challenges with Solutions

    This report provides an overview of the challenges that arise in data management and the solutions that can help overcome these challenges.

    https://www.kdnuggets.com/2023/04/5-data-management-challenges-solutions.html

  • How Data Science Can Transform Mobile App Development?

    Data science is an intelligent and powerful technology. By knowing how to use data science in mobile app development you can achieve great results.

    https://www.kdnuggets.com/2023/03/data-science-transform-mobile-app-development.html

  • Data Quality Dimensions: Assuring Your Data Quality with Great Expectations

    This article highlights the significance of ensuring high-quality data and presents six key dimensions for measuring it. These dimensions include Completeness, Consistency, Integrity, Timelessness, Uniqueness, and Validity.

    https://www.kdnuggets.com/2023/03/data-quality-dimensions-assuring-data-quality-great-expectations.html

  • 3 Mistakes That Could Be Affecting the Accuracy of Your Data Analytics

    As more companies are starting to rely on big data, more companies are also misanalyzing the data that they receive. Is your company one of them? These are the top three mistakes that companies commonly make that affect the accuracy of their data analytics.

    https://www.kdnuggets.com/2023/03/3-mistakes-could-affecting-accuracy-data-analytics.html

  • Learn About Large Language Models

    An introduction to Large Language Models, what they are, how they work, and use cases.

    https://www.kdnuggets.com/2023/03/learn-large-language-models.html

  • What Are The Downsides of AI Advancement?

    While AI has certainly several positive uses to offer the world, it’s also displaying harm when it comes to academics, cybersecurity, the environment, jobs, and privacy.

    https://www.kdnuggets.com/2023/03/downsides-ai-advancement.html

  • Simpson’s Paradox and its Implications in Data Science

    KDnuggets Top Blog The importance of Simpson’s Paradox and why you need to consider it when working with data.

    https://www.kdnuggets.com/2023/03/simpson-paradox-implications-data-science.html

  • ChatGPT vs Google Bard: A Comparison of the Technical Differences

    KDnuggets Top Blog The Biggest Rivalry: ChatGPT vs Google Bard! Here's a comparison of the technical differences between the two AI engines.

    https://www.kdnuggets.com/2023/03/chatgpt-google-bard-comparison-technical-differences.html

  • Must Read NLP Papers from the Last 12 Months

    The era of large language models is here now.

    https://www.kdnuggets.com/2023/03/must-read-nlp-papers-last-12-months.html

  • Top 5 Advantages That CatBoost ML Brings to Your Data to Make it Purr

    This article outlines the advantages of CatBoost as a GBDTs for interpreting data sources that are highly categorical or contain missing data points.

    https://www.kdnuggets.com/2023/02/top-5-advantages-catboost-ml-brings-data-make-purr.html

  • Make Quantum Leaps in Your Data Science Journey

    Learn about three levels of data science to make the quantum leap to the next level.

    https://www.kdnuggets.com/2023/02/make-quantum-leaps-data-science-journey.html

  • ChatGPT, GPT-4, and More Generative AI News

    A short review of developments in the AI world.

    https://www.kdnuggets.com/2023/02/chatgpt-gpt4-generative-ai-news.html

  • Learning Python in Four Weeks: A Roadmap

    Here is a roadmap for learning Python in four weeks, a combination of curated resources and ChatGPT prompts to master the language.

    https://www.kdnuggets.com/2023/02/learning-python-four-weeks-roadmap.html

  • Why Data Scientists Expect Flawed Advice From Google Bard

    First reported by Reuters, Bard returned an inaccurate response, leading to a drop in Alphabet’s (GOOGL) stock price by as much as 9% on the day of the demonstration. For many in the data community, this did not come as a surprise; here’s why.

    https://www.kdnuggets.com/2023/02/data-scientists-expect-flawed-advice-google-bard.html

  • Making Intelligent Document Processing Smarter: Part 1

    This article attempts to measure the effect of various noises present in scanned documents on the performance of various APIs in the OCR segment.

    https://www.kdnuggets.com/2023/02/making-intelligent-document-processing-smarter-part-1.html

Refine your search here: