Publications on Data Science, Machine Learning, AI & Analytics



 

 
 

Featured Publications


  • The Machine Learning Mastery EBook Catalog
    Machine Learning Mastery

    Frustrated with one-off articles and too much math? Take the Next Step and Get Tutorial-Based Playbooks that will Guide You to a Specific Result. Welcome to: the Machine Learning Mastery EBook Catalog.

Partner Publications


  • Developing Modern Applications with a Converged Database
    O'Reilly Media

    Trying to accommodate multiple datatypes or workloads can create data fragmentation that spills over into application development, IT operations, data security, system scalability, and availability. This report explains cloud native application development techniques for working with both structured and unstructured data so you can run transactional and analytical workloads on a single, unified data platform.

  • Smart Retail Operations with AI, Robotics, and Automation
    CloudFactory

    In this eBook, dive into amazing examples of robotics and AI automation in retail warehouses, brick-and-mortar stores, and parcel delivery. Learn important insight about the people, processes, and tools you’ll need to bring high-quality retail AI solutions to life.

  • Enterprise AIOps
    O'Reilly Media

    Artificial intelligence operations (AIOps) allows infrastructure and ops teams to reduce operational workloads, support rapid DevOps initiatives, and improve incident management cycles. This O’Reilly report reviews AIOps components and provides an engineering framework for enterprise-wide scalable and sustainable AI solutions.

  • A Modern Data Architecture for Financial Services Firms
    Dremio

    Addressing the customer experience consistently ranks among the top initiatives for financial leaders across the globe. Data teams must deliver access to data while managing a complex, sprawling data footprint that consists of on-premises and cloud data lakes and data warehouses, organizational silos, and legacy platforms that were never designed to store today’s data volumes or meet modern query performance requirements.

  • Introducing Python, Chapter 5
    O'Reilly Media

    Introducing Python, second edition, takes you step-by-step through one of the world’s most popular programming languages. And today you can get chapter 2, covering data types, values, variables and names, free.

  • Turn Data into Action with Dremio and Amazon Web Services
    Dremio

    Across nearly every industry, organizations of all sizes are experiencing growth in the volume, variety, and velocity of data. At the same time, there is greater demand and need to derive actionable insights from that data.

  • How to Scale Precision Agriculture Insights with High-Quality Data
    CloudFactory

    What happens when you combine one of the oldest professions with one of the newest innovations? This ebook explores how AI and new precision agriculture technology, backed by high-quality data, help today’s farmers overcome agricultural challenges, lower costs, and scale their operations.

  • Make Insights Actionable with AI & BI
    Atscale

    Each chapter contains practical advice on designing and executing winning Data & Analytics strategies with AI & BI across your organization.

  • Gartner Report
    Dataiku

    Dataiku, a 2x Leader (2020 and 2021) in the Magic QuadrantTM for Data Science and ML Platforms, first coined the term Everyday AI in 2021. To us, Everyday AI is all about making the use of data almost pedestrian — AI that is so ingrained and intertwined with the workings of the day-to-day that it’s just part of the business.

  • Automating Analytics
    O'Reilly Media

    Thousands of organizations across nearly every business and industry use analytic process automation (APA) to accelerate data-driven business outcomes. This report uses real-world examples to examine the power of APA. You'll learn how to use APA to tackle complex problems, increase productivity, and improve efficiency.

  • DZone Refcard: Getting started with Apache Iceberg
    Dremio

    Cloud data lakes represent the first destination for a growing volume and variety of data. Apache Iceberg, an open table format for data lake storage, enables the data management and data governance capabilities typically associated with the data warehouse directly on the data lake.

  • Kafka: The Definitive Guide, Chapter 5
    O'Reilly Media

    In chapter 5, “Managing Apache Kafka Programmatically,” engineers from Confluent and LinkedIn who are responsible for developing Kafka explain the functionality of the AdminClient and how to use it in your applications to manage topics, consumer groups, and entity configuration.

  • A Beginner’s Guide to Network Analysis
    Virtualitics

    Have you ever wondered if there was a better way to discover relationships in complex data sets? Have you seen network graphs and wondered how you could be using them if only you could overcome the hurdles of creating them?

  • Computer Vision & Smart Retail: Accelerating AI Innovation at Scale
    CloudFactory

    This ebook dives into the opportunity for AI innovation in retail today and the role data labeling and annotation workforces play in producing and refining computer vision solutions.

  • The Business Impact of Using a Semantic Layer for AI and BI
    Atscale

    The report includes data from more than 100 enterprise data leaders about their experience using a Semantic Layer. It explores: The roadblocks to delivering AI and BI at scale, including cost, time, and ease of use; How a Semantic Layer addresses these roadblocks; Quantitative and qualitative measures of the business impact of a Semantic Layer.

  • 2022 Gartner Research: How Graph Techniques Deliver Business Value
    Tiger Graph

    In this report, Gartner analysts provide the foundation for understanding how you can use graph techniques to deliver new business value. Key Takeaways: What are graph techniques? What are the benefits of graph techniques? How Are Enterprises Using Graph Analytics Today?

  • Build, Automate, and Query your Lakehouse
    Dremio

    Dremio Cloud seamlessly combines the benefits of the data warehouse and data lake with a fully managed lakehouse platform - one that’s built for SQL, provides a Git-like experience, and is built on an open foundation. With Dremio Cloud, you can deliver mission-critical BI dashboards and interactive analytics directly on data lake storage.

  • Machine Learning for High-Risk Applications: Techniques for Responsible AI, Chapters 1-5
    Dataiku

    In this ebook, we explore practices to identify cutting-edge and responsible strategies for managing high-impact AI systems and work to understand the concepts and techniques of model interpretability and explainability.

  • Optimizing Data Pipelines Using Multiple Workforces
    CloudFactory

    This guide will show you how to overcome common hurdles of working with multiple workforces and optimize your data pipeline to execute safe, reliable AV models.

  • Blockchain Success Stories, Chapter 3
    O'Reilly Media

    There’s a lot of hype about blockchain, but sometimes it’s hard to tell who’s using it—and who’s actually making it work. This fun-to-read book offers a look at ten case studies of companies that have successfully integrated blockchain into their businesses. Chapter 3 explains how to build a successful blockchain by describing how Changpeng Zhao became a blockchain billionaire in less than a year.

  • Death of A Star Schema (Redux): Moving Beyond Inmon & Kimball
    Incorta

    3 ways a new approach to data modeling can transform your organization.

  • Product Management for AI
    O'Reilly Media

    This insightful report will help you anticipate and solve the problems you face as you develop an AI project and shepherd it into production.

  • End-to-End Management of the Full Data Lifecycle
    Incorta

    Incorta Named in Two 2022 Analytics Reports from Gartner®. The only vendor to be recognized in both reports.

  • The Big Book of Machine Learning Use Cases
    Databricks

    A collection of technical blogs, including code samples and notebooks.

  • The Big Book of MLOps
    Databricks

    A new data-centric approach to building robust MLOps practices.

  • The Value of AI‑Powered Business Intelligence
    O'Reilly Media

    To stay competitive, you need to know how to use AI strategically. The Value of AI-Powered Business Intelligence explains how AI-infused business intelligence can help your business users discover actionable, easy-to-understand insights independently from IT—even while remaining within the organization’s secure and governed IT architecture.

  • Compilation of Semantic Layer White Papers by Best Selling Authors & Experts
    Atscale

    Read this bundle of Semantic Layer whitepapers to learn the key value propositions to implement a semantic layer and best practices for analytics success with one.

  • Practical Machine Learning for Computer Vision: Chapter 3
    O'Reilly Media

    Using machine learning models to extract information from images is one of the trickiest ML tasks—but it often yields invaluable insights. What’s more, image classification is the “Hello World” of deep learning: It’s a stepping stone to other deep learning domains, such as natural language processing.

  • Creating a Production Launch Plan
    O'Reilly Media

    This practical report demonstrates how Google devised its production launch plan and provides actionable advice to help your company develop its own.

  • Intel and Aible Performance Benchmark and Case Studies Report
    Aible

    Download the report to see details with more case studies and the initial results from the performance benchmark study.

  • 5 Critical Considerations for Building an Agile Data Pipeline
    Incorta

    Traditional data collection, curation, and analysis methods are anything but “agile.” Here’s what to do instead.

  • Building Effective Machine Learning Teams
    Incorta

    eBook: Why Visibility, Reproducibility, and Collaboration are Required for ML & AI Success.