Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Getting Started with DuckDB
  • Table Of Contents Toc
Getting Started with DuckDB

Getting Started with DuckDB

By : Simon Aubury, Ned Letcher
5 (1)
close
close
Getting Started with DuckDB

Getting Started with DuckDB

5 (1)
By: Simon Aubury, Ned Letcher

Overview of this book

DuckDB is a fast in-process analytical database. Getting Started with DuckDB offers a practical overview of its usage. You'll learn to load, transform, and query various data formats, including CSV, JSON, and Parquet. The book covers DuckDB's optimizations, SQL enhancements, and extensions for specialized applications. Working with examples in SQL, Python, and R, you'll explore analyzing public datasets and discover tools enhancing DuckDB workflows. This guide suits both experienced and new data practitioners, quickly equipping you to apply DuckDB's capabilities in analytical projects. You'll gain proficiency in using DuckDB for diverse tasks, enabling effective integration into your data workflows.
Table of Contents (15 chapters)
close
close

Preface

There is no shortage of data being produced by humanity, in myriad formats, shapes, and ever-growing quantities. As it grows, so do the opportunities for leveraging data to benefit our world: improving decision making for governments, companies, and public organizations; supporting scientific research and technological advancements; and enabling the development of consumer products and important public services. To realize these opportunities, we are faced with an imperative: if we want to perform effective data analysis and develop products and services infused with machine learning, we must be able to manage, understand, and effectively work with the data that makes it possible.

Whether you are a data analyst, data scientist, research scientist, data engineer, software engineer, or data hobbyist, you are likely to face many of the same challenges when it comes to working with data. Analytical data workflows and applications require that data be loaded, cleaned, transformed, organized, exported, and crunched into summarized forms. A running joke amongst data practitioners is that they spend more time preparing and wrangling their data, as well as fighting with the tools that support their work than they do on the value-producing activities that are likely to be in their job descriptions. As data grows in volume and variety, these activities become both more difficult and more pressing to solve.

DuckDB is an analytical database that handles many of these challenges with ease. It enables data practitioners to streamline and improve the effectiveness of activities across the entire life cycle of data analysis and the development of analytical data infrastructure. It is simple to install and use on virtually any machine, running entirely in-process—without the overheads of connecting to and maintaining a dedicated server. At the same time, it offers blazing-fast performance for analytical operations, as well as powerful data management capabilities — features that are normally associated with distributed data processing engines and dedicated SQL database management systems. DuckDB’s rich feature set makes it an incredibly versatile tool, being well suited to a range of different use cases, such as performing interactive data analysis and ad hoc data wrangling, efficiently querying data lakes, developing lean pipelines for transforming data, functioning as an operational data warehouse, and forming a low-latency query engine for powering responsive data apps. This versatility can also be a bit overwhelming at first, as it’s hard to compare DuckDB with any one existing tool that you might be familiar with.

In this book, we’ll dive into many of DuckDB’s powerful and flexible capabilities. We’ll give you a clear framework for how to think about what kind of a data tool DuckDB is and the types of applications it excels at. Through a range of hands-on examples, you’ll learn how to make the most of this exciting tool and discover the many ways that you can incorporate it into your own analytical workflows and projects.

CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Getting Started with DuckDB
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon