Databricks
flow-image

Delta Lake: Up & Running by O’Reilly

Published by Databricks

Step-by-step guidance on the basics of Delta Lake

Get started with Delta Lake

Get this O’Reilly eBook to learn the basics of Delta Lake, the open storage format at the heart of the lakehouse architecture. You’ll get guidance on taking your first steps with Delta Lake, including running Delta Lake on a Spark cluster using PySpark or Scala, creating a table, and reading data from and writing data to a Delta table. In addition, you’ll learn a little bit about the inner workings of Delta Lake. After reading this ebook, you’ll come away with a better understanding of the kind of challenges Delta Lake helps you solve and how you can get started with leveraging Delta Lake for your next project.

This eBook includes:

  • An introduction to data warehouses, data lakes and the lakehouse architecture
  • An overview of how Delta Lake solves common problems and powers the lakehouse architecture
  • Hands-on steps for creating your first Delta table
  • A glimpse of Delta Lake under the hood, including the transaction log and checkpoint files

Download Now

box-icon-download

Required fields*

By registering, I agree to the processing of my personal data by Databricks in accordance with their Privacy Policy. I can update my preferences at any time. Please agree to the conditions

By requesting this resource you agree to our terms of use. All data is protected by our Privacy Notice. If you have any further questions please email dataprotection@headleymedia.com.

Related Categories Hardware, Server, Data Warehousing, Applications, CRM, Databases, Open Source, Big Data, Business Intelligence (BI), Data Warehousing, Compliance, Data management, Enterprise Resource Planning (ERP), Cloud Platforms, Cloud Storage, CRM, ERP, Big Data, Databases, Server, Storage, Data Storage, NAS, SAN, Storage Systems, Collaboration, Cloud, Analytics, DevOps, Machine Learning, AIM, Enterprise Applications, IOT, Workflow, Business Process, Server, Software, Goals, Distribution, Budget, Oracle, SAP, Healthcare, Generative AI, Machine Learning Models, Object Storage

More resources from Databricks