DEV Community

Cover image for My Data Engineering Library
Christian Himpe
Christian Himpe

Posted on • Edited on

My Data Engineering Library

This is neither an "Ultimate List of Data Engineering Books" nor the "Data Engineering Must-Read List", but rather my subjective recommendation for a minimal data engineering library; particularly for data engineers coming from software development, as I used to.

General

Databases

  • Forta: "SQL in 10 Minutes"; Pearson, 2010.
    • The best minimal SQL guide I found, yet.
  • Kaufmann, Meier: "SQL and NoSQL Databases"; Springer, 2019.
    • Beyond touching all the database basics, the relational (SQL), document (Mongo) and graph model (Cypher) are recurringly contrasted.

Containerization

Extra

  • Densmore: "Data Pipelines Pocket Reference"; Oreilly Media, 2021.
    • Good example-driven overview; original source of EtLT (extract, partial transform, load, transform).
  • Chromatic: "Extreme Programming Pocket Guide"; Oreilly Media, 2003.
    • Extreme programming (XP) is the sanest agile method with some good sustainability practices like the 40h week.

Image of Timescale

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read more →

Top comments (0)

Image of Docusign

🛠️ Bring your solution into Docusign. Reach over 1.6M customers.

Docusign is now extensible. Overcome challenges with disconnected products and inaccessible data by bringing your solutions into Docusign and publishing to 1.6M customers in the App Center.

Learn more