DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
A Comprehensive Dive into the New Time-Series Storage Engine - Mito

A Comprehensive Dive into the New Time-Series Storage Engine - Mito

2
Comments
5 min read
Dummy Variable Trap in Machine Learning

Dummy Variable Trap in Machine Learning

19
Comments
2 min read
Data Engineering For Beginners.

Data Engineering For Beginners.

Comments
3 min read
Data Engineering for Beginners: A Step-by-Step Guide

Data Engineering for Beginners: A Step-by-Step Guide

Comments 1
3 min read
[Free E-Book] The World of Vector Databases & AI Applications!

[Free E-Book] The World of Vector Databases & AI Applications!

35
Comments
3 min read
Why Python is best tool for data processing

Why Python is best tool for data processing

Comments
5 min read
Amazon Kinesis Data Streams (What is?, Benefits, Terminologies)

Amazon Kinesis Data Streams (What is?, Benefits, Terminologies)

4
Comments
2 min read
Data teams can deliver 10x better to the rest of us

Data teams can deliver 10x better to the rest of us

5
Comments
3 min read
6 judgment mistakes companies make when refusing data freelancers

6 judgment mistakes companies make when refusing data freelancers

1
Comments
10 min read
Transfer SQL-> analytics 30x faster with ConnectorX + arrow + dlt

Transfer SQL-> analytics 30x faster with ConnectorX + arrow + dlt

3
Comments
1 min read
Time Series Models for beginners complete guide

Time Series Models for beginners complete guide

1
Comments
3 min read
TIME SERIES ANALYSIS.

TIME SERIES ANALYSIS.

Comments
4 min read
Meet Bob 🙂 and his Databases exploration journey 🗺️! [Relational Databases - Basics - Theory]

Meet Bob 🙂 and his Databases exploration journey 🗺️! [Relational Databases - Basics - Theory]

1
Comments
9 min read
Data Science for Beginners: 2023 - 2024 Complete Roadmap

Data Science for Beginners: 2023 - 2024 Complete Roadmap

Comments
4 min read
Redefining ETL: Data Flows Powered by C# (Part III)

Redefining ETL: Data Flows Powered by C# (Part III)

Comments
12 min read
AWS Bedrock on Snowflake (Talk to Claude, LLAMA)

AWS Bedrock on Snowflake (Talk to Claude, LLAMA)

2
Comments 1
3 min read
How best to learn RDBMS Part 1

How best to learn RDBMS Part 1

2
Comments
2 min read
Log Analysis: Elasticsearch VS Apache Doris

Log Analysis: Elasticsearch VS Apache Doris

2
Comments
11 min read
Scraping AliExpress with Python

Scraping AliExpress with Python

2
Comments
21 min read
Basics of data modelling and data models

Basics of data modelling and data models

1
Comments
3 min read
Exploratory Data Analysis Using Data Visualization Techniques 📊.

Exploratory Data Analysis Using Data Visualization Techniques 📊.

6
Comments
5 min read
Data Engineering Lifecycle - Basics - Theory

Data Engineering Lifecycle - Basics - Theory

2
Comments
5 min read
Avoiding the DBT Monolith

Avoiding the DBT Monolith

Comments 1
3 min read
Running Jobs on Athena Spark

Running Jobs on Athena Spark

3
Comments
2 min read
Data Product Possibilities and Opportunities!

Data Product Possibilities and Opportunities!

Comments
2 min read
Redefining ETL: Data Flows Powered by C# (Part I)

Redefining ETL: Data Flows Powered by C# (Part I)

12
Comments
11 min read
Data Analysis of the Titanic with Python!

Data Analysis of the Titanic with Python!

56
Comments 1
6 min read
Your Free and Offline SQL console in a few steps

Your Free and Offline SQL console in a few steps

1
Comments
5 min read
Event-Driven Architecture with Serverless Functions – Part 1

Event-Driven Architecture with Serverless Functions – Part 1

2
Comments 1
5 min read
Exploratory data Analysis using Visualization Techniques

Exploratory data Analysis using Visualization Techniques

1
Comments 1
3 min read
Data Science for Beginners: 2023 - 2024 Complete Road Map

Data Science for Beginners: 2023 - 2024 Complete Road Map

Comments
3 min read
Data Science for Beginners: 2023-2024 Edition

Data Science for Beginners: 2023-2024 Edition

Comments
7 min read
Deploying ByConity with Kubernetes: A Step-by-Step Guide

Deploying ByConity with Kubernetes: A Step-by-Step Guide

Comments
5 min read
Data Science for Beginners: 2023 Complete Road Map

Data Science for Beginners: 2023 Complete Road Map

1
Comments
3 min read
Data Science for Beginners: 2023 - 2024 Complete Road Map

Data Science for Beginners: 2023 - 2024 Complete Road Map

Comments
2 min read
The Data Engineering Docker-Compose Starter Kit

The Data Engineering Docker-Compose Starter Kit

13
Comments
13 min read
Sentiment Analysis Using Python: A Beginner-Friendly Tutorial!

Sentiment Analysis Using Python: A Beginner-Friendly Tutorial!

23
Comments
4 min read
Let’s Create an End-to-End Web Scraping Pipeline With Scrapy!

Let’s Create an End-to-End Web Scraping Pipeline With Scrapy!

8
Comments
10 min read
Data Science for beginners 2023 - 2024 Complete Roadmap

Data Science for beginners 2023 - 2024 Complete Roadmap

Comments 2
3 min read
SQL + Docker: The combo for Quick and Safe Query Testing

SQL + Docker: The combo for Quick and Safe Query Testing

4
Comments
5 min read
Ultimate Guide: Best Books for Data Science with Ratings for All Levels

Ultimate Guide: Best Books for Data Science with Ratings for All Levels

8
Comments
8 min read
10 NoSQL databases available as alternatives to MongoDB

10 NoSQL databases available as alternatives to MongoDB

2
Comments
4 min read
A Beginner’s Guide to Building LLM-Powered Applications with LangChain!

A Beginner’s Guide to Building LLM-Powered Applications with LangChain!

94
Comments 8
7 min read
Transactions and the ACID principle, going a little deeper.

Transactions and the ACID principle, going a little deeper.

3
Comments
11 min read
Python Cheat Sheet for Data Engineers and Data Scientists!

Python Cheat Sheet for Data Engineers and Data Scientists!

69
Comments
3 min read
A Step-by-Step Guide to Implementing Data Version Control

A Step-by-Step Guide to Implementing Data Version Control

6
Comments
4 min read
What's new and noteworthy on AWS - Summer 2023 edition

What's new and noteworthy on AWS - Summer 2023 edition

5
Comments
24 min read
KNIME Analytics Platform for Data Science-1

KNIME Analytics Platform for Data Science-1

4
Comments
4 min read
Unlearning what you know about relational databases to unlock the power of Redshift

Unlearning what you know about relational databases to unlock the power of Redshift

Comments
4 min read
The Wrath of Unicron - When Airflow Gets Scary

The Wrath of Unicron - When Airflow Gets Scary

4
Comments
4 min read
End to End Netflix data analytics and recommendation system project using Microsoft Azure tools

End to End Netflix data analytics and recommendation system project using Microsoft Azure tools

8
Comments
5 min read
Navigating the Data Engineering Landscape: From Raw Data to Insights

Navigating the Data Engineering Landscape: From Raw Data to Insights

5
Comments 1
7 min read
Machine learning 101

Machine learning 101

85
Comments
8 min read
Building ETL/ELT Pipelines For Data Engineers.

Building ETL/ELT Pipelines For Data Engineers.

5
Comments 2
2 min read
Automating Talend Jobs Using Apache Airflow .

Automating Talend Jobs Using Apache Airflow .

6
Comments
3 min read
Data-aware Scheduling in Airflow: A Practical Guide with DAG Factory

Data-aware Scheduling in Airflow: A Practical Guide with DAG Factory

6
Comments
6 min read
Automating Data Pipeline Deployment on AWS with Terraform: Utilizing Lambda, Glue, Crawler, Redshift, and S3

Automating Data Pipeline Deployment on AWS with Terraform: Utilizing Lambda, Glue, Crawler, Redshift, and S3

Comments 1
8 min read
Push dbt beyond boundaries: Exploring a Fresh Approach to dbt Integration

Push dbt beyond boundaries: Exploring a Fresh Approach to dbt Integration

1
Comments
1 min read
There is no Data Engineering roadmap

There is no Data Engineering roadmap

2
Comments
5 min read
A mage on the Hero’s Journey: a fantasy epic on how a startup rose from the ashes

A mage on the Hero’s Journey: a fantasy epic on how a startup rose from the ashes

6
Comments
9 min read
loading...