DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Mais dados é melhor que um algoritmo mais eficiente

Mais dados é melhor que um algoritmo mais eficiente

6
Comments
3 min read
Amazon Kinesis Firehose

Amazon Kinesis Firehose

2
Comments
2 min read
How Vector Databases Work: A Hands-On Tutorial!

How Vector Databases Work: A Hands-On Tutorial!

10
Comments
9 min read
Transfer SQL-> analytics 30x faster with ConnectorX + arrow + dlt

Transfer SQL-> analytics 30x faster with ConnectorX + arrow + dlt

2
Comments
1 min read
The easiest way to navigate through MongoDB, PySpark, and Jupyter Notebook

The easiest way to navigate through MongoDB, PySpark, and Jupyter Notebook

7
Comments
3 min read
Big data models 📊 vs. Computer memory 💾

Big data models 📊 vs. Computer memory 💾

186
Comments 3
11 min read
How to Import Existing Resources in your CloudFormation Stacks

How to Import Existing Resources in your CloudFormation Stacks

6
Comments
4 min read
Engenharia de Dados com Scala: aprenda a fazer webscraping dos filmes mais assistidos da Netflix em cada país

Engenharia de Dados com Scala: aprenda a fazer webscraping dos filmes mais assistidos da Netflix em cada país

6
Comments 2
22 min read
Unveiling the Robust Architecture of SQL Server A Comprehensive Overview

Unveiling the Robust Architecture of SQL Server A Comprehensive Overview

Comments
2 min read
Maximizing Database Efficiency Mastering Query Optimization in SQL

Maximizing Database Efficiency Mastering Query Optimization in SQL

Comments
2 min read
Data Engineering For Beginners: A Step-By-Step Guide

Data Engineering For Beginners: A Step-By-Step Guide

5
Comments
8 min read
8 Rusty open source data projects to watch in 2024 🤩

8 Rusty open source data projects to watch in 2024 🤩

14
Comments 11
7 min read
Data Engineer vs. Business Intelligence Data Analyst

Data Engineer vs. Business Intelligence Data Analyst

9
Comments
4 min read
Unveiling the Azure Data Lake for Bike Share Data Analytics

Unveiling the Azure Data Lake for Bike Share Data Analytics

Comments
1 min read
Data Quality

Data Quality

Comments
15 min read
A Comprehensive Dive into the New Time-Series Storage Engine - Mito

A Comprehensive Dive into the New Time-Series Storage Engine - Mito

8
Comments
5 min read
Batch Processing using PySpark on AWS EMR

Batch Processing using PySpark on AWS EMR

Comments
4 min read
Installing Python Packages in AWS Glue using AWS CodeArtifact

Installing Python Packages in AWS Glue using AWS CodeArtifact

9
Comments
6 min read
Data teams can deliver 10x better to the rest of us

Data teams can deliver 10x better to the rest of us

5
Comments
3 min read
Optimizing Data Analysis: A Guide to Handling Missing Data Effectively

Optimizing Data Analysis: A Guide to Handling Missing Data Effectively

6
Comments
3 min read
Data Engineering Roadmap 2023

Data Engineering Roadmap 2023

2
Comments
4 min read
Data Modeling

Data Modeling

6
Comments
5 min read
AWS Data Engineering Certification

AWS Data Engineering Certification

1
Comments 2
9 min read
Dummy Variable Trap in Machine Learning

Dummy Variable Trap in Machine Learning

19
Comments
2 min read
[Free E-Book] The World of Vector Databases & AI Applications!

[Free E-Book] The World of Vector Databases & AI Applications!

14
Comments
3 min read
Why Python is best tool for data processing

Why Python is best tool for data processing

Comments
5 min read
Amazon Kinesis Data Streams (What is?, Benefits, Terminologies)

Amazon Kinesis Data Streams (What is?, Benefits, Terminologies)

4
Comments
2 min read
Meet Bob 🙂 and his Databases exploration journey 🗺️! [Relational Databases - Basics - Theory]

Meet Bob 🙂 and his Databases exploration journey 🗺️! [Relational Databases - Basics - Theory]

3
Comments
9 min read
Redefining ETL: Data Flows Powered by C# (Part III)

Redefining ETL: Data Flows Powered by C# (Part III)

Comments
12 min read
Redefining ETL: Data Flows Powered by C# (Part I)

Redefining ETL: Data Flows Powered by C# (Part I)

Comments
11 min read
AWS Bedrock on Snowflake (Talk to Claude, LLAMA)

AWS Bedrock on Snowflake (Talk to Claude, LLAMA)

1
Comments 1
3 min read
How best to learn RDBMS Part 1

How best to learn RDBMS Part 1

2
Comments
2 min read
Log Analysis: Elasticsearch VS Apache Doris

Log Analysis: Elasticsearch VS Apache Doris

Comments
11 min read
Basics of data modelling and data models

Basics of data modelling and data models

1
Comments
3 min read
Exploratory Data Analysis Using Data Visualization Techniques 📊.

Exploratory Data Analysis Using Data Visualization Techniques 📊.

6
Comments
5 min read
Data Engineering Lifecycle - Basics - Theory

Data Engineering Lifecycle - Basics - Theory

Comments
5 min read
Avoiding the DBT Monolith

Avoiding the DBT Monolith

Comments 1
3 min read
Running Jobs on Athena Spark

Running Jobs on Athena Spark

2
Comments
2 min read
Data Analysis of the Titanic with Python!

Data Analysis of the Titanic with Python!

17
Comments 1
6 min read
Your Free and Offline SQL console in a few steps

Your Free and Offline SQL console in a few steps

1
Comments
5 min read
Event-Driven Architecture with Serverless Functions – Part 1

Event-Driven Architecture with Serverless Functions – Part 1

2
Comments 1
5 min read
Exploratory data Analysis using Visualization Techniques

Exploratory data Analysis using Visualization Techniques

1
Comments
3 min read
10 NoSQL databases available as alternatives to MongoDB

10 NoSQL databases available as alternatives to MongoDB

2
Comments
4 min read
Data Science for Beginners: 2023 - 2024 Complete Road Map

Data Science for Beginners: 2023 - 2024 Complete Road Map

Comments
2 min read
The Data Engineering Docker-Compose Starter Kit

The Data Engineering Docker-Compose Starter Kit

10
Comments
13 min read
Sentiment Analysis Using Python: A Beginner-Friendly Tutorial!

Sentiment Analysis Using Python: A Beginner-Friendly Tutorial!

10
Comments
4 min read
Let’s Create an End-to-End Web Scraping Pipeline With Scrapy!

Let’s Create an End-to-End Web Scraping Pipeline With Scrapy!

3
Comments
10 min read
SQL + Docker: The combo for Quick and Safe Query Testing

SQL + Docker: The combo for Quick and Safe Query Testing

4
Comments
5 min read
Ultimate Guide: Best Books for Data Science with Ratings for All Levels

Ultimate Guide: Best Books for Data Science with Ratings for All Levels

7
Comments
8 min read
A Beginner’s Guide to Building LLM-Powered Applications with LangChain!

A Beginner’s Guide to Building LLM-Powered Applications with LangChain!

54
Comments 6
7 min read
Transactions and the ACID principle, going a little deeper.

Transactions and the ACID principle, going a little deeper.

3
Comments
11 min read
Python Cheat Sheet for Data Engineers and Data Scientists!

Python Cheat Sheet for Data Engineers and Data Scientists!

52
Comments
3 min read
A Step-by-Step Guide to Implementing Data Version Control

A Step-by-Step Guide to Implementing Data Version Control

5
Comments
4 min read
What's new and noteworthy on AWS - Summer 2023 edition

What's new and noteworthy on AWS - Summer 2023 edition

5
Comments
24 min read
KNIME Analytics Platform for Data Science-1

KNIME Analytics Platform for Data Science-1

3
Comments
4 min read
The Wrath of Unicron - When Airflow Gets Scary

The Wrath of Unicron - When Airflow Gets Scary

6
Comments
4 min read
End to End Netflix data analytics and recommendation system project using Microsoft Azure tools

End to End Netflix data analytics and recommendation system project using Microsoft Azure tools

2
Comments
5 min read
Navigating the Data Engineering Landscape: From Raw Data to Insights

Navigating the Data Engineering Landscape: From Raw Data to Insights

5
Comments 1
7 min read
Machine learning 101

Machine learning 101

85
Comments
8 min read
Building ETL/ELT Pipelines For Data Engineers.

Building ETL/ELT Pipelines For Data Engineers.

5
Comments 2
2 min read
loading...