Skip to content
Navigation menu
Search
Search
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Vitess: Easy database deployment, clustering, and scaling!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Vitess: Easy database deployment, clustering, and scaling!
#
database
#
nosql
#
dataops
#
bigdata
5
reactions
Comments
Add Comment
5 min read
Zero to Deployment and Evolution Data Catalog!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Zero to Deployment and Evolution Data Catalog!
#
datacatalog
#
dataops
#
bigdata
#
database
4
reactions
Comments
Add Comment
6 min read
Build an analytics app with React and Cube.js
Matt Angelosanto
Matt Angelosanto
Matt Angelosanto
Follow
for
LogRocket
Dec 22 '21
Build an analytics app with React and Cube.js
#
bigdata
#
react
#
analytics
#
tutorial
8
reactions
Comments
Add Comment
9 min read
Cardinality Counting in Redis
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Dec 22 '21
Cardinality Counting in Redis
#
redis
#
architecture
#
bigdata
#
javascript
2
reactions
Comments
Add Comment
4 min read
Cube Cloud Deep Dive: Mastering Pre-Aggregations
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Dec 20 '21
Cube Cloud Deep Dive: Mastering Pre-Aggregations
#
opensource
#
database
#
bigdata
#
analytics
6
reactions
Comments
Add Comment
11 min read
BigQuery SQL Tip: QUALIFY clause
Hui Zheng (I/Trust/You)
Hui Zheng (I/Trust/You)
Hui Zheng (I/Trust/You)
Follow
Dec 10 '21
BigQuery SQL Tip: QUALIFY clause
#
bigdata
5
reactions
Comments
Add Comment
1 min read
What Is Trino And Why Is It Great At Processing Big Data
SeattleDataGuy
SeattleDataGuy
SeattleDataGuy
Follow
Dec 9 '21
What Is Trino And Why Is It Great At Processing Big Data
#
database
#
bigdata
#
datascience
#
sql
15
reactions
Comments
Add Comment
7 min read
Using PySpark and AWS Glue to analyze multi-line log files
Maurice Borgmeier
Maurice Borgmeier
Maurice Borgmeier
Follow
for
AWS Community Builders
Dec 3 '21
Using PySpark and AWS Glue to analyze multi-line log files
#
aws
#
python
#
bigdata
#
pyspark
12
reactions
Comments
1
comment
5 min read
ETLs vs ELTs: Why are ELTs Disrupting the Data Market?
SeattleDataGuy
SeattleDataGuy
SeattleDataGuy
Follow
Nov 30 '21
ETLs vs ELTs: Why are ELTs Disrupting the Data Market?
#
datascience
#
database
#
bigdata
#
startup
15
reactions
Comments
Add Comment
8 min read
How IoT integration with ERP system can bring business benefits
HyperNym
HyperNym
HyperNym
Follow
Nov 29 '21
How IoT integration with ERP system can bring business benefits
#
erp
#
iot
#
bigdata
#
cloudcomputing
2
reactions
Comments
Add Comment
4 min read
Bigdata: A problem and a solution
Radha
Radha
Radha
Follow
Oct 26 '21
Bigdata: A problem and a solution
#
discuss
#
bigdata
#
industry
#
challenge
1
reaction
Comments
Add Comment
4 min read
Cube Cloud Deep Dive: Starting a New Cube App
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Nov 23 '21
Cube Cloud Deep Dive: Starting a New Cube App
#
tutorial
#
opensource
#
analytics
#
bigdata
16
reactions
Comments
Add Comment
9 min read
How Zero-Code Data Preparations Tools Enable Better, Faster IT Performance in the Age of Big Data
Javeria Gauhar
Javeria Gauhar
Javeria Gauhar
Follow
Nov 21 '21
How Zero-Code Data Preparations Tools Enable Better, Faster IT Performance in the Age of Big Data
#
bigdata
#
datacleansing
#
datapreparation
#
itperformance
2
reactions
Comments
Add Comment
6 min read
Build your own data quality rules with AWS Glue DataBrew
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Nov 21 '21
Build your own data quality rules with AWS Glue DataBrew
#
aws
#
awsthai
#
bigdata
#
datascience
10
reactions
Comments
Add Comment
6 min read
Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Nov 20 '21
Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew
#
aws
#
awsthai
#
bigdata
#
datascience
6
reactions
Comments
Add Comment
4 min read
Understanding Apache Hive LLAP
Sunny Srinidhi
Sunny Srinidhi
Sunny Srinidhi
Follow
Nov 19 '21
Understanding Apache Hive LLAP
#
bigdata
#
datascience
#
database
#
apachehive
3
reactions
Comments
Add Comment
7 min read
What Is Crypto and How Does It Work ?
Bek Brace
Bek Brace
Bek Brace
Follow
Nov 17 '21
What Is Crypto and How Does It Work ?
#
bigdata
#
blockchain
#
cryptocurrency
#
machinelearning
8
reactions
Comments
7
comments
3 min read
Data lakes: building a serverless data pipeline
Luca Silvestri
Luca Silvestri
Luca Silvestri
Follow
Nov 8 '21
Data lakes: building a serverless data pipeline
#
serverless
#
bigdata
#
aws
#
datapipelines
3
reactions
Comments
Add Comment
6 min read
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro
Sunny Srinidhi
Sunny Srinidhi
Sunny Srinidhi
Follow
Nov 6 '21
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro
#
hadoop
#
bigdata
#
macbook
#
programming
8
reactions
Comments
Add Comment
8 min read
Meet the Innovators with Krzysztof Nowocin
12:14
ITMAGINATION
ITMAGINATION
ITMAGINATION
Follow
Nov 5 '21
Meet the Innovators with Krzysztof Nowocin
#
datascience
#
bigdata
#
career
2
reactions
Comments
Add Comment
6 min read
The Important SQL Queries for Beginners
Mahmoud EL-kariouny
Mahmoud EL-kariouny
Mahmoud EL-kariouny
Follow
Nov 2 '21
The Important SQL Queries for Beginners
#
tutorial
#
bigdata
#
sql
#
database
12
reactions
Comments
Add Comment
8 min read
A first update on our AI/ML/Big Data salary survey
ai-jobs.net
ai-jobs.net
ai-jobs.net
Follow
Sep 28 '21
A first update on our AI/ML/Big Data salary survey
#
machinelearning
#
bigdata
#
salaries
#
career
2
reactions
Comments
Add Comment
2 min read
Data Engineering Introduction
Nicholas
Nicholas
Nicholas
Follow
Oct 29 '21
Data Engineering Introduction
#
database
#
bigdata
6
reactions
Comments
Add Comment
2 min read
Performance capabilities of data warehouses and how Cube can help
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Oct 28 '21
Performance capabilities of data warehouses and how Cube can help
#
bigdata
#
database
#
node
#
tutorial
18
reactions
Comments
Add Comment
18 min read
Scramjet Transform Hub — Quick Start introduction
Łukasz Kamieniecki-Mruk
Łukasz Kamieniecki-Mruk
Łukasz Kamieniecki-Mruk
Follow
for
Scramjet
Oct 28 '21
Scramjet Transform Hub — Quick Start introduction
#
bigdata
#
javascript
#
typescript
#
serverless
12
reactions
Comments
Add Comment
7 min read
Introduction to Scramjet Data Processing Platform
Łukasz Kamieniecki-Mruk
Łukasz Kamieniecki-Mruk
Łukasz Kamieniecki-Mruk
Follow
for
Scramjet
Oct 28 '21
Introduction to Scramjet Data Processing Platform
#
javascript
#
typescript
#
bigdata
#
serverless
9
reactions
Comments
Add Comment
3 min read
Find The Best Way To Load Data In A Data Warehouse
Team RudderStack
Team RudderStack
Team RudderStack
Follow
for
RudderStack
Oct 18 '21
Find The Best Way To Load Data In A Data Warehouse
#
datawarehouse
#
bi
#
bigdata
#
etl
2
reactions
Comments
Add Comment
4 min read
Snowflake Vs BigQuery — Two Cloud Data Warehouses Of Many
SeattleDataGuy
SeattleDataGuy
SeattleDataGuy
Follow
Oct 11 '21
Snowflake Vs BigQuery — Two Cloud Data Warehouses Of Many
#
bigdata
#
database
#
datascience
#
cloud
12
reactions
Comments
2
comments
6 min read
Getting Started With Apache Airflow
Sunny Srinidhi
Sunny Srinidhi
Sunny Srinidhi
Follow
Oct 11 '21
Getting Started With Apache Airflow
#
airflow
#
bigdata
#
datascience
#
python
6
reactions
Comments
Add Comment
11 min read
Big Data & Analytics : Driving Value To Advanced Business Growth Initiatives
Kanika Vatsyayan
Kanika Vatsyayan
Kanika Vatsyayan
Follow
Sep 30 '21
Big Data & Analytics : Driving Value To Advanced Business Growth Initiatives
#
testing
#
bigdata
#
productivity
#
startup
6
reactions
Comments
1
comment
6 min read
ทดสอบทำ Machine Learning predict customer churn โดยใช้งาน Amazon SageMaker กับ Snowflake!
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Sep 17 '21
ทดสอบทำ Machine Learning predict customer churn โดยใช้งาน Amazon SageMaker กับ Snowflake!
#
aws
#
awsthai
#
bigdata
#
datascience
8
reactions
Comments
1
comment
6 min read
Computing the Pearson correlation matrix on huge datasets in Python
Linus Kohl
Linus Kohl
Linus Kohl
Follow
Sep 6 '21
Computing the Pearson correlation matrix on huge datasets in Python
#
python
#
bigdata
6
reactions
Comments
2
comments
5 min read
How to scrape twitter data with Headless Chrome and Puppeteer
Paymon Wang Lotfi
Paymon Wang Lotfi
Paymon Wang Lotfi
Follow
Sep 2 '21
How to scrape twitter data with Headless Chrome and Puppeteer
#
webdev
#
node
#
bigdata
#
javascript
6
reactions
Comments
2
comments
5 min read
Reliable ingestion from AWS S3 using Hudi
vinoth chandar
vinoth chandar
vinoth chandar
Follow
Sep 2 '21
Reliable ingestion from AWS S3 using Hudi
#
datascience
#
bigdata
#
analytics
#
aws
3
reactions
Comments
Add Comment
6 min read
ทดสอบการทำ Anonymize data in your data lake with Amazon Athena
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Aug 31 '21
ทดสอบการทำ Anonymize data in your data lake with Amazon Athena
#
aws
#
awsthai
#
bigdata
#
datascience
9
reactions
Comments
1
comment
2 min read
เริ่มใช้งาน SQL-based INSERTS, DELETES and UPSERTS in S3 โดยใช้ AWS Glue 3.0 และ Delta Lake
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Aug 30 '21
เริ่มใช้งาน SQL-based INSERTS, DELETES and UPSERTS in S3 โดยใช้ AWS Glue 3.0 และ Delta Lake
#
aws
#
awsthai
#
bigdata
#
datascience
10
reactions
Comments
Add Comment
6 min read
How to deal with Big data challenges
Adamo Software
Adamo Software
Adamo Software
Follow
Aug 30 '21
How to deal with Big data challenges
#
challenge
#
bigdata
#
news
6
reactions
Comments
Add Comment
5 min read
SQL-based INSERTS, DELETES and UPSERTS in S3 using AWS Glue 3.0 and Delta Lake
Kyle Escosia
Kyle Escosia
Kyle Escosia
Follow
for
AWS Community ASEAN
Aug 23 '21
SQL-based INSERTS, DELETES and UPSERTS in S3 using AWS Glue 3.0 and Delta Lake
#
aws
#
tutorial
#
bigdata
#
datascience
21
reactions
Comments
8
comments
8 min read
To let the beginners know their career goals who have opted data science.
Devansh Tayal
Devansh Tayal
Devansh Tayal
Follow
Aug 18 '21
To let the beginners know their career goals who have opted data science.
#
datascience
#
machinelearning
#
bigdata
#
cloud
2
reactions
Comments
Add Comment
2 min read
Updating data files, commits vs. pull requests
Nicolas Frankel
Nicolas Frankel
Nicolas Frankel
Follow
Aug 15 '21
Updating data files, commits vs. pull requests
#
github
#
bigdata
#
automation
#
git
6
reactions
Comments
4
comments
3 min read
Unboxing a Database-How Databases Work Internally
Elegberun Olugbenga
Elegberun Olugbenga
Elegberun Olugbenga
Follow
Jul 30 '21
Unboxing a Database-How Databases Work Internally
#
database
#
bigdata
#
distributedsystems
18
reactions
Comments
4
comments
11 min read
Data Optimization for Compacted Partitions
Dustin Smith
Dustin Smith
Dustin Smith
Follow
Jul 28 '21
Data Optimization for Compacted Partitions
#
bigdata
#
datascience
#
spark
#
dataplatforms
3
reactions
Comments
Add Comment
8 min read
Apache Hudi - The Streaming Data Lake Platform
vinoth chandar
vinoth chandar
vinoth chandar
Follow
Jul 27 '21
Apache Hudi - The Streaming Data Lake Platform
#
datascience
#
analytics
#
bigdata
#
database
2
reactions
Comments
Add Comment
25 min read
UPSERTS and DELETES using AWS Glue and Delta Lake
Kyle Escosia
Kyle Escosia
Kyle Escosia
Follow
for
AWS Community ASEAN
Jul 21 '21
UPSERTS and DELETES using AWS Glue and Delta Lake
#
aws
#
tutorial
#
bigdata
#
analytics
25
reactions
Comments
4
comments
10 min read
Exploratory Data Analysis Using Python
Mwenda Harun Mbaabu
Mwenda Harun Mbaabu
Mwenda Harun Mbaabu
Follow
Jul 17 '21
Exploratory Data Analysis Using Python
#
python
#
datascience
#
machinelearning
#
bigdata
44
reactions
Comments
1
comment
5 min read
Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing
Abhishek Gupta
Abhishek Gupta
Abhishek Gupta
Follow
for
Microsoft Azure
Jul 17 '21
Getting started with Azure Data Explorer and Azure Synapse Analytics for Big Data processing
#
bigdata
#
analytics
#
python
#
programming
3
reactions
Comments
Add Comment
9 min read
E-commerce Security Basics: How to Start with E-commerce Security
Jhon Wilson
Jhon Wilson
Jhon Wilson
Follow
Jun 30 '21
E-commerce Security Basics: How to Start with E-commerce Security
#
security
#
beginners
#
bigdata
2
reactions
Comments
Add Comment
6 min read
Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)
Marco Villarreal
Marco Villarreal
Marco Villarreal
Follow
Jun 27 '21
Creating a Spark Standalone Cluster with Docker and docker-compose(2021 update)
#
docker
#
spark
#
bigdata
35
reactions
Comments
4
comments
7 min read
Understanding Open Telemetry and Observability w/ Splunk's Spiros Xanthos
Conor Bronsdon
Conor Bronsdon
Conor Bronsdon
Follow
for
LinearB
Jun 23 '21
Understanding Open Telemetry and Observability w/ Splunk's Spiros Xanthos
#
opentelemetry
#
observability
#
bigdata
#
datascience
12
reactions
Comments
Add Comment
1 min read
How to Use Consistent Hashing in a System Design Interview?
Arslan Ahmad
Arslan Ahmad
Arslan Ahmad
Follow
Jun 19 '21
How to Use Consistent Hashing in a System Design Interview?
#
distributedsystems
#
bigdata
#
career
#
architecture
7
reactions
Comments
3
comments
7 min read
The Complete Guide to Data Science, Big Data, and Data Analytics
Le Truong
Le Truong
Le Truong
Follow
Jun 11 '21
The Complete Guide to Data Science, Big Data, and Data Analytics
#
datascience
#
bigdata
11
reactions
Comments
Add Comment
3 min read
How to easily install kafka without zookeeper
Aditya Sridhar
Aditya Sridhar
Aditya Sridhar
Follow
Jun 7 '21
How to easily install kafka without zookeeper
#
kafka
#
tutorial
#
beginners
#
bigdata
5
reactions
Comments
Add Comment
7 min read
AWS Data Lake with Terraform - Part 2 of 6
Augusto Valdivia
Augusto Valdivia
Augusto Valdivia
Follow
for
AWS Community Builders
Jun 3 '21
AWS Data Lake with Terraform - Part 2 of 6
#
aws
#
terraform
#
awsdatalake
#
bigdata
17
reactions
Comments
Add Comment
2 min read
AWS Data Lake with Terraform - Part 1 of 6
Augusto Valdivia
Augusto Valdivia
Augusto Valdivia
Follow
for
AWS Community Builders
Jun 1 '21
AWS Data Lake with Terraform - Part 1 of 6
#
aws
#
terraform
#
awsdatalake
#
bigdata
28
reactions
Comments
Add Comment
4 min read
Assess how many Kafka servers are needed to face a scenario of 1 billion requests.
fante-sun
fante-sun
fante-sun
Follow
May 27 '21
Assess how many Kafka servers are needed to face a scenario of 1 billion requests.
#
kafka
#
architecture
#
bigdata
6
reactions
Comments
Add Comment
6 min read
Big Data + MySQL = Mission InnoPossible?
Arctype Team
Arctype Team
Arctype Team
Follow
for
Arctype
May 25 '21
Big Data + MySQL = Mission InnoPossible?
#
database
#
mysql
#
bigdata
#
innodb
4
reactions
Comments
Add Comment
9 min read
A Visual Guide To: Azure Data Factory
Nitya Narasimhan, Ph.D
Nitya Narasimhan, Ph.D
Nitya Narasimhan, Ph.D
Follow
for
Microsoft Azure
May 24 '21
A Visual Guide To: Azure Data Factory
#
azure
#
sketchnote
#
bigdata
#
beginners
10
reactions
Comments
Add Comment
4 min read
AzureFunBytes Reminder - Intro to @Azure Data Factory with @KromerBigData - 5/13/2021
Jay Gordon
Jay Gordon
Jay Gordon
Follow
for
Microsoft Azure
May 12 '21
AzureFunBytes Reminder - Intro to @Azure Data Factory with @KromerBigData - 5/13/2021
#
bigdata
#
azure
#
tutorial
#
beginners
7
reactions
Comments
Add Comment
3 min read
5 Ways Big Data & Analytics Can Pay Off To Your Marketing & Sales in 2021
Shifa Martin
Shifa Martin
Shifa Martin
Follow
May 18 '21
5 Ways Big Data & Analytics Can Pay Off To Your Marketing & Sales in 2021
#
bigdata
#
analytics
#
development
#
solutions
5
reactions
Comments
2
comments
6 min read
AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData
Jay Gordon
Jay Gordon
Jay Gordon
Follow
for
Microsoft Azure
May 13 '21
AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData
#
azure
#
bigdata
#
etl
#
beginners
6
reactions
Comments
Add Comment
3 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account