Skip to content
Navigation menu
Search
Search
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
🏆How to master 📊 Big Data pipelines with Taipy and PySpark 🐍
Marine
Marine
Marine
Follow
for
Taipy
Nov 29 '23
🏆How to master 📊 Big Data pipelines with Taipy and PySpark 🐍
#
python
#
opensource
#
bigdata
#
tutorial
218
reactions
Comments
8
comments
9 min read
Big data models 📊 vs. Computer memory 💾
Marine
Marine
Marine
Follow
for
Taipy
Nov 23 '23
Big data models 📊 vs. Computer memory 💾
#
bigdata
#
pipeline
#
dataengineering
#
dask
186
reactions
Comments
3
comments
11 min read
Data-Powered Accessibility: How to Build Inclusive Product for Any User Need
Natalia
Natalia
Natalia
Follow
Oct 24 '23
Data-Powered Accessibility: How to Build Inclusive Product for Any User Need
#
datapowered
#
bigdata
#
inclusiveproduct
#
a11y
48
reactions
Comments
Add Comment
7 min read
Simplifying ETL Pipelines with SQL: Three Tips for Data Processing
gupta
gupta
gupta
Follow
Dec 10 '23
Simplifying ETL Pipelines with SQL: Three Tips for Data Processing
#
database
#
sql
#
bigdata
#
programming
18
reactions
Comments
Add Comment
3 min read
Why Python and SQL are Must-Have Skills for Marketing Analysts in the Age of Big Data
Scofield Idehen
Scofield Idehen
Scofield Idehen
Follow
Feb 23
Why Python and SQL are Must-Have Skills for Marketing Analysts in the Age of Big Data
#
bigdata
#
python
#
datascience
#
sql
10
reactions
Comments
Add Comment
6 min read
Most common errors when setting up Amazon EMR
Nowsath
Nowsath
Nowsath
Follow
for
AWS Community Builders
Nov 14 '23
Most common errors when setting up Amazon EMR
#
emr
#
dynamodb
#
hive
#
bigdata
8
reactions
Comments
Add Comment
2 min read
Here comes big data technology that rivals clusters on a single machine
jbx1279
jbx1279
jbx1279
Follow
Dec 23 '23
Here comes big data technology that rivals clusters on a single machine
#
bigdata
#
database
#
performance
#
sql
6
reactions
Comments
Add Comment
6 min read
How to store and calculate historical big data with lower usage frequency
jbx1279
jbx1279
jbx1279
Follow
Dec 9 '23
How to store and calculate historical big data with lower usage frequency
#
database
#
bigdata
#
programming
#
sql
6
reactions
Comments
Add Comment
4 min read
S3 Multi-Part Upload: Part 2 Conclusion
Mitansh Gor
Mitansh Gor
Mitansh Gor
Follow
for
Distinction Dev
Nov 18 '23
S3 Multi-Part Upload: Part 2 Conclusion
#
aws
#
bigdata
#
s3
#
multipart
6
reactions
Comments
Add Comment
11 min read
SQL is consuming the lives of data scientists
jbx1279
jbx1279
jbx1279
Follow
Aug 20 '23
SQL is consuming the lives of data scientists
#
sql
#
datascience
#
database
#
bigdata
6
reactions
Comments
3
comments
20 min read
SPL computing performance test series: in-group accumulation
jbx1279
jbx1279
jbx1279
Follow
Oct 22 '23
SPL computing performance test series: in-group accumulation
#
performance
#
bigdata
#
database
5
reactions
Comments
Add Comment
12 min read
Choosing the right AWS Database
Gaurav Raje
Gaurav Raje
Gaurav Raje
Follow
for
AWS Community Builders
Jan 17
Choosing the right AWS Database
#
bigdata
#
beginners
#
architecture
#
database
5
reactions
Comments
Add Comment
4 min read
Why does wide table prevail?
jbx1279
jbx1279
jbx1279
Follow
Jun 18 '23
Why does wide table prevail?
#
database
#
bigdata
#
sql
#
programming
5
reactions
Comments
Add Comment
13 min read
Which Scenarios Does ClickHouse Applies to?
jbx1279
jbx1279
jbx1279
Follow
Oct 28 '23
Which Scenarios Does ClickHouse Applies to?
#
bigdata
#
performance
#
database
#
sql
5
reactions
Comments
1
comment
9 min read
SPL computing performance test series: funnel analysis
jbx1279
jbx1279
jbx1279
Follow
Oct 14 '23
SPL computing performance test series: funnel analysis
#
performance
#
bigdata
#
database
5
reactions
Comments
Add Comment
16 min read
⛏ Get Mining into Data with These Top 5 Resources
Cherlock Code 🔎
Cherlock Code 🔎
Cherlock Code 🔎
Follow
Aug 15 '23
⛏ Get Mining into Data with These Top 5 Resources
#
bigdata
#
datascience
#
learning
#
beginners
5
reactions
Comments
2
comments
6 min read
Why Are There So Many Snapshot Tables in BI Systems?
jbx1279
jbx1279
jbx1279
Follow
Jun 24 '23
Why Are There So Many Snapshot Tables in BI Systems?
#
database
#
bigdata
#
sql
#
programming
5
reactions
Comments
Add Comment
9 min read
A major culprit in the slow running and collapse of a database
jbx1279
jbx1279
jbx1279
Follow
Jan 13
A major culprit in the slow running and collapse of a database
#
bigdata
#
database
#
datawarehouse
#
performance
5
reactions
Comments
Add Comment
10 min read
Is Your Latest Data Really the Latest? Check the Data Update Mechanism of Your Database
Apache Doris
Apache Doris
Apache Doris
Follow
Aug 3 '23
Is Your Latest Data Really the Latest? Check the Data Update Mechanism of Your Database
#
datascience
#
database
#
sql
#
bigdata
4
reactions
Comments
1
comment
6 min read
Connecting Multiple Kafka Clusters in ClickHouse Using Named Collections
Shahab Ranjbary
Shahab Ranjbary
Shahab Ranjbary
Follow
Sep 25 '23
Connecting Multiple Kafka Clusters in ClickHouse Using Named Collections
#
clickhouse
#
kafka
#
dataintegration
#
bigdata
4
reactions
Comments
Add Comment
3 min read
Data Streaming Architecture
Jose Luis Sastoque Rey
Jose Luis Sastoque Rey
Jose Luis Sastoque Rey
Follow
for
AWS Community Builders
Mar 27
Data Streaming Architecture
#
aws
#
bigdata
#
architecture
4
reactions
Comments
Add Comment
4 min read
AWS Lake Formation Summarization
عبدالله عياد | Abdullah Ayad
عبدالله عياد | Abdullah Ayad
عبدالله عياد | Abdullah Ayad
Follow
for
AWS Community Builders
Dec 24 '23
AWS Lake Formation Summarization
#
aws
#
beginners
#
cloud
#
bigdata
3
reactions
Comments
Add Comment
3 min read
Snowflake: Revolutionizing data warehousing
Mage
Mage
Mage
Follow
Jul 21 '23
Snowflake: Revolutionizing data warehousing
#
datawarehouse
#
snowflake
#
cloudcomputing
#
bigdata
3
reactions
Comments
1
comment
6 min read
SQL Pro Tips : industrial GCP BigQuery SQL using WITH
hexfloor
hexfloor
hexfloor
Follow
Mar 28
SQL Pro Tips : industrial GCP BigQuery SQL using WITH
#
gcp
#
sql
#
database
#
bigdata
3
reactions
Comments
Add Comment
5 min read
SQL Pro Tips : industrial Oracle SQL using WITH
hexfloor
hexfloor
hexfloor
Follow
Mar 28
SQL Pro Tips : industrial Oracle SQL using WITH
#
sql
#
oracle
#
bigdata
#
database
3
reactions
Comments
Add Comment
4 min read
SQL Pro Tips : industrial AWS Athena SQL using WITH
hexfloor
hexfloor
hexfloor
Follow
Mar 28
SQL Pro Tips : industrial AWS Athena SQL using WITH
#
aws
#
database
#
bigdata
#
sql
3
reactions
Comments
Add Comment
4 min read
BigQuery Machine Learning
Cris Crawford
Cris Crawford
Cris Crawford
Follow
Feb 10
BigQuery Machine Learning
#
bigdata
#
machinelearning
#
googlecloud
#
sql
2
reactions
Comments
Add Comment
5 min read
MWAA Plugins and Dependency Survival Guide
elliott cordo
elliott cordo
elliott cordo
Follow
for
AWS Heroes
Apr 5
MWAA Plugins and Dependency Survival Guide
#
airflow
#
bigdata
#
dataengineering
#
aws
2
reactions
Comments
Add Comment
3 min read
How to clone tables in BigQuery
Marcelo Costa
Marcelo Costa
Marcelo Costa
Follow
May 18 '23
How to clone tables in BigQuery
#
bigquery
#
bigdata
#
dataengineering
2
reactions
Comments
Add Comment
1 min read
Business Intelligence Data Analyst vs. BI Developer
ai-jobs.net
ai-jobs.net
ai-jobs.net
Follow
Nov 22 '23
Business Intelligence Data Analyst vs. BI Developer
#
bigdata
#
analyst
#
career
#
programming
2
reactions
Comments
Add Comment
3 min read
Introduction to Big-data
Razi Shaikh
Razi Shaikh
Razi Shaikh
Follow
Aug 1 '23
Introduction to Big-data
#
bigdata
#
shortoverview
#
shortnotes
2
reactions
Comments
2
comments
3 min read
HyperLogLog | Un algoritmo para contarlos (aproximadamente) a todos
Javi AS
Javi AS
Javi AS
Follow
Oct 4 '23
HyperLogLog | Un algoritmo para contarlos (aproximadamente) a todos
#
algorithms
#
computerscience
#
bigdata
#
spanish
2
reactions
Comments
Add Comment
6 min read
5 Common Mistakes with Apache Flink and How to Avoid Them
Umair-khurshid
Umair-khurshid
Umair-khurshid
Follow
Jul 13 '23
5 Common Mistakes with Apache Flink and How to Avoid Them
#
bigdata
#
datascience
#
development
#
beginners
2
reactions
Comments
Add Comment
3 min read
Bulk load to Elastic Search with PySpark
Valery C. Briz
Valery C. Briz
Valery C. Briz
Follow
Jun 1 '23
Bulk load to Elastic Search with PySpark
#
elasticsearch
#
spark
#
pyspark
#
bigdata
2
reactions
Comments
Add Comment
2 min read
How come there are tens of thousands of tables in a database
jbx1279
jbx1279
jbx1279
Follow
Mar 23
How come there are tens of thousands of tables in a database
#
database
#
bigdata
#
sql
2
reactions
Comments
1
comment
5 min read
Integrating Apache Age with Other Big Data Tools and Frameworks
Mohanad Toaima
Mohanad Toaima
Mohanad Toaima
Follow
May 2 '23
Integrating Apache Age with Other Big Data Tools and Frameworks
#
apacheage
#
apache
#
postgres
#
bigdata
2
reactions
Comments
1
comment
2 min read
BigQuery best practices
Cris Crawford
Cris Crawford
Cris Crawford
Follow
Feb 10
BigQuery best practices
#
dataengineering
#
bigdata
1
reaction
Comments
Add Comment
2 min read
Supercharge Your S3 Data with AWS S3 Transfer Acceleration
Nils Whitmont
Nils Whitmont
Nils Whitmont
Follow
Jan 24
Supercharge Your S3 Data with AWS S3 Transfer Acceleration
#
s3
#
aws
#
performance
#
bigdata
1
reaction
Comments
Add Comment
3 min read
How to implement an efficient logical data warehouse? Try SPL!
jbx1279
jbx1279
jbx1279
Follow
Sep 10 '23
How to implement an efficient logical data warehouse? Try SPL!
#
database
#
bigdata
#
programming
#
sql
1
reaction
Comments
Add Comment
12 min read
Exploring Connections: How Meeting People Enriched My Master's Journey
Tawanda Nyahuye
Tawanda Nyahuye
Tawanda Nyahuye
Follow
Aug 30 '23
Exploring Connections: How Meeting People Enriched My Master's Journey
#
programming
#
machinelearning
#
bigdata
#
datascience
1
reaction
Comments
Add Comment
3 min read
Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Dec 18 '23
Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog
#
datascience
#
bigdata
#
architecture
#
tutorial
1
reaction
Comments
Add Comment
4 min read
Understanding Elasticsearch. A Guide for Beginners
nivelepsilon
nivelepsilon
nivelepsilon
Follow
Feb 10
Understanding Elasticsearch. A Guide for Beginners
#
elasticsearch
#
devops
#
bigdata
#
beginners
1
reaction
Comments
Add Comment
4 min read
Understanding Concurrency Through Amdahl's Law
luminousmen
luminousmen
luminousmen
Follow
Dec 4 '23
Understanding Concurrency Through Amdahl's Law
#
bigdata
#
data
1
reaction
Comments
Add Comment
3 min read
Must-Know Tech Terms Explained
Mohammad Daniyal
Mohammad Daniyal
Mohammad Daniyal
Follow
Aug 17 '23
Must-Know Tech Terms Explained
#
webdev
#
cloud
#
bigdata
#
cybersecurity
1
reaction
Comments
Add Comment
2 min read
Data warehouse with “no house” performs better than the one with “the house”
jbx1279
jbx1279
jbx1279
Follow
Aug 13 '23
Data warehouse with “no house” performs better than the one with “the house”
#
database
#
bigdata
#
sql
#
datascience
1
reaction
Comments
Add Comment
11 min read
Lightweight big data processing technology
jbx1279
jbx1279
jbx1279
Follow
Aug 5 '23
Lightweight big data processing technology
#
database
#
bigdata
#
sql
#
datascience
1
reaction
Comments
Add Comment
9 min read
Working with Parquet files in Java using Avro
Jerónimo López
Jerónimo López
Jerónimo López
Follow
Nov 26 '23
Working with Parquet files in Java using Avro
#
parquet
#
java
#
avro
#
bigdata
1
reaction
Comments
Add Comment
10 min read
Big data with Software Systems
Ravikanth Kowdeed
Ravikanth Kowdeed
Ravikanth Kowdeed
Follow
Feb 14
Big data with Software Systems
#
softwareengineering
#
bigdata
1
reaction
Comments
Add Comment
1 min read
From Big Data to Graph Computing - Graph On BigData
tugraph-analytics
tugraph-analytics
tugraph-analytics
Follow
Aug 1 '23
From Big Data to Graph Computing - Graph On BigData
#
bigdata
#
graphql
#
gql
#
github
1
reaction
Comments
Add Comment
6 min read
Install Hadoop on Ubuntu
Atul Vishwakarma
Atul Vishwakarma
Atul Vishwakarma
Follow
Nov 4 '23
Install Hadoop on Ubuntu
#
bigdata
#
hadoop
#
ubuntu
#
learning
1
reaction
Comments
Add Comment
6 min read
Unveiling the visualization capabilities of the DataWind product in Volcano Engine
玄魂
玄魂
玄魂
Follow
Nov 9 '23
Unveiling the visualization capabilities of the DataWind product in Volcano Engine
#
bigdata
#
opensource
#
bigscreen
#
bi
1
reaction
Comments
Add Comment
16 min read
ELT is dead, and EtLT becomes the ultimate destination of modern data processing architecture
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Jul 31 '23
ELT is dead, and EtLT becomes the ultimate destination of modern data processing architecture
#
bigdata
#
database
#
datascience
#
apacheseatunnel
1
reaction
Comments
Add Comment
10 min read
Healthcare & IT: Medical standards in IT based on HIPAA
*instinctools
*instinctools
*instinctools
Follow
Jul 27 '23
Healthcare & IT: Medical standards in IT based on HIPAA
#
cybersecurity
#
api
#
bigdata
#
healthcare
1
reaction
Comments
Add Comment
9 min read
SQL Pro tips : GCP BigQuery SQL CROSS JOIN with UNPIVOT UNNEST
hexfloor
hexfloor
hexfloor
Follow
Oct 6 '23
SQL Pro tips : GCP BigQuery SQL CROSS JOIN with UNPIVOT UNNEST
#
gcp
#
sql
#
database
#
bigdata
1
reaction
Comments
Add Comment
4 min read
SQL Pro tips : AWS Athena SQL UNPIVOT : CROSS JOIN UNNEST
hexfloor
hexfloor
hexfloor
Follow
Oct 6 '23
SQL Pro tips : AWS Athena SQL UNPIVOT : CROSS JOIN UNNEST
#
aws
#
database
#
bigdata
#
sql
1
reaction
Comments
Add Comment
3 min read
HTAP: Learning from Xiaohongshu
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
May 8 '23
HTAP: Learning from Xiaohongshu
#
architecture
#
bigdata
#
database
#
datascience
1
reaction
Comments
Add Comment
5 min read
Meet Apache SeaTunnel, a new Apache Top-Level Project!
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Jun 2 '23
Meet Apache SeaTunnel, a new Apache Top-Level Project!
#
seatun
#
opensource
#
datascience
#
bigdata
1
reaction
Comments
Add Comment
4 min read
Test Driving Redshift AI-Driven Scaling
elliott cordo
elliott cordo
elliott cordo
Follow
for
AWS Heroes
Dec 21 '23
Test Driving Redshift AI-Driven Scaling
#
aws
#
bigdata
#
dataengineering
#
analytics
1
reaction
Comments
Add Comment
3 min read
SPL computing performance test series: multi-index aggregating
jbx1279
jbx1279
jbx1279
Follow
Sep 30 '23
SPL computing performance test series: multi-index aggregating
#
bigdata
#
database
#
performance
#
sql
1
reaction
Comments
Add Comment
6 min read
SQL Pro tips : CROSS JOIN UNPIVOT summary for beginners
hexfloor
hexfloor
hexfloor
Follow
Nov 22 '23
SQL Pro tips : CROSS JOIN UNPIVOT summary for beginners
#
sql
#
database
#
bigdata
#
beginners
1
reaction
Comments
Add Comment
3 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account