Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Search
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions
DMetaSoul
DMetaSoul
DMetaSoul
Follow
Mar 17 '22
A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions
#
dataengineering
#
opensource
#
bigdata
#
programming
8
reactions
Comments
Add Comment
2 min read
[OPINIÃO] Construindo uma Carreira como Data Engineer
Lis R. Barreto
Lis R. Barreto
Lis R. Barreto
Follow
Mar 9 '22
[OPINIÃO] Construindo uma Carreira como Data Engineer
#
bigdata
#
dataengineering
#
tips
2
reactions
Comments
Add Comment
2 min read
Characteristics of Big Data
Aarti Yadav
Aarti Yadav
Aarti Yadav
Follow
Mar 3 '22
Characteristics of Big Data
#
bigdata
4
reactions
Comments
Add Comment
8 min read
Apache Spark Unit Testing Strategies
Sukumaar Mane
Sukumaar Mane
Sukumaar Mane
Follow
Feb 28 '22
Apache Spark Unit Testing Strategies
#
scala
#
programming
#
apachespark
#
bigdata
9
reactions
Comments
Add Comment
1 min read
NodeJS - Get data from Redash v6 API
IRPAN KUSUMA W
IRPAN KUSUMA W
IRPAN KUSUMA W
Follow
Feb 26 '22
NodeJS - Get data from Redash v6 API
#
redash
#
node
#
bigdata
#
analytics
6
reactions
Comments
Add Comment
2 min read
Building an Apache ECharts dashboard with React and Cube
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Feb 24 '22
Building an Apache ECharts dashboard with React and Cube
#
react
#
javascript
#
apacheecharts
#
bigdata
14
reactions
Comments
Add Comment
11 min read
[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!
Lis R. Barreto
Lis R. Barreto
Lis R. Barreto
Follow
Feb 24 '22
[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!
#
bigdata
#
dados
#
dataengineering
#
career
6
reactions
Comments
Add Comment
2 min read
What are the best practices while using BigQuery?
Kedar Kodgire
Kedar Kodgire
Kedar Kodgire
Follow
Feb 19 '22
What are the best practices while using BigQuery?
#
bigdata
#
cloud
#
googlecloud
11
reactions
Comments
Add Comment
2 min read
Building a Bubble Dashboard with Cube
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Feb 18 '22
Building a Bubble Dashboard with Cube
#
bigdata
#
businessintelligence
#
webdev
#
tutorial
9
reactions
Comments
Add Comment
14 min read
[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças
Lis R. Barreto
Lis R. Barreto
Lis R. Barreto
Follow
Feb 18 '22
[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças
#
bigdata
#
data
#
dataengineering
6
reactions
Comments
Add Comment
3 min read
Fast Multivalue Look-ups For Huge Data Sets
Oleksandr
Oleksandr
Oleksandr
Follow
Feb 7 '22
Fast Multivalue Look-ups For Huge Data Sets
#
bigdata
#
python
#
numpy
#
csv
6
reactions
Comments
Add Comment
6 min read
Dagster: The Best Free and Open-Source Alternative to Airflow With Python!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Feb 5 '22
Dagster: The Best Free and Open-Source Alternative to Airflow With Python!
#
python
#
programming
#
bigdata
#
tooling
4
reactions
Comments
Add Comment
1 min read
What is the SingleStore and why should we use it?
winner
winner
winner
Follow
Feb 4 '22
What is the SingleStore and why should we use it?
#
postgres
#
machinelearning
#
bigdata
#
programming
11
reactions
Comments
2
comments
3 min read
How to handle nested JSON with Apache Spark
JayReddy
JayReddy
JayReddy
Follow
Feb 3 '22
How to handle nested JSON with Apache Spark
#
database
#
bigdata
#
spark
#
scala
3
reactions
Comments
Add Comment
3 min read
Machine Learning Lifecycle Process
Adit Modi
Adit Modi
Adit Modi
Follow
for
Cloud Tech
Jan 31 '22
Machine Learning Lifecycle Process
#
machinelearning
#
datascience
#
bigdata
#
beginners
45
reactions
Comments
Add Comment
4 min read
Quill- Most efficient Scala driver for Apache Cassandra and Spark
JayReddy
JayReddy
JayReddy
Follow
Jan 31 '22
Quill- Most efficient Scala driver for Apache Cassandra and Spark
#
bigdata
#
spark
#
sql
#
database
2
reactions
Comments
Add Comment
4 min read
Presenting ML-based COVID-19 Risk Assessment App Pandemonium
andreykh
andreykh
andreykh
Follow
Jan 28 '22
Presenting ML-based COVID-19 Risk Assessment App Pandemonium
#
showdev
#
javascript
#
webdev
#
bigdata
4
reactions
Comments
Add Comment
3 min read
Cleaning And Normalizing Data Using AWS Glue DataBrew
Sunny Srinidhi
Sunny Srinidhi
Sunny Srinidhi
Follow
for
AWS Community Builders
Jan 18 '22
Cleaning And Normalizing Data Using AWS Glue DataBrew
#
aws
#
datascience
#
bigdata
#
tutorial
14
reactions
Comments
3
comments
9 min read
Introduction to Apache Spark, SparkQL, and Spark MLib.
hridyesh bisht
hridyesh bisht
hridyesh bisht
Follow
for
AWS Community Builders
Jan 15 '22
Introduction to Apache Spark, SparkQL, and Spark MLib.
#
database
#
bigdata
12
reactions
Comments
Add Comment
15 min read
Data Lake explained
Barbara
Barbara
Barbara
Follow
Jan 11 '22
Data Lake explained
#
bigdata
#
spark
#
analytics
#
schemaonread
6
reactions
Comments
Add Comment
4 min read
Introduction to Hive(A SQL layer above Hadoop)
hridyesh bisht
hridyesh bisht
hridyesh bisht
Follow
for
AWS Community Builders
Jan 10 '22
Introduction to Hive(A SQL layer above Hadoop)
#
database
#
bigdata
8
reactions
Comments
Add Comment
9 min read
Build a small TA-Lib container image
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Jan 6 '22
Build a small TA-Lib container image
#
docker
#
python
#
tutorial
#
bigdata
3
reactions
Comments
Add Comment
2 min read
SPOTLIGHT: A GENTLE INTRODUCTION TO MACHINE LEARNING CONCEPTS IN PYTHON
Ukeje Chukwuemeriwo Goodness
Ukeje Chukwuemeriwo Goodness
Ukeje Chukwuemeriwo Goodness
Follow
Jan 5 '22
SPOTLIGHT: A GENTLE INTRODUCTION TO MACHINE LEARNING CONCEPTS IN PYTHON
#
machinelearning
#
python
#
datascience
#
bigdata
5
reactions
Comments
Add Comment
5 min read
How to choose a MongoDB shard key
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Dec 29 '21
How to choose a MongoDB shard key
#
mongodb
#
database
#
distributedsystems
#
bigdata
8
reactions
Comments
1
comment
3 min read
Big Data Open Source Frameworks
Siddharth Chandra
Siddharth Chandra
Siddharth Chandra
Follow
Dec 29 '21
Big Data Open Source Frameworks
#
scala
#
python
#
bigdata
#
opensource
3
reactions
Comments
Add Comment
5 min read
Scala Vs Python Syntax Cheat Sheet
Siddharth Chandra
Siddharth Chandra
Siddharth Chandra
Follow
Dec 29 '21
Scala Vs Python Syntax Cheat Sheet
#
scala
#
bigdata
#
opensource
#
beginners
4
reactions
Comments
Add Comment
5 min read
Scala For Beginners - Crash Course - Part 2
Siddharth Chandra
Siddharth Chandra
Siddharth Chandra
Follow
Dec 29 '21
Scala For Beginners - Crash Course - Part 2
#
scala
#
opensource
#
bigdata
#
beginners
3
reactions
Comments
Add Comment
6 min read
Scala For Beginners - Crash Course - Part 5
Siddharth Chandra
Siddharth Chandra
Siddharth Chandra
Follow
Dec 29 '21
Scala For Beginners - Crash Course - Part 5
#
scala
#
bigdata
#
opensource
#
beginners
4
reactions
Comments
Add Comment
6 min read
Scala For Beginners - Crash Course - Part 3
Siddharth Chandra
Siddharth Chandra
Siddharth Chandra
Follow
Dec 29 '21
Scala For Beginners - Crash Course - Part 3
#
scala
#
bigdata
#
opensource
#
beginners
3
reactions
Comments
Add Comment
6 min read
Scala For Beginners - Crash Course - Part 4
Siddharth Chandra
Siddharth Chandra
Siddharth Chandra
Follow
Dec 29 '21
Scala For Beginners - Crash Course - Part 4
#
scala
#
bigdata
#
opensource
#
beginners
3
reactions
Comments
Add Comment
4 min read
Django + Mongodb works slowly
Jasmine Beeh
Jasmine Beeh
Jasmine Beeh
Follow
Dec 24 '21
Django + Mongodb works slowly
#
help
#
django
#
mongodb
#
bigdata
1
reaction
Comments
Add Comment
1 min read
Creating and running Spark Jobs in Scala on Cloud Dataproc !!!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Creating and running Spark Jobs in Scala on Cloud Dataproc !!!
#
scala
#
googlecloud
#
spark
#
bigdata
7
reactions
Comments
Add Comment
3 min read
Getting started with Spark
Barbara
Barbara
Barbara
Follow
Dec 22 '21
Getting started with Spark
#
bigdata
#
beginners
#
distributedsystems
#
programming
12
reactions
Comments
2
comments
6 min read
The World Beyond the Docker! $$ :)
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
The World Beyond the Docker! $$ :)
#
docker
#
devops
#
kubernetes
#
bigdata
5
reactions
Comments
Add Comment
2 min read
Airbyte: Data Integration / CDC Solution for Modern Data Teams!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Airbyte: Data Integration / CDC Solution for Modern Data Teams!
#
bigdata
#
database
#
opensource
#
sql
6
reactions
Comments
Add Comment
12 min read
Best extensions for JupyterLab!!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Best extensions for JupyterLab!!
#
jupyter
#
python
#
bigdata
#
datascience
6
reactions
Comments
Add Comment
3 min read
Vitess: Easy database deployment, clustering, and scaling!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Vitess: Easy database deployment, clustering, and scaling!
#
database
#
nosql
#
dataops
#
bigdata
5
reactions
Comments
Add Comment
5 min read
Zero to Deployment and Evolution Data Catalog!
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Josue Luzardo Gebrim
Follow
Dec 22 '21
Zero to Deployment and Evolution Data Catalog!
#
datacatalog
#
dataops
#
bigdata
#
database
4
reactions
Comments
Add Comment
6 min read
Build an analytics app with React and Cube.js
Matt Angelosanto
Matt Angelosanto
Matt Angelosanto
Follow
for
LogRocket
Dec 22 '21
Build an analytics app with React and Cube.js
#
bigdata
#
react
#
analytics
#
tutorial
8
reactions
Comments
Add Comment
9 min read
Cardinality Counting in Redis
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Dec 22 '21
Cardinality Counting in Redis
#
redis
#
architecture
#
bigdata
#
javascript
2
reactions
Comments
Add Comment
4 min read
Cube Cloud Deep Dive: Mastering Pre-Aggregations
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Dec 20 '21
Cube Cloud Deep Dive: Mastering Pre-Aggregations
#
opensource
#
database
#
bigdata
#
analytics
6
reactions
Comments
Add Comment
11 min read
BigQuery SQL Tip: QUALIFY clause
Hui Zheng (I/Trust/You)
Hui Zheng (I/Trust/You)
Hui Zheng (I/Trust/You)
Follow
Dec 10 '21
BigQuery SQL Tip: QUALIFY clause
#
bigdata
5
reactions
Comments
Add Comment
1 min read
What Is Trino And Why Is It Great At Processing Big Data
SeattleDataGuy
SeattleDataGuy
SeattleDataGuy
Follow
Dec 9 '21
What Is Trino And Why Is It Great At Processing Big Data
#
database
#
bigdata
#
datascience
#
sql
19
reactions
Comments
Add Comment
7 min read
Using PySpark and AWS Glue to analyze multi-line log files
Maurice Borgmeier
Maurice Borgmeier
Maurice Borgmeier
Follow
for
AWS Community Builders
Dec 3 '21
Using PySpark and AWS Glue to analyze multi-line log files
#
aws
#
python
#
bigdata
#
pyspark
12
reactions
Comments
1
comment
5 min read
ETLs vs ELTs: Why are ELTs Disrupting the Data Market?
SeattleDataGuy
SeattleDataGuy
SeattleDataGuy
Follow
Nov 30 '21
ETLs vs ELTs: Why are ELTs Disrupting the Data Market?
#
datascience
#
database
#
bigdata
#
startup
15
reactions
Comments
Add Comment
8 min read
How IoT integration with ERP system can bring business benefits
HyperNym
HyperNym
HyperNym
Follow
Nov 29 '21
How IoT integration with ERP system can bring business benefits
#
erp
#
iot
#
bigdata
#
cloudcomputing
2
reactions
Comments
Add Comment
4 min read
Bigdata: A problem and a solution
Radha
Radha
Radha
Follow
Oct 26 '21
Bigdata: A problem and a solution
#
discuss
#
bigdata
#
industry
#
challenge
1
reaction
Comments
Add Comment
4 min read
Cube Cloud Deep Dive: Starting a New Cube App
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Nov 23 '21
Cube Cloud Deep Dive: Starting a New Cube App
#
tutorial
#
opensource
#
analytics
#
bigdata
16
reactions
Comments
Add Comment
9 min read
How Zero-Code Data Preparations Tools Enable Better, Faster IT Performance in the Age of Big Data
Javeria Gauhar
Javeria Gauhar
Javeria Gauhar
Follow
Nov 21 '21
How Zero-Code Data Preparations Tools Enable Better, Faster IT Performance in the Age of Big Data
#
bigdata
#
datacleansing
#
datapreparation
#
itperformance
2
reactions
Comments
Add Comment
6 min read
Build your own data quality rules with AWS Glue DataBrew
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Nov 21 '21
Build your own data quality rules with AWS Glue DataBrew
#
aws
#
awsthai
#
bigdata
#
datascience
12
reactions
Comments
Add Comment
6 min read
Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Chatchai Komrangded (Bas)
Follow
for
AWS Community ASEAN
Nov 20 '21
Identifying and handling personally identifiable information (PII) ด้วย AWS Glue DataBrew
#
aws
#
awsthai
#
bigdata
#
datascience
6
reactions
Comments
Add Comment
4 min read
Understanding Apache Hive LLAP
Sunny Srinidhi
Sunny Srinidhi
Sunny Srinidhi
Follow
Nov 19 '21
Understanding Apache Hive LLAP
#
bigdata
#
datascience
#
database
#
apachehive
3
reactions
Comments
Add Comment
7 min read
What Is Crypto and How Does It Work ?
Bek Brace
Bek Brace
Bek Brace
Follow
Nov 17 '21
What Is Crypto and How Does It Work ?
#
bigdata
#
blockchain
#
cryptocurrency
#
machinelearning
9
reactions
Comments
7
comments
3 min read
Data lakes: building a serverless data pipeline
Luca Silvestri
Luca Silvestri
Luca Silvestri
Follow
Nov 8 '21
Data lakes: building a serverless data pipeline
#
serverless
#
bigdata
#
aws
#
datapipelines
3
reactions
Comments
Add Comment
6 min read
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro
Sunny Srinidhi
Sunny Srinidhi
Sunny Srinidhi
Follow
Nov 6 '21
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro
#
hadoop
#
bigdata
#
macbook
#
programming
9
reactions
Comments
Add Comment
8 min read
Meet the Innovators with Krzysztof Nowocin
12:14
ITMAGINATION
ITMAGINATION
ITMAGINATION
Follow
Nov 5 '21
Meet the Innovators with Krzysztof Nowocin
#
datascience
#
bigdata
#
career
2
reactions
Comments
Add Comment
6 min read
The Important SQL Queries for Beginners
Mahmoud EL-kariouny
Mahmoud EL-kariouny
Mahmoud EL-kariouny
Follow
Nov 2 '21
The Important SQL Queries for Beginners
#
tutorial
#
bigdata
#
sql
#
database
13
reactions
Comments
Add Comment
8 min read
A first update on our AI/ML/Big Data salary survey
ai-jobs.net
ai-jobs.net
ai-jobs.net
Follow
Sep 28 '21
A first update on our AI/ML/Big Data salary survey
#
machinelearning
#
bigdata
#
salaries
#
career
2
reactions
Comments
Add Comment
2 min read
Data Engineering Introduction
Nicholas
Nicholas
Nicholas
Follow
Oct 29 '21
Data Engineering Introduction
#
database
#
bigdata
7
reactions
Comments
Add Comment
2 min read
Performance capabilities of data warehouses and how Cube can help
Adnan Rahić
Adnan Rahić
Adnan Rahić
Follow
for
Cube
Oct 28 '21
Performance capabilities of data warehouses and how Cube can help
#
bigdata
#
database
#
node
#
tutorial
18
reactions
Comments
Add Comment
18 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account