DEV Community

loading...

# bigdata

👋 Sign in for the ability sort posts by top and latest.
Cloud Data Fusion, a game-changer for GCP

Cloud Data Fusion, a game-changer for GCP

Reactions 11 Comments 7
4 min read
6 big data trends and forecasts worthy of attention in 2020

6 big data trends and forecasts worthy of attention in 2020

Reactions 5 Comments
3 min read
Database is not always the answer

Database is not always the answer

Reactions 22 Comments 12
2 min read
AWS: Redshift – quick start and SQL-workbench connection configuration

AWS: Redshift – quick start and SQL-workbench connection configuration

Reactions 13 Comments
4 min read
Data Lake vs Data Warehouse

Data Lake vs Data Warehouse

Reactions 9 Comments
2 min read
Life Beyond Kafka with Apache Pulsar

Life Beyond Kafka with Apache Pulsar

Reactions 16 Comments
4 min read
10 Apache Hadoop tutorials, books, and courses for Java and Web developers

10 Apache Hadoop tutorials, books, and courses for Java and Web developers

Reactions 46 Comments
6 min read
Azure Blob Storage with Pyspark

Azure Blob Storage with Pyspark

Reactions 10 Comments 1
2 min read
Building simple data pipelines in Azure using Cosmos DB, Databricks and Blob Storage

Building simple data pipelines in Azure using Cosmos DB, Databricks and Blob Storage

Reactions 7 Comments
15 min read
Big Data file formats explained

Big Data file formats explained

Reactions 10 Comments
7 min read
Spark. Anatomy of Spark application

Spark. Anatomy of Spark application

Reactions 15 Comments
6 min read
Categorical Variables and Cardinality

Categorical Variables and Cardinality

Reactions 5 Comments
1 min read
Event Tracking and Analytics via Ruby on Rails, DynamoDB (with Streams), Kinesis Firehose and Athena and CloudWatch Dashboard! 21:24

Event Tracking and Analytics via Ruby on Rails, DynamoDB (with Streams), Kinesis Firehose and Athena and CloudWatch Dashboard!

Reactions 81 Comments
13 min read
Data Engineering — Complete Reference Guide From A-Z [2019]

Data Engineering — Complete Reference Guide From A-Z [2019]

Reactions 21 Comments
16 min read
PySpark and Parquet - Analysis

PySpark and Parquet - Analysis

Reactions 10 Comments
3 min read
Creating a proof of concept for Spatial Joins

Creating a proof of concept for Spatial Joins

Reactions 4 Comments
4 min read
Understanding Partitioning in Azure Cosmos DB

Understanding Partitioning in Azure Cosmos DB

Reactions 5 Comments 4
5 min read
Extending Business Intelligence Features of Kibana

Extending Business Intelligence Features of Kibana

Reactions 22 Comments 1
4 min read
Top 5 Online Courses to Learn Big Data and Hadoop for Beginners

Top 5 Online Courses to Learn Big Data and Hadoop for Beginners

Reactions 67 Comments
10 min read
Basic introduction to Big data

Basic introduction to Big data

Reactions 14 Comments
3 min read
5 Best Practices for Setting Up Your Data Warehouse in the Cloud

5 Best Practices for Setting Up Your Data Warehouse in the Cloud

Reactions 5 Comments
6 min read
Data lakes are hard

Data lakes are hard

Reactions 17 Comments
4 min read
Kafka Getting Started - Kafka Series - Part 2

Kafka Getting Started - Kafka Series - Part 2

Reactions 14 Comments
4 min read
How Apache Kafka works? Kafka Series - Part 1

How Apache Kafka works? Kafka Series - Part 1

Reactions 15 Comments 4
3 min read
[Antisèche] Apache Spark : structure d'une application Spark

[Antisèche] Apache Spark : structure d'une application Spark

Reactions 6 Comments
2 min read
Installing, Configuring and Using the Azure Databricks CLI

Installing, Configuring and Using the Azure Databricks CLI

Reactions 8 Comments
3 min read
Different ways to word count in apache spark

Different ways to word count in apache spark

Reactions 9 Comments
2 min read
What is the Future of Big Data Analytics and Hadoop?

What is the Future of Big Data Analytics and Hadoop?

Reactions 8 Comments
2 min read
How to Process Epic Amounts of Data in NodeJS

How to Process Epic Amounts of Data in NodeJS

Reactions 90 Comments 1
6 min read
Wielding the power of web transparency

Wielding the power of web transparency

Reactions 15 Comments 1
9 min read
[Video] Visualizing data at scale with Google Data Studio

[Video] Visualizing data at scale with Google Data Studio

Reactions 6 Comments
1 min read
Apache Hadoop - TLS and SSL Notes

Apache Hadoop - TLS and SSL Notes

Reactions 9 Comments
4 min read
Big Data Analysis with Hadoop, Spark, and R Shiny

Big Data Analysis with Hadoop, Spark, and R Shiny

Reactions 28 Comments 1
12 min read
Processing Streaming Twitter Data using Kafka and Spark - Part 2: Creating Kafka Twitter producer

Processing Streaming Twitter Data using Kafka and Spark - Part 2: Creating Kafka Twitter producer

Reactions 21 Comments 5
7 min read
Processing Streaming Twitter Data using Kafka and Spark — Part 1: Setting Up Kafka Cluster

Processing Streaming Twitter Data using Kafka and Spark — Part 1: Setting Up Kafka Cluster

Reactions 18 Comments
4 min read
Processing Streaming Twitter Data using Kafka and Spark — The Plan

Processing Streaming Twitter Data using Kafka and Spark — The Plan

Reactions 10 Comments
2 min read
Window Functions in Stream Analytics

Window Functions in Stream Analytics

Reactions 25 Comments 5
9 min read
Tips and tools for analysing big data

Tips and tools for analysing big data

Reactions 9 Comments
5 min read
Amazon Athena vs AWS Lambda: Comparing two solutions for Big Data Analysis

Amazon Athena vs AWS Lambda: Comparing two solutions for Big Data Analysis

Reactions 21 Comments 5
8 min read
Super simple and fast delimited CSV data normalization with AWK

Super simple and fast delimited CSV data normalization with AWK

Reactions 10 Comments
2 min read
Streaming Data in Databricks Delta Tables

Streaming Data in Databricks Delta Tables

Reactions 14 Comments 3
3 min read
Managing and Configuring Clusters within Azure Databricks

Managing and Configuring Clusters within Azure Databricks

Reactions 9 Comments
9 min read
Databases and Tables in Azure Databricks

Databases and Tables in Azure Databricks

Reactions 13 Comments
5 min read
What Is MapReduce?

What Is MapReduce?

Reactions 47 Comments 3
7 min read
生醫大數據:從集權治理到公眾參與

生醫大數據:從集權治理到公眾參與

Reactions 18 Comments
2 min read
Expertise and context-based answer rating system for Q&A websites.

Expertise and context-based answer rating system for Q&A websites.

Reactions 7 Comments
1 min read
Local hadoop on laptop for practice

Local hadoop on laptop for practice

Reactions 20 Comments
4 min read
Apache Livy - Apache Spark, HDFS, and Kerberos

Apache Livy - Apache Spark, HDFS, and Kerberos

Reactions 14 Comments
2 min read
Using Hadoop in Azure HDInsight to process Big Data

Using Hadoop in Azure HDInsight to process Big Data

Reactions 13 Comments
6 min read
Apache HBase - REST API - Atomic Operations

Apache HBase - REST API - Atomic Operations

Reactions 9 Comments
6 min read
Apache Storm - Topology Permissions

Apache Storm - Topology Permissions

Reactions 6 Comments
2 min read
Apache Hadoop S3A With Hitachi Content Platform (HCP)

Apache Hadoop S3A With Hitachi Content Platform (HCP)

Reactions 7 Comments
4 min read
Apache Livy - Simplified Apache Spark Integration

Apache Livy - Simplified Apache Spark Integration

Reactions 11 Comments
2 min read
Apache Ranger - Hive over HDFS Audit Logs

Apache Ranger - Hive over HDFS Audit Logs

Reactions 8 Comments
3 min read
Apache Ambari - Custom Alert Dispatch Script

Apache Ambari - Custom Alert Dispatch Script

Reactions 8 Comments
2 min read
Oracle JDK - Missing Ciphers - libsunec.so

Oracle JDK - Missing Ciphers - libsunec.so

Reactions 6 Comments
3 min read
Apache Knox - Improved Group Support

Apache Knox - Improved Group Support

Reactions 8 Comments
3 min read
Apache Knox - Proxying Apache NiFi

Apache Knox - Proxying Apache NiFi

Reactions 6 Comments
13 min read
HDF - Apache NiFi - Kerberos Errors and useSubjectCredsOnly

HDF - Apache NiFi - Kerberos Errors and useSubjectCredsOnly

Reactions 2 Comments
3 min read
Learning about the Druid Architecture

Learning about the Druid Architecture

Reactions 8 Comments
6 min read
loading...
Forem Open with the Forem app