DEV Community

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Processing Streaming Twitter Data using Kafka and Spark - Part 2: Creating Kafka Twitter producer

Processing Streaming Twitter Data using Kafka and Spark - Part 2: Creating Kafka Twitter producer

21
Comments 5
7 min read
Processing Streaming Twitter Data using Kafka and Spark — The Plan

Processing Streaming Twitter Data using Kafka and Spark — The Plan

11
Comments
2 min read
Processing Streaming Twitter Data using Kafka and Spark — Part 1: Setting Up Kafka Cluster

Processing Streaming Twitter Data using Kafka and Spark — Part 1: Setting Up Kafka Cluster

18
Comments
4 min read
Streams For the Win: A Performance Comparison of Node.js Methods for Reading Large Datasets (Pt 2)

Streams For the Win: A Performance Comparison of Node.js Methods for Reading Large Datasets (Pt 2)

5
Comments
9 min read
Window Functions in Stream Analytics

Window Functions in Stream Analytics

31
Comments 5
9 min read
What makes code slow to execute

What makes code slow to execute

14
Comments
1 min read
Blockchain: What Is It, How It Works, And What It Means For Big Data

Blockchain: What Is It, How It Works, And What It Means For Big Data

8
Comments
4 min read
Real world data processing with Google Cloud Platform

Real world data processing with Google Cloud Platform

12
Comments 1
1 min read
Amazon Athena vs AWS Lambda: Comparing two solutions for Big Data Analysis

Amazon Athena vs AWS Lambda: Comparing two solutions for Big Data Analysis

22
Comments 5
8 min read
Streaming Data in Databricks Delta Tables

Streaming Data in Databricks Delta Tables

15
Comments 3
3 min read
Super simple and fast delimited CSV data normalization with AWK

Super simple and fast delimited CSV data normalization with AWK

10
Comments
2 min read
Managing and Configuring Clusters within Azure Databricks

Managing and Configuring Clusters within Azure Databricks

11
Comments
9 min read
Databases and Tables in Azure Databricks

Databases and Tables in Azure Databricks

13
Comments
5 min read
What Is MapReduce?

What Is MapReduce?

47
Comments 3
7 min read
生醫大數據:從集權治理到公眾參與

生醫大數據:從集權治理到公眾參與

18
Comments 2
2 min read
Expertise and context-based answer rating system for Q&A websites.

Expertise and context-based answer rating system for Q&A websites.

7
Comments
1 min read
Local hadoop on laptop for practice

Local hadoop on laptop for practice

20
Comments
4 min read
Apache Livy - Apache Spark, HDFS, and Kerberos

Apache Livy - Apache Spark, HDFS, and Kerberos

14
Comments
2 min read
Conferences, Meetups, Hackathons: A learning rollercoaster in the past two months. Part II - The Hackathons

Conferences, Meetups, Hackathons: A learning rollercoaster in the past two months. Part II - The Hackathons

9
Comments
4 min read
Using Hadoop in Azure HDInsight to process Big Data

Using Hadoop in Azure HDInsight to process Big Data

13
Comments
6 min read
Apache HBase - REST API - Atomic Operations

Apache HBase - REST API - Atomic Operations

9
Comments
6 min read
Apache Storm - Topology Permissions

Apache Storm - Topology Permissions

6
Comments
2 min read
Apache Hadoop S3A With Hitachi Content Platform (HCP)

Apache Hadoop S3A With Hitachi Content Platform (HCP)

7
Comments
4 min read
Apache Livy - Simplified Apache Spark Integration

Apache Livy - Simplified Apache Spark Integration

11
Comments
2 min read
Apache Ranger - Hive over HDFS Audit Logs

Apache Ranger - Hive over HDFS Audit Logs

8
Comments
3 min read
loading...