What is RDD?

RDD stands for Resilient Distributed Dataset. It is a fundamental data structure of Apache Spark that represents an immutable, partitioned collection of elements that can be operated on in parallel. RDDs are fault-tolerant, meaning that if a partition of an RDD is lost, it can be recomputed from the lineage of transformations that created it. They are commonly used for large-scale data processing and analysis tasks, such as data cleaning, transformation, and machine learning. RDDs can be created from various data sources like Hadoop Distributed File System (HDFS), local files, databases, and other RDDs.

Find 165 organizations using RDD on Sumble →

What other technologies are related to RDD?

RDD Complementary Technologies

HDFS

HDFS is a distributed file system often used as a storage layer for Spark RDDs.

Hive

Hive can be used with Spark to query data stored in HDFS or other data sources used by RDDs.

HBase

HBase can be a data source for RDDs in Spark.

Number of organizations that mention technology

ⓘ Tap on a tech to explore matching organizations

Which job functions commonly mention RDD?

Data Engineer

161 Data Engineer jobs mention RDD

View 161 jobs on Sumble

Data Architect

10 Data Architect jobs mention RDD

View 10 jobs on Sumble

Software Engineer

286 Software Engineer jobs mention RDD

View 286 jobs on Sumble

Engineer

289 Engineer jobs mention RDD

View 289 jobs on Sumble

Data, Analytics & Machine Learning

51 Data, Analytics & Machine Learning jobs mention RDD

View 51 jobs on Sumble

See more or filter by date, location, industry, etc →

Which organizations are mentioning RDD?

Gemeente Rotterdam

Public Administration

Citi

Finance and Insurance

EXL

Professional Services

Barclays

Finance and Insurance

Cognizant Technology Solutions

Professional Services

See more or filter by date, location, industry, etc →

Summary powered by

Sumble

Find the right accounts, contact, message, and time to sell

Whether you're looking to get your foot in the door, find the right person to talk to, or close the deal — accurate, detailed, trustworthy, and timely information about the organization you're selling to is invaluable.

Use Sumble to:

Sign in to continue exploring

or

Book a call to discuss your needs

**RDD**

What is RDD?

What other technologies are related to RDD?

RDD Complementary Technologies

Which job functions commonly mention RDD?

Which organizations are mentioning RDD?

Find the right accounts, contact, message, and time to sell

RDD