Tech Insights
ORC

ORC

Last updated , generated by Sumble
Explore more →

What is ORC?

ORC (Optimized Row Columnar) is a self-describing, type-aware columnar file format designed for Hadoop workloads. It is optimized for large-scale data storage and processing, providing features like efficient data compression, predicate pushdown, and schema evolution. ORC files are commonly used in big data environments to improve query performance and reduce storage costs compared to traditional row-based formats like CSV or SequenceFile.

What other technologies are related to ORC?

ORC Competitor Technologies

Parquet is a columnar storage format like ORC, used for efficient data storage and retrieval in Hadoop ecosystems. They compete as choices for storing big data.
mentioned alongside ORC in 17% (2k) of relevant job posts
Avro is a row-based storage format, commonly used for data serialization. While the storage paradigm is different, it is used for data storage and is thus a competitor.
mentioned alongside ORC in 18% (1.4k) of relevant job posts
Iceberg is a table format for large analytic datasets. It competes with ORC as an alternative storage and management layer.
mentioned alongside ORC in 5% (382) of relevant job posts
Hudi is a data lake platform that brings database and data warehouse capabilities to data lakes. It competes with ORC as a data management solution.
mentioned alongside ORC in 7% (187) of relevant job posts
Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads. It competes with ORC as a storage and management layer.
mentioned alongside ORC in 2% (283) of relevant job posts

ORC Complementary Technologies

Spark is a unified analytics engine often used to process data stored in ORC format. It complements ORC by providing processing capabilities.
mentioned alongside ORC in 0% (1.5k) of relevant job posts
Hive is a data warehouse system built on top of Hadoop, often used to query data stored in ORC format. It complements ORC by providing querying capabilities.
mentioned alongside ORC in 1% (740) of relevant job posts
Flink is a stream processing framework that can read and write data in ORC format. It complements ORC by providing real-time processing capabilities.
mentioned alongside ORC in 1% (405) of relevant job posts

Which organizations are mentioning ORC?

Organization
Industry
Matching Teams
Matching People

This tech insight summary was produced by Sumble. We provide rich account intelligence data.

On our web app, we make a lot of our data available for browsing at no cost.

We have two paid products, Sumble Signals and Sumble Enrich, that integrate with your internal sales systems.