Apache HDFS Review - why 3 stars?

Hadoop HDFS Review
ITQlick Score: 89/100
ITQlick Rating: (3/5)
Pricing: 6.4/10 - high cost
Category: Business Intelligence -> Apache HDFS
Ranking:Ranked 18 out of 532 Business Intelligence systems
Company: Apache Hadoop
Pricing: starts at $1,000 per license
Typical customers: SMBs
Platforms: Desktop, Cloud
Links: Apache HDFS pricing, Apache HDFS alternatives

Shlomi LaviShlomi Lavi / updated: Jun 01, 2022

We publish unbiased reviews. Our opinions are our own and are not influenced by payments from advertisers. This content is reader-supported, which means if you leave your details with us we may earn a commission. Learn more by reading our advertiser disclosure.

What is Apache HDFS and its features?

Apache HDFS is a flexible open-source software module part of the Apache Hadoop project. HDFS provides high-throughput access to application data, handles large data sets, and runs on commodity hardware. The distributed file system has several features, and some are discussed below:
  • Large Data Set Processing: With HDFS, users can store, move, delete, and organize large data sets, usually saved in 128MB blocks across several machines. The software can support millions of files at once and provides hundreds of nodes in a single cluster with high aggregate data bandwidth and scale.
  • Data Replication: Users can store large files, in gigabytes, terabytes, and petabytes, across several machines in large clusters with HDFS. By storing these files as a sequence of blocks, HDFS replicates these blocks for fault tolerance in a write-once mode.
  • File System Namespace: With HDFS, users can create traditional file organizations and create file directories for saving files. Users or applications can also create, move, remove, and rename files.
  • NameNode and DataNode: HDFS features a NameNode, a controller server that regulates the file system namespace, grants clients access to files, and manages commands like opening, closing, and renaming file directories. The software also has DataNodes that serve read and write requests from clients.
The Apache HDFS was launched in 2009. The company has between 11 to 50 employees and has its headquarters in Baltimore, Maryland, USA.

What is Apache HDFS rating?

The rating of Apache HDFS is 3 stars out of 5 and the total score is 89 out of 100. The rating score is based on our unbiased data-based algo. Learn more - rating methodology page.

Who uses Apache HDFS?

The typical customers include the following business size: SMBs.

Is it easy to use?

Yes, Apache HDFS makes it fairly easy to use. This product is user friendly and intuitive.

Is Apache HDFS cloud-based?

Yes, they offer cloud based hosting, which means IT resources are retrieved through web-based tools and applications and you don't need to buy servers and additional hardware equipment.

Can it be used on mobile?

No, currently Apache HDFS can't be accessed on mobile, you would need to access it from your computer.

Is Apache HDFS the best Business Intelligence system?

Apache HDFS is ranked 18 out of 532 software in their category (Business Intelligence systems).

What are Apache HDFS's pros? (4 Pros)

  • Apache HDFS is open source and free to use
  • The software can be deployed as a cloud solution and on-premise software and runs on commodity hardware
  • Apache HDFS supports storing files in several formats
  • It is fast and not prone to failure

What are Apache HDFS's cons? (3 cons)

  • Occasionally, file replicas created for fault tolerance usually consume more space
  • HDFS has a steep learning curve, especially for users not proficient in coding
  • Sometimes, the system lags when nodes are shared

Who are their main alternatives?

HDFS faces competition from leading software brands in the distributed file system and object storage software category. Some of its top alternatives include Red Hat Ceph Storage, Qumulo Core File System, Pure Storage FlashBlade, and NetApp StorageGRID. HDFS is compared to its top alternatives in the following aspects:
  • Cloud/On-premise: Apache HDFS is an open-source software module that can be deployed on the cloud or as an on-premise solution. In comparison, all top alternatives of Apache HDFS are available as cloud-based software, on-premise software, and hybrid-cloud solutions.
  • Cost: HDFS is open source, meaning it is free to use. In comparison, Qumulo Core pricing starts at $3411 per month, and a free, open-source version of Red Hat Ceph Storage is available. Other top alternatives of HDFS do not disclose their pricing details. Also, a free trial offer is available for StorageGRID and FlashBlade.
  • Customers/Industries: Many companies from several industries use the Apache HDFS, including Twitter, LinkedIn, and eBay. In comparison, HDFS and its top competitors serve similar customers and industries. Also, companies in media & entertainment use Qumulo Core, Red Hat Ceph Storage is used by financial services, NetApp StorageGRID is used in the oil & gas industry, while governments use Pure Storage FlashBlade.
  • Functionality: Apache HDFS has several features like file system namespace, data replication, large data processing, etc. Compared to its top alternatives, HDFS does not have native machine learning and artificial intelligence functionalities like its top competitors.
  • Popularity: HDFS is a famous open-source software that several enterprises from all over the world use. The Apache Software Foundation, the community that offers HDFS, valued the total software releases, downloads, and usage at $22 billion. In comparison, Pure Storage has over 10,000 customers globally, NetApp serves notable brands like Dow Jones and DreamWorks Animation, and Qumulo stores over 200 billion files.
  • Scalability: Apache HDFS handles big data, primarily suited to multinational enterprises and global conglomerates store data in terabytes and petabytes. In comparison, all top alternatives of HDFS are equally scalable software options.
  • Support: The Apache HDFS users can get support via a knowledge base, the community, and FAQ channels. Compared to its top alternatives, Apache HDFS does not offer support via live chat, which is available on NetApp and Qumulo. Also, all top alternatives of HDFS offer support through email and phone channels, which are not available for clients that use HDFS.

What is the cost of Apache HDFS?

The cost of license starts at $1,000 per license. On top of the total cost of licenses which is easy to calculate - what is the REAL total cost of implementation? You need to add the cost of training, customization, data migration, and other "hidden cost". ITQlick pricing score is 6.4 out of 10 (10 is most expensive). You can find here the REAL total cost breakdown for Apache HDFS.

What should you ask Apache Hadoop during a demo session? (65 questions)

Schedule a demo with Apache HDFS and access 65 must ask questions for the vendor while watching Apache HDFS in action. Visit the vendor's website here.

Where Apache HDFS is used?

The Apache HDFS is an open-source distributed file system that can be deployed as a cloud solution and on-premise software. HDFS stores and processes big data and is suitable for large-scale companies across several industries. Companies like Twitter, eBay, and LinkedIn have used the Apache HDFS.

List of categories

BI & Analytics
Business Intelligence

Apache HDFS Vs. Alternatives

ITQlick rating
License pricing
Apache H...
ITQlick rating
License pricing
$1,000 per license
ITQlick rating
License pricing
$30 per user/month
ITQlick rating
License pricing
$499 per month + $99 per user
ITQlick rating
License pricing
$83 per employee/month
Infor Bi...
ITQlick rating
License pricing
Pricing not available

Shlomi Lavi

Website Linkedin Facebook Twitter

Shlomi holds a Bachelor of Science (B.Sc.) in Information System Engineering from Ben Gurion University in Israel. Shlomi brings 15 years of global IT and IS management experience as a consultant, and implementation expert for small, medium and large size (global) companies.