Key points are not available for this paper at this time.
The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. We describe the architecture of HDFS and report on experience using HDFS to manage 25 petabytes of enterprise data at Yahoo!.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shvachko et al. (Sat,) studied this question.
synapsesocial.com/papers/69df46cf35659245ec6148ea — DOI: https://doi.org/10.1109/msst.2010.5496972
Konstantin V. Shvachko
LinkedIn (United States)
Hairong Kuang
Meta (United States)
Sanjay Radia
Yahoo (United Kingdom)
Yahoo (United States)
Yahoo (United Kingdom)
Building similarity graph...
Analyzing shared references across papers
Loading...