Back To Schedule
Monday, September 21 • 4:35pm - 5:25pm
Big Data Analytics on Object Stoage - Hadoop Over Ceph Object Storage with SSD Cache

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Cloud object store provides the ability to store objects across multiple datacenters over a straightforward HTTPS REST API. The namespace is hierarchical and can be searched. Objects can be arbitraiy large and numerous. The deployment can also be done on a commodity-harware based. This makes them an attractive option for archiving large amounts of data that are produced in science and industry. To analyze the data, advanced analytics such as MapReduce can be used. However, copying the data from the object store into distributed file system that the analytics system requires directly on object stores greatly improves usability and performance. In this work, we study the possibility of running Hadoop over Ceph Object Storage and identify common problems.


Yuan Zhou

Software Engineer, Intel Asia R&D
Yuan Zhou is a Senior Software Development Engineer in the Software and Service Group for Intel Corporation, working in the Big Data Technology team primarily focused on Cloud Storage Software. He has been working in Databases, Virtualization and Cloud computing for most of his 5... Read More →

Monday September 21, 2015 4:35pm - 5:25pm PDT
Stevens Creek Room

Attendees (0)