Loading…
New Thinking [clear filter]
Wednesday, September 23
 

1:00pm PDT

f4: Facebook’s Warm BLOB Storage System
Facebook’s corpus of photos, videos, and other Binary Large OBjects (BLOBs) that need to be reliably stored and quickly accessible is massive and continues to grow. As the footprint of BLOBs increases, storing them in our traditional storage system, Haystack, is becoming increasingly inefficient. To increase our storage efficiency, measured in the effective-replication-factor of BLOBs, we examine the underlying access patterns of BLOBs and identify temperature zones that include hot BLOBs that are accessed frequently and warm BLOBs that are accessed far less often. Our overall BLOB storage system is designed to isolate warm BLOBs and enable us to use a specialized warm BLOB storage system, f4. f4 is a new system that lowers the effective-replication-factor of warm BLOBs while remaining fault tolerant and able to support the lower throughput demands. f4 currently stores over 65PBs of logical BLOBs and reduces their effective-replication-factor from 3.6 to either 2.8 or 2.1. f4 provides low latency; is resilient to disk, host, rack, and datacenter failures; and provides sufficient throughput for warm BLOBs.

Speakers
avatar for Satadru Pan

Satadru Pan

Software Engineer, Facebook
Satadru Pan works at Facebook and He has a passion for building web scale distributed infrastructure. Currently he is working on building and scaling Facebook’s blob storage system, which stores and serves all the photos/videos of of the site. In past he has worked on search engine... Read More →


Wednesday September 23, 2015 1:00pm - 1:50pm PDT
Winchester Room

2:00pm PDT

Pelican: A Building Block for Exascale Cold Data Storage
Pelican is a rack-scale design for cheap storage of data which is rarely accessed: cold data. It uses spun-down hard drives to maximise density and reduce costs. A Pelican rack supplies only enough resources (power, cooling, bandwidth) to support the cold data workloads we target, significantly reducing Pelican's total cost of ownership compared to traditional disk-based systems provisioned for peak performance.

The Pelican storage stack manages the limited resources, and their constraints. We describe the data layout and IO scheduling algorithms which ensures these constraints are not violated, while making best use of the available resources. We evaluate Pelican both in simulation and with a full rack, and show that Pelican performs well: delivering both high throughput and acceptable latency.

Speakers
avatar for Austin Donnelly

Austin Donnelly

Principal Research Software Development Engineer, Microsoft
Austin Donnelly is a Principal Research Software Development Engineer (RSDE) in the Systems group at Microsoft Research, Cambridge. He obtained his B.A. in Computer Science from the University of Cambridge in 1996, and went on to complete his Ph.D. there in 2002. He has worked on... Read More →


Wednesday September 23, 2015 2:00pm - 2:50pm PDT
Winchester Room

3:05pm PDT

Torturing Databases for Fun and Profit
Programmers use databases when they want a high level of reliability. Specifically, they want the sophisticated ACID (atomicity, consistency, isolation, and durability) protection modern databases provide. However, the ACID properties are far from trivial to provide, particularly when high performance must be achieved. This leads to complex and error-prone code—even at a low defect rate of one bug per thousand lines, the millions of lines of code in a commercial OLTP database can harbor thousands of bugs.

Here we propose a method to expose and diagnose violations of the ACID properties. We focus on an ostensibly easy case: power faults. Our framework includes workloads to exercise the ACID guarantees, a record/replay subsystem to allow the controlled injection of simulated power faults, a ranking algorithm to prioritize where to fault based on our experience, and a multi-layer tracer to diagnose root causes. Using our framework, we study 8 widely-used databases, ranging from open-source key-value stores to high-end commercial OLTP servers. Surprisingly, all 8 databases exhibit erroneous behavior. For the open-source databases, we are able to diagnose the root causes using our tracer, and for the proprietary commercial databases we can reproducibly induce data loss.

Speakers
avatar for Mai Zheng

Mai Zheng

Assistant Professor Computer Science Department - College of Arts and Sciences, New Mexico State University
Mai Zheng is an Assistant Professor at New Mexico State University. He received his Ph.D. in Computer Science from The Ohio State University in 2015. He was a visiting scholar at HP Labs in summer 2013. His research interests are in the broad areas of computer systems, including storage... Read More →


Wednesday September 23, 2015 3:05pm - 3:55pm PDT
Winchester Room

4:05pm PDT

Skylight — A Window on Shingled Disk Operation
We introduce Skylight, a novel methodology that combines software and hardware techniques to reverse engineer key properties of drive-managed ShingledMagnetic Recording (SMR) drives. The software part of Skylight measures the latency of controlled I/O operations to infer important properties of drive-managed SMR, including type, structure, and size of the persistent cache; type of cleaning algorithm; type of block mapping; and size of bands. The hardware part of Skylight tracks drive head movements during these tests, using a high-speed camera through an observation window drilled through the cover of the drive. These observations not only confirm inferences from measurements, but resolve ambiguities that arise from the use of latency measurements alone.We show the generality and efficacy of our techniques by running them on top of three emulated and two real SMR drives, discovering valuable performance-relevant details of the behavior of the real SMR drives.

Speakers
avatar for Peter Desnoyers

Peter Desnoyers

Professor College of Computer and Information Science, Northeastern University
Peter received his PhD in Computer Science from the University of Massachusetts, Amherst in 2008. Prior to that he spent fifteen years as an engineer in industry, at Apple, Motorola, and a number of start-ups, after receiving the BS and MS degrees in EECS from MIT in 1988. Peter's... Read More →


Wednesday September 23, 2015 4:05pm - 4:55pm PDT
Winchester Room
 
Filter sessions
Apply filters to sessions.