EECS 591 - Winter 2017

Week Monday Wednesday Slides Related papers
1/2, 1/4 Two generals' problem, common knowledge Wednesday's slides J. Halpern and Y. Moses
Knowledge and Common Knowledge in a Distributed Environment
1/9, 1/11 Event ordering, Lamport clocks, vector clocks Clock synchronization Monday's slides
Wednesday's slides
L. Lamport
Time, Clocks, and the Ordering of Events in a Distributed System
O. Babaoglu, K. Marzullo
Consistent Global States of Distributed Systems: Fundamental Concepts and Mechanisms
F. Cristian
Probabilistic Clock Synchronization
1/16, 1/18 MLK day Atomic Commit, 2PC Wednesday's slides Bernstein, Goodman and Hadzilacos
Distributed Recovery
1/23, 1/25 3PC State Machine Replication, Primary-Backup Monday's slides
Wednesday's slides
D. Skeen
Non-blocking commit protocols
D. Skeen
Determining the Last Process to Fail
F. B. Scheider
The State Machine Approach
N. Budhiraja, K. Marzullo, F. B. Schneider, S. Toueg
The Primary-Backup Approach
1/30, 2/1 Consistency, TRB, Consensus Paxos Monday's slides
Wednesday's slides
M.J. Fisher, N.A. Lynch, and M.S. Paterson
Impossibility of Consensus in Asynchronous Systems
L. Lamport
Paxos made simple
The original Paxos paper, for brave souls only:
L. Lamport
The Part-time Parliament
2/6, 2/8 Byzantine Generals, PBFT Eve, Review of part 1 Monday's slides
Wednesday's slides
L. Lamport, R. Shostak, and M. Pease
The Byzantine Generals Problem
M. Castro and B. Liskov
Practical Byzantine Fault Tolerance
M. Kapritsos, Y. Wang, V. Quema, A. Clement, L. Alvisi, and M. Dahlin
All about Eve: Execute-Verify Replication for Multi-core Servers
2/13, 2/15 SpecPaxos, NOPaxos Zyzzyva, XFT SpecPaxos slides
NOPaxos slides
Zyzzyva slides
XFT slides
D. Ports, J. Li, V. Liu. N. Sharma, A. Krishnamurthy
Designing Distributed Systems Using Approximate Synchrony in Data Center Networks
J. Li, E. Michael, N. Sharma, A Szekeres, D. Ports
Just Say NO to Paxos Overhead: Replacing Consensus with Network Ordering
R. Kotla, L. Alvisi, M. Dahlin, A. Clement, and E. Wong
Zyzzyva: Speculative Byzantine Fault Tolerance
S. Liu, P. Viotti, C. Cachin, V. Quema, M. Vukolic
XFT: Practical Fault Tolerance beyond Crashes
2/20, 2/22 Traveling, no class Chubby, ZooKeeper Chubby slides
ZooKeeper slides
M. Burrows
The Chubby lock service for loosely-coupled distributed systems
P. Hunt, M. Konar, F. Junqueira, and B. Reed
ZooKeeper: Wait-free coordination for Internet-scale systems
2/27, 2/29 Spring break Spring break
3/6, 3/8 Locks with intent, CORFU, Tango Bayou, Depot Locks with intent slides
CORFU slides
Tango slides
Bayou slides
Depot slides
S. Setty, C. Su, J. Lorch, L. Zhou, H. Chen, P. Patel, J. Ren
Realizing the Fault-Tolerance Promise of Cloud Storage using Locks with Intent
M. Balakrishnan, D. Malkhi, V. Prabhakaran, T. Wobber, M. Wei, J. Davis
CORFU: A shared log design for flash clusters
M. Balakrishnan, D. Malkhi, T. Wobber, M. Wu, V. Prabhakaran, M. Wei, J. Davis, S. Rao, T. Zou, A. Zuck
Tango: distributed data structures over a shared log
D. Terry, M. Theimer, K. Petersen, A. Demers, M. Spreitzer, and C. Hauser
Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System
P. Mahajan, S. Setty, S.Lee, A. Clement, L. Alvisi, M. Dahlin, and M. Walfish
Depot: Cloud storage with minimal trust
3/13, 3/15 Falcon, TAPIR Mencius - IronFleet Falcon slides
TAPIR slides Mencius slides IronFleet slides
J. Leners, H. Wu, W. Hung, M. Aguilera, M. Walfish
Detecting failures in distributed systems with the FALCON spy network
I. Zhang, N. Sharma, A. Szekeres, A. Krishnamurthy, D. Ports
Building Consistent Transactions with Inconsistent Replication
Y. Mao, F. Junqueira, K. Marzullo
Mencius: Building efficient replicated state machines for WANs
C. Hawblitzel, J. Howell, M. Kapritsos, J. Lorch, B. Parno, M. Roberts, S. Setty, B. Zill
IronFleet: proving practical distributed systems correct
3/20, 3/22 Dynamo, Gemini COPS, Eiger Dynamo slides
Gemini slides
COPS slides
Eiger slides
G. DeCandia et al.
Dynamo: Amazon's highly available key-value store
C. Li, D. Porto, A. Clement, J. Gehrke, N. Preguica, and R. Rodrigues
Making geo-replicated systems fast as possible, consistent when necessary
W. Lloyd, M. Freedman, M. Kaminsky, and D. Andersen
Don’t Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS
W. Lloyd, M. Freedman, M. Kaminsky, and D. Andersen
Stronger Semantics for Low-Latency Geo-Replicated Storage
3/27, 3/29 PNUTS, Cassandra GFS, Megastore PNUTS slides
Cassandra slides
GFS slides
Megastore slides
B. Cooper et al.
PNUTS: Yahoo!’s Hosted Data Serving Platform
A. Lakshman, and P. Malik
Cassandra - A decentralized structured storage system
S. Ghemawat, H. Gobioff, and S. Leung
The Google file system
J. Baker et al.
Megastore: Providing Scalable, Highly Available Storage for Interactive Services
4/3, 4/5 Bigtable, RAMCloud MapReduce, Dryad Bigtable slides
RAMCloud slides
MapReduce slides
Dryad slides
F. Chang et al.
Bigtable: a distributed storage system for structured data
D. Ongaro, S. Rumble, R. Stutsman, J. Ousterhour, and M. Rosenblum
Fast crash recovery in RAMCloud
J. Dean, and S. Ghemawat
MapReduce: simplified data processing in large clusters
M. Isard, M. Budiu, Y. Yu, A. Birrel, and D. Fetterly
Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks
4/10, 4/12 Spark, Chord Spanner, BitCoin Spark slides
Chord slides
Spanner slides
Bitcoin slides
M. Zaharia et al.
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
I. Stoica, R. Morris, D. Karger, M. Kaashoek, and H. Balakrishnan
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications
J. Corbett et al.
Spanner: Google’s Globally-Distributed Database
S. Nakamoto
Bitcoin: A Peer-to-Peer Electronic Cash System
4/17 Review Review slides