EECS 591 - Winter 2018

Week Monday Wednesday Slides Related papers
1/1, 1/3 Two generals' problem, common knowledge Wednesday's slides J. Halpern and Y. Moses
Knowledge and Common Knowledge in a Distributed Environment
1/8, 1/10 Event ordering, Lamport clocks, vector clocks Clock synchronization Monday's slides
Wednesday's slides
L. Lamport
Time, Clocks, and the Ordering of Events in a Distributed System
O. Babaoglu, K. Marzullo
Consistent Global States of Distributed Systems: Fundamental Concepts and Mechanisms
F. Cristian
Probabilistic Clock Synchronization
1/15, 1/17 MLK day NTP, Atomic Commit Wednesday's slides Bernstein, Goodman and Hadzilacos
Distributed Recovery
1/22, 1/24 2PC, 3PC TRB, Consensus Monday's slides
Wednesday's slides
D. Skeen
Non-blocking commit protocols
D. Skeen
Determining the Last Process to Fail
M.J. Fisher, N.A. Lynch, and M.S. Paterson
Impossibility of Consensus in Asynchronous Systems
1/29, 1/31 State Machine Replication, Primary-Backup Consistency, Paxos Monday's slides
Wednesday's slides
F. B. Scheider
The State Machine Approach
N. Budhiraja, K. Marzullo, F. B. Schneider, S. Toueg
The Primary-Backup Approach
L. Lamport
Paxos made simple
The original Paxos paper, for brave souls only:
L. Lamport
The Part-time Parliament
2/5, 2/7 Paxos (cont.) Byzantine Generals, PBFT Monday's slides
Wednesday's slides
L. Lamport, R. Shostak, and M. Pease
The Byzantine Generals Problem
M. Castro and B. Liskov
Practical Byzantine Fault Tolerance
2/12, 2/14 Eve FastPaxos, Flexible Paxos Monday's slides
Fast Paxos slides
Flexible Paxos slides
M. Kapritsos, Y. Wang, V. Quema, A. Clement, L. Alvisi, and M. Dahlin
All about Eve: Execute-Verify Replication for Multi-core Servers
L. Lamport
Fast Paxos
H. Howard, D. Malkhi, A. Spiegelman
Flexible Paxos: Quorum Intersection Revisited
2/19, 2/21 SpecPaxos, NOPaxos ZooKeeper, CORFU SpecPaxos slides
NOPaxos slides
ZooKeeper slides
CORFU slides
D. Ports, J. Li, V. Liu. N. Sharma, A. Krishnamurthy
Designing Distributed Systems Using Approximate Synchrony in Data Center Networks
J. Li, E. Michael, N. Sharma, A Szekeres, D. Ports
Just Say NO to Paxos Overhead: Replacing Consensus with Network Ordering
P. Hunt, M. Konar, F. Junqueira, and B. Reed
ZooKeeper: Wait-free coordination for Internet-scale systems
M. Balakrishnan, D. Malkhi, V. Prabhakaran, T. Wobber, M. Wei, J. Davis
CORFU: A shared log design for flash clusters
2/26, 2/28 Spring break Spring break
3/5, 3/7 Zyzzyva, XFT Falcon, Mencius Zyzzyva slides
XFT slides
R. Kotla, L. Alvisi, M. Dahlin, A. Clement, and E. Wong
Zyzzyva: Speculative Byzantine Fault Tolerance
S. Liu, P. Viotti, C. Cachin, V. Quema, M. Vukolic
XFT: Practical Fault Tolerance beyond Crashes
J. Leners, H. Wu, W. Hung, M. Aguilera, M. Walfish
Detecting failures in distributed systems with the FALCON spy network
Y. Mao, F. Junqueira, K. Marzullo
Mencius: Building efficient replicated state machines for WANs
3/12, 3/14 TAPIR, IronFleet Midterm I. Zhang, N. Sharma, A. Szekeres, A. Krishnamurthy, D. Ports
Building Consistent Transactions with Inconsistent Replication
C. Hawblitzel, J. Howell, M. Kapritsos, J. Lorch, B. Parno, M. Roberts, S. Setty, B. Zill
IronFleet: proving practical distributed systems correct
3/19, 3/21 Bayou, Dynamo COPS, RAMCloud D. Terry, M. Theimer, K. Petersen, A. Demers, M. Spreitzer, and C. Hauser
Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System
G. DeCandia et al.
Dynamo: Amazon's highly available key-value store
W. Lloyd, M. Freedman, M. Kaminsky, and D. Andersen
Don’t Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS
D. Ongaro, S. Rumble, R. Stutsman, J. Ousterhour, and M. Rosenblum
Fast crash recovery in RAMCloud
3/26, 3/28 GFS, BigTable MegaStore, Spanner S. Ghemawat, H. Gobioff, and S. Leung
The Google file system
F. Chang et al.
Bigtable: a distributed storage system for structured data
J. Baker et al.
Megastore: Providing Scalable, Highly Available Storage for Interactive Services
J. Corbett et al.
Spanner: Google’s Globally-Distributed Database
4/2, 4/4 MapReduce, Spark BitCoin, Algorand J. Dean, and S. Ghemawat
MapReduce: simplified data processing in large clusters
M. Zaharia et al.
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
S. Nakamoto
Bitcoin: A Peer-to-Peer Electronic Cash System
Y. Gilad, R. Hemo, S. Micali, G. Vlachos, N. Zeldovich Algorand: Scaling Byzantine Agreements for Cryptocurrencies
4/9, 4/11 TBD TBD
4/16 TBD