CSCI 5570 Large-Scale Data Processing Systems

Paper list

Mandatory, covered in lectures

Data analytics systems [W2-3]

NoSQL and Distributed storage [W4]
- Bigtable: A Distributed Storage System for Structured Data
- Dynamo: Amazon’s Highly Available Key-value Store

Cluster management [W5-6]

Networking [W7-8]

Machine learning systems [W9-12]

Optional, for paper critics

Data analytics systems

Distributed storage

Cluster management

Networking I: Architecture

Networking II: Performance

Machine learning systems