├── distributed-system ├── TLA + Proofs.pdf ├── Paxos Made Live.pdf ├── Paxos_Made_Simple.pdf ├── formal-methods-amazon.pdf ├── Flat Datacenter Storage.pdf ├── The_Google_File_System.pdf ├── millions-of-tiny-databases.pdf ├── Scaling Memcache at Facebook.pdf ├── GFS: Evolution on Fast-Forward.pdf ├── The_Hadoop_Distributed_File_System.pdf ├── On the Locality of Codeword Symbols.pdf ├── Autopilot:Automatc-DataCenter-Management.pdf ├── Kudu_Storage_for_Fast_Analytics_on_Fast_Data.pdf ├── PNUTS: Yahoo!’s Hosted Data Serving Platform.pdf ├── Spark_SQL_Relational_Data_Processing_in_Spark.pdf ├── The_Design_and_Implementation_of_Open vSwitch.pdf ├── Vertical Paxos and Primary-Backup Replication.pdf ├── Inside the Social Network’s (Datacenter) Network.pdf ├── Pregel a system for large-scale graph processing.pdf ├── Spanner: Google’s Globally-Distributed Database.pdf ├── Apache_Calcite_A_Foundational_Framework_for_Optimi.pdf ├── Apache_Hadoop_YARN_Yet_Another_Resource_Negotiator.pdf ├── Dremel: Interactive Analysis of Web-Scale Datasets.pdf ├── Dynamo: Amazon’s Highly Available Key-value Store .pdf ├── Impala: A Modern, Open-Source SQL Engine for Hadoop.pdf ├── In_Search_of_an_Understandable_Consensus_Algorithm.pdf ├── Large-scale_cluster_management_at_Google_with_Borg.pdf ├── Cassandra - A Decentralized Structured Storage System.pdf ├── Finding a needle in Haystack: Facebook’s photo storage.pdf ├── MapReduce_Simplied_Data_Processing_on_Large_Clusters.pdf ├── Tango: Distributed Data Structures over a Shared Log.pdf ├── Hive_A_Warehousing_Solution_Over_a_Map-Reduce_Framework.pdf ├── Raft:In Search of an Understandable Consensus Algorithm.pdf ├── Tenzing A SQL Implementation On The MapReduce Framework.pdf ├── Bigtable_A_Distributed_Storage_System_for_Structured_Data.pdf ├── Zab: High-performance broadcast for primary-backup systems.pdf ├── Analysis of HDFS Under HBase: A Facebook Messages Case Study.pdf ├── Mesa_Geo-Replicated_Near_Real-Time_Scalable_Data_Warehousing.pdf ├── MillWheel: Fault-Tolerant Stream Processing at Internet Scale.pdf ├── PacificA_Replication_in_Log-Based_Distributed_Storage_System.pdf ├── ZooKeeper - Wait-free coordination for Internet-scale systems.pdf ├── Dapper, a Large-Scale Distributed Systems Tracing Infrastructure.pdf ├── The_Chubby_lock_service_for_loosely-coupled_distributed_systems.pdf ├── Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing.pdf ├── Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams.pdf ├── SEDA: An Architecture for Well-Conditioned, Scalable Internet Services.pdf ├── Optimizing_Queries_Using_Materialize_Views_A_Practical_Scalable_Solution.pdf ├── Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore.pdf ├── Megastore: Providing Scalable, Highly Available Storage for Interactive Services.pdf ├── Large-scale Incremental Processing Using Distributed Transactions and Notifications.pdf ├── Fuxi: a Fault-Tolerant Resource Management and Job Scheduling System at Internet Scale.pdf ├── Resilient_Distributed_Datasets_A_Fault-Tolerant_Abstraction_for_In-Memory_Cluster_Computing.pdf └── README.md ├── operation-system └── ProgrammingGroundUp.pdf └── README.md /distributed-system/TLA + Proofs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/TLA + Proofs.pdf -------------------------------------------------------------------------------- /distributed-system/Paxos Made Live.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Paxos Made Live.pdf -------------------------------------------------------------------------------- /distributed-system/Paxos_Made_Simple.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Paxos_Made_Simple.pdf -------------------------------------------------------------------------------- /operation-system/ProgrammingGroundUp.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/operation-system/ProgrammingGroundUp.pdf -------------------------------------------------------------------------------- /distributed-system/formal-methods-amazon.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/formal-methods-amazon.pdf -------------------------------------------------------------------------------- /distributed-system/Flat Datacenter Storage.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Flat Datacenter Storage.pdf -------------------------------------------------------------------------------- /distributed-system/The_Google_File_System.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/The_Google_File_System.pdf -------------------------------------------------------------------------------- /distributed-system/millions-of-tiny-databases.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/millions-of-tiny-databases.pdf -------------------------------------------------------------------------------- /distributed-system/Scaling Memcache at Facebook.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Scaling Memcache at Facebook.pdf -------------------------------------------------------------------------------- /distributed-system/GFS: Evolution on Fast-Forward.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/GFS: Evolution on Fast-Forward.pdf -------------------------------------------------------------------------------- /distributed-system/The_Hadoop_Distributed_File_System.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/The_Hadoop_Distributed_File_System.pdf -------------------------------------------------------------------------------- /distributed-system/On the Locality of Codeword Symbols.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/On the Locality of Codeword Symbols.pdf -------------------------------------------------------------------------------- /distributed-system/Autopilot:Automatc-DataCenter-Management.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Autopilot:Automatc-DataCenter-Management.pdf -------------------------------------------------------------------------------- /distributed-system/Kudu_Storage_for_Fast_Analytics_on_Fast_Data.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Kudu_Storage_for_Fast_Analytics_on_Fast_Data.pdf -------------------------------------------------------------------------------- /distributed-system/PNUTS: Yahoo!’s Hosted Data Serving Platform.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/PNUTS: Yahoo!’s Hosted Data Serving Platform.pdf -------------------------------------------------------------------------------- /distributed-system/Spark_SQL_Relational_Data_Processing_in_Spark.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Spark_SQL_Relational_Data_Processing_in_Spark.pdf -------------------------------------------------------------------------------- /distributed-system/The_Design_and_Implementation_of_Open vSwitch.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/The_Design_and_Implementation_of_Open vSwitch.pdf -------------------------------------------------------------------------------- /distributed-system/Vertical Paxos and Primary-Backup Replication.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Vertical Paxos and Primary-Backup Replication.pdf -------------------------------------------------------------------------------- /distributed-system/Inside the Social Network’s (Datacenter) Network.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Inside the Social Network’s (Datacenter) Network.pdf -------------------------------------------------------------------------------- /distributed-system/Pregel a system for large-scale graph processing.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Pregel a system for large-scale graph processing.pdf -------------------------------------------------------------------------------- /distributed-system/Spanner: Google’s Globally-Distributed Database.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Spanner: Google’s Globally-Distributed Database.pdf -------------------------------------------------------------------------------- /distributed-system/Apache_Calcite_A_Foundational_Framework_for_Optimi.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Apache_Calcite_A_Foundational_Framework_for_Optimi.pdf -------------------------------------------------------------------------------- /distributed-system/Apache_Hadoop_YARN_Yet_Another_Resource_Negotiator.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Apache_Hadoop_YARN_Yet_Another_Resource_Negotiator.pdf -------------------------------------------------------------------------------- /distributed-system/Dremel: Interactive Analysis of Web-Scale Datasets.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Dremel: Interactive Analysis of Web-Scale Datasets.pdf -------------------------------------------------------------------------------- /distributed-system/Dynamo: Amazon’s Highly Available Key-value Store .pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Dynamo: Amazon’s Highly Available Key-value Store .pdf -------------------------------------------------------------------------------- /distributed-system/Impala: A Modern, Open-Source SQL Engine for Hadoop.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Impala: A Modern, Open-Source SQL Engine for Hadoop.pdf -------------------------------------------------------------------------------- /distributed-system/In_Search_of_an_Understandable_Consensus_Algorithm.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/In_Search_of_an_Understandable_Consensus_Algorithm.pdf -------------------------------------------------------------------------------- /distributed-system/Large-scale_cluster_management_at_Google_with_Borg.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Large-scale_cluster_management_at_Google_with_Borg.pdf -------------------------------------------------------------------------------- /distributed-system/Cassandra - A Decentralized Structured Storage System.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Cassandra - A Decentralized Structured Storage System.pdf -------------------------------------------------------------------------------- /distributed-system/Finding a needle in Haystack: Facebook’s photo storage.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Finding a needle in Haystack: Facebook’s photo storage.pdf -------------------------------------------------------------------------------- /distributed-system/MapReduce_Simplied_Data_Processing_on_Large_Clusters.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/MapReduce_Simplied_Data_Processing_on_Large_Clusters.pdf -------------------------------------------------------------------------------- /distributed-system/Tango: Distributed Data Structures over a Shared Log.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Tango: Distributed Data Structures over a Shared Log.pdf -------------------------------------------------------------------------------- /distributed-system/Hive_A_Warehousing_Solution_Over_a_Map-Reduce_Framework.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Hive_A_Warehousing_Solution_Over_a_Map-Reduce_Framework.pdf -------------------------------------------------------------------------------- /distributed-system/Raft:In Search of an Understandable Consensus Algorithm.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Raft:In Search of an Understandable Consensus Algorithm.pdf -------------------------------------------------------------------------------- /distributed-system/Tenzing A SQL Implementation On The MapReduce Framework.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Tenzing A SQL Implementation On The MapReduce Framework.pdf -------------------------------------------------------------------------------- /distributed-system/Bigtable_A_Distributed_Storage_System_for_Structured_Data.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Bigtable_A_Distributed_Storage_System_for_Structured_Data.pdf -------------------------------------------------------------------------------- /distributed-system/Zab: High-performance broadcast for primary-backup systems.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Zab: High-performance broadcast for primary-backup systems.pdf -------------------------------------------------------------------------------- /distributed-system/Analysis of HDFS Under HBase: A Facebook Messages Case Study.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Analysis of HDFS Under HBase: A Facebook Messages Case Study.pdf -------------------------------------------------------------------------------- /distributed-system/Mesa_Geo-Replicated_Near_Real-Time_Scalable_Data_Warehousing.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Mesa_Geo-Replicated_Near_Real-Time_Scalable_Data_Warehousing.pdf -------------------------------------------------------------------------------- /distributed-system/MillWheel: Fault-Tolerant Stream Processing at Internet Scale.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/MillWheel: Fault-Tolerant Stream Processing at Internet Scale.pdf -------------------------------------------------------------------------------- /distributed-system/PacificA_Replication_in_Log-Based_Distributed_Storage_System.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/PacificA_Replication_in_Log-Based_Distributed_Storage_System.pdf -------------------------------------------------------------------------------- /distributed-system/ZooKeeper - Wait-free coordination for Internet-scale systems.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/ZooKeeper - Wait-free coordination for Internet-scale systems.pdf -------------------------------------------------------------------------------- /distributed-system/Dapper, a Large-Scale Distributed Systems Tracing Infrastructure.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Dapper, a Large-Scale Distributed Systems Tracing Infrastructure.pdf -------------------------------------------------------------------------------- /distributed-system/The_Chubby_lock_service_for_loosely-coupled_distributed_systems.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/The_Chubby_lock_service_for_loosely-coupled_distributed_systems.pdf -------------------------------------------------------------------------------- /distributed-system/Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing.pdf -------------------------------------------------------------------------------- /distributed-system/Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams.pdf -------------------------------------------------------------------------------- /distributed-system/SEDA: An Architecture for Well-Conditioned, Scalable Internet Services.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/SEDA: An Architecture for Well-Conditioned, Scalable Internet Services.pdf -------------------------------------------------------------------------------- /distributed-system/Optimizing_Queries_Using_Materialize_Views_A_Practical_Scalable_Solution.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Optimizing_Queries_Using_Materialize_Views_A_Practical_Scalable_Solution.pdf -------------------------------------------------------------------------------- /distributed-system/Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore.pdf -------------------------------------------------------------------------------- /distributed-system/Megastore: Providing Scalable, Highly Available Storage for Interactive Services.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Megastore: Providing Scalable, Highly Available Storage for Interactive Services.pdf -------------------------------------------------------------------------------- /distributed-system/Large-scale Incremental Processing Using Distributed Transactions and Notifications.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Large-scale Incremental Processing Using Distributed Transactions and Notifications.pdf -------------------------------------------------------------------------------- /distributed-system/Fuxi: a Fault-Tolerant Resource Management and Job Scheduling System at Internet Scale.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Fuxi: a Fault-Tolerant Resource Management and Job Scheduling System at Internet Scale.pdf -------------------------------------------------------------------------------- /distributed-system/Resilient_Distributed_Datasets_A_Fault-Tolerant_Abstraction_for_In-Memory_Cluster_Computing.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/lshmouse/reading-papers/HEAD/distributed-system/Resilient_Distributed_Datasets_A_Fault-Tolerant_Abstraction_for_In-Memory_Cluster_Computing.pdf -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | Add the papers for reading 2 | 3 | ### Reference 4 | - http://the-paper-trail.org/blog/distributed-systems-theory-for-the-distributed-systems-engineer/ 5 | - http://dancres.github.io/Pages/ 6 | - http://www.quora.com/What-are-the-seminal-papers-in-distributed-systems-Why 7 | - http://christophermeiklejohn.com/distributed/systems/2013/07/12/readings-in-distributed-systems.html 8 | - https://courses.engr.illinois.edu/cs525/sp2011/sched.htm 9 | -------------------------------------------------------------------------------- /distributed-system/README.md: -------------------------------------------------------------------------------- 1 | ## Papers for Computing Platform 2 | 3 | ### Distributed System 4 | - The Google File System(Google 5星推荐) 5 | - MapReduce:Simplied Data Processing on Large Clusters (Google 5星推荐) 6 | - Bigtable:A Distributed Storage System for Structured Data (Google 5星推荐) 7 | - The Chubby lock service for loosely-coupled distributed system 8 | - Large-scale cluster management at Google with Borg (Google 5星推荐) 9 | 10 | - The Hadoop Distributed File System (Hadoop 4星推荐) 11 | - Apache Hadoop YARN: Yet Another Resource Negotiator (Hadoop 4星推荐) 12 | 13 | ### OLAP 14 | - Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources 15 | - Optimizing Queries Using Materialized Views:A Practical, Scalable Solution 16 | - Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing (Google 4星推荐) 17 | - Hive: A Warehousing Solution Over a Map-Reduce Framework 18 | - Kudu: Storage for Fast Analytics on Fast Data 19 | 20 | ### Consistent 21 | - In Search of an Understandable Consensus Algorithm (raft) 22 | - Paxos Made Simple 23 | - PacificA: Replication in Log-Based Distributed Storage System 24 | 25 | ### Computing Engine 26 | - Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing (Spark 4星推荐) 27 | - Spark SQL: Relational Data Processing in Spark (Spark 4星推荐) 28 | 29 | 注: 上面都是一些非常经典的论文,需要反复读(10遍+),互相讨论才有收获。隔一段时间在读,也会有新的收获。 30 | 31 | --------------------------------------------------------------------------------