└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Paper Reading 2 | 3 | A list of papers I have read. 4 | 5 | ## Data Lake, Data Warehouse and Lakehouse 6 | - [Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores](https://www.vldb.org/pvldb/vol13/p3411-armbrust.pdf) 7 | - [Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics](https://www.cidrdb.org/cidr2021/papers/cidr2021_paper17.pdf) 8 | 9 | ## KV Store 10 | 11 | - [WiscKey: Separating Keys from Values in SSD-conscious Storage](https://www.usenix.org/system/files/conference/fast16/fast16-papers-lu.pdf) 12 | - [MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph](https://vldb.org/pvldb/vol13/p3217-matsunobu.pdf) 13 | - [PebblesDB: Building Key-Value Stores using Fragmented Log-Structured Merge Trees](https://www.cs.utexas.edu/~rak/papers/sosp17-pebblesdb.pdf) 14 | - [Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store](https://dl.acm.org/doi/10.1145/3448016.3457297) 15 | 16 | ## Cloud Database 17 | 18 | - [The Snowflake Elastic Data Warehouse](https://event.cwi.nl/lsde/papers/p215-dageville-snowflake.pdf) 19 | - [CloudJump: optimizing cloud databases for cloud storages](https://dl.acm.org/doi/abs/10.14778/3554821.3554834) 20 | 21 | ## NewSQL 22 | 23 | - [What’s Really New with NewSQL?](https://db.cs.cmu.edu/papers/2016/pavlo-newsql-sigmodrec2016.pdf) 24 | - [Spanner: Google’s Globally-Distributed Database](https://static.googleusercontent.com/media/research.google.com/zh-CN//archive/spanner-osdi2012.pdf) 25 | - [Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases](https://pdos.csail.mit.edu/6.824/papers/aurora.pdf) 26 | 27 | ## Distributed System 28 | 29 | - [Time, Clocks, and the Ordering of Events in a Distributed System](http://lamport.azurewebsites.net/pubs/time-clocks.pdf) 30 | - [The Google File System](https://static.googleusercontent.com/media/research.google.com/zh-CN//archive/gfs-sosp2003.pdf) 31 | - [MapReduce: Simplified Data Processing on Large Clusters](https://static.googleusercontent.com/media/research.google.com/zh-CN//archive/mapreduce-osdi04.pdf) 32 | - [Bigtable: A Distributed Storage System for Structured Data](https://static.googleusercontent.com/media/research.google.com/zh-CN//archive/bigtable-osdi06.pdf) 33 | - [The Design of a Practical System for Fault-Tolerant Virtual Machines](https://pdos.csail.mit.edu/6.824/papers/vm-ft.pdf) 34 | - [In Search of an Understandable Consensus Algorithm (Extended Version)](https://raft.github.io/raft.pdf) 35 | - [ZooKeeper: Wait-free coordination for Internet-scale systems](https://www.usenix.org/legacy/event/atc10/tech/full_papers/Hunt.pdf) 36 | - [Strong and Efficient Consistency with Consistency-Aware Durability](https://www.usenix.org/system/files/fast20-ganesan.pdf) 37 | 38 | ## Serverless 39 | 40 | - [Benchmarking, Analysis, and Optimization of Serverless Function Snapshots](https://ustiugov.github.io/assets/files/REAP_ASPLOS21.pdf) 41 | 42 | ## Others 43 | 44 | - [Are You Sure You Want to Use MMAP in Your Database Management System?](https://db.cs.cmu.edu/papers/2022/cidr2022-p13-crotty.pdf) 45 | - [Optimizing Bloom Filter: Challenges, Solutions, and Comparisons](https://arxiv.org/pdf/1804.04777.pdf) 46 | --------------------------------------------------------------------------------