Computing Platforms
Content
The seminar will cover core concepts and ideas in the general area of computer systems, ranging from software and hardware architectures to system design for operating systems, data processing systems, and distributed systems. The focus will be on fundamental ideas that apply across systems and application areas but with an emphasis on those ideas that apply to cloud platforms and hardware accelerators.
Format
The seminar will consist on student presentations based on a list of papers that will be provided at the beginning of the course. Presentations will be arranged in slots of 20 minutes talk plus 10 minutes questions. Grades will be assigned based on quality of the presentation, coverage of the topic including material not in the original papers, participation during the seminar, and ability to understand, present, and criticize the underlying technology.
Seminar Hours
Mondays, 4-6pm, at CHN D 44. The first seminar will be on February 19th.
Lecturers
- Prof. Gustavo Alonso
- Dr. Michael Giardino
Teaching Assistant
- Michael Wawrzoniak ()
Schedule
Papers
You may need to click on the links from within the ETH network (via VPN) to get the full-text papers.
1. Michael Armbrust, Armando Fox, Rean Griffith, Anthony D. Joseph, Randy Katz, Andy Konwinski, Gunho Lee, David Patterson, Ariel Rabkin, Ion Stoica, Matei Zaharia (2010) A View of Cloud Computing. In: CACM. [external pageLinkcall_made]
2. Barroso, L., Marty, M., Patterson, D., & Ranganathan, P. (2017). Attack of the killer microseconds. In: CACM. [external pageLinkcall_made]
3. Primorac, M., Bugnion, E., & Argyraki, K. (2017). How to measure the killer microsecond. In: CCR. [external pageLinkcall_made]
4. Delimitrou, C., & Kozyrakis, C. (2018). Amdahl’s law for tail latency: Queueing theoretic models can guide design trade-offs in systems targeting tail latency, not just average performance. In: CACM. [external pageLinkcall_made]
5. Huaicheng Li , Daniel S. Berger, Stanko Novakovic , Lisa Hsu , Dan Ernst , Pantea Zardoshti, Monish Shah , Samir Rajadnya , Scott Lee , Ishwar Agarwal , Mark D. Hill , Marcus Fontoura , Ricardo Bianchini. (2023) Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. In: ASPLOS. [external pageLinkcall_made]
6. Hasan Al Maruf, Hao Wang, Abhishek Dhanotia, Johannes Weiner, Niket Agarwal, Pallab Bhattacharya, Chris Petersen, Mosharaf Chowdhury, Shobhit Kanaujia, Prakash Chauhan. (2023) TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory. In: ASPLOS. [external pageLinkcall_made]
7. Z Guo, Y Shan, X Luo, Y Huang, Y Zhang. Clio: A Hardware-Software Co-Designed Disaggregated Memory System. ASPLOS, 2022 [external pageLinkcall_made]
8. Q Zhang, P Bernstein, D Berger, B Chandramouli. Redy: Remote Dynamic Memory Cache. VLDB, 2022. [external pageLinkcall_made]
9. Shafer, J., Rixner, S., & Cox, A. L. (2010). The Hadoop distributed file system: Balancing portability and performance. In: ISPASS. [external pageLink1call_made][external pageLink2call_made]
10. Burrows, M. (2006). The Chubby lock service for loosely-coupled distributed systems. In: OSDI. [external pageLinkcall_made]
11. Hunt, P., Konar, M., Junqueira, F. P., & Reed, B. (2010). ZooKeeper: Wait-free coordination for internet-scale systems. In: USENIX ATC. [external pageLinkcall_made]
12. DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, et al. (2007). Dynamo: Amazon’s Highly Available Key-value Store. In: SIGOPS. [external pageLinkcall_made]
13. Corbett, J. C., Dean, J., Epstein, M., et al. (2012). Spanner: Google’s Globally-Distributed Database. In: OSDI. [external pageLinkcall_made]
14. Bacon, D. F., Bales, N., Bruno, N., et al. (2017). Spanner: Becoming a SQL system. In: SIGMOD. [external pageLinkcall_made]
15. Armbrust, M., Ghodsi, A., Zaharia, M., et al. (2015). Spark SQL: Relational Data Processing in Spark. In: SIGMOD. [external pageLinkcall_made]
16. Chen, G. J., Wiener, J. L., Iyer, S., Jaiswa, et al. (2016). Realtime Data Processing at Facebook. In: SIGMOD. [external pageLinkcall_made]
17. Verbitski et al (2017) Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases. In: SIGMOD '17. [external pageLinkcall_made]
18. Wang et al. (2023) PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba. In: SIGMOD'23. [external pageLinkcall_made]
19. Lasse Thostrup, Gloria Doci, Nils Boeschen, Manisha Luthra, Carsten Binnig. Distributed GPU Joins on Fast RDMA-capable Networks. SIGMOD 2023 [external pageLinkcall_made]
20. Fang, Yuanwei and Zou, Chen and Chien, Andrew A. Accelerating raw data analysis with the ACCORDA software and hardware architecture. VLDB 2019 [external pageLinkcall_made]
21. Harshit Saokar, Soteris Demetriou, Nick Magerko, Max Kontorovich, Josh Kirstein, and Margot Leibold, Dimitrios Skarlatos, Hitesh Khandelwal and Chunqiang Tang. (2023) ServiceRouter: Hyperscale and Minimal Cost Service Mesh at Meta. In OSDI'23. [external pageLinkcall_made]
22. Johann Schleier-Smith, Vikram Sreekanti, Anurag Khandelwal, Joao Carreira, Neeraja J. Yadwadkar, Raluca Ada Popa, Joseph E. Gonzalez, Ion Stoica, David A. Patterson. What Serverless Computing Is and Should Become: The Next Phase of Cloud Computing. In: CACM. [external pageLinkcall_made]
23. Saxena et al. (2023). The Story of AWS Glue. In: VLDB'23. [external pageLinkcall_made]
24. Shankar, V., Krauth, K., Vodrahalli, K., Pu, Q., et al. (2020). Serverless linear algebra. In: SoCC. [external pageLinkcall_made]
25. Klimovic, A., Wang, Y., Stuedi, P., et al. (2018). Pocket: Elastic Ephemeral Storage for Serverless Analytics. In: OSDI. [external pageLinkcall_made]
26. S Fouladi, F Romero et al. From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers. ATC, 2019. [external pageLinkcall_made]
27. Ao Wang, Jingyuan Zhang, Xiaolong Ma, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng (2020). INFINICACHE: exploiting ephemeral serverless functions to build a cost-effective memory cache. In FAST'20. [external pageLinkcall_made]
Invited Talks
TBD
Presentations Tips