Reading List

These are the papers that we will cover this semester. Please sign up for a presentation by sending me an email with your preferences indicating the number of the paper. You can list three papers in order of preference so that we will be able to resolve conflicts faster.
NOTE: Some of you may be interested in other topics, e.g., graph related work, than the ones in the list below. If you prefer to present a paper on a different topic, please look over the accepted papers in the leading DB conferences in the last 2 years. Choose up to 3 papers and rank them in your order of preference. We will choose one that will best fit the scope of the class and your interest.
Presentation schedule is here in pdf.
Presentation guidelines are here.

  1. A Prolegomenon on OLTP Database Systems for NonVolatile Memory. DeBrabant et al. [read it]
    Presenter: TBD
  2. Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration. Zhepeng Yan, Nan Zheng, Zachary G. Ives, Partha Pratim Talukdar, Cong Yu. [read it]
    Presenter: TBD
  3. Data Management in Non-Volatile Memory. Stratis D. Viglas. [read it]
    Presenter: Radheshyam Devkinandan Karnani.
  4. REWIND: Recovery Write-Ahead System for In-Memory Non-Volatile Data-Structures. Andreas Chatzistergiou, Marcelo Cintra, Stratis D. Viglas: [read it]
    Presenter: Jimmy Seng
  5. Highly Available Transactions: Virtues and Limitations. Peter Bailis, Aaron Davidson, Alan Fekete, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica. [slides]
    Presenter: TBD
  6. Write-limited sorts and joins for persistent memory. Stratis D. Viglas. [slides]
    Presenter: TBD
  7. Let's Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. J. Arulraj, A. Pavlo, and S. R. Dulloor. [read it]
    Presenter: Hongzheng Wang
  8. NVRAMaware Logging in Transaction Systems. Jian Huang, Karsten Schwan, Moinuddin K. Qureshi. [read it]
    Presenter: TBD
  9. Incremental Record Linkage. Anja Gruenheid, Xin Luna Dong, Divesh Srivastava. [read it]
    Presenter: TBD
  10. SQLite Optimization with Phase Change Memory for Mobile Applications. Gihwan Oh, Sangchul Kim, Sang-Won Lee, Bongki Moon. [read it]
    Presenter: Jimmy Seng
  11. Storage Management in the NVRAM Era. Steven Pelley, Thomas F. Wenisch, Brian T. Gold, and Bill Bridge [read it]
    Presenter: Stephen Kofi Osei-Akoto
  12. Durable Write Cache in Flash Memory SSD for Relational and NoSQL Database. Woon-Hak Kang, Sang-Won Lee, Bongki Moon, Yang-Suk Kee and Moonwook Oh [read it]
    Presenter: Arif Aziz
  13. NoSE: Schema Design for NoSQL Applications. Michael Mior, Kenneth Salem, Ashraf Aboulnaga and Rui Liu [read it]
    Presenter: Apurupa Amperayani
  14. Fuzzy Joins in MapReduce: An Experimental Study. Ben Kimmett (University of Victoria), Venkatesh Srinivasan (University of Victoria), Alex Thomo (University of Victoria) [read it]
    Presenter: Alireza Jafarian
  15. BF-Tree: Approximate Tree Indexing. Manos Athanassoulis (EPFL), Anastasia Ailamaki (EPFL) [read it]
    Presenter: Qizhong Mao
  16. Understanding the Causes of Consistency Anomalies in Apache Cassandr. Hua Fan (University of Waterloo), Aditya Ramaraju (University of Waterloo), Marlon McKenzie (University of Waterloo), Wojciech Golab (University of Waterloo), Bernard Wong (University of Waterloo). [read it]
    Presenter: TBD
  17. Gobblin: Unifying Data Ingestion for Hadoop. Lin Qiao (LinkedIn), Yinan Li (LinkedIn), Sahil Takiar (LinkedIn), Ziyang Liu (LinkedIn), Narasimha Veeramreddy (LinkedIn), Min Tu (LinkedIn), Ying Dai (LinkedIn), Issac Buenrostro (LinkedIn), Kapil Surlaker (LinkedIn), Shirshanka Das (LinkedIn), Chavdar Botev (LinkedIn). [read it]
    Presenter: Yifan Wu
  18. Scaling Spark in the Real World: Performance and Usability. Michael Armbrust, Tathagata Das, Aaron Davidson, Ali Ghodsi, Andrew Or, Josh Rosen, Ion Stoica, Patrick Wendell, Reynold Xin, Matei Zaharia. [read it]
    Presenter: Maxim V. Shapovalov
  19. Constructing an Interactive Natural Language Interface for Relational Databases. Fei Li (University of Michigan), H V Jagadish (University of Michigan). [read it]
    Presenter: Yagni Urvishbhai Patel
  20. Possible and Certain SQL Keys. Henning Köhler (Massey University), Sebastian Link (The University of Auckland), Xiaofang Zhou (The University of Queensland). [read it]
    Presenter: Abdulrahman Mohammed Aldkheel
  21. Schema-free SQL. Fei Li, Tianyin Pan, H. V. Jagadish. [read it]
    Presenter: Mitra Neshatfar
  22. Schema Management for Document Stores. Lanjun Wang, Shuo Zhang, Juwei Shi, Limei Jiao, Oktie Hassanzadeh, Jia Zou, and Chen Wangz. [read it]
    Presenter: TBD
  23. Coordination Avoidance in Database Systems.Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica. [read it]
    Presenter: TBD
  24. Memory-Efficient Hash Joins. Gopi Attaluri (IBM Software Group), Ronald Barber (IBM Research-Almaden), Naresh Chainani (IBM Software Group), Sam Lightstone (IBM Software Group), Guy Lohman (IBM Research-Almaden), Ippokratis Pandis (IBM Research-Almaden), Vijayshankar Raman (IBM Research-Almaden), Dave Sharpe (IBM Software Group), Richard Sidle (IBM Research-Almaden). [read it]
    Presenter: Benjamin North
  25. Deployment of Query Plans on Multicoresn. Jana Giceva (ETH Zurich), Gustavo Alonso (ETH Zurich), Timothy Roscoe (ETH Zurich), Tim Harris (Oracle labs). [read it]
    Presenter: Chao Han
  26. From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System. Shumo Chu, Magdalena Balazinska, Dan Suciu . [read it]
    Presenter: Baiyi Tao