Publications

  • epiC: an Extensible and Scalable System for Processing Big Data
  • D. Jiang, G. Chen, B. C. Ooi, K. L. Tan, S. Wu. Int'l Conference on Very Large Data Bases (VLDB), 2014.

  • Data Sensitive Hashing for High-dimensional KNN Search
  • J. Gao, H.V. Jagadish, W. Lu, B. C. Ooi. ACM SIGMOD International Conference on Management of Data (SIGMOD), 2014.

  • R-Store: A Scalable Distributed System for Supporting Real-time Analytics
  • F. Li, T. Ozsu, G. Chen, B.C. Ooi. 30th IEEE International Conference on Data Engineering (ICDE), 2014.

  • A hybrid machine-crowdsourcing system for matching web tables
  • J. Fan, M. Lu, B.C. Ooi, W.C. Tan. International Conference on Data Engineering (ICDE), 2014.

  • Distributed Data Management Using MapReduce
  • F. Li, B. C. Ooi, T. Ozsu, S. Wu. ACM Computing Survey 46(3), 2014.

  • Efficiently Supporting Edit Distance based String Similarity Search Using B+-trees
  • W. Lu, X. Du, M. Hadjieleftheriou, B. C. Ooi. Transactions on Knowledge and Data Engineering, (to appear) 2014.

  • BestPeer++: A Peer-to-Peer based Large-scale Data Processing Platform
  • G. Chen, T. Hu, D. Jiang, P. Lu, K. L. Tan, H. T. Vo, S. Wu. Transactions on Knowledge and Data Engineering 26(6), 2014.

  • Efficiently extracting frequent subgraphs using MapReduce
  • W. Lu, G. Chen, A. K. Tung, F. Zhao. IEEE International Conference on Big Data, 2013.

  • BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
  • G. Chen, T. Hu, D. Jiang, P. Lu, K. L. Tan, H. T. Vo, S. Wu. 28th International Conference on Data Engineering (ICDE), 2012.

  • Efficient Processing of K Nearest Neighbor Joins using MapReduce
  • W. Lu, Y. Shen, S. Chen, B. C. Ooi. Int'l Conference on Very Large Data Bases (VLDB), PVLDB 5(10):1016-1027, 2012.

  • Efficient and Scalable Processing of String Similarity Join
  • C. Rong, W. Lu, X. Wang, X. Du, Y. Chen, A. K. H. TUNG. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2012.

  • E3: an Elastic Execution Engine for Scalable Data Processing
  • G. Chen, K. Chen, D. Jiang, B. C. Ooi, L. Shi, H. T. Vo, S. Wu. Journal of Information Processing, Vol.20, No.1, 2012.

  • Query Optimization for Massively Parallel Data Processing
  • S. Wu, F. Li, S. Mehrotra, B. C. Ooi.ACM Symposium on Cloud Computing (SOCC). 2011.

  • A Framework for Supporting DBMS-like Indexes in the Cloud
  • G. Chen, H. T. Vo, S. Wu, B. C. Ooi, M. T. Ozsu. Int'l Conference on Very Large Data Bases (VLDB), 2011.

  • Llama: Leveraging Columnar Storage for Scalable Join Processing in the MapReduce Framework
  • Y. Lin, D. Agrawal, C. Chen, B. C. Ooi, S. Wu. ACM Int'l. Conference on Management of Data (SIGMOD), 2011.

  • ES2:A Cloud Data Storage System for Supporting Both OLTP and OLAP
  • Y. Cao, C. Chen, F. Guo, D. Jiang, Y. Lin, B. C. Ooi, H. T. Vo, S. Wu and Q. Xu. International Conference on Data Engineering (ICDE), 2011.

  • Providing Scalable Database Services on the Cloud
  • C. Chen, G. Chen, D. Jiang, B. C. Ooi, H. T. Vo, S. Wu, Q. Xu. WISE 2010 (Keynote).

  • MAP-JOIN-REDUCE: Towards Scalable and Efficient Data Analysis on Large Clusters
  • D. Jiang, A. K. H. TUNG, and G. Chen. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2010.

  • The Performance of MapReduce: An In-depth Study
  • D. Jiang, B. C. Ooi, L. Shi, S. Wu. Int'l Conference on Very Large Data Bases (VLDB), 2010.
    Source Code

  • Towards Elastic Transactional Cloud Storage with Range Query Support
  • H. T. Vo, C. Chen, B. C. Ooi. Int'l Conference on Very Large Data Bases (VLDB), 2010.

  • Efficient B+-tree Based Indexing for Cloud Data Processing
  • S. Wu, D. Jiang, B. C. Ooi, K. L. Wu. Int'l Conference on Very Large Data Bases (VLDB), 2010.

  • Indexing Multi-dimensional Data in a Cloud System
  • J. Wang, S. Wu, H. Gao, J. Li, B. C. Ooi. ACM Int'l. Conference on Management of Data (SIGMOD), 2010.

  • An Indexing Framework for Efficient Retrieval on the Cloud
  • S. Wu, K. L. Wu. IEEE Data Eng. Bull. 32(1): 75-82 (2009).