-

General Info

I am currently a Research Fellow in School of Computing, National University of Singapore. I am working in Database System Research Group leaded by Professor Beng Chin Ooi. I obtained my BSc in Computer Science from Harbin Institute of Technology in 2011, and recived my Ph.D. in Computer Science from National University of Singapore in 2016.

Research

As a member of Database Research Group, my research interests are mainly in distributed data management and processing, including database storage, data processing engines, machine/deep learning platforms, and blockchain techniques.

Project

ForkBase: An Efficient Storage Engine for Blockchain and Forkable Applications

ForkBase is our attempt to build a storage system that supports high-level properties demanded in many modern applications, such as blockchains and collaborative analytics. Particularly, ForkBase provides immutability, collaboration and security. The system enables rapid developments of many classes of scalable, distributed applications, thanks to its versatile programming interface, rich semantics and high performance.

SINGA: A Distributed Deep Learning Platform

SINGA is an Apache Incubator open source, distributed training platform for deep learning models, and is designed based on three principles, namely, usability, scalability and extensibility. A variety of popular deep learning models are supported. SINGA architecture is sufficiently flexible to run synchronous, asynchronous and hybrid training frameworks. I am one of the main developers since the project started in 2014.

LogBase: Scalable Log-structured Database

The LogBase project aims to develop a scalable log-structured database that supports very high write throughput in addtion to other functionalities including dynamic scalability, multi-version data access, transactional semantics for bundled read and write operations, and fast recovery from machine failures. I joined the LogBase project in August 2011.

epiC: Elastic Power-aware data Intensive Cloud

The epiC project aims to build an elastic, power-aware, data-intensive cloud computing platform for large-scale services, supporting high throughout low latency transactions and high performance reliable query processing. It is to bridge the performance gap between data intensive analytical jobs and online-transactions. I joined the epiC project in November 2011.

Publication

  • Sheng Wang, Tien Tuan Anh Dinh, Qian Lin, Zhongle Xie, Meihui Zhang, Qingchao Cai, Gang Chen, Wanzeng Fu, Beng Chin Ooi, Pingcheng Ruan. ForkBase: An Efficient Storage Engine for Blockchain and Forkable Applications. Int'l Conference on Very Large Data Bases (VLDB), 2018. (to appear)
  • Tien Tuan Anh Dinh, Ji Wang, Sheng Wang, Gang Chen, Wei-Ngan Chin, Qian Lin, Beng Chin Ooi, Pingcheng Ruan, Kian-Lee Tan, Zhongle Xie, Hao Zhang, and Meihui Zhang. UStore: A distributed storage with rich semantics. arXiv:1702.02799, February 2017.
  • Sheng Wang, David Maier, Beng Chin Ooi. Fast and Adaptive Indexing of Multi-Dimensional Observational Data. Int'l Conference on Very Large Data Bases (VLDB), 2016.
  • Wei Wang, Gang Chen, Haibo Chen, Tien Tuan Anh Dinh, Jinyang Gao, Beng Chin Ooi, Kian-Lee Tan, Sheng Wang, Meihui Zhang. Deep learning at scale and at ease. Transactions on Multimedia Computing Communications and Applications, 2016.
  • Beng Chin Ooi, Kian-Lee Tan, Sheng Wang, Wei Wang, Qingchao Cai, Gang Chen, Jinyang Gao, Zhaojing Luo, Anthony K.H. Tung, Yuan Wang, Zhongle Xie, Meihui Zhang, Kaiping Zheng. SINGA: A Distributed Deep Learning Platform. ACM Multimedia, 2015.
  • Wei Wang, Gang Chen, Tien Tuan Anh Dinh, Jinyang Gao, Beng Chin Ooi, Kian-Lee Tan, Sheng Wang. SINGA: Putting Deep Learning in the Hands of Multimedia Users. ACM Multimedia, 2015.
  • Jinyang Gao, H.V. Jagadish, Beng Chin Ooi, Sheng Wang. Selective Hashing: Closing the Gap between Radius Search and k-NN Search. ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2015.
  • Sheng Wang, David Maier, Beng Chin Ooi. Lightweight Indexing of Observational Data in Log Structured Storage. Int'l Conference on Very Large Data Bases (VLDB), 2014.[code]
  • Sai Wu, Xiaoli Wang, Sheng Wang, Zhenjie Zhang, Anthony K.H. Tung: K-Anonymity for Crowdsourcing Database. IEEE Transactions on Knowledge and Data Engineering, 15 May 2013 (preprint).
  • Hoang Tam Vo, Sheng Wang, Divyakant Agrawal, Gang Chen, Beng Chin Ooi. LogBase: A Scalable Log-Structured Database System in the Cloud. Int'l Conference on Very Large Data Bases (VLDB), 2012.