Welcome!

swirl

Lin Yuting 林煜庭


Student in
Department of Computer Science
School of Computing
National University of Singapore
Computing 1, Computing Drive, Singapore 117590

lin36 AT comp.nus.edu.sg

General Info

I was pleased to join NUS in July 2008. Now I am a master student of Department of Computer Science, supervised by Prof. Ooi Beng Chin. I'm honored to be a member in the DB System Research Group.

I obtained my BSc from Sichuan University in 2008.

Research

I am interested in solving problems in large data scale. My major work includes dataset scaling, storage system and query processing in the cloud environment.

Project

UpSizeR: Synthetically scaling up a given database state
Engineering for rapid growth in this highly competitive setting requires intensive testing with scaled-up datasets. To solve this dataset scaling problem, UpSizeR was not only conceived for scalability testing (s>1), it can also be used to make an anonymous copy (s=1) of its proprietary dataset, or scale down a production dataset (s<1) for non-production testing.

Llama: A columnar data warehouse for complex analytical query on MapReduce
MapReduce is highly desirable due to its scalability and fault tolerance. However, if a task required joining multiple datasets, it is done by a sequential composition of MapReduce phases. Llama is designed to process the query with multiple joins. By leveraging the columnar storage, it not only saves the I/O overhead, but also reduces the number of phases and thus significantly improves the performance.

epiC: elastic power-aware data intensive Cloud
The goals are to design and implement an efficient multi-tenancy cloud system for supporting high throughout low latency transactions and high performance reliable query processing. It is to bridge the performance gap between data intensive analytical jobs and online-transactions.

PUBLICATIONS

  • Yuting Lin, Divyakant Agrawal, Chun Chen, Beng Chin Ooi and Sai Wu: Llama: Leveraging Columnar Storage for Scalable Join Processing in the MapReduce. ACM Int'l. Conference on Management of Data (SIGMOD), 2011.

  • Yu Cao, Chun Chen, Fei Guo, Dawei Jiang, Yuting Lin, Beng Chin Ooi, Hoang Tam Vo, Sai Wu and Quanqing Xu: A Cloud Data Storage System for Supporting Both OLTP and OLAP. 27th IEEE International Conference on Data Engineering (ICDE), 2011.

  • Links

    Conferences of database research: SigmodVLDBICDE
    Open Sourses: HadoopHBaseHive
    Other Resources: DBMS2Facebook Engineering's Notes