MLCask

A Git-like end-to-end ML life-cycle management system

View our publication in the 2021 IEEE International Conference on Data Engineering (ICDE):

MLCask: Efficient Management of Component Evolution in Collaborative Data Analytics Pipelines

Research Highlights

MLCask is a Git-like end-to-end ML life-cycle management system. In real-world machine learning (ML) applications, maintaining an ML pipeline in a collaborative environment is significant and challenging. The costs of frequent retraining and asynchronous component update by different users need to be taken into consideration. MLCask supports both linear and non-linear version control semantics for efficient management of ML pipelines.

People

Professors

Avatar

Beng Chin Ooi

Lee Kong Chian Centennial Professor, National University of Singapore

Avatar

Gang Chen

Professor, Dean of computer college, Zhejiang University

Avatar

Meihui Zhang

Professor, School of Computer Science & Technology, Beijing Institute of Technology

Researchers

Avatar

Zhaojing Luo

Research Fellow in the Database Systems Research Group at NUS

Avatar

Yuncheng Wu

Research Fellow in the Database Systems Research Group at NUS

Avatar

Kaiping Zheng

Research Fellow in the Database Systems Research Group at NUS

Avatar

Kaiyuan Yang

Research Assistant in the Database Systems Research Group at NUS

Avatar

Zhu Lei

PhD student, NUS School of Computing

Avatar

Qingpeng Cai

PhD student, NUS School of Computing

Avatar

Naili Xing

PhD student, NUS School of Computing

Contact