Publications

Data Cleaning

  1. Judice Koh, Mong Li Lee, Wynne Hsu, Wee Tiong Ang. Correlation-based Attribute Outlier Detection in XML (Poster), in 24th International Conference on Data Engineering, Cancun, Mexico, April 2008.

  2. Qiangfeng Peter Lau, Wynne Hsu, Judice L. Y. Koh, and Mong Li Lee. DeepDetect: An Extensible System for Detecting Attribute Outliers & Duplicates in XML (Invited), in DASFAA2008 Workshop on Data Quality in Collaborative Information Systems, New Delhi, India, March 2008.

  3. Judice Koh, Mong Li Lee, Wynne Hsu, Kai Tak Lam. Correlation-based Detection of Attribute Outliers, in 12th International Conference on Database Systems for Advanced Applications, Bangkok, Thailand, April 2007.

  4. Judice L.Y. Koh, Mong Li Lee, Vladimir Brusic. A Classification of Biological Data Artifacts, in ICDT Workshop on Database Issues in Biological Databases (DBiBD), Edinburgh, Scotland, UK, January 2005.

  5. Judice L.Y. Koh, Mong Li Lee, Asif M. Khan, Paul T.J. Tan, Vladimir Brusic. Duplicate Detection in Biological Data using Association Rule Mining, in ECML/PKDD Workshop on Data Mining and Text Mining for Bioinformatics, Pisa, Italy, September 2004.

  6. Judice L.Y. Koh, S.P.T Krishnan, Seah Seng Hong, Paul T.J. Tan, Asif M. Khan, Mong Li Lee, Vladimir Brusic. BioWare: A framework for bioinformatics data retrieval, annotation and publishing, in ACM SIGIR Workshop on Search and Discovery in Bioinformatics (SIGIRBIO), Sheffield, UK, July 2004.

  7. Ren Lu, Mong Li Lee, Wynne Hsu. Using Interval Association Rules to Identify Dubious Values, in 5th International Conference on Web-Age Information Management (WAIM), Dalian, China, July 2004.

  8. Mong Li Lee, Wynne Hsu, Vijay Kothari. Cleaning the Spurious Links in Data, in IEEE Intelligent Systems: Special issue on Data and Information Cleaning and Preprocessing , Volume 19, No. 2, March/April 2004.

  9. Wai Lup Low, Wee Hyong Tok, Mong Li Lee, Tok Wang Ling. Data Cleaning and XML : The DBLP Experience , in IEEE 18th International Conference on Data Engineering (ICDE), San Jose, California, 2002. (full paper in technical report TRA1/03)

  10. Wai Lup Low, Mong Li Lee, Tok Wang Ling. A Knowledge-Based Framework for Duplicates Elimination , in Information Systems: Special Issue on Data Extraction, Cleaning and Reconciliation, Volume 26, Issue 8, Elsevier Science, 2001.

  11. Mong Li Lee, Tok Wang Ling and Wai Lup Low. IntelliClean: A Knowledge-Based Intelligent Data Cleaner (Poster), in Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, MA, 2000.

  12. Mong Li Lee, Hongjun Lu, Tok Wang Ling and Yee Teng Ko. Cleansing Data for Mining and Warehousing, in Proceedings of the 10th International Conference on Database and Expert Systems Applications (DEXA99), Florence, Italy, August 1999.