Publications are listed in reverse chronological order.
Presentation slides associated with the publications are supplied
when available. You can also review most recent foils and posters from talks, as well as older
presentations from my Ph.D. years.
Publications are also labeled by general research area:
DL
IR
NLP
Web
Others: DB EduTech HCI ML MedInfo MM
- Pu-Jen Cheng, Min-Yen Kan, Wai Lam, Preslav I. Nakov (Eds.) (2010) Information Retrieval Technology: 6th Asia Information Retrieval Societies Conference, AIRS 2010, Lecture Notes in Computer Science (LNCS), Volume 6458, Taipei, Taiwan, December 1-3, 2010. 627 pp. ISBN: 978-3-642-17186-4
[ Volume at SpringerLink ]
IR
Web
- Simone Teufel and Min-Yen Kan (2009) Proceedings of NLPIR4DL 2009: 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries. Association for Computational Linguistics and the Asian Federation of Natural Language Processing, Singapore, August 2009.
[ ACL Anthology (W09-36)] [ Local copy ] [ Preface only ]
DL
IR
NLP
- Min-Yen Kan, Dongwon Lee and Ee-Peng Lim (2008) Scholarly digital libraries at scale: introduction to the special issue on very large digital libraries. International Journal on Digital Libraries 9(2): pp. 81-82
[ doi:10.1007/s00799-008-0042-0 ] [ Local copy (as allowed by self-archiving, courtesy Springer-Verlag) ]
DL
- Hwee Tou Ng, Mun-Kew Leong, Min-Yen Kan and Donghong Ji (Eds.) (2006) Information Retrieval Technology: Third Asia Information Retrieval Symposium, AIRS 2006, Lecture Notes in Computer Science (LNCS), Volume 4182. Singapore, October 16-18, Springer, 684 pp. IBSN: 978-3-540-45780-0
[ Volume at SpringerLink ]
IR
Web
- Aobo Wang, Cong Duy Vu Hoang and Min-Yen Kan (2013) Perspectives on Crowdsourcing Annotations for Natural Language Processing. Language Resources and Evaluation, 47(1) (2013), pages 9-31.
[ Local copy (.pdf) as allowed by Springer Verlag self-archiving policy ]
[ doi:10.1007/s10579-012-9176-1 ]
IR
NLP
- Su Nam Kim, Olena Medelyan, Min-Yen Kan and Timothy Baldwin (2012). Automatic keyphrase extraction from scientific articles. In Language Resources and Evaluation. December 2012.
[ Local copy (.pdf) as allowed by Springer self-archiving policy ] [ doi:10.1007/s10579-012-9210-3 ]
NLP
- Ziheng Lin, Hwee Tou Ng and Min-Yen Kan (2012). A PDTB-Styled End-to-End Discourse Parser. Forthcoming in Natural Language Engineering.
[ Local copy (.pdf) as allowed by Cambridge University Press self-archiving policy ]
NLP
- Tao Chen and Min-Yen Kan (2012). Creating a live, public short message service corpus: the NUS SMS corpus. Language Resources and Evaluation. August 2012.
[ doi:10.1007/s10579-012-9197-9 ] [ Local copy (.pdf) as allowed by Springer's self-archiving policy ]
NLP
- Jesse Prabawa Gozali and Min-Yen Kan (2012). Rich and Dynamic Library Catalogs: A Case Study of Online Search Interfaces. In Jesus Tramullas and Piedad Garrido (Eds.) Library Automation and OPAC 2.0: Information Access and Services in the 2.0 Landscape, Idea Group Publishing.
[ doi:10.4018/978-1-4666-1912-8.ch002 ]
[ Local copy (.pdf) ]
[ From IGI ]
DL
HCI
- Simone Teufel and Min-Yen Kan (2011) Robust Argumentative Zoning for Sensemaking in Scholarly Documents. Advanced Language Technologies for Digital Libraries (ALT4DL). Lecture Notes in Computer Science, 2011, Volume 6699/2011, pp. 154-170
[ doi:10.1007/978-3-642-23160-5_10 ]
[ Local copy (.pdf) as allowed by Springer Verlag self-archiving policy ]
[ RAZ (@ GitHub) ]
DL
NLP
- Minh-Thang Luong, Thuy Dung Nguyen and Min-Yen Kan (2010) Logical Structure Recovery in Scholarly Articles with Rich Document Features. International Journal of Digital Library Systems, 1(4). pp. 1-23.
[ doi:10.4018/jdls.2010100101 ]
[ Local copy (.pdf) ]
DL
NLP
- Min-Yen Kan and Yee Fan Tan (2008) Record Matching in Digital Library Metadata. Communications of the ACM (CACM), Technical Opinion Column, 51(2), pp 91-94, February.
[ doi:10.1145/1314215.1314231 ]
[ Local copy (.pdf) ]
IR
DB
ML
- Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe and Arun Shenoy (2008) LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals. IEEE Transactions on Audio, Speech, and Language Processing, 16(2), February. pp. 338-349.
[ doi:10.1109/TASL.2007.911559 ]
[ Local copy (.pdf) as allowed by IEEE author self-archiving policy ]
NLP
MM
- Shiren Ye, Tat-Seng Chua, Min-Yen Kan and Long Qiu (2007) Document concept lattice for text understanding and summarization, Information Processing and Management, 43(6), pp. 1643-1662.
[ doi:10.1016/j.ipm.2007.03.010 ]
NLP
- Hang Cui, Min-Yen Kan and Tat-Seng Chua (2007) Soft Pattern Matching Models for Definitional Question Answering, ACM Transactions on Information Systems (TOIS), 25(2), April.
[ doi:10.1145/12291799.1229182 ]
[ Local copy (.pdf) as allowed by ACM author self-archiving policy ]
IR
NLP
- Wei Lu and Min-Yen Kan (2007) Supervised Categorization of Javascript using Program Analysis Features, Information Processing and Management, 43(2), pages 431-444.
[ doi:10.1016/j.ipm.2006.07.019 ]
IR
Web
- Min-Yen Kan (2005) Using multi-document summarisation to assist in semi-structured literature retrieval: A case study in consumer healthcare In Theng, Yin Leng and Foo, Schubert (Eds.) "Design and Usability of Digital Libraries : Case Studies in the Asia Pacific", Idea Group Publishing.
DL
NLP
- Noemie Elhadad, Min-Yen Kan, Judith Klavans and Kathleen McKeown (2005) Customization in a Unified Framework for Summarizing Medical Literature, Journal of Artificial Intelligence in Medicine, 33 (2), pp. 179-198.
[ doi:10.1016/j.artmed.2004.07.018 ]
NLP
MedInfo
2013
- Jovian Lin, Kazunari Sugiyama, Min-Yen Kan and Tat-Seng Chua (2013) Addressing Cold-Start in App Recommendation: Latent User Models Constructed from Twitter Followers. To appear in the Proceedings of Special Interest Group on Information Retrieval (SIGIR '13). 28 July-1 August. Dublin, Ireland.
[ .pdf (preprint) ]
IR
Web
- Aobo Wang and Min-Yen Kan (2013) Mining Informal Language from Chinese Microtext: Joint Word Recognition and Segmentation. To appear in the Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL '13). 4-9 August. Sofia, Bulgaria.
[ .pdf (preprint) ]
NLP
- Kazunari Sugiyama and Min-Yen Kan (2013) Exploiting Potential Citation Papers in Scholarly Paper Recommendation. To appear in the Proceedings of the Joint Conference on Digital Libraries (JCDL '13). 22-26 July, Indianapolis, USA.
[ .pdf (preprint) ]
DL
- Huy Do Hoang Nhat, Muthu Kumar C., Philip S. Cho and Min-Yen Kan (2013) Extracting and Matching Authors and Affiliations in Scholarly Documents. To appear in the Proceedings of the Joint Conference on Digital Libraries (JCDL '13). 22-26 July, Indianapolis, USA.
[ .pdf (preprint) ]
DL
- Bamdad Bahrani and Min-Yen Kan (2013) Multimodal Alignment of Scholarly Documents and Their Presentations. To appear in the Proceedings of the Joint Conference on Digital Libraries (JCDL '13). 22-26 July, Indianapolis, USA. Short Paper.
[ .pdf (preprint) ]
DL
MM
- Jesse Prabawa Gozali, Min-Yen Kan and Hari Sundaram (2013) Constructing an Anonymous Dataset From the Personal Digital Photo Libraries of Mac App Store Users. To appear in the Proceedings of the Joint Conference on Digital Libraries (JCDL '13). 22-26 July, Indianapolis, USA. Short Paper.
[ .pdf (preprint) ]
DL
MM
2012
- Jun-Ping Ng and Min-Yen Kan (2012) Improved Temporal Relation Classification using Dependency Parses and Selective Crowdsourced Annotations. Forthcoming in Proceedings of the International Conference on Computational Linguistics (COLING 2012). Mumbai, India. 8-15 December.
[ .pdf (preprint) ]
[ Slides (.pdf) ]
NLP
- Jun-Ping Ng, Praveen Bysani, Ziheng Lin, Min-Yen Kan and Chew-Lim Tan (2012) Exploiting Category-Specific Information for Multi-Document Summarization. Forthcoming in Proceedings of the International Conference on Computational Linguistics (COLING 2012). Mumbai, India. 8-15 December.
[ .pdf (preprint) ]
[ Slides (.pdf) ]
NLP
- Jin Zhao, Praveen Bysani and Min-Yen Kan (2012) Exploiting Classification Correlations for the Extraction of Evidence-based Practice Information. In Proceedings of the AMIA 2012 Annual Symposium. November 3-7, Chicago, USA.
[ .pdf (preprint) ]
IR
MedInfo
- Anqi Cui, Liner Yang, Dejun Hou, Min-Yen Kan, Yiqun Liu, Min Zhang and Shaoping Ma (2012) PrEV: Preservation Explorer and Vault for Web 2.0 User-Generated Content. In Proceedings of the Theory and Practice of Digital Libraries (TPDL 2012), Paphos, Cyprus. pp. 101-112. Lecture Notes in Computer Science, Volume 7489/2012.
[ .pdf (preprint) ]
[ doi:10.1007/978-3-642-33290-6_12 ]
[ Slides (.pdf) ]
DL
Web
- Praveen Bysani and Min-Yen Kan (2012) Integrating User-Generated Content in the ACL Anthology. In Proceedings of the ACL Special Workshop 2012 on Rediscovering 50 years of Discoveries. pp. 83-87.
[ .pdf (preprint) ]
[ ACL Anthology (W12-3209) ]
[ Slides (.pdf) ]
NLP
- Aobo Wang, Tao Chen and Min-Yen Kan (2012) Re-tweeting from a Linguistic Perspective. In Proceedings of the NAACL-HLT 2012 Workshop on Language in Social Media. Montréal, Canada. pp. 46-55.
[ .pdf (preprint) ]
[ ACL Anthology (W12-2106) ]
[ Slides (.pdf) ]
[ Demo and Corpus ]
NLP
- Ziheng Lin, Chang Liu, Hwee Tou Ng and Min-Yen Kan (2012) Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012). Jeju, Korea. pp. 1006-1014.
[ .pdf (preprint) ]
[ ACL Anthology (P12-1106) ]
[ Poster (.pdf) ]
[ Software (.zip) ]
NLP
- Jesse Prabawa Gozali, Min-Yen Kan and Hari Sundaram (2012) How Do People Organize Their Photos in Each Event and How Does It Affect Storytelling, Searching and Interpretation Tasks? In Proceedings of the 12th ACM/IEEE Joint Conference on Digital Libraries (JCDL '12). Washington, DC, USA. pp. 315-324.
[ .pdf (preprint) ]
[ doi:10.1145/2232817.2232875 ]
[ Slides (.pdf) ]
DL
HCI
MM
- Jesse Prabawa Gozali, Min-Yen Kan and Hari Sundaram (2012) Hidden Markov Model for Event Photo Stream Segmentation. In Proceedings of the IEEE ICME 2012 Workshop on Human-Focused Communications in the 3D Continuum (HFC3D). Melbourne, Australia. pp. 25-30.
[ .pdf (preprint) ]
[ Slides (.pdf) ]
DL
MM
- Jonathan Y. H. Poon, Kazunari Sugiyama, Yee Fan Tan and Min-Yen Kan (2012) Instructor-Centric Source Code Plagiarism Detection and Plagiarism Corpus. In Proceedings of the 17th Annual ACM SIGCSE Conference on Innovation and Technology in Computer Science Education (ITiCSE 2012). Haifa, Israel. pp. 122-127.
[ .pdf (preprint) ]
[ Slides (.pdf) ]
[ SSID system (@ GitHub) ]
[ SSID homepage ]
DL
IR
NLP
EduTech
HCI
2011
- Ziheng Lin, Hwee Tou Ng and Min-Yen Kan (2011) Automatically Evaluating Text Coherence Using Discourse Relations. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT 2011). Portland, Oregon, USA. pp. 997-1006.
[ .pdf (preprint) ]
[ ACL Anthology (P11-1100) ]
[ Slides (.pdf) ]
NLP
- Kazunari Sugiyama and Min-Yen Kan (2011) Serendipitous Recommendation for Scholarly Papers Considering Relations Among Researchers. In Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011) Short Papers. Ottawa, Canada. pp. 307-310.
[ .pdf ] [ Slides (.pdf) ]
DL
IR
- Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin and Min-Yen Kan (2011). Product Review Summarization from a Deeper Perspective. In Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011) Short Papers. Ottawa, Canada. pp. 311-314.
[ .pdf ]
[ Slides (.pdf) ]
IR
NLP
2010
- Minh-Thang Luong, Preslav I. Nakov and Min-Yen Kan (2010) A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2010), Boston, Massachusetts, USA. pp. 148-157.
[ .pdf ]
[ ACL Anthology (D10-1015) ]
[ Slides (.pdf) ]
NLP
- Jin Zhao, Min-Yen Kan, Paula M. Procter, Siti Zubaidah, Wai Kin Yip and Goh Mien Li (2010) eEvidence: Information Seeking Support for Evidence-based Practice: An Implementation Case Study. In the Proceedings of the AMIA 2010 Annual Symposium. Washington, DC, USA. pp. 937-941.
[ .pdf ]
[ Slides (.pdf) ]
DL
IR
HCI
MedInfo
- Jin Zhao, Min-Yen Kan, Paula M. Procter, Siti Zubaidah, Wai Kin Yip and Goh Mien Li (2010) Improving Search for Evidence-based Practice using Information Extraction. In the Proceedings of the AMIA 2010 Annual Symposium. Washington, DC, USA. pp. 932-936.
[ .pdf ]
[ Slides (.pdf) ]
DL
IR
NLP
MedInfo
- Minh-Thang Luong and Min-Yen Kan (2010) Enhancing Morphological Alignment for Translating Highly Inflected Languages. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China. pp. 743-751.
[ .pdf ]
[ ACL Anthology (C10-1084) ]
[ Slides (.htm) ]
NLP
- Cong Duy Vu Hoang and Min-Yen Kan (2010) Towards Automated Related Work Summarization. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China. pp. 427-435.
[ .pdf ]
[ ACL Anthology (C10-2049) ]
[ Poster (.png) ]
NLP
- Su Nam Kim, Timothy Baldwin and Min-Yen Kan (2010) Evaluating N-gram based Evaluation Metrics for Automatic Keyphrase Extraction. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China. pp 572-580.
[ .pdf ]
[ ACL Anthology (C10-1065) ]
[ Slides (.pdf) ]
NLP
- Su Nam Kim, Alyona Medelyan, Timothy Baldwin and Min-Yen Kan (2010) SemEval-2010 Task 5: Automatic Keyphrase Extraction from Scientific Articles. In the Proceedings of SemEval2. Uppsala, Sweden. pp. 21-26.
[ .pdf ]
[ ACL Anthology (S10-1004) ]
[ Slides (.pdf) ]
NLP
- Yee Fan Tan and Min-Yen Kan (2010) Hierarchical Cost-sensitive Web Resource Acquisition for Record Matching. In Proceedings of the 2010 IEEE / WIC / ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT 2010). Toronto, Canada. August-September. pp. 382-389.
[ .pdf ]
[ doi:10.1109/WI-IAT.2010.14 ]
[ Slides (.pdf) ]
DL
IR
DL
- Justin Sein Lin, Jun Ping Ng, Shreyasee Pradhan, Jatin Shah, Ricardo Pietrobon and Min-Yen Kan (2010) Extracting Formulaic and Free Text Clinical Research Articles Metadata using Conditional Random Fields. In Proceedings of the Second Conference on Text and Data Mining of Clinical Documents (Louhi 2010). Los Angeles, USA. June. pp 90-95.
[ Pre-print (.pdf) ]
[ ACL Anthology (W10-1114) ]
[ Slides (.htm) ]
DL
NLP
MedInfo
- Jin Zhao and Min-Yen Kan (2010) Domain-Specific Iterative Readability Computation. Proceedings of the Joint Conference on Digital Libraries (JCDL '10), Brisbane, Australia, June. pp. 205-214.
[ Pre-print (.pdf) ] [
[ doi:10.1145/1816123.1816155 ]
[ Slides (.htm) ]
DL
IR
- Kazunari Sugiyama and Min-Yen Kan (2010) Scholarly Paper Recommendation via User's Recent Research Interests. Proceedings of the Joint Conference on Digital Libraries (JCDL '10), Brisbane, Australia, June. pp. 29-38.
[ Pre-print (.pdf) ]
[ doi:10.1145/1816123.1816129 ]
[ Slides (.pdf) ]
DL
IR
- Markus Hänse, Min-Yen Kan and Achim P. Karduck (2010) Kairos: Proactive Harvesting of Research Paper Metadata from Scientific Conference Web Sites. Proceedings of the International Conference on Asia-Pacific Digital Libraries (ICADL '10), Brisbane, Australia, June. pp. 226-235.
[ Pre-print (.pdf) ]
[ doi:10.1007/978-3-642-13654-2_28 ]
[ Slides (.htm) ]
DL
IR
- Min-Yen Kan, Tarun Kumar and Himanshu Gahlot (2010) Prastava: An Open-Source Ruby-Based Generic Recommendation System. Joint JCDL/ICADL Demo Session (JCDL/ICADL '10), Brisbane, Australia, June. Demo Paper.
[ Abstract (.pdf) ]
DL
IR
- Thuy Dung Nguyen, Min-Yen Kan, Dinh-Trung Dang, Markus Hänse, Ching Hoi Andy Hong, Minh-Thang Luong, Jesse Prabawa Gozali, Kazunari Sugiyama and Yee Fan Tan (2010) ForeCite: towards a reader-centric scholarly digital library. In Proceedings of the Joint Conference on Digital Libraries (JCDL '10), Brisbane, Australia, June. Poster Paper. pp. 387-388.
[ doi:10.1145/1816123.1816193 ]
[ Local copy (.pdf) ]
[ Poster (.png) ]
DL
- Kazunari Sugiyama, Tarun Kumar, Min-Yen Kan and Ramesh C. Tripathi (2010). Identifying Citing Sentences in Research Papers Using Supervised Learning. In Proceedings of the 2010 International Conference on Information Retrieval and Knowledge Management (CAMP '10), Shah Alam, Malaysia, March, pp. 67-72.
[ .pdf ] [ Slides (.pdf) ]
DL
NLP
2009
- Su Nam Kim, Timothy Baldwin and Min-Yen Kan (2009). The Use of Topic Representative Words in Text Categorization. In Proceedings of the Australasian Document Computing Symposium (ADCS:B).
[ .pdf ] [ Slides (.pdf) ]
IR
NLP
- Su Nam Kim, Timothy Baldwin and Min-Yen Kan (2009). An Unsupervised Approach to Domain-Specific Term Extraction. In Proceedings of the Australasian Language Technology Association Workshop (ALTW:B), pp. 94-98.
[ .pdf ] [ Slides (.pdf) ]
DL
NLP
- Ching Hoi Andy Hong, Jesse Prabawa Gozali and Min-Yen Kan (2009). FireCite: Lightweight real-time reference string extraction from webpages. In Proceedings of ACL-IJCNLP 2009 Workshop on text and citation analysis for scholarly digital libraries (NLPIR4DL), Singapore, August 2009.
[ .pdf ]
[ ACL Anthology (W09-3609) ]
[ Slides (.pdf) ]
DL
IR
NLP
- Ziheng Lin, Min-Yen Kan and Hwee Tou Ng (2009). Recognizing Implicit Discourse Relations in the Penn Discourse Treebank. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP 2009), Singapore, August 2009.
[ pre-print .pdf ]
[ ACL Anthology (D09-1036) ]
[ Slides (.htm) ]
NLP
- Hung Huu Hoang, Su Nam Kim and Min-Yen Kan (2009). A re-examination of lexical association measures. In Proceedings of ACL-IJCNLP 2009 Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications, Singapore, August 2009.
[ .pdf ]
[ ACL Anthology (W09-2905) ]
[ Slides (.htm) ]
IR
- Hendra Setiawan, Min-Yen Kan, Haizhou Li and Philip Resnik (2009). Topological Ordering of Function Words in Hierarchical Phrase-based Translation. In Proceedings of ACL-IJCNLP 2009, Singapore, August 2009.
[ .pdf ]
[ ACL Anthology (P09-1037) ]
[ Slides (.pdf) ]
[ Slides (.htm) ]
NLP
- Su Nam Kim and Min-Yen Kan (2009). Re-examining Automatic Keyphrase Extraction Approaches in Scientific Articles. In Proceedings of ACL-IJCNLP 2009 Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications, Singapore, August 2009.
[ .pdf ]
[ ACL Anthology (W09-2902) ]
[ Slides (.pdf) ]
DL
NLP
2008
- Dinh-Trung Dang, Yee Fan Tan and Min-Yen Kan (2008) Towards a Webpage-based Bibliographic Manager. In Proceedings of the 11th International Conference on Asian Digital Libraries (ICADL), pp. 313-316, Bali, Indonesia. December. Short paper.
[ pre-print .pdf ] [ doi:10.1007/978-3-540-89533-6_33 ] [ Slides (.htm) ] [ Video (.m4v) ]
DL
Web
- Yin-Leng Theng, Natalie Pang, Min-Yen Kan, Chunyan Miao and Ai Chee Tang (2008). Investigating Students Perceptions of the NTUs edveNTUre: Implications for Design Patterns in E-learning Systems. In Proceedings of World Conference on Educational Multimedia, Hypermedia and Telecommunications 2008 (pp. 863-871). Chesapeake, VA: AACE.
[ From EdItLib ] [ Local copy ]
HCI
EduTech
- Yin-Leng Theng, Hong-Ren Wong, Ai Chee Tang, Chunyan Miao and Min-Yen Kan (2008). Claims Analysis Meets Structuration Theory: Analysing Qualitative Students Interactions with NTUs edveNTUre. In Proceedings of World Conference on Educational Multimedia, Hypermedia and Telecommunications 2008 (pp. 1494-1503). Chesapeake, VA: AACE.
[ From EdItLib ] [ Local copy ]
HCI
EduTech
- Yee Fan Tan, Ergin Elmacioglu, Min-Yen Kan and Dongwon Lee (2008). Efficient Web-Based Linkage of Short to Long Forms. International Workshop on the Web and Databases (WebDB), Vancouver, Canada, June 2008.
[ .pdf pre-print ] [ Slides (.htm) ]
IR
Web
- Jin Zhao, Min-Yen Kan and Yin Leng Theng (2008) Math Information Retrieval: User Requirements and Prototype Implementation. In Proceedings of the Joint Conference on Digital Libraries (JCDL '08). Pittsburgh, Pennsylvania, June, pages 187-196.
[ .pdf pre-print ]
[ Slides (.htm) ]
DL
IR
HCI
- Guo Min Liew and Min-Yen Kan (2008) Slide Image Retrieval: A Preliminary Study. In Proceedings of the Joint Conference on Digital Libraries (JCDL '08). Pittsburgh, Pennsylvania, June, pages 359-362. Short paper.
[ .pdf pre-print ]
[ Slides (.htm) ]
DL
IR
NLP
- Steven Bird, Robert Dale, Bonnie Dorr, Bryan Gibson, Mark Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir Radev and Yee Fan Tan (2008) The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics. In Language Resources and Evaluation Conference (LREC 08). Marrakesh, Morocco, May.
[ .pdf pre-print ]
[ Slides (.htm) ]
[ To the ACL ARC bibliographic reference corpus ]
NLP
- Isaac G. Councill, C. Lee Giles and Min-Yen Kan (2008) ParsCit: An open-source CRF reference string parsing package. In Language Resources and Evaluation Conference (LREC 08). Marrakesh, Morocco, May.
[ .pdf pre-print ]
[ Poster (.png) ]
[ To ParsCit website and downloads ]
DL
NLP
- Long Qiu, Min-Yen Kan and Tat-Seng Chua (2008) Modeling Context in Scenario Template Creation, In Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP '08), Hyderabad, India.
[ .pdf ]
[ Slides (.pdf) ]
NLP
2007 and earlier
- Thuy Dung Nguyen and Min-Yen Kan (2007) Keyphrase Extraction in Scientific Publications. In Proc. of International Conference on Asian Digital Libraries (ICADL '07). Hanoi, Vietnam, December. pp. 317-326.
[ .pdf ]
[ Slides (.htm) ]
[ Download the corpus ]
DL
NLP
- Ergin Elmacioglu, Min-Yen Kan, Dongwon Lee and Yi Zhang (2007) Web Based Linkage. In Proc. of Workshop on Web Information and Data Management (WIDM '07). Lisboa, Portugal, September, pp. 121-128.
[ .pdf ]
[ Slides (.htm) ]
IR
Web
DB
- Ergin Elmacioglu, Yee Fan Tan, Su Yan, Min-Yen Kan and Dongwon Lee (2007) PSNUS: Web People Name Disambiguation by Simple Clustering with Rich Features. In Proceedings of SemEval 2007 Workshop, Association of Computational Linguistics (ACL), Prague, Czech Republic, June. Placed third out of 16 teams in the WEPS competition.
[ .pdf ]
[ ACL Anthology (W07-2058) ]
[ Slides (.htm) ]
IR
NLP
Web
- Hendra Setiawan, Min-Yen Kan and Haizhou Li (2007) Ordering Phrases with Function Words, In Proceedings of the Association of Computational Linguistics, (ACL 07). Prague, Czech Republic, June.
[ .pdf ]
[ ACL Anthology (W07-2058) ]
[ Slides (.htm) ]
NLP
- Min-Yen Kan (2007) SlideSeer: A Digital Library of Aligned Document and Presentation Pairs, In Proceedings of the Joint Conference on Digital Libraries (JCDL '07). Vancouver, Canada, June, pp. 81-90.
[ .pdf ]
[ Slides (.htm) ]
DL
IR
NLP
- Su Yan, Dongwon Lee, Min-Yen Kan and C. Lee Giles (2007) Adaptive Sorted Neighborhood Methods for Efficient Record Linkage, In Proceedings of the Joint Conference on Digital Libraries (JCDL '07). Vancouver, Canada, June, pp. 185-194.
[ .pdf ]
[ Slides (.pdf,6-up) ]
DB
- Jesse Prabawa Gozali and Min-Yen Kan (2007) A Rich OPAC User Interface with AJAX, In Proceedings of the Joint Conference on Digital Libraries (JCDL '07). Vancouver, Canada, June, pp. 329-330. Short paper.
[ .pdf ]
[ Slides (.htm) ]
DL
HCI
- Bang Viet Nguyen and Min-Yen Kan (2007) Functional Faceted Web Query Classification. In the Proceedings of Query Log Analysis: Social and Technological Challenges, Banff, Canada, May.
[ .pdf ]
[ Slides (.htm) ]
IR
Web
- Ziheng Lin and Min-Yen Kan (2007) Timestamped Graphs: Evolutionary Models of Text for Multi-document Summarization, In Proceedings of Textgraphs-2: Workshop on Graph-based Methods for Natural Language Processing Rochester, NY, USA, April.
[ .pdf ]
[ Slides (.htm) ]
NLP
- Denny Iskandar, Ye Wang, Min-Yen Kan and Haizhou Li (2006) Syllabic Level Automatic Synchronization of Music Signals and Text Lyrics, In Proceedings of ACM Multimedia (MM '06), Santa Barbara, CA, USA, October.
[ .pdf ]
[ Poster (.png) ]
NLP
MM
- Long Qiu, Min-Yen Kan and Tat-Seng Chua (2006) Paraphrase Recognition via Dissimilarity Significance Classification, In Proceedings of the Empirical Methods for Natural Language Processing (EMNLP '06), Syndey, Australia, July, pp. 18-26.
[ local .pdf ]
[ ACL Anthology (W06-1603) ]
[ Slides (.pdf) ]
NLP
- Shi-yong Neo, Jin Zhao, Min-Yen Kan and Tat-Seng Chua (2006) Video Retrieval using High Level Features: Exploiting Query Matching and Confidence-based Weighting, In Proceedings of the Conference on Image and Video Retrieval (CIVR), Tempe, Arizona, USA, July 2006. pp. 143-152.
[ .pdf ]
[ Slides (.htm) ]
IR
MM
- Fei Wang and Min-Yen Kan (2006) NPIC: Hierarchical synthetic image classification using image search and generic features, In Proceedings of the Conference on Image and Video Retrieval (CIVR), Tempe, Arizona, USA, July 2006. pp. 473-482.
[ .pdf ]
[ Poster (.png) ]
IR
Web
- Yee Fan Tan, Min-Yen Kan and Dongwon Lee (2006) Search Engine Driven Author Disambiguation. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL), Chapel Hill, North Carolina, USA, June 2006. pp. 314-315. (Short Paper)
[ .pdf ]
[ Slides (.htm) ]
IR
Web
- Yee Fan Tan, Min-Yen Kan and Hang Cui (2006) Extending corpus-based identification of light verb constructions using a supervised learning framework. In Proceedings of the EACL 2006 Workshop on Multi-word-expressions in a multilingual context (MWEmc), Trento, Italy, April, pages 47-54.
[ .pdf ]
[ Slides (.htm) ]
NLP
- Renxu Sun, Jing Jiang, Yee Fan Tan, Hang Cui, Tat-Seng Chua and Min-Yen Kan (2005) Using Syntactic and Semantic Relation Analysis in Question Answering, In Proceedings of the 14th Text Retrieval Conference (TREC), Gaithersburg, Maryland, USA, November 2005.
[ .pdf ]
IR
NLP
Web
- Min-Yen Kan and Hoang Oanh Nguyen Thi (2005) Fast webpage classification using URL features. In Proc. of Conf. on Info and Knowledge Management (CIKM '05). Bremen, Germany, November 2005. Poster Paper. pp. 325-236.
[ .pdf ] [ Poster (.png) ]
IR
Web
- Wei Lu and Min-Yen Kan (2005) Supervised Categorization of Javascript using Program Analysis Features. In Asian Information Retrieval Symposium (AIRS 05). Jeju Island, Korea, October 2005. pp. 160-173.
[ (pre-print draft) .pdf ]
[ Slides (.htm) ]
IR
Web
- Cui Hang, Min-Yen Kan and Tat-Seng Chua (2005) Generic Pattern Models for Definitional Question Answering. In Proc. of ACM SIG on Information Retrieval (SIGIR 05). Brazil, August 2005. pp. 384-391.
[ .pdf ]
[ Slides (.htm) ]
IR
Web
- Cui Hang, Renxu Sun, Keya Li, Min-Yen Kan and Tat-Seng Chua (2005) Question Answering Passage Retrieval Using Depedency Relations. In Proc. of ACM SIG on Information Retrieval R 05). Brazil, August 2005. pp. 400-407.
[ .pdf ]
[ Slides (.htm) ]
IR
Web
- Bageshree Shevade, Hari Sundaram and Min Yen-Kan (2005) A Collaborative Annotation Framework. In Proceedings of the International Conference on Multimedia and Expo (ICME '05), Amsterdam, Netherlands, July 2005.
[ .pdf ]
MM
- Renxu Sun, Hang Cui, Keya Li, Min-Yen Kan and Tat-Seng Chua (2005) Dependency Relation Matching for Answer Selection. In Proc. of ACM SIG on Information Retrieval R 05). pp. 651-652. (Poster Paper)
[ .pdf ]
IR
NLP
- Yijue How and Min-Yen Kan (2005) Optimizing predictive text entry for short message service on mobile phones. In M. J. Smith & G. Salvendy (Eds.) Proc. of Human Computer Interfaces International (HCII 05). Lawrence Erlbaum Associates. Las Vegas, July 2005. ISBN 0805858075
[ .pdf ]
[ Slides (.htm) ]
NLP
HCI
- Min-Yen Kan and Danny C. C. Poo (2005) Detecting and supporting known item queries in online public access catalogs. Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries (JCDL 05). Denver, 7-11 June 2005. pp. 91-99.
[ doi:10.1145/1065385.1065406 ]
[ .pdf ]
[ Slides (.htm) ]
DL
NLP
- Chee How Lee, Min-Yen Kan and Sandra Lai (2004) Stylistic and Lexical Co-training for Web Block Classification. In Proceedings of Workshop on Web Information and Data Management (WIDM '04), Washington, D.C., USA, 12-13 November.
[ doi:10.1145/1031453.1031478 ]
[ .pdf ]
[ Slides (.htm) ]
IR
Web
- Wang Ye, Min-Yen Kan, Tin Lay Nwe, Arun Shenoy and Jun Yin (2004) LyricAlly: Automatic Synchronization of Acoustic Musical Signals and Textual Lyrics. In Proceedings of ACM Multimedia 2004 (MM '04), New York, USA, 10-16 October. pp. 212-219.
Winner of best student paper award.
[ .pdf (preprint) ]
[ Slides (.htm) ]
NLP
MM
- Jeffry Komarjaya, Danny C.C. Poo and Min-Yen Kan (2004) Corpus-Based Query Expansion in Online Public Access Catalogs. In Proceedings of the European Conference on Digital Libraries (ECDL '04), Bath, United Kingdom, 12-17 September.
[ .pdf ]
[ Slides ]
DL
IR
Web
- Hang Cui, Min-Yen Kan, Tat-Seng Chua and Jing Xiao (2004) A Comparative Study on Sentence Retrieval for Definitional Question Answering. In Proceedings of the Workshop on Information Retrieval for Question Answering (IR4QA), SIGIR '04. Sheffield, United Kingdom.
[ .pdf ]
IR
NLP
Web
- Min-Yen Kan (2004) Web Page Classification Without the Web Page. In Proceedings of the 13th International World Wide Web Conference (WWW2004), May 2004. New York, New York, USA. Poster Paper.
[ .pdf ]
[ MeURLin demo ]
[ Poster (.png) ]
IR
Web
- Long Qiu, Min-Yen Kan and Tat-Seng Chua (2004) A Public Reference Implementation of the RAP Anaphora Resolution Algorithm. In Proceedings of the Language Resources and Evaluation Conference 2004 (LREC 04), Lisbon, Portugal.
[ .pdf ] [ Poster (.png) ]
NLP
- Hang Cui, Min-Yen Kan and Tat-Seng Chua (2004) Unsupervised Learning of Soft Patterns for Generating Definitions from Online News. In Proceedings of the 13th International World Wide Web Conference (WWW2004), May 2004. New York, New York, USA.
[ .pdf ]
[ DefSearch Demo ]
IR
NLP
Web
- Simon Lok and Min-Yen Kan (2003) Employing Natural Language Summarization and Automated Layout for Effective Presentation and Navigation of Information Retrieval Results. Proceedings of the 12th International World Wide Web Conference (WWW2003), May 2003. Poster paper.
[ .pdf ]
IR
NLP
Web
- Andre W. Kushniruk, Min-Yen Kan, Kathleen R. McKeown, Judith L. Klavans, Desmond Jordan, Mark LaFlamme and Vimla L. Patel (2002) Usability Evaluation of an Experimental Text Summarization System and Three Search Engines: Implications for the Reengineering of Health Care Interfaces. In Proceedings of the American Medical Informatics Association Annual Symposium (AMIA 2002), San Antonio, Texas, USA: November 2002.
DL
NLP
MedInfo
- Andre W. Kushniruk, Min-Yen Kan, Kathleen R. McKeown, Judith L. Klavans and Vimla L. Patel (2002) Evaluating the Content and Usability of an Experimental Text Summarization System and Three Web-Based Search Engines. In Proceedings of the Human Factors and Ergonomics 46th Annual Meeting (HFES 2002), Baltimore, Maryland, USA: September 2002.
DL
IR
MedInfo
- Min-Yen Kan and Judith L. Klavans (2002) Using Librarian Techniques in Automatic Text Summarization for Information Retrieval. Proceedings of the Joint Conference on Digital Libraries (JCDL 2002), Portland, Oregon, USA: July 2002. pp. 36-45. Nominated for best paper award.
[ Postscript, GZIPped ]
[ .pdf ]
[ Presentation Slides ]
DL
NLP
- Min-Yen Kan and Kathleen R. McKeown (2002) Corpus-trained text generation for summarization. Proceedings of the Second International Natural Language Generation Conference (INLG 2002), Harriman, New York, USA: July 2002. pp. 1-8.
[ Postscript, GZIPped ]
[ .pdf ]
[ Presentation Slides ]
NLP
- Min-Yen Kan, Judith L. Klavans and Kathleen R. McKeown (2002) Using the Annotated Bibliography as a Resource for Indicative Summarization. In Proceedings of the Language Resources and Evaluation Conference (LREC 2002), Las Palmas, Spain: May 2002. pp. 1746-1752. (Posted to cmp-lg)
[ Postscript, GZIPped ]
[ .pdf ]
[ poster (.jpg) ]
[ poster (MS .ppt file) ]
NLP
- Min-Yen Kan, Kathleen R. McKeown and Judith L. Klavans (2001) Domain-specific informative and indicative summarization for information retrieval. In Proceedings of the Document Understanding Workshop (DUC 2001), New Orleans, USA: September 2001.
[ Postscript, GZIPped ]
[ .pdf ]
[ Poster Session Foils/Slides ]
NLP
- Kathleen R. McKeown, Regina Barzilay, David Evans, Vasileios Hatzivassiloglou, Min-Yen Kan, Barry Schiffman and Simone Teufel (2001) Columbia Multi-Document Summarization: Approach and Evaluation. In Proceedings of the Document Understanding Workshop (DUC 2001), New Orleans, USA: September 2001.
[ Postscript, GZIPped ]
[ .pdf ]
NLP
- Min-Yen Kan, Kathleen R. McKeown and Judith L. Klavans (2001) Applying Natural Language Generation to Indicative Summarization. In Proceedings of 8th European Workshop on Natural Language Generation, Toulouse, France: July 2001. pp. 92-100. (Posted to cmp-lg)
[ Postscript, GZIPped ]
[ .pdf ]
[ Poster Session Foils/Slides ]
NLP
- Vasileios Hatzivassiloglou, Judith L. Klavans, Melissa L. Holcombe, Regina Barzilay, Min-Yen Kan and Kathleen R. McKeown (2001) Simfinder: A Flexible Clustering Tool for Summarization. In Proceedings of the Workshop on Summarization in NAACL `01, Pittsburg, Pennsylvania, USA: June 2001.
[ Postscript, GZIPped ]
[ .pdf ]
NLP
- Judith L. Klavans and Min-Yen Kan (1998) Role of Verbs in Document Analysis. In Proceedings of COLING/ACL 98, Montréal, Québec, Canada: Aug. 1998. pp. 680-686. (Posted to cmp-lg archives)
[ Postscript, GZIPped ]
[ .pdf ]
NLP
- Judith L. Klavans, Kathleen R. McKeown, Min-Yen Kan and Susan Lee (1998) Resources for the Evaluation of Summarization Techniques. Proceedings of the 1st International Conference on Language Resources and Evaluation, Grenada, Spain: May 1998. (Posted to cmp-lg)
[ Postscript, GZIPped ]
[ .pdf ]
[ LaTeX source, GZIPped (uses lrec98.sty, and LaTeX 2.09e) ]
[ Poster Session Foils/Slides]
NLP
- Min-Yen Kan, Judith L. Klavans and Kathleen R. McKeown (1998) Linear Segmentation and Segment Relevence. Proceedings of 6th International Workshop of Very Large Corpora (WVLC-6), Montréal, Québec, Canada: August 1998. pp. 197-205. (Posted to cmp-lg) archives
[ Postscript, GZIPped ]
[ .pdf ]
[ Presentation Slides ]
NLP
- Pascale Fung, Min-Yen Kan and Yurie Horita (1996) Extracting Japanese Domain and Technical Terms is Relatively Easy. Second International Conference in New Methods for Language Processing, (NEMLP) Bilkent, Turkey: September 1996. pp. 148-159.
[ Postscript, GZIPped ]
[ .pdf ]
NLP
- Jesse Prabawa Gozali, Min-Yen Kan and Hari Sundaram. (2012b). How Do People Organize Their Photos in Each Event and How Does It Affect Storytelling, Searching and Interpretation Tasks? Technical Report TRC4/12, National University of Singapore Department of Computer Science.
[ .pdf ]
DL
HCI
MM
- Tao Chen and Min-Yen Kan (2011) Creating a Live, Public Short Message Service Corpus: The NUS SMS Corpus. (Posted to cmp-lg) World's large publicly available SMS Corpus.
[ .pdf ]
[ Corpus Website ]
NLP
- Jun Ping Ng, Praveen Bysani, Ziheng Lin, Min-Yen Kan and Chew Lim Tan (2011) SWING: Exploiting Category-Specific Information for Guided Summarization. In Proceedings of the Text Analysis Conference 2011 (TAC 2011). Gaithersburg, Maryland, USA. 1st place in automated ROUGE measures among all teams.
[ .pdf ] [ Slides (.pdf) ]
NLP
- Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin and Min-Yen Kan (2011). Product Review Summarization based on Facet Identification and Sentence Clustering. National University of Singapore, Department of Computer Science Technical Report, TR 30/11. (Posted to cmg-lg)
[ .pdf ]
NLP
- Aobo Wang, Cong Duy Vu Hoang and Min-Yen Kan (2010). Perspectives on Crowdsourcing Annotations for Natural Language Processing. National University of Singapore. Department of Computer Science Technical Report, TRB 7/10.
[ .pdf ]
NLP
- Yee Fan Tan and Min-Yen Kan (2010). A Framework for Hierarchical Cost-sensitive Web Resource Acquisition. National University of Singapore. Department of Computer Science Technical Report, TRA 3/10.
[ .pdf ]
IR
- Yee Fan Tan and Min-Yen Kan (2010). Cost-sensitive Attribute Value Acquisition for Support Vector Machines.
National University of Singapore
Department of Computer Science Technical Report, TRB 3/10.
[ .pdf ]
ML
-
Paula M. Procter, Min-Yen Kan, Siu Yin Lee, Siti Zubaidah, Wai Kin Yip, Jin Zhao, David Arthur, Goh Mien Li (2009) eEvidence: Supplying Evidence to the Patient Interaction. In Saranto, K. et al. (Eds) NI2009 Nursing Informatics: Connecting Health and Humans. pp. 488-492. IOS Press. ISBN 978-1-60750-024-7. Highest Scholarship Poster Abstract Award Winner.
[ doi:10.3233/978-1-60750-024-7-488 ] [ Local copy (.pdf) (corrects misspelled author name; as allowed by self-archiving, courtesy IOS press) ] [ Poster (.pdf) ]
DL
IR
MedInfo
- Ziheng Lin, Huu Hung Hoang, Min-Yen Kan, Long Qiu and Shiren Ye (2008) NUS at TAC 2008: Augmenting Timestamped Graphs with Event Information and Selectively Expanding Opinion Contexts. In Proc. of Text Analysis Conference.
[ Pre-print (.pdf) ]
NLP
- Jesse Prabawa Gozali and Min-Yen Kan (2007). Rich and Dynamic Library Catalogs: A Case Study of Online Search Interfaces.
National University of Singapore
Department of Computer Science Technical Report, TRA 8/07.
[ .pdf ]
DL
HCI
- Ziheng Lin, Tat-Seng Chua, Min-Yen Kan, Wee Sun Lee, Long Qiu
and Shiren Ye (2007). NUS at DUC 2007: Using Evolutionary Models of
Text. In Proceedings of the Document Understanding Conference (DUC
'07), Rochester, NY, USA.
[ .pdf ] [ Slides (.htm) ]
NLP
- Shiren Ye, Long Qiu, Tat-Seng Chua and Min-Yen Kan
(2005). NUS at DUC 2005: Understanding Documents via Concept
Links. In Proceedings of Document Understanding Conference
(DUC '05), Vancouver, Canada.
Placed first of 31 teams in the DUC 2005 competition based on ROUGE scoring.
[ .pdf ] [ Slides (.htm) ]
NLP
- Min-Yen Kan and Hoang Oanh Nguyen Thi (2005) Fast webpage
classification using URL features.
National University of Singapore
Department of Computer Science Technical Report, TRC 8/05.
[ .pdf ]
IR
- Yee Fan Tan, Min-Yen Kan and Hang Cui (2005) Extending
corpus-based identification of light verb constructions using a
supervised learning framework. National University of Singapore
Department of Computer Science Technical Report, TRB 8/05.
[ http://hdl.handle.net/1900.100/1850
] [ .pdf ]
NLP
- Hang Cui, Keya Li, Renxu Sun, Tat-Seng Chua and Min-Yen Kan
(2004) National University of Singapore at the TREC-13 Question Answering Main Task. In Proceedings of
TREC 13.
[ .pdf ] [ Slides (.htm) ] [ Poster (.htm) ]
NLP
IR
- Hui Yang, Hang Cui, Mstislav Maslennikov, Long Qiu, Min-Yen Kan, Tat-Seng Chua
(2003) QUALIFIER In TREC-12 QA Main Task. In Proceedings of
TREC 12, pages 480-488.
[ .pdf ]
NLP
IR
- Min-Yen Kan (2003) Metadata extraction and text categorization using Universal Resource Locator expansions. National University of Singapore Department of Computer Science Technical Report, TR 10/03.
[ Postscript, GZIPped ]
[ .pdf ]
IR
- Min-Yen Kan, Judith L. Klavans, Kathleen R. McKeown (2001)
Synthesizing composite topic structure trees for multiple
domain specific documents. Columbia University Computer
Science Technical Report, CUCS-003-01.
[ Postscript, GZIPped ]
[ .pdf ]
NLP
- Min-Yen Kan (2001)
Combining visual layout and lexical cohesion features for text
segmentation. Columbia University Computer
Science Technical Report, CUCS-002-01.
[ Postscript, GZIPped ]
[ .pdf ]
NLP
- Min-Yen Kan and Kathleen R. McKeown (1999)
Information Extraction and Summarization: Domain
Independence through Focus Types. Columbia University Computer
Science Technical Report, CUCS-030-99.
[ Postscript, GZIPped ]
[ .pdf ]
[ .html (via latex2html) ]
NLP
- Martin Braschler, Min-Yen Kan, Peter Schäuble and Judith L. Klavans
The Eurospider Retreival System and the TREC-8 Cross-Language Task.
Proceedings of TREC-8, Gaithersburg, Maryland, USA:
Nov. 1999.
[ Postscript, GZIPped ]
[ .pdf ]
[ .html (via MS Word) ]
IR
Min-Yen Kan
Automatic text summarization as applied to information retrieval: Using indicative and informative summaries, New York, New York, USA:
Feb 2003. Ph.D. Thesis.
[ Postscript, GZIPped ]
[ .pdf ]
[ .html (via latex2html) ]
Presentation slides: [ .html ]
[ .ppt ]
NLP