sg.edu.nus.comp.nlp.ims.util
Class CAmendLexeltCorpus

java.lang.Object
  extended by sg.edu.nus.comp.nlp.ims.util.CAmendLexeltCorpus

public final class CAmendLexeltCorpus
extends java.lang.Object

amend senseval 3 lexical sample task training corpus. because in SensEval 3 lexical sample task training corpus, there may be two or more training instances in one tag. this file is to separate the instances described above. I will add a "-number" at the end of the instance id and create a file with only one instance in one tag.

Author:
zhongzhi

Field Summary
protected  java.util.regex.Pattern m_AnswerPattern
           
protected  java.util.regex.Pattern m_InstancePattern
           
protected  java.util.regex.Pattern m_SplitPattern
           
 
Constructor Summary
CAmendLexeltCorpus()
           
 
Method Summary
 void amend(java.lang.String p_XmlFile)
          amend an input xml file
 void amend2(java.lang.String p_XmlFile)
           
static void main(java.lang.String[] p_Args)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

m_InstancePattern

protected java.util.regex.Pattern m_InstancePattern

m_AnswerPattern

protected java.util.regex.Pattern m_AnswerPattern

m_SplitPattern

protected java.util.regex.Pattern m_SplitPattern
Constructor Detail

CAmendLexeltCorpus

public CAmendLexeltCorpus()
Method Detail

amend2

public void amend2(java.lang.String p_XmlFile)
            throws java.io.IOException
Throws:
java.io.IOException

amend

public void amend(java.lang.String p_XmlFile)
           throws org.jdom.JDOMException,
                  java.io.IOException
amend an input xml file

Parameters:
p_XmlFile - input xml file
Throws:
java.io.IOException - io excpetion
org.jdom.JDOMException - jdom exception

main

public static void main(java.lang.String[] p_Args)
Parameters:
p_Args - arguments