Motivation and problem
definition
n Transcripts are widely used to index speech
information in videos.
n Current automatic speech recognition (ASR)
introduces errors during the transcription
process.
n This error can cascade and result in errors in
the indexing as well, which affects retrieval.
n In this paper, we will discuss how to use
auxiliary data and NLP technology to correct
speech recognition errors.