Speech Recognition for Index Generation
- Integrate closed captioning with speech recognition generated transcription
- Improve accuracy by automatic daily expansion of language model from closed captioning e.g. “Dodi Fayed”
- Participated (with Claritech) in TREC Spoken Document track
- large text retrieval evaluation benchmarks (NIST/DARPA)
- scored second due to OOV words (CIA, well-known, torched)