News-on-Demand: Types of Errors
Of course, a
fully automated system like News-on-Demand will inevitably make errors.
We distinguish five types of errors, all of which are subjects of active research
aimed at improving the system:
- False segmentation of stories. This happens when we incorrectly
identify the beginning and end of a video paragraph associated with a
single news story. Incorrect segmentation is usually
due to inaccurate transcription, either from true closed-captioning errors,
errors in processing the closed-captioning text into stories, or errors
in the segmentation based on the speech transcription.
- False words in transcripts. Errors in the transcript
are either the result of faulty speech recognition or
errors in the closed-captioned text. The result is the appearance of
incorrect words in stories
and consequent errors in the index and in retrieval.
- False synchronization designates retrieved words that
were actually spoken elsewhere in the video. This is generally
due to closed-captioning that does not match the words returned by
speech recognition.
- Incorrectly recognized query. This is the result of an incorrect
speech recognition during the library exploration. The user can
edit and correct query misrecognitions through typing, or simply
repeat or rephrase the query.
- Incorrect set of stories returned for a query. This type of error
is measured through information recall and precision. The user
might get stories that are not relevant to the query or miss relevant
stories. Some of these are caused by previously mentioned problems,
and others are the result of shortcomings in the processing of the
query into retrieval keywords.
Next: Conclusions