CCExtractor Development

Validate status of our multiprogram support

Introduction

CCExtractor is a tool to extract subtitles/closed captions from video streams. It's used by universities for research purposes, as well by regular users that want to extract captions from various formats.

The task

In the digital world, a number of programs are transmitted simultaneously (multiplexed) in a single channel. This means that for example you can have a stream that carries BBC1, BBC2, etc. A couple years ago we added support for this - meaning that if you have such a stream you can get the subtitles for all the programs in the stream at the same time.

When this was done however (by one of our best ever Summer of Code students, by the way) we didn't have many samples to test; so it's probably a bit rough around the edges.

The task here is to get a few sample streams (maybe 20 or so) and validate the output.

We will of course provide the streams.

If subtitles are perfect - great. We're done. If they are not - we need to add detailed bug reports in GitHub, which as much information as we can so we can work on this.

Requirements to complete this task

As stated above, we either require a report stating that the streams you tested were perfect (and why), or new bug reports on our GitHub repository.

Relevant extra links

Students who completed this task

Ivan Makarov

Task type

  • done_all Quality Assurance
close

2018