CCExtractor Development

Debugging issues with Spanish recordings

CCExtractor supports a lot of different standards for extracting captions/subtitles from video files in almost any language. However, occasionally we still encounter issues with certain recordings. This is one of them. The samples we got result in empty files or just garbage output.

The issue linked in the external URL will give you the two samples we have for this issue, and in order to solve this task, we'd like you to dig into why these samples could be producing issues.

This can be done in several ways, but will most likely require stepping through the code in debugger mode.

We recommend following this procedure:

  • running CCExtractor on the samples to see what they produce as output
  • check if popular media player(s) can parse the captions
  • check if some tooling indicates captions are present
  • check up on standards for the captions if needed
  • run CCExtractor again using a debugger to see what goes wrong

We expect either a report indicating why it's impossible to extract captions from it, or a root cause in case you find out why it isn't. Bonus points if you can open a PR with a fix.

Task tags

  • c
  • spanish
  • closed captions
  • dvb

Students who completed this task

Harry Yu

Task type

  • code Code
close

2017