CCExtractor Development

Find out a way to get a current topic based on captions and implement it

Suppose you have a set of captions lines from a few minutes of newscast. When a person reads these captions he can immediately tell you which topic is being discussed, e.g. “it’s about US president election” or “it’s about a species of fish that can jump out of water ”. We want you to do some research (write a brief survey) on how to summarise a set of given captions lines to tell a topic of them. As this problem is well known and there are algorithms for it on the Internet, we also want you to find a good one and implement it.

Task tags

  • text summarization
  • machine learning

Students who completed this task

Evgeny Shulgin, Manveer_Basra

Task type

  • code Code
  • assessment Outreach / Research
close

2016