CCExtractor Development

[AI][Medium] Chinese Word Segmentation

Chinese language is linguistically very different from English and one of its distinguished linguistic property is that it doesn't contain spaces in the sentences which makes us difficult to distinguish between the words or tokens that can be aligned to the words in other languages. The task is to make computer program that will take a sentence as an input and will output the segmented sentence or a list of words in the original sentence. Use of external libraries is allowed.

Task tags

  • natural language processing
  • machine learning

Students who completed this task

abacles, T1duS, Jed Lim, lyect, Ivan Makarov

Task type

  • code Code
  • web Design
  • assessment Outreach / Research
close

2018