Apertium

find content that phenny/begiak wiki modules don't do a good job with

Identify at least 10 pages or sections on Wikipedia or the apertium wiki that the respective Begiak module doesn't return good output for. These may include content where there's immediately a subsection, content where the first thing is a table or infobox, or content where the first . doesn't end the sentence. Document generalisable scenarios about what the preferred behaviour would be.

One example currently is the page on Begiak itself. .awik Begiak returns the first sentence of the first sub-section, not of the page.

For further information and guidance on this task, you are encouraged to come to our IRC channel.

Task tags

  • basic_nlp
  • python
  • IRC
  • bot
  • begiak

Students who completed this task

Darkgaia

Task type

  • done_all Quality Assurance
close

2015