Wikimedia

Add scraping of BE Press tags to html-metadata node library

Full description here: https://phabricator.wikimedia.org/T118635

We use the node.js html-metadata library to scrape metadata from webpages. This metadata is then used to generate a citation for the webpage using citoid.

BE Press tags are a type of metadata in a webpage that gives information about how to cite the resource. You should create a function called exports.parseBEPress in https://github.com/wikimedia/html-metadata/blob/master/lib/index.js that scrapes all the BE Press tags, and creates a javascript object.

Task tags

  • node
  • javascript
  • testing
  • html
  • web scraping

Students who completed this task

Geoffrey Mon

Task type

  • code Code
close

2015