Wikimedia

Add scraping of Highwire Press tags to html-metadata node library

Full description here: https://phabricator.wikimedia.org/T118633

We use the node.js html-metadata library to scrape metadata from webpages. This metadata is then used to generate a citation for the webpage using citoid.

Highwire Press Tags are a type of metadata in a webpage that gives information about how to cite the resource. You should create a function called exports.parseHighwirePress that scrapes all the highwire press tags, and creates a javascript object as well as tests.

Task tags

  • node, node.js, javascript, web scraping

Students who completed this task

neonowy

Task type

  • code Code
close

2015