Script: Scrape Public Parliament Data using Python

FOSSASIA

Scraping is the art of taking information from websites. In the European Parliament Members may pose written and oral questions to the European Institutions. E.g.

http://www.europarl.europa.eu/rss/doc/questions-declarations/en.xml
http://www.europarl.europa.eu/RegistreWeb/search/typedoc.htm?codeTypeDocu=QECR
http://www.europarl.europa.eu/sides/getDoc.do?type=WQ&reference=P-2015-015758&format=XML&language=EN

Expected Outcome

Your task is to build a script that automatically scrapes all Answers to Written Questions from 2015
The script downloads all data into a directory
Hint: The Answers to Written Questions are provided by different sources. You try to get them in Word (.doc) or Portable Document Format (.pdf)
Accepted pull request: Script and data should be added into the repository at https://github.com/fossasia/parliament-scaper
Step by step process added in the Readme.md

Task tags

python
scraping
opendata
opengov

Students who completed this task

Yasoob, Scott Deng, pythad, sampriti, rajdeep, Pol Baladas

Task type

Code