FOSSASIA
Script: Scrape Public Parliament Data using Python
Scraping is the art of taking information from websites. In the European Parliament Members may pose written and oral questions to the European Institutions. E.g.
- http://www.europarl.europa.eu/rss/doc/questions-declarations/en.xml
- http://www.europarl.europa.eu/RegistreWeb/search/typedoc.htm?codeTypeDocu=QECR
- http://www.europarl.europa.eu/sides/getDoc.do?type=WQ&reference=P-2015-015758&format=XML&language=EN
Expected Outcome
- Your task is to build a script that automatically scrapes all Answers to Written Questions from 2015
- The script downloads all data into a directory
- Hint: The Answers to Written Questions are provided by different sources. You try to get them in Word (.doc) or Portable Document Format (.pdf)
- Accepted pull request: Script and data should be added into the repository at https://github.com/fossasia/parliament-scaper
- Step by step process added in the Readme.md
Task tags
Students who completed this task
Yasoob, Scott Deng, pythad, sampriti, rajdeep, Pol Baladas