Ben Hancock

Computational Journalism, Python, and Linux

Web Scraping Articles

Collecting Data from Messy Websites

If you're a journalist (or a student, or a researcher, or pretty much anyone else who relies on the internet for information), you've undoubtedly been in this situation: you want to save a bunch of data from a website, but it's in messy or uncopyable format. With so many sites …