Introduction

Background

For the last few years I have been specializing at web scraping and collected what I found useful into this library.

All code is pure Python and has been run across multiple Linux servers, Windows machines, as well as Google App Engine.

Install

Some options to install the webscraping package:

  1. Checkout the repository: hg clone https://code.google.com/p/webscraping/

  2. Download the zip: https://pypi.python.org/pypi/webscraping/

  3. Install with pypi: pip install webscraping

The only dependency is python 2.5 or higher.

License

This code is licensed under the LGPL license.