[S]crape is a tool developed to help researchers extract selective data from web publications. It is particularly useful for serial web publications which have similar structure over many issues. You interactively develop a selection and extraction set of commands, and run them across a series of issues, generating output (JSON or CSV).
Software and browser requirements.
Examples of just some of the alternatives to [S]crape include:
You can also look through the notable tools section on Wikipedia.
Many of these are either not interactive, or are programmers libraries or toolkits. [S]crape is an interactive script development tool which, with a modicum of knowledge, is both powerful and simple. [S]crape scripts use [S]crape commands, shell commands, and commands provided by extensions.
This document is currently a work-in-progress. Here are a list of known items left to do:
Todo
Have yet to debug the scrape.gz install file (installation does not mirror setup.py).
(The original entry is located in /home/docs/checkouts/readthedocs.org/user_builds/scrape/checkouts/latest/source/installation.rst, line 92.)