[S]crape - the Documentation

[S]crape is a tool developed to help researchers extract selective data from web publications. It is particularly useful for serial web publications which have similar structure over many issues. You interactively develop a selection and extraction set of commands, and run them across a series of issues, generating output (JSON or CSV).

Overview

What’s involved, required knowledge, and the basic modes of operation.

Installation

Software and browser requirements.

Tutorials

Get started with a concrete examples.

Alternatives

Examples of just some of the alternatives to [S]crape include:

You can also look through the notable tools section on Wikipedia.

Many of these are either not interactive, or are programmers libraries or toolkits. [S]crape is an interactive script development tool which, with a modicum of knowledge, is both powerful and simple. [S]crape scripts use [S]crape commands, shell commands, and commands provided by extensions.

Copyright Notice

This license applies to the program, [S]crape, and its documentation.

Work in Progress

This document is currently a work-in-progress. Here are a list of known items left to do:

Todo

Have yet to debug the scrape.gz install file (installation does not mirror setup.py).

(The original entry is located in /home/docs/checkouts/readthedocs.org/user_builds/scrape/checkouts/latest/source/installation.rst, line 92.)

Indices and tables

comments powered by Disqus

Table Of Contents

Next topic

Overview

This Page

Edit this document!

Anyone with a Github account can edit and submit changes directly through the Web.

  1. Click to edit: - the Documentation
  2. Edit using GitHub's editor in your web browser (click 'Edit' tab on the top right)
  3. Fill in the Commit message the bottom of the page describing why you made the changes. If you've completed your changes, press the Propose file change button.
  4. If you've completed your changes, click Send a pull request.
  5. Your changes are now queued for review under the project's Pull requests tab on GitHub!

For an introduction to the documentation format see the reST primer.