Python module reference

This module reference extends the manual with a comprehensive overview of the available functionality. Each module in the package is documented by a general summary of its purpose and the list of classes and functions it provides.

Commands

crawl Interface for crawling a webpage and push extracted data into a dataset
crawl_init Interface for a generic template in which arguments are specified by the user

Pipelines

pipeline Pipeline functionality.