nasa-jpl-memex/nutch-python

Name: nutch-python

Owner: NASA JPL MEMEX

Owner: NASA JPL MEMEX

Description: Python port of Nutch that allows controlling Apache Nutch via its REST API.

Forked from: chrismattmann/nutch-python

Created: 2015-09-10 19:56:03.0

Updated: 2018-02-26 02:09:11.0

Pushed: 2015-12-02 05:58:49.0

Homepage: http://nutch.apache.org/

Size: 66

Language: Python

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

nutch-python

A Python client library for the Apache Nutch that makes Nutch 1.x capabilities available using the Nutch REST Server.

See (https://wiki.apache.org/nutch/NutchTutorial) for installing Nutch 1.x and alternatively operating it via the command line.

This Python client library for Nutch is installable via Setuptools, Pip and Easy Install.

Installation (with pip)
  1. pip install nutch
Installation (without pip)
  1. python setup.py build
  2. python setup.py install

Wiki Documentation

See the wiki for instructions on how to use Nutch-Python and its API.

New Command Line Tool

When you install Nutch-Python you also get a new command line client tool, nutch-python installed in your /path/to/python/bin directory.

The options and help for the command line tool can be seen by typing nutch-python without any arguments.

Questions, comments?

Send them to Chris A. Mattmann.

Contributors

License

Apache License, version 2


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.