This is the documentation for CDH 5.1.0.
Documentation for other versions is available at Cloudera Documentation.

MapReduce Batch Indexing Reference

Cloudera Search provides the ability to batch index documents using MapReduce jobs.

If you did not install MapReduce tools required for Cloudera Search, do so now by installing MapReduce tools on nodes where you want to submit a batch indexing job as described in Installing MapReduce Tools for use with Cloudera Search.

For information on tools related to batch indexing, see:

Running an Example Indexing Job

See Cloudera Search Tutorial for examples of running a MapReduce job to index documents.