This is the documentation for CDH 5.1.0.
Documentation for other versions is available at Cloudera Documentation.

Installing the Lily HBase Indexer Service

If you want to query data stored in HBase, you must install the Lily HBase Indexer service. The Lily HBase Indexer service indexes the stream of records being added to HBase tables. This process is scalable, fault tolerant, transactional, and operates at Near Real Time (NRT). Typically there is only a few seconds between the time when data arrives and the same data appears in search results.

Choosing where to Deploy the Lily HBase Indexer service Processes

You can have as many Lily HBase Indexer services running on different nodes as is required to accommodate the HBase ingest load. Consult HBase replication documentation for details on how to plan the capacity. You can co-locate Lily HBase Indexer service processes with SolrCloud on the same set of nodes.

To install the Lily HBase Indexer service on RHEL systems:

$ sudo yum install hbase-solr-indexer hbase-solr-doc

To install the Lily HBase Indexer service on Ubuntu and Debian systems:

$ sudo apt-get install hbase-solr-indexer hbase-solr-doc

To install the Lily HBase Indexer service on SUSE-based systems:

$ sudo zypper install hbase-solr-indexer hbase-solr-doc
  Important: For the Lily HBase Indexer to work with CDH 5, users may have to run the following command before issuing Lily HBase MapReduce jobs.
export HADOOP_CLASSPATH=<Path to hbase-protocol-**.jar>