This is the documentation for Cloudera Search CDH 5 Beta 2 and 1.2.0 for CDH 4.
Documentation for other versions is available at Cloudera Documentation.

Known Issues Fixed in Cloudera Search for CDH 5 Beta 2

— Sample twitter-flume.conf included with Search refers to outdated package location.

The file in examples/solr-nrt/twitter-flume.conf includes the following reference:

agent.sources.twitterSrc.type = org.apache.flume.sink.solr.morphline.TwitterSource. 

TwitterSource is now bundled with Flume, itself, so the reference should be:

agent.sources.twitterSrc.type = org.apache.flume.source.twitter.TwitterSource
Severity: Medium

Workaround: Edit twitter-flume.conf so agent.sources.twitterSrc.type equals org.apache.flume.source.twitter.TwitterSource.

readSequenceFile morphline command reuses values

The readSequenceFile morphline command reuses key and value Hadoop Writable objects across rows.

Downstream commands such as loadSolr or HBase indexer buffer records before sending them to Solr. Buffered records containing a reference to the same Hadoop Writable object as the primary key id result in records appearing to be the same record, meaning they have the same ID.

Severity: Low

Workaround: Immediately after the readSequenceFile command in your morphline, insert the following commands:

toString { field: key }
toString { field : value }

This converts the key and value from the Hadoop Writable to a distinct String object, resulting in the identity of the key and object being different for each row.