DataStax Enterprise Search
Document MetaData Search using Tika and DSEFS
DSEFS (DataStax Enterprise file system) is a fault-tolerant, general-purpose, distributed file system within DataStax Enterprise. DSEFS is similar to HDFS, but avoids the deployment complexity and single point of failure typical of HDFS.
In this example, we load all the documents in a directory into DSEFS while extracting the metadata for indexing into DSE Search. First we query the data in DSE using cqlsh.
Document MetaData Search using Tika and DSEFS
To then query for a particular word in the document we simply amend the initial query.
READY TO TRY DATASTAX?
Spin up a cluster in the cloud with DataStax Astra, the best way to get started with Cassandra in just a few clicks with 10 GB for free!