DataStax Enterprise Search

Find Exactly What You're Looking For

Built for distributed, real-time applications that need to easily search data quickly and easily at cloud scale.

Document MetaData Search using Tika and DSEFS

Document MetaData Search using Tika and DSEFS

DSEFS (DataStax Enterprise file system) is a fault-tolerant, general-purpose, distributed file system within DataStax Enterprise. DSEFS is similar to HDFS, but avoids the deployment complexity and single point of failure typical of HDFS.


In this example, we load all the documents in a directory into DSEFS while extracting the metadata for indexing into DSE Search. First we query the data in DSE using cqlsh.

Document MetaData Search using Tika and DSEFS

Document MetaData Search using Tika and DSEFS

To then query for a particular word in the document we simply amend the initial query.

Key Features

Resources

READY TO TRY DATASTAX?

Spin up a cluster in the cloud with DataStax Astra, the best way to get started with Cassandra in just a few clicks with 10 GB for free!

Try For Free