Back to top

HDFS Connector for Elasticsearch

HDFS (Hadoop Distributed File System) is the main distributed storage used by Hadoop applications.

The HDFS connector for Elasticsearch is a part of Search Technologies’ range of connectors designed to support data connectivity between search engines and third-party repositories. This connector brings content from HDFS into Elasticsearch securely and is customizable based on your organization’s specific requirements.

See our complete list of available connectors for Elasticsearch.

STANDARD HDFS-ELASTICSEARCH CONNECTOR FEATURES

  • Metadata extraction
  • Incremental crawling 
  • Runs from any machine with HTTP access to the given HDFS Namenode
  • Filters the crawled documents by paths (including file names) using regex patterns
  • Support for Kerberized Clusters 

The ones above are standard features. Additional functionalities can be requested.

CUSTOMIZATION, SUPPORT, AND MAINTENANCE

Search Technologies provides planning, implementation, support, and maintenance for the HDFS connector through our professional service engagement. 


Contact us for details and pricing.

0