Skip to content

Ingest processor that extracts emails and store it in an array of different fields

License

Notifications You must be signed in to change notification settings

nthreads/elasticsearch-ingest-email-extract

Repository files navigation

Elasticsearch email-extract Ingest Processor

Explain the use case of this processor in a TLDR fashion.

Usage

PUT _ingest/pipeline/email-extract-pipeline
{
  "description": "Ingest processor that extracts emails and store it in an array of different fields",
  "processors": [
    {
      "email_extract" : {
        "field" : "my_field"
      }
    }
  ]
}

PUT /my-index/my-type/1?pipeline=email-extract-pipeline
{
  "my_field" : "You can reach out to me at [email protected] or at [email protected] or commenting on www.nthreads.com"
}

GET /my-index/my-type/1
{
  "my_field" : "Some content"
  "potentially_enriched_field": "potentially_enriched_value"
}

Configuration

Parameter Use
some.setting Configure x
other.setting Configure y

Setup

In order to install this plugin, you need to create a zip distribution first by running

gradle clean check

This will produce a zip file in build/distributions.

After building the zip file, you can install it like this

bin/elasticsearch-plugin install file:///path/to/ingest-email-extract/build/distribution/ingest-email-extract-0.0.1-SNAPSHOT.zip

Bugs & TODO

  • Add Link extraction

About

Ingest processor that extracts emails and store it in an array of different fields

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages