allegro/elasticsearch-analysis-morfologik

Name: elasticsearch-analysis-morfologik

Owner: Allegro Tech

Description: Morfologik Polish Lemmatizer plugin for Elasticsearch

Created: 2017-03-22 08:10:35.0

Updated: 2018-04-23 10:20:18.0

Pushed: 2018-04-23 10:21:57.0

Homepage: null

Size: 207

Language: Java

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

Morfologik Polish Lemmatizer plugin for Elasticsearch

Build Status Maven Central

Morfologik plugin for elasticsearch 6.x, 5.x and 2.x. It's lucene-analyzers-morfologik wrapper for elasticsearch.

Plugin provide “morfologik” analyzer and “morfologik_stem” token filter.

Originally created by https://github.com/monterail/elasticsearch-analysis-morfologik

Build

./gradlew clean build

Examples
“morfologik” analyzer

Request:

_analyze

nalyzer": "morfologik",
ext": "jestem"

Response:


okens": [
{
  "token": "by?",
  "start_offset": 0,
  "end_offset": 6,
  "type": "<ALPHANUM>",
  "position": 0
}


“morfologik_stem” token filter

Request:

_analyze

okenizer": "standard",
ilter": ["morfologik_stem"],
ext": "jestem"

Response:


okens": [
{
  "token": "by?",
  "start_offset": 0,
  "end_offset": 6,
  "type": "<ALPHANUM>",
  "position": 0
}


Supported elasticsearch versions:

All ready to install plugins are deployed to maven central: http://central.maven.org/maven2/pl/allegro/tech/elasticsearch/plugin/elasticsearch-analysis-morfologik/

Elasticsearch 6.x
Elasticsearch 5.x
Install in Elasticsearch 5.x and 6.x
directory>/bin/elasticsearch-plugin install \
.allegro.tech.elasticsearch.plugin:elasticsearch-analysis-morfologik:6.2.4

for version <= 5.4.0

directory>/bin/elasticsearch-plugin install \
tp://central.maven.org/maven2/pl/allegro/tech/elasticsearch/plugin/elasticsearch-analysis-morfologik/5.3.1/elasticsearch-analysis-morfologik-5.3.1-plugin.zip

tip: select proper version in url to plugin

Elasticsearch 2.x
Install in Elasticsearch 2.x
directory>/bin/plugin install \
tp://central.maven.org/maven2/pl/allegro/tech/elasticsearch/plugin/elasticsearch-analysis-morfologik/2.4.2/elasticsearch-analysis-morfologik-2.4.2-plugin.zip

tip: select proper version in url to plugin


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.