13 Nov 08:22

github-actions

487e21e

v3.3.0 Latest

Latest

Highlights

Offset correction of SudachiSplitFilter now works properly with CharFilter #149
SPI is changed to implement #149
- New methods are added to MorphemeAttribute
Add allow_empty_morpheme setting to the tokenizer (#151)
- If false (default), when a char is split into multiple morphemes (e.g. ㍿), all morphemes will contain the char in their span.
- If true, only the first morpheme will contain the char and the span of other morphemes may be empty.
  - Previously this was set true by default.

Assets 35

elasticsearch-7.10.2-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-7.14.2-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-7.17.24-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-7.17.8-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-8.10.4-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-8.11.4-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-8.12.2-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-8.13.4-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-8.14.3-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
elasticsearch-8.15.2-analysis-sudachi-3.3.0.zip

2.11 MB 2024-11-13T08:09:17Z
Source code (zip)

2024-11-13T07:59:08Z
Source code (tar.gz)

2024-11-13T07:59:08Z

16 Oct 07:22

github-actions

v3.2.3

65c1f21

v3.2.3

Highlights

support latest elasticsearch and opensearch versions (#144)
- es: 8.14.3, 8.15.2, 7.17.24
- os: 2.15.0, 2.16.0, 2.17.1

Assets 35

04 Jul 06:29

mh-northlander

v3.2.2

0e56378

Release v3.2.2

Highlights

Use lazyTokenizeSentences for the analysis to fix the problem of input chunking (#137).

Breaking Changes

Chunking behavior in v3.2.1 is fixed.
- Analysis works same as v.3.2.0.

Assets 30

14 Jun 05:53

mh-northlander

v3.2.1

6b73b1a

Release v3.2.1

Highlights

Fix OOM issue with a huge input by @kenmasumitsu in #132
- Huge input now split into relatively small (1M char) chunks now.
- Analysis maybe broken around the edge of chunks (open issue, see #131).
Add documentation about Sudachi synonym dict by @sorami in #65

Breaking Changes

Huge (>1M char) input now split into chunks to avoid OOM and the analysis may be broken around the edge of chunks (open issue, see #131).

Contributors

sorami and kenmasumitsu

Assets 29

30 May 08:12

mh-northlander

v3.2.0

7d9f7da

Release v3.2.0

Highlights

Explain with morpheme attribute (#121)
Synonym filter and Sudachi filters can be used in any order (#122)

Breaking Change

MorphemeConsumerAttribute is removed from SPI.
- You can just remove related code to migrate.

Assets 29

17 May 04:49

mh-northlander

v3.1.1

932df4e

Release v3.1.1

Highlights

Support Elasticsearch -8.13.4 and OpenSearch -2.14.0
Fix dictionary caching problem

Assets 28

26 Jun 01:49

eiennohito

v3.1.0

3b179a1

Release v3.1.0

Highlights

OpenSearch support
Fix trimming problems
Extensibility support

OpenSearch support

We now support OpenSearch in addition to Elasticsearch. Plugins should work the same way as with Elasticsearch.
For the time being we test only on 2.6.0 and upper. There are no plans for supporting 1.* branch at the time being.

Because of OpenSearch support we changed the naming scheme of distribution zip to <engine kind>-<engine version>-analysis-sudachi-<plugin-version>.zip

Extensibility support

analysis-sudachi plugin now support being extended by other plugins, both in OpenSearch and Elasticsearch.
When extending analysis-sudachi please use sudachi-search-spi artifact as a provided dependency. We plan to have SPI stable, but the internal implementation of analysis-sudachi will not be stable.

In Elasticsearch 8.3.*+ we utilize SPI-aware packaging and internal implementation will not be available to extending plugins.

Internal & Testing Improvements

For improving quality of releases we have greatly improved testing. Previously there were only unit tests for analysis logic, from 3.0.0 there are additionally (tier-2) integration tests which spawn full Elasticsearch instance and execute workload which perform parallel document indexing following by validation of results.

From 3.1.0 we have added tier-1 integration tests which perform relatively simple validation, however these tests are executed inside SecurityManager-present JVMs, simulating fully-fledged Elasticsearch instance. We hope that this procedure will increase quality of releases and help us to catch issues faster.

Assets 17

10 Mar 00:46

eiennohito

v3.0.1

1a1fb60

v3.0.1

Highlights

Upgrade Sudachi to 0.7.1 which contains serious fixes for streaming analysis

Assets 11

19 Jan 09:05

eiennohito

v3.0.0

c4270bd

v3.0.0

Highlights

Sudachi is updated to 0.7.0
Analysis results are cached within a single index
All versions of ElasticSearch are supported by a single branch with some conditional compilation Gradle magic
Implementation now uses Kotlin inside

Analysis cache

Previous versions of ES Sudachi plugin were analyzing the input multiple times when using multiple analyzer chains (e.g. mode A, mode B, mode C, readings, dictionary form) for the same field. From this version, the underlying analysis is done only once, yielding n times speedup, where n is a number of configured analysis chains which stem from Sudachi.

Assets 17

29 Dec 05:00

kazuma-t

v2.1.0-es5.6

ae2c25f

v2.1.0 for Elasticsearch 5.6

Added a new property additional_settings to write Sudachi settings directly in config
Added support for specifying Elasticsearch version at build time

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Highlights

Highlights

Highlights

Breaking Changes

Highlights

Breaking Changes

Contributors

Highlights

Breaking Change

Highlights

Highlights

OpenSearch support

Extensibility support

Internal & Testing Improvements

Highlights

Highlights

Analysis cache

Releases: WorksApplications/elasticsearch-sudachi

v3.3.0

Highlights

v3.2.3

Highlights

Release v3.2.2

Highlights

Breaking Changes

Release v3.2.1

Highlights

Breaking Changes

Contributors

Release v3.2.0

Highlights

Breaking Change

Release v3.1.1

Highlights

Release v3.1.0

Highlights

OpenSearch support

Extensibility support

Internal & Testing Improvements

v3.0.1

Highlights

v3.0.0

Highlights

Analysis cache

v2.1.0 for Elasticsearch 5.6