Update snowball

Enterprise / PostgreSQL - Peter Eisentraut [eisentraut.org] - 8 June 2020 06:07 UTC

Update to snowball tag v2.0.0. Major changes are new stemmers for Basque, Catalan, and Hindi.

Discussion: https://www.postgresql.org/message-id/flat/a8eeabd6-2be1-43fe-401e-a97594c38478%402ndquadrant.com

cbcc8726bb Update snowball
src/backend/snowball/Makefile | 8 +
src/backend/snowball/README | 9 +-
src/backend/snowball/dict_snowball.c | 10 +
src/backend/snowball/libstemmer/api.c | 1 -
.../snowball/libstemmer/stem_ISO_8859_1_basque.c | 1184 ++++++++++++++
.../snowball/libstemmer/stem_ISO_8859_1_catalan.c | 1447 ++++++++++++++++++
.../snowball/libstemmer/stem_ISO_8859_1_danish.c | 15 +-
.../snowball/libstemmer/stem_ISO_8859_1_dutch.c | 100 +-
.../snowball/libstemmer/stem_ISO_8859_1_english.c | 41 +-
.../snowball/libstemmer/stem_ISO_8859_1_finnish.c | 27 +-
.../snowball/libstemmer/stem_ISO_8859_1_french.c | 64 +-
.../snowball/libstemmer/stem_ISO_8859_1_german.c | 26 +-
.../libstemmer/stem_ISO_8859_1_indonesian.c | 38 +-
.../snowball/libstemmer/stem_ISO_8859_1_irish.c | 13 +-
.../snowball/libstemmer/stem_ISO_8859_1_italian.c | 40 +-
.../libstemmer/stem_ISO_8859_1_norwegian.c | 11 +-
.../snowball/libstemmer/stem_ISO_8859_1_porter.c | 41 +-
.../libstemmer/stem_ISO_8859_1_portuguese.c | 49 +-
.../snowball/libstemmer/stem_ISO_8859_1_spanish.c | 34 +-
.../snowball/libstemmer/stem_ISO_8859_1_swedish.c | 11 +-
.../libstemmer/stem_ISO_8859_2_hungarian.c | 23 +-
.../snowball/libstemmer/stem_ISO_8859_2_romanian.c | 42 +-
.../snowball/libstemmer/stem_KOI8_R_russian.c | 46 +-
.../snowball/libstemmer/stem_UTF_8_arabic.c | 188 ++-
.../snowball/libstemmer/stem_UTF_8_basque.c | 1186 ++++++++++++++
.../snowball/libstemmer/stem_UTF_8_catalan.c | 1450 ++++++++++++++++++
.../snowball/libstemmer/stem_UTF_8_danish.c | 15 +-
src/backend/snowball/libstemmer/stem_UTF_8_dutch.c | 100 +-
.../snowball/libstemmer/stem_UTF_8_english.c | 41 +-
.../snowball/libstemmer/stem_UTF_8_finnish.c | 27 +-
.../snowball/libstemmer/stem_UTF_8_french.c | 64 +-
.../snowball/libstemmer/stem_UTF_8_german.c | 26 +-
src/backend/snowball/libstemmer/stem_UTF_8_greek.c | 1614 +++++++-------------
src/backend/snowball/libstemmer/stem_UTF_8_hindi.c | 332 ++++
.../snowball/libstemmer/stem_UTF_8_hungarian.c | 23 +-
.../snowball/libstemmer/stem_UTF_8_indonesian.c | 38 +-
src/backend/snowball/libstemmer/stem_UTF_8_irish.c | 13 +-
.../snowball/libstemmer/stem_UTF_8_italian.c | 40 +-
.../snowball/libstemmer/stem_UTF_8_lithuanian.c | 20 +-
.../snowball/libstemmer/stem_UTF_8_nepali.c | 20 +-
.../snowball/libstemmer/stem_UTF_8_norwegian.c | 11 +-
.../snowball/libstemmer/stem_UTF_8_porter.c | 41 +-
.../snowball/libstemmer/stem_UTF_8_portuguese.c | 49 +-
.../snowball/libstemmer/stem_UTF_8_romanian.c | 42 +-
.../snowball/libstemmer/stem_UTF_8_russian.c | 46 +-
.../snowball/libstemmer/stem_UTF_8_spanish.c | 34 +-
.../snowball/libstemmer/stem_UTF_8_swedish.c | 11 +-
src/backend/snowball/libstemmer/stem_UTF_8_tamil.c | 49 +-
.../snowball/libstemmer/stem_UTF_8_turkish.c | 14 +-
src/backend/snowball/libstemmer/utilities.c | 104 +-
src/bin/initdb/initdb.c | 6 +
src/include/snowball/libstemmer/header.h | 1 -
.../snowball/libstemmer/stem_ISO_8859_1_basque.h | 15 +
.../snowball/libstemmer/stem_ISO_8859_1_catalan.h | 15 +
.../snowball/libstemmer/stem_ISO_8859_1_danish.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_dutch.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_english.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_finnish.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_french.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_german.h | 3 +-
.../libstemmer/stem_ISO_8859_1_indonesian.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_irish.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_italian.h | 3 +-
.../libstemmer/stem_ISO_8859_1_norwegian.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_porter.h | 3 +-
.../libstemmer/stem_ISO_8859_1_portuguese.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_spanish.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_1_swedish.h | 3 +-
.../libstemmer/stem_ISO_8859_2_hungarian.h | 3 +-
.../snowball/libstemmer/stem_ISO_8859_2_romanian.h | 3 +-
.../snowball/libstemmer/stem_KOI8_R_russian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_arabic.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_basque.h | 15 +
.../snowball/libstemmer/stem_UTF_8_catalan.h | 15 +
.../snowball/libstemmer/stem_UTF_8_danish.h | 3 +-
src/include/snowball/libstemmer/stem_UTF_8_dutch.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_english.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_finnish.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_french.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_german.h | 3 +-
src/include/snowball/libstemmer/stem_UTF_8_greek.h | 3 +-
src/include/snowball/libstemmer/stem_UTF_8_hindi.h | 15 +
.../snowball/libstemmer/stem_UTF_8_hungarian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_indonesian.h | 3 +-
src/include/snowball/libstemmer/stem_UTF_8_irish.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_italian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_lithuanian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_nepali.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_norwegian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_porter.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_portuguese.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_romanian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_russian.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_spanish.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_swedish.h | 3 +-
src/include/snowball/libstemmer/stem_UTF_8_tamil.h | 3 +-
.../snowball/libstemmer/stem_UTF_8_turkish.h | 3 +-
97 files changed, 6914 insertions(+), 2166 deletions(-)

Upstream: git.postgresql.org


  • Share