aboutsummaryrefslogtreecommitdiff
path: root/src/backend/utils/adt/tsquery_gist.c
diff options
context:
space:
mode:
authorTom Lane <tgl@sss.pgh.pa.us>2007-10-23 20:46:12 +0000
committerTom Lane <tgl@sss.pgh.pa.us>2007-10-23 20:46:12 +0000
commitdbaec70c153239224c0288d865b96c2f939fbdf5 (patch)
treea2309acc315e5d4b9f9b0cd8b2ad60dc999ba93d /src/backend/utils/adt/tsquery_gist.c
parent344d0cae64dbf398559b855806fc7338ec0a2e64 (diff)
downloadpostgresql-dbaec70c153239224c0288d865b96c2f939fbdf5.tar.gz
postgresql-dbaec70c153239224c0288d865b96c2f939fbdf5.zip
Rename and slightly redefine the default text search parser's "word"
categories, as per discussion. asciiword (formerly lword) is still ASCII-letters-only, and numword (formerly word) is still the most general mixed-alpha-and-digits case. But word (formerly nlword) is now any-group-of-letters-with-at-least-one-non-ASCII, rather than all-non-ASCII as before. This is no worse than before for parsing mixed Russian/English text, which seems to have been the design center for the original coding; and it should simplify matters for parsing most European languages. In particular it will not be necessary for any language to accept strings containing digits as being regular "words". The hyphenated-word categories are adjusted similarly.
Diffstat (limited to 'src/backend/utils/adt/tsquery_gist.c')
0 files changed, 0 insertions, 0 deletions