postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	Remove unnecessary type violation in tsvectorrecv().	Tom Lane	2025-04-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	compareentry() is declared to work on WordEntryIN structs, but tsvectorrecv() is using it in two places to work on WordEntry structs. This is almost okay, since WordEntry is the first field of WordEntryIN. But on machines with 8-byte pointers, WordEntryIN will have a larger alignment spec than WordEntry, and it's at least theoretically possible that the compiler could generate code that depends on the larger alignment. Given the lack of field reports, this may be just a hypothetical bug that upsets nothing except sanitizer tools. Or it may be real on certain hardware but nobody's tried to use tsvectorrecv() on such hardware. In any case we should fix it, and the fix is trivial: just change compareentry() so that it works on WordEntry without any mention of WordEntryIN. We can also get rid of the quite-useless intermediate function WordEntryCMP. Bug: #18875 Reported-by: Alexander Lakhin <exclusion@gmail.com> Author: Tom Lane <tgl@sss.pgh.pa.us> Discussion: https://postgr.es/m/18875-07a29c49c825a608@postgresql.org Backpatch-through: 13
*	Update copyright for 2025	Bruce Momjian	2025-01-01
\| \| \| \|	Backpatch-through: 13
*	Remove unused #include's from backend .c files	Peter Eisentraut	2024-03-04
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	as determined by include-what-you-use (IWYU) While IWYU also suggests to add a bunch of #include's (which is its main purpose), this patch does not do that. In some cases, a more specific #include replaces another less specific one. Some manual adjustments of the automatic result: - IWYU currently doesn't know about includes that provide global variable declarations (like -Wmissing-variable-declarations), so those includes are being kept manually. - All includes for port(ability) headers are being kept for now, to play it safe. - No changes of catalog/pg_foo.h to catalog/pg_foo_d.h, to keep the patch from exploding in size. Note that this patch touches just *.c files, so nothing declared in header files changes in hidden ways. As a small example, in src/backend/access/transam/rmgr.c, some IWYU pragma annotations are added to handle a special case there. Discussion: https://www.postgresql.org/message-id/flat/af837490-6b2f-46df-ba05-37ea6a6653fc%40eisentraut.org
*	Use new overflow-safe integer comparison functions.	Nathan Bossart	2024-02-16
\| \| \| \| \| \| \| \| \| \| \| \|	Commit 6b80394781 introduced integer comparison functions designed to be as efficient as possible while avoiding overflow. This commit makes use of these functions in many of the in-tree qsort() comparators to help ensure transitivity. Many of these comparator functions should also see a small performance boost. Author: Mats Kindahl Reviewed-by: Andres Freund, Fabrízio de Royes Mello Discussion: https://postgr.es/m/CA%2B14426g2Wa9QuUpmakwPxXFWG_1FaY0AsApkvcTBy-YfS6uaw%40mail.gmail.com
*	Update copyright for 2024	Bruce Momjian	2024-01-03
\| \| \| \| \| \| \| \|	Reported-by: Michael Paquier Discussion: https://postgr.es/m/ZZKTDPxBBMt3C0J9@paquier.xyz Backpatch-through: 12
*	Fix datalen calculation in tsvectorrecv().	Tom Lane	2023-10-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	After receiving position data for a lexeme, tsvectorrecv() advanced its "datalen" value by (npos+1)sizeof(WordEntry) where the correct calculation is (npos+1)sizeof(WordEntryPos). This accidentally failed to render the constructed tsvector invalid, but it did result in leaving some wasted space approximately equal to the space consumed by the position data. That could have several bad effects: * Disk space is wasted if the received tsvector is stored into a table as-is. * A legal tsvector could get rejected with "maximum total lexeme length exceeded" if the extra space pushes it over the MAXSTRPOS limit. * In edge cases, the finished tsvector could be assigned a length larger than the allocated size of its palloc chunk, conceivably leading to SIGSEGV when the tsvector gets copied somewhere else. The odds of a field failure of this sort seem low, though valgrind testing could probably have found this. While we're here, let's express the calculation as "sizeof(uint16) + npos * sizeof(WordEntryPos)" to avoid the type pun implicit in the "npos + 1" formulation. It's not wrong given that WordEntryPos had better be 2 bytes to avoid padding problems, but it seems clearer this way. Report and patch by Denis Erokhin. Back-patch to all supported versions. Discussion: https://postgr.es/m/009801d9f2d9$f29730c0$d7c59240$@datagile.ru
*	Remove useless casts to (void *) in arguments of some system functions	Peter Eisentraut	2023-02-07
\| \| \| \| \| \| \| \|	The affected functions are: bsearch, memcmp, memcpy, memset, memmove, qsort, repalloc Reviewed-by: Corey Huinker <corey.huinker@gmail.com> Discussion: https://www.postgresql.org/message-id/flat/fd9adf5d-b1aa-e82f-e4c7-263c30145807%40enterprisedb.com
*	New header varatt.h split off from postgres.h	Peter Eisentraut	2023-01-10
\| \| \| \| \| \| \| \| \|	This new header contains all the variable-length data types support (TOAST support) from postgres.h, which isn't needed by large parts of the backend code. Reviewed-by: Tom Lane <tgl@sss.pgh.pa.us> Discussion: https://www.postgresql.org/message-id/flat/ddcce239-0f29-6e62-4b47-1f8ca742addf%40enterprisedb.com
*	Update copyright for 2023	Bruce Momjian	2023-01-02
\| \| \| \|	Backpatch-through: 11
*	Convert tsqueryin and tsvectorin to report errors softly.	Tom Lane	2022-12-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is slightly tedious because the adjustments cascade through a couple of levels of subroutines, but it's not very hard. I chose to avoid changing function signatures more than absolutely necessary, by passing the escontext pointer in existing structs where possible. tsquery's nuisance NOTICEs about empty queries are suppressed in soft-error mode, since they're not errors and we surely don't want them to be shown to the user anyway. Maybe that whole behavior should be reconsidered. Discussion: https://postgr.es/m/3824377.1672076822@sss.pgh.pa.us
*	Update copyright for 2022	Bruce Momjian	2022-01-07
\| \| \| \|	Backpatch-through: 10
*	Update copyright for 2021	Bruce Momjian	2021-01-02
\| \| \| \|	Backpatch-through: 9.5
*	Update copyrights for 2020	Bruce Momjian	2020-01-01
\| \| \| \|	Backpatch-through: update all files in master, backpatch legal files through 9.4
*	Add reusable routine for making arrays unique.	Thomas Munro	2019-11-07
\| \| \| \| \| \| \| \| \| \|	Introduce qunique() and qunique_arg(), which can be used after qsort() and qsort_arg() respectively to remove duplicate values. Use it where appropriate. Author: Thomas Munro Reviewed-by: Tom Lane (in an earlier version) Discussion: https://postgr.es/m/CAEepm%3D2vmFTNpAmwbGGD2WaryM6T3hSDVKQPfUwjdD_5XY6vAA%40mail.gmail.com
*	Update copyright for 2019	Bruce Momjian	2019-01-02
\| \| \| \|	Backpatch-through: certain files through 9.4
*	Add websearch_to_tsquery	Teodor Sigaev	2018-04-05
\| \| \| \| \| \| \| \| \| \| \| \|	Error-tolerant conversion function with web-like syntax for search query, it simplifies constraining search engine with close to habitual interface for users. Bump catalog version Authors: Victor Drobny, Dmitry Ivanov with editorization by me Reviewed by: Aleksander Alekseev, Tomas Vondra, Thomas Munro, Aleksandr Parfenov Discussion: https://www.postgresql.org/message-id/flat/fe931111ff7e9ad79196486ada79e268@postgrespro.ru
*	Update copyright for 2018	Bruce Momjian	2018-01-02
\| \| \| \|	Backpatch-through: certain files through 9.3
*	Replace remaining uses of pq_sendint with pq_sendint{8,16,32}.	Andres Freund	2017-10-11
\| \| \| \| \| \| \|	pq_sendint() remains, so extension code doesn't unnecessarily break. Author: Andres Freund Discussion: https://postgr.es/m/20170914063418.sckdzgjfrsbekae4@alap3.anarazel.de
*	Generate fmgr prototypes automatically	Peter Eisentraut	2017-01-17
\| \| \| \| \| \| \| \| \| \| \| \|	Gen_fmgrtab.pl creates a new file fmgrprotos.h, which contains prototypes for all functions registered in pg_proc.h. This avoids having to manually maintain these prototypes across a random variety of header files. It also automatically enforces a correct function signature, and since there are warnings about missing prototypes, it will detect functions that are defined but not registered in pg_proc.h (or otherwise used). Reviewed-by: Pavel Stehule <pavel.stehule@gmail.com>
*	Update copyright via script for 2017	Bruce Momjian	2017-01-03
\|
*	Rename comparePos() to compareWordEntryPos()	Teodor Sigaev	2016-04-08
\| \| \| \| \| \| \|	Rename comparePos() to compareWordEntryPos() to prevent export of too generic name. Per gripe from Tom Lane.
*	Phrase full text search.	Teodor Sigaev	2016-04-07
\| \| \| \| \| \| \| \| \| \| \| \| \|	Patch introduces new text search operator (<-> or <DISTANCE>) into tsquery. On-disk and binary in/out format of tsquery are backward compatible. It has two side effect: - change order for tsquery, so, users, who has a btree index over tsquery, should reindex it - less number of parenthesis in tsquery output, and tsquery becomes more readable Authors: Teodor Sigaev, Oleg Bartunov, Dmitry Ivanov Reviewers: Alexander Korotkov, Artur Zakirov
*	Update copyright for 2016	Bruce Momjian	2016-01-02
\| \| \| \|	Backpatch certain files through 9.1
*	Update copyright for 2015	Bruce Momjian	2015-01-06
\| \| \| \|	Backpatch certain files through 9.0
*	Avoid memcpy() with same source and destination address.	Heikki Linnakangas	2014-03-07
\| \| \| \| \| \| \|	The behavior of that is undefined, although unlikely to lead to problems in practice. Found by running regression tests with Valgrind.
*	Update copyright for 2014	Bruce Momjian	2014-01-07
\| \| \| \| \|	Update all files in head, and files COPYRIGHT and legal.sgml in all back branches.
*	Update copyrights for 2013	Bruce Momjian	2013-01-01
\| \| \| \| \|	Fully update git head, and update back branches in ./COPYRIGHT and legal.sgml files.
*	Replace int2/int4 in C code with int16/int32	Peter Eisentraut	2012-06-25
\| \| \| \| \| \| \| \| \| \|	The latter was already the dominant use, and it's preferable because in C the convention is that intXX means XX bits. Therefore, allowing mixed use of int2, int4, int8, int16, int32 is obviously confusing. Remove the typedefs for int2 and int4 for now. They don't seem to be widely used outside of the PostgreSQL source tree, and the few uses can probably be cleaned up by the time this ships.
*	Remove unnecessary pg_verifymbstr() calls from tsvector/query in functions.	Heikki Linnakangas	2012-05-14
\| \| \| \| \|	The input should've been validated well before it hits the input function. Doing so again is a waste of cycles.
*	Update copyright notices for year 2012.	Bruce Momjian	2012-01-01
\|
*	Remove unnecessary #include references, per pgrminclude script.	Bruce Momjian	2011-09-01
\|
*	Stamp copyrights for year 2011.	Bruce Momjian	2011-01-01
\|
*	Remove cvs keywords from all files.	Magnus Hagander	2010-09-20
\|
*	Update copyright for the year 2010.	Bruce Momjian	2010-01-02
\|
*	8.4 pgindent run, with new combined Linux/FreeBSD/MinGW typedef list	Bruce Momjian	2009-06-11
\| \| \| \|	provided by Andrew.
*	Resort tsvector's lexemes in tsvectorrecv instead of emmiting an error.	Teodor Sigaev	2009-05-21
\| \| \| \| \| \| \|	Basically, it's needed to support binary dump from 8.3 because ordering rule was changed. Per discussion with Bruce.
*	Removed comparison of unsigned expression < 0.	Michael Meskes	2009-05-21
\|
*	Update copyright for 2009.	Bruce Momjian	2009-01-01
\|
*	Extend GIN to support partial-match searches, and extend tsquery to support	Tom Lane	2008-05-16
\| \| \| \| \| \|	prefix matching using this facility. Teodor Sigaev and Oleg Bartunov
*	Fix unportable coding of new error message, per Kris Jurka.	Tom Lane	2008-03-10
\|
*	When text search string is too long, in error message report actual and	Bruce Momjian	2008-03-05
\| \| \| \|	maximum number of bytes allowed.
*	Update copyrights in source tree to 2008.	Bruce Momjian	2008-01-01
\|
*	Make a cleanup pass over error reports in tsearch code. Use ereport	Tom Lane	2007-11-28
\| \| \| \| \|	for user-facing errors, fix some poor choices of errcode, adhere to message style guide.
*	Fix tsvectorout() and tsqueryout() to escape backslesh, add test of that.	Teodor Sigaev	2007-11-16
\| \| \| \| \| \|	Patch by Bruce Momjian <bruce@momjian.us> Backpatch is needed, but it's impossible to apply it directly
*	Re-run pgindent with updated list of typedefs. (Updated README should	Bruce Momjian	2007-11-15
\| \| \| \|	avoid this problem in the future.)
*	pgindent run for 8.3.	Bruce Momjian	2007-11-15
\|
*	Fix several bugs in tsvectorin, including crash due to uninitialized field and	Tom Lane	2007-10-23
\| \| \| \| \| \| \| \| \| \| \| \| \|	miscomputation of required palloc size. The crash could only occur if the input contained lexemes both with and without positions, which is probably not common in practice. The miscomputation would definitely result in wasted space. Also fix some inconsistent coding around alignment of strings and positions in a tsvector value; these errors could also lead to crashes given mixed with/without position data and a machine that's picky about alignment. And be more careful about checking for overflow of string offsets. Patch is only against HEAD --- I have not looked to see if same bugs are in back-branch contrib/tsearch2 code.
*	Fix shared tsvector/tsquery input code so that we don't say "syntax error in	Tom Lane	2007-10-21
\| \| \| \| \|	tsvector" when we are really parsing a tsquery. Report the bogus input, too. Make styles of some related error messages more consistent.
*	Improvements from Heikki Linnakangas <heikki@enterprisedb.com>	Teodor Sigaev	2007-09-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- change the alignment requirement of lexemes in TSVector slightly. Lexeme strings were always padded to 2-byte aligned length to make sure that if there's position array (uint16[]) it has the right alignment. The patch changes that so that the padding is not done when there's no positions. That makes the storage of tsvectors without positions slightly more compact. - added some #include "miscadmin.h" lines I missed in the earlier when I added calls to check_stack_depth(). - Reimplement the send/recv functions, and added a comment above them describing the on-wire format. The CRC is now recalculated in tsquery as well per previous discussion.
*	Refactoring by Heikki Linnakangas <heikki@enterprisedb.com> with	Teodor Sigaev	2007-09-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	small editorization by me - Brake the QueryItem struct into QueryOperator and QueryOperand. Type was really the only common field between them. QueryItem still exists, and is used in the TSQuery struct as before, but it's now a union of the two. Many other changes fell from that, like separation of pushval_asis function into pushValue, pushOperator and pushStop. - Moved some structs that were for internal use only from header files to the right .c-files. - Moved tsvector parser to a new tsvector_parser.c file. Parser code was about half of the size of tsvector.c, it's also used from tsquery.c, and it has some data structures of its own, so it seems better to separate it. Cleaned up the API so that TSVectorParserState is not accessed from outside tsvector_parser.c. - Separated enumerations (#defines, really) used for QueryItem.type field and as return codes from gettoken_query. It was just accidental code sharing. - Removed ParseQueryNode struct used internally by makepol and friends. push*-functions now construct QueryItems directly. - Changed int4 variables to just ints for variables like "i" or "array size", where the storage-size was not significant.