postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	pgindent run for 9.0	Bruce Momjian	2010-02-26
\|
*	Generic implementation of red-black binary tree. It's planned to use in	Teodor Sigaev	2010-02-11
\| \| \| \| \| \|	several places, but for now only GIN uses it during index creation. Using self-balanced tree greatly speeds up index creation in corner cases with preordered data.
*	Fix bug in GIN WAL redo cleanup function: don't free fake relcache entry	Heikki Linnakangas	2010-02-09
\| \| \| \| \| \|	while it's still being used. Backpatch to 8.4, where the fake relcache method was introduced.
*	Remove old-style VACUUM FULL (which was known for a little while as	Tom Lane	2010-02-08
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	VACUUM FULL INPLACE), along with a boatload of subsidiary code and complexity. Per discussion, the use case for this method of vacuuming is no longer large enough to justify maintaining it; not to mention that we don't wish to invest the work that would be needed to make it play nicely with Hot Standby. Aside from the code directly related to old-style VACUUM FULL, this commit removes support for certain WAL record types that could only be generated within VACUUM FULL, redirect-pointer removal in heap_page_prune, and nontransactional generation of cache invalidation sinval messages (the last being the sticking point for Hot Standby). We still have to retain all code that copes with finding HEAP_MOVED_OFF and HEAP_MOVED_IN flag bits on existing tuples. This can't be removed as long as we want to support in-place update from pre-9.0 databases.
*	Fix incorrect comparison of scan key in GIN. Per report from	Teodor Sigaev	2010-01-18
\| \| \| \|	Vyacheslav Kalinin <vka@mgcp.com>
*	Update copyright for the year 2010.	Bruce Momjian	2010-01-02
\|
*	Allow read only connections during recovery, known as Hot Standby.	Simon Riggs	2009-12-19
\| \| \| \| \| \| \| \| \| \| \| \|	Enabled by recovery_connections = on (default) and forcing archive recovery using a recovery.conf. Recovery processing now emulates the original transactions as they are replayed, providing full locking and MVCC behaviour for read only queries. Recovery must enter consistent state before connections are allowed, so there is a delay, typically short, before connections succeed. Replay of recovering transactions can conflict and in some cases deadlock with queries during recovery; these result in query cancellation after max_standby_delay seconds have expired. Infrastructure changes have minor effects on normal running, though introduce four new types of WAL record. New test mode "make standbycheck" allows regression tests of static command behaviour on a standby server while in recovery. Typical and extreme dynamic behaviours have been checked via code inspection and manual testing. Few port specific behaviours have been utilised, though primary testing has been on Linux only so far. This commit is the basic patch. Additional changes will follow in this release to enhance some aspects of behaviour, notably improved handling of conflicts, deadlock detection and query cancellation. Changes to VACUUM FULL are also required. Simon Riggs, with significant and lengthy review by Heikki Linnakangas, including streamlined redesign of snapshot creation and two-phase commit. Important contributions from Florian Pflug, Mark Kirkwood, Merlin Moncure, Greg Stark, Gianni Ciolli, Gabriele Bartolini, Hannu Krosing, Robert Haas, Tatsuo Ishii, Hiroyuki Yamada plus support and feedback from many other community members.
*	Fix multicolumn GIN's wrong results with fastupdate enabled.	Teodor Sigaev	2009-11-13
\| \| \| \| \| \| \| \|	User-defined consistent functions believes the check array contains at least one true element which was not a true for scanning pending list. Per report from Yury Don <yura@vpcit.ru>
*	Make sure that GIN fast-insert and regular code paths enforce the same	Tom Lane	2009-10-02
\| \| \| \| \| \| \| \| \| \| \|	tuple size limit. Improve the error message for index-tuple-too-large so that it includes the actual size, the limit, and the index name. Sync with the btree occurrences of the same error. Back-patch to 8.4 because it appears that the out-of-sync problem is occurring in the field. Teodor and Tom
*	Fix two distinct errors in creation of GIN_INSERT_LISTPAGE xlog records.	Tom Lane	2009-09-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In practice these mistakes were always masked when full_page_writes was on, because XLogInsert would always choose to log the full page, and then ginRedoInsertListPage wouldn't try to do anything. But with full_page_writes off a WAL replay failure was certain. The GIN_INSERT_LISTPAGE record type could probably be eliminated entirely in favor of using XLOG_HEAP_NEWPAGE, but I refrained from doing that now since it would have required a significantly more invasive patch. In passing do a little bit of code cleanup, including making the accounting for free space on GIN list pages more precise. (This wasn't a bug as the errors were always in the conservative direction.) Per report from Simon. Back-patch to 8.4 which contains the identical code.
*	Support deferrable uniqueness constraints.	Tom Lane	2009-07-29
\| \| \| \| \| \| \| \| \| \|	The current implementation fires an AFTER ROW trigger for each tuple that looks like it might be non-unique according to the index contents at the time of insertion. This works well as long as there aren't many conflicts, but won't scale to massive unique-key reassignments. Improving that case is a TODO item. Dean Rasheed
*	8.4 pgindent run, with new combined Linux/FreeBSD/MinGW typedef list	Bruce Momjian	2009-06-11
\| \| \| \|	provided by Andrew.
*	Improve the IndexVacuumInfo/IndexBulkDeleteResult API to allow somewhat sane	Tom Lane	2009-06-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	behavior in cases where we don't know the heap tuple count accurately; in particular partial vacuum, but this also makes the API a bit more useful for ANALYZE. This patch adds "estimated_count" flags to both structs so that an approximate count can be flagged as such, and adjusts the logic so that approximate counts are not used for updating pg_class.reltuples. This fixes my previous complaint that VACUUM was putting ridiculous values into pg_class.reltuples for indexes. The actual impact of that bug is limited, because the planner only pays attention to reltuples for an index if the index is partial; which probably explains why beta testers hadn't noticed a degradation in plan quality from it. But it needs to be fixed. The whole thing is a bit messy and should be redesigned in future, because reltuples now has the potential to drift quite far away from reality when a long period elapses with no non-partial vacuums. But this is as good as it's going to get for 8.4.
*	Fix a serious bug introduced into GIN in 8.4: now that MergeItemPointers()	Tom Lane	2009-06-06
\| \| \| \| \| \| \| \| \|	is supposed to remove duplicate heap TIDs, we have to be sure to reduce the tuple size and posting-item count accordingly in addItemPointersToTuple(). Failing to do so resulted in the effective injection of garbage TIDs into the index contents, ie, whatever happened to be in the memory palloc'd for the new tuple. I'm not sure that this fully explains the index corruption reported by Tatsuo Ishii, but the test case I'm using no longer fails.
*	Fix bug #4814 (wrong subscript in consistent-function call), and add some	Tom Lane	2009-05-19
\| \| \| \|	minimal regression test coverage for matchPartialInPendingList().
*	Fix infinite loop while checking of partial match in pending list.	Teodor Sigaev	2009-04-05
\| \| \| \| \|	Improve comments. Now GIN-indexable operators should be strict. Per Tom's questions/suggestions.
*	Adjust the APIs for GIN opclass support functions to allow the extractQuery()	Tom Lane	2009-03-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	method to pass extra data to the consistent() and comparePartial() methods. This is the core infrastructure needed to support the soon-to-appear contrib/btree_gin module. The APIs are still upward compatible with the definitions used in 8.3 and before, although not with the previous 8.4devel function definitions. catversion bump for changes in pg_proc entries (although these are just cosmetic, since GIN doesn't actually look at the function signature before calling it...) Teodor Sigaev and Oleg Bartunov
*	Install a search tree depth limit in GIN bulk-insert operations, to prevent	Tom Lane	2009-03-24
\| \| \| \| \| \| \| \| \| \| \| \|	them from degrading badly when the input is sorted or nearly so. In this scenario the tree is unbalanced to the point of becoming a mere linked list, so insertions become O(N^2). The easiest and most safely back-patchable solution is to stop growing the tree sooner, ie limit the growth of N. We might later consider a rebalancing tree algorithm, but it's not clear that the benefit would be worth the cost and complexity. Per report from Sergey Burladyan and an earlier complaint from Heikki. Back-patch to 8.2; older versions didn't have GIN indexes.
*	Implement "fastupdate" support for GIN indexes, in which we try to accumulate	Tom Lane	2009-03-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	multiple index entries in a holding area before adding them to the main index structure. This helps because bulk insert is (usually) significantly faster than retail insert for GIN. This patch also removes GIN support for amgettuple-style index scans. The API defined for amgettuple is difficult to support with fastupdate, and the previously committed partial-match feature didn't really work with it either. We might eventually figure a way to put back amgettuple support, but it won't happen for 8.4. catversion bumped because of change in GIN's pg_am entry, and because the format of GIN indexes changed on-disk (there's a metapage now, and possibly a pending list). Teodor Sigaev
*	Add a new option to RestoreBkpBlocks() to indicate if a cleanup lock should	Heikki Linnakangas	2009-01-20
\| \| \| \| \| \| \| \| \|	be used instead of the normal exclusive lock, and make WAL redo functions responsible for calling RestoreBkpBlocks(). They know better what kind of a lock they need. At the moment, this just moves things around with no functional change, but makes the hot standby patch that's under review cleaner.
*	Revise the TIDBitmap API to support multiple concurrent iterations over a	Tom Lane	2009-01-10
\| \| \| \| \| \|	bitmap. This is extracted from Greg Stark's posix_fadvise patch; it seems worth committing separately, since it's potentially useful independently of posix_fadvise.
*	Change the reloptions machinery to use a table-based parser, and provide	Alvaro Herrera	2009-01-05
\| \| \| \| \| \| \| \|	a more complete framework for writing custom option processing routines by user-defined access methods. Catalog version bumped due to the general API changes, which are going to affect user-defined "amoptions" routines.
*	Update copyright for 2009.	Bruce Momjian	2009-01-01
\|
*	Rethink the way FSM truncation works. Instead of WAL-logging FSM	Heikki Linnakangas	2008-11-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	truncations in FSM code, call FreeSpaceMapTruncateRel from smgr_redo. To make that cleaner from modularity point of view, move the WAL-logging one level up to RelationTruncate, and move RelationTruncate and all the related WAL-logging to new src/backend/catalog/storage.c file. Introduce new RelationCreateStorage and RelationDropStorage functions that are used instead of calling smgrcreate/smgrscheduleunlink directly. Move the pending rel deletion stuff from smgrcreate/smgrscheduleunlink to the new functions. This leaves smgr.c as a thin wrapper around md.c; all the transactional stuff is now in storage.c. This will make it easier to add new forks with similar truncation logic, like the visibility map.
*	Prevent synchronous scan during GIN index build, because GIN is optimized	Tom Lane	2008-11-13
\| \| \| \| \| \| \| \| \|	for inserting tuples in increasing TID order. It's not clear whether this fully explains Ivan Sergio Borgonovo's complaint, but simple testing confirms that a scan that doesn't start at block 0 can slow GIN build by a factor of three or four. Backpatch to 8.3. Sync scan didn't exist before that.
*	Clean up the messy semantics (not to mention inefficiency) of PageGetTempPage	Tom Lane	2008-11-03
\| \| \| \| \| \|	by splitting it into three functions with better-defined behaviors. Zdenek Kotala
*	Unite ReadBufferWithFork, ReadBufferWithStrategy, and ZeroOrReadBuffer	Heikki Linnakangas	2008-10-31
\| \| \| \| \| \| \| \| \| \| \| \|	functions into one ReadBufferExtended function, that takes the strategy and mode as argument. There's three modes, RBM_NORMAL which is the default used by plain ReadBuffer(), RBM_ZERO, which replaces ZeroOrReadBuffer, and a new mode RBM_ZERO_ON_ERROR, which allows callers to read corrupt pages without throwing an error. The FSM needs the new mode to recover from corrupt pages, which could happend if we crash after extending an FSM file, and the new page is "torn". Add fork number to some error messages in bufmgr.c, that still lacked it.
*	Remove mark/restore support in GIN and GiST indexes.	Teodor Sigaev	2008-10-20
\| \| \| \| \|	Per Tom's comment. Also revome useless GISTScanOpaque->flags field.
*	Index FSMs needs to be vacuumed as well. Report by Jeff Davis.	Heikki Linnakangas	2008-10-06
\|
*	Rewrite the FSM. Instead of relying on a fixed-size shared memory segment, the	Heikki Linnakangas	2008-09-30
\| \| \| \| \| \| \| \| \| \| \| \| \|	free space information is stored in a dedicated FSM relation fork, with each relation (except for hash indexes; they don't use FSM). This eliminates the max_fsm_relations and max_fsm_pages GUC options; remove any trace of them from the backend, initdb, and documentation. Rewrite contrib/pg_freespacemap to match the new FSM implementation. Also introduce a new variant of the get_raw_page(regclass, int4, int4) function in contrib/pageinspect that let's you to return pages from any relation fork, and a new fsm_page_contents() function to inspect the new FSM pages.
*	Fix strategy propagation to scanEntry for partial match by moving propagation	Teodor Sigaev	2008-09-04
\| \| \| \|	to initializaion of scanEntry.
*	Multi-column GIN indexes. Teodor Sigaev	Tom Lane	2008-07-11
\|
*	Minor improvements to the Gin internal documentation.	Neil Conway	2008-07-08
\|
*	Fix initialization of GinScanEntryData.partialMatch	Teodor Sigaev	2008-07-04
\|
*	Remove unnecessary coziness of GIN code with datum copying. Now that	Tom Lane	2008-06-29
\| \| \| \| \| \|	space is tracked via GetMemoryChunkSpace, there's really no advantage to duplicating datumCopy's innards here. This is one bit of my toast indirection patch that should go in anyway.
*	Improve our #include situation by moving pointer types away from the	Alvaro Herrera	2008-06-19
\| \| \| \| \| \| \|	corresponding struct definitions. This allows other headers to avoid including certain highly-loaded headers such as rel.h and relscan.h, instead using just relcache.h, heapam.h or genam.h, which are more lightweight and thus cause less unnecessary dependencies.
*	Refactor XLogOpenRelation() and XLogReadBuffer() in preparation for relation	Heikki Linnakangas	2008-06-12
\| \| \| \| \| \| \| \| \| \|	forks. XLogOpenRelation() and the associated light-weight relation cache in xlogutils.c is gone, and XLogReadBuffer() now takes a RelFileNode as argument, instead of Relation. For functions that still need a Relation struct during WAL replay, there's a new function called CreateFakeRelcacheEntry() that returns a fake entry like XLogOpenRelation() used to.
*	Move BufferGetPageSize and BufferGetPage from bufpage.h to bufmgr.h. It is	Alvaro Herrera	2008-06-08
\| \| \| \| \| \| \| \| \| \|	more logical that way, and also it reduces the amount of unnecessary includes in bufpage.h, which is widely used. Zdenek Kotala. My previous patch to bufpage.h should also have credited him as author, but I forgot (sorry about that).
*	Extend GIN to support partial-match searches, and extend tsquery to support	Tom Lane	2008-05-16
\| \| \| \| \| \|	prefix matching using this facility. Teodor Sigaev and Oleg Bartunov
*	Persuade GIN to react to control-C in a reasonable amount of time	Tom Lane	2008-05-16
\| \| \| \|	while building a GIN index.
*	Put back bufmgr.h in bufpage.h -- it is needed by some macros.	Alvaro Herrera	2008-05-12
\| \| \| \| \|	Remove #include bufmgr.h from (most?) source files which already include bufpage.h.
*	Restructure some header files a bit, in particular heapam.h, by removing some	Alvaro Herrera	2008-05-12
\| \| \| \| \| \| \| \| \| \| \| \|	unnecessary #include lines in it. Also, move some tuple routine prototypes and macros to htup.h, which allows removal of heapam.h inclusion from some .c files. For this to work, a new header file access/sysattr.h needed to be created, initially containing attribute numbers of system columns, for pg_dump usage. While at it, make contrib ltree, intarray and hstore header files more consistent with our header style.
*	Fix using too many LWLocks bug, reported by Craig Ringer	Teodor Sigaev	2008-04-22
\| \| \| \| \| \| \| \| \|	<craig@postnewspapers.com.au>. It was my mistake, I missed limitation of number of held locks, now GIN doesn't use continiuous locks, but still hold buffers pinned to prevent interference with vacuum's deletion algorithm. Backpatch is needed.
*	Push index operator lossiness determination down to GIST/GIN opclass	Tom Lane	2008-04-14
\| \| \| \| \| \| \| \| \| \| \|	"consistent" functions, and remove pg_amop.opreqcheck, as per recent discussion. The main immediate benefit of this is that we no longer need 8.3's ugly hack of requiring @@@ rather than @@ to test weight-using tsquery searches on GIN indexes. In future it should be possible to optimize some other queries better than is done now, by detecting at runtime whether the index match is exact or not. Tom Lane, after an idea of Heikki's, and with some help from Teodor.
*	Phase 2 of project to make index operator lossiness be determined at runtime	Tom Lane	2008-04-13
\| \| \| \| \| \| \| \| \| \| \| \|	instead of plan time. Extend the amgettuple API so that the index AM returns a boolean indicating whether the indexquals need to be rechecked, and make that rechecking happen in nodeIndexscan.c (currently the only place where it's expected to be needed; other callers of index_getnext are just erroring out for now). For the moment, GIN and GIST have stub logic that just always sets the recheck flag to TRUE --- I'm hoping to get Teodor to handle pushing that control down to the opclass consistent() functions. The planner no longer pays any attention to amopreqcheck, and that catalog column will go away in due course.
*	Replace "amgetmulti" AM functions with "amgetbitmap", in which the whole	Tom Lane	2008-04-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	indexscan always occurs in one call, and the results are returned in a TIDBitmap instead of a limited-size array of TIDs. This should improve speed a little by reducing AM entry/exit overhead, and it is necessary infrastructure if we are ever to support bitmap indexes. In an only slightly related change, add support for TIDBitmaps to preserve (somewhat lossily) the knowledge that particular TIDs reported by an index need to have their quals rechecked when the heap is visited. This facility is not really used yet; we'll need to extend the forced-recheck feature to plain indexscans before it's useful, and that hasn't been coded yet. The intent is to use it to clean up 8.3's horrid @@@ kluge for text search with weighted queries. There might be other uses in future, but that one alone is sufficient reason. Heikki Linnakangas, with some adjustments by me.
*	Make source code READMEs more consistent. Add CVS tags to all README files.	Bruce Momjian	2008-03-20
\|
*	Refactor backend makefiles to remove lots of duplicate code	Peter Eisentraut	2008-02-19
\|
*	Update copyrights in source tree to 2008.	Bruce Momjian	2008-01-01
\|
*	Improve GIN index build's tracking of memory usage by using	Tom Lane	2007-11-16
\| \| \| \| \| \| \| \| \|	GetMemoryChunkSpace, not just the palloc request size. This brings the allocatedMemory counter close enough to reality (as measured by MemoryContextStats printouts) that I think we can get rid of the arbitrary factor-of-2 adjustment that was put into the code initially. Given the sensitivity of GIN build to work memory size, not using as much of work memory as we're allowed to seems a pretty bad idea.