postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	Update copyrights for 2013	Bruce Momjian	2013-01-01
\| \| \| \| \|	Fully update git head, and update back branches in ./COPYRIGHT and legal.sgml files.
*	In our source code, make a copy of getopt's 'optarg' string arguments,	Bruce Momjian	2012-10-12
\| \| \| \|	rather than just storing a pointer.
*	Split tuple struct defs from htup.h to htup_details.h	Alvaro Herrera	2012-08-30
\| \| \| \| \| \| \| \| \| \| \| \|	This reduces unnecessary exposure of other headers through htup.h, which is very widely included by many files. I have chosen to move the function prototypes to the new file as well, because that means htup.h no longer needs to include tupdesc.h. In itself this doesn't have much effect in indirect inclusion of tupdesc.h throughout the tree, because it's also required by execnodes.h; but it's something to explore in the future, and it seemed best to do the htup.h change now while I'm busy with it.
*	Fix management of pendingOpsTable in auxiliary processes.	Tom Lane	2012-07-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	mdinit() was misusing IsBootstrapProcessingMode() to decide whether to create an fsync pending-operations table in the current process. This led to creating a table not only in the startup and checkpointer processes as intended, but also in the bgwriter process, not to mention other auxiliary processes such as walwriter and walreceiver. Creation of the table in the bgwriter is fatal, because it absorbs fsync requests that should have gone to the checkpointer; instead they just sit in bgwriter local memory and are never acted on. So writes performed by the bgwriter were not being fsync'd which could result in data loss after an OS crash. I think there is no live bug with respect to walwriter and walreceiver because those never perform any writes of shared buffers; but the potential is there for future breakage in those processes too. To fix, make AuxiliaryProcessMain() export the current process's AuxProcType as a global variable, and then make mdinit() test directly for the types of aux process that should have a pendingOpsTable. Having done that, we might as well also get rid of the random bool flags such as am_walreceiver that some of the aux processes had grown. (Note that we could not have fixed the bug by examining those variables in mdinit(), because it's called from BaseInit() which is run by AuxiliaryProcessMain() before entering any of the process-type-specific code.) Back-patch to 9.2, where the problem was introduced by the split-up of bgwriter and checkpointer processes. The bogus pendingOpsTable exists in walwriter and walreceiver processes in earlier branches, but absent any evidence that it causes actual problems there, I'll leave the older branches alone.
*	Update copyright notices for year 2012.	Bruce Momjian	2012-01-01
\|
*	Refactor xlog.c to create src/backend/postmaster/startup.c	Simon Riggs	2011-11-02
\| \| \| \| \|	Startup process now has its own dedicated file, just like all other special/background processes. Reduces role and size of xlog.c
*	Split work of bgwriter between 2 processes: bgwriter and checkpointer.	Simon Riggs	2011-11-01
\| \| \| \| \| \| \| \| \| \| \| \| \|	bgwriter is now a much less important process, responsible for page cleaning duties only. checkpointer is now responsible for checkpoints and so has a key role in shutdown. Later patches will correct doc references to the now old idea that bgwriter performs checkpoints. Has beneficial effect on performance at high write rates, but mainly refactoring to more easily allow changes for power reduction by simplifying previously tortuous code around required to allow page cleaning and checkpointing to time slice in the same process. Patch by me, Review by Dickson Guedes
*	Simplify handling of the timezone GUC by making initdb choose the default.	Tom Lane	2011-09-09
\| \| \| \| \| \| \| \| \| \| \|	We were doing some amazingly complicated things in order to avoid running the very expensive identify_system_timezone() procedure during GUC initialization. But there is an obvious fix for that, which is to do it once during initdb and have initdb install the system-specific default into postgresql.conf, as it already does for most other GUC variables that need system-environment-dependent defaults. This means that the timezone (and log_timezone) settings no longer have any magic behavior in the server. Per discussion.
*	Correct ancient logic mistake in assertion	Peter Eisentraut	2011-09-06
\| \| \| \|	Found by gcc -Wlogical-op
*	Clean up the #include mess a little.	Tom Lane	2011-09-04
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	walsender.h should depend on xlog.h, not vice versa. (Actually, the inclusion was circular until a couple hours ago, which was even sillier; but Bruce broke it in the expedient rather than logically correct direction.) Because of that poor decision, plus blind application of pgrminclude, we had a situation where half the system was depending on xlog.h to include such unrelated stuff as array.h and guc.h. Clean up the header inclusion, and manually revert a lot of what pgrminclude had done so things build again. This episode reinforces my feeling that pgrminclude should not be run without adult supervision. Inclusion changes in header files in particular need to be reviewed with great care. More generally, it'd be good if we had a clearer notion of module layering to dictate which headers can sanely include which others ... but that's a big task for another day.
*	Remove unnecessary #include references, per pgrminclude script.	Bruce Momjian	2011-09-01
\|
*	Move Trigger and TriggerDesc structs out of rel.h into a new reltrigger.h	Alvaro Herrera	2011-07-04
\| \| \| \| \|	This lets us stop including rel.h into execnodes.h, which is a widely used header.
*	Avoid changing an index's indcheckxmin horizon during REINDEX.	Tom Lane	2011-04-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There can never be a need to push the indcheckxmin horizon forward, since any HOT chains that are actually broken with respect to the index must pre-date its original creation. So we can just avoid changing pg_index altogether during a REINDEX operation. This offers a cleaner solution than my previous patch for the problem found a few days ago that we mustn't try to update pg_index while we are reindexing it. System catalog indexes will always be created with indcheckxmin = false during initdb, and with this modified code we should never try to change their pg_index entries. This avoids special-casing system catalogs as the former patch did, and should provide a performance benefit for many cases where REINDEX formerly caused an index to be considered unusable for a short time. Back-patch to 8.3 to cover all versions containing HOT. Note that this patch changes the API for index_build(), but I believe it is unlikely that any add-on code is calling that directly.
*	Per-column collation support	Peter Eisentraut	2011-02-08
\| \| \| \| \| \| \| \|	This adds collation support for columns and domains, a COLLATE clause to override it per expression, and B-tree index support. Peter Eisentraut reviewed by Pavel Stehule, Itagaki Takahiro, Robert Haas, Noah Misch
*	Stamp copyrights for year 2011.	Bruce Momjian	2011-01-01
\|
*	Remove cvs keywords from all files.	Magnus Hagander	2010-09-20
\|
*	Install a data-type-based solution for protecting pg_get_expr().REL9_1_ALPHA1	Tom Lane	2010-09-03
\| \| \| \| \| \| \| \| \| \| \| \|	Since the code underlying pg_get_expr() is not secure against malformed input, and can't practically be made so, we need to prevent miscreants from feeding arbitrary data to it. We can do this securely by declaring pg_get_expr() to take a new datatype "pg_node_tree" and declaring the system catalog columns that hold nodeToString output to be of that type. There is no way at SQL level to create a non-null value of type pg_node_tree. Since the backend-internal operations that fill those catalog columns operate below the SQL level, they are oblivious to the datatype relabeling and don't need any changes.
*	Move the responsibility for calling StartupXLOG into InitPostgres, for	Tom Lane	2010-04-20
\| \| \| \| \| \| \| \| \| \| \| \| \|	those process types that go through InitPostgres; in particular, bootstrap and standalone-backend cases. This ensures that we have set up a PGPROC and done some other basic initialization steps (corresponding to the if (IsUnderPostmaster) block in AuxiliaryProcessMain) before we attempt to run WAL recovery in a standalone backend. As was discovered last September, this is necessary for some corner-case code paths during WAL recovery, particularly end-of-WAL cleanup. Moving the bootstrap case here too is not necessary for correctness, but it seems like a good idea since it reduces the number of distinct code paths.
*	pgindent run for 9.0	Bruce Momjian	2010-02-26
\|
*	Create a "relation mapping" infrastructure to support changing the relfilenodes	Tom Lane	2010-02-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	of shared or nailed system catalogs. This has two key benefits: * The new CLUSTER-based VACUUM FULL can be applied safely to all catalogs. * We no longer have to use an unsafe reindex-in-place approach for reindexing shared catalogs. CLUSTER on nailed catalogs now works too, although I left it disabled on shared catalogs because the resulting pg_index.indisclustered update would only be visible in one database. Since reindexing shared system catalogs is now fully transactional and crash-safe, the former special cases in REINDEX behavior have been removed; shared catalogs are treated the same as non-shared. This commit does not do anything about the recently-discussed problem of deadlocks between VACUUM FULL/CLUSTER on a system catalog and other concurrent queries; will address that in a separate patch. As a stopgap, parallel_schedule has been tweaked to run vacuum.sql by itself, to avoid such failures during the regression tests.
*	Replace ALTER TABLE ... SET STATISTICS DISTINCT with a more general mechanism.	Robert Haas	2010-01-22
\| \| \| \| \| \| \| \| \|	Attributes can now have options, just as relations and tablespaces do, and the reloptions code is used to parse, validate, and store them. For simplicity and because these options are not performance critical, we store them in a separate cache rather than the main relcache. Thanks to Alex Hunsaker for the review.
*	Rethink the way walreceiver is linked into the backend. Instead than shoving	Heikki Linnakangas	2010-01-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	walreceiver as whole into a dynamically loaded module, split the libpq-specific parts of it into dynamically loaded module and keep the rest in the main backend binary. Although Tom fixed the Windows compilation problems with the old walreceiver module already, this is a cleaner division of labour and makes the code more readable. There's also the prospect of adding new transport methods as pluggable modules in the future, which this patch makes easier, though for now the API between libpqwalreceiver and walreceiver process should be considered private. The libpq-specific module is now in src/backend/replication/libpqwalreceiver, and the part linked with postgres binary is in src/backend/replication/walreceiver.c.
*	Introduce Streaming Replication.	Heikki Linnakangas	2010-01-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This includes two new kinds of postmaster processes, walsenders and walreceiver. Walreceiver is responsible for connecting to the primary server and streaming WAL to disk, while walsender runs in the primary server and streams WAL from disk to the client. Documentation still needs work, but the basics are there. We will probably pull the replication section to a new chapter later on, as well as the sections describing file-based replication. But let's do that as a separate patch, so that it's easier to see what has been added/changed. This patch also adds a new section to the chapter about FE/BE protocol, documenting the protocol used by walsender/walreceivxer. Bump catalog version because of two new functions, pg_last_xlog_receive_location() and pg_last_xlog_replay_location(), for monitoring the progress of replication. Fujii Masao, with additional hacking by me
*	Update copyright for the year 2010.	Bruce Momjian	2010-01-02
\|
*	Add exclusion constraints, which generalize the concept of uniqueness to	Tom Lane	2009-12-07
\| \| \| \| \| \| \| \|	support any indexable commutative operator, not just equality. Two rows violate the exclusion constraint if "row1.col OP row2.col" is TRUE for each of the columns in the constraint. Jeff Davis, reviewed by Robert Haas
*	Simplify the bootstrap (BKI) code by getting rid of a useless table of all	Tom Lane	2009-09-27
\| \| \| \| \| \| \| \| \| \| \| \|	the strings seen during the bootstrap run. There might have been some actual point to doing that, many years ago, but as far as I can see the only value now is to conserve a bit of memory. Even if we cared about wasting a megabyte or so during the initdb run, it'd be far more effective to arrange to release memory at the end of each BKI command, instead of intentionally hanging onto strings that might never be used again. Not maintaining the table probably makes it faster too; but the main point of this patch is to get rid of a couple hundred lines of unnecessary and rather crufty code.
*	Add ALTER TABLE ... ALTER COLUMN ... SET STATISTICS DISTINCT	Tom Lane	2009-08-02
\| \| \| \|	Robert Haas
*	Create a multiplexing structure for signals to Postgres child processes.	Tom Lane	2009-07-31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch gets us out from under the Unix limitation of two user-defined signal types. We already had done something similar for signals directed to the postmaster process; this adds multiplexing for signals directed to backends and auxiliary processes (so long as they're connected to shared memory). As proof of concept, replace the former usage of SIGUSR1 and SIGUSR2 for backends with use of the multiplexing mechanism. There are still some hard-wired definitions of SIGUSR1 and SIGUSR2 for other process types, but getting rid of those doesn't seem interesting at the moment. Fujii Masao
*	Start background writer during archive recovery. Background writer now performs	Heikki Linnakangas	2009-02-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	its usual buffer cleaning duties during archive recovery, and it's responsible for performing restartpoints. This requires some changes in postmaster. When the startup process has done all the initialization and is ready to start WAL redo, it signals the postmaster to launch the background writer. The postmaster is signaled again when the point in recovery is reached where we know that the database is in consistent state. Postmaster isn't interested in that at the moment, but that's the point where we could let other backends in to perform read-only queries. The postmaster is signaled third time when the recovery has ended, so that postmaster knows that it's safe to start accepting connections. The startup process now traps SIGTERM, and performs a "clean" shutdown. If you do a fast shutdown during recovery, a shutdown restartpoint is performed, like a shutdown checkpoint, and postmaster kills the processes cleanly. You still have to continue the recovery at next startup, though. Currently, the background writer is only launched during archive recovery. We could launch it during crash recovery as well, but it seems better to keep that codepath as simple as possible, for the sake of robustness. And it couldn't do any restartpoints during crash recovery anyway, so it wouldn't be that useful. log_restartpoints is gone. Use log_checkpoints instead. This is yet to be documented. This whole operation is a pre-requisite for Hot Standby, but has some value of its own whether the hot standby patch makes 8.4 or not. Simon Riggs, with lots of modifications by me.
*	Support column-level privileges, as required by SQL standard.	Tom Lane	2009-01-22
\| \| \| \|	Stephen Frost, with help from KaiGai Kohei and others
*	Update copyright for 2009.	Bruce Momjian	2009-01-01
\|
*	Remove all uses of the deprecated functions heap_formtuple, heap_modifytuple,	Tom Lane	2008-11-02
\| \| \| \| \| \| \| \| \| \| \|	and heap_deformtuple in favor of the newer functions heap_form_tuple et al (which do the same things but use bool control flags instead of arbitrary char values). Eliminate the former duplicate coding of these functions, reducing the deprecated functions to mere wrappers around the newer ones. We can't get rid of them entirely because add-on modules probably still contain many instances of the old coding style. Kris Jurka
*	Rewrite the FSM. Instead of relying on a fixed-size shared memory segment, the	Heikki Linnakangas	2008-09-30
\| \| \| \| \| \| \| \| \| \| \| \| \|	free space information is stored in a dedicated FSM relation fork, with each relation (except for hash indexes; they don't use FSM). This eliminates the max_fsm_relations and max_fsm_pages GUC options; remove any trace of them from the backend, initdb, and documentation. Rewrite contrib/pg_freespacemap to match the new FSM implementation. Also introduce a new variant of the get_raw_page(regclass, int4, int4) function in contrib/pageinspect that let's you to return pages from any relation fork, and a new fsm_page_contents() function to inspect the new FSM pages.
*	Add a bunch of new error location reports to parse-analysis error messages.	Tom Lane	2008-09-01
\| \| \| \| \|	There are still some weak spots around JOIN USING and relation alias lists, but most errors reported within backend/parser/ now have locations.
*	Reduce the alignment requirement of type "name" from int to char, and arrange	Tom Lane	2008-06-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to suppress zero-padding of "name" entries in indexes. The alignment change is unlikely to save any space, but it is really needed anyway to make the world safe for our widespread practice of passing plain old C strings to functions that are declared as taking Name. In the previous coding, the C compiler was entitled to assume that a Name pointer was word-aligned; but we were failing to guarantee that. I think the reason we'd not seen failures is that usually the only thing that gets done with such a pointer is strcmp(), which is hard to optimize in a way that exploits word-alignment. Still, some enterprising compiler guy will probably think of a way eventually, or we might change our code in a way that exposes more-obvious optimization opportunities. The padding change is accomplished in one-liner fashion by declaring the "name" index opclasses to use storage type "cstring" in pg_opclass.h. Normally btree and hash don't allow a nondefault storage type, because they don't have any provisions for converting the input datum to another type. However, because name and cstring are effectively the same thing except for padding, no conversion is needed --- we only need index_form_tuple() to treat the datum as being cstring not name, and this is sufficient. This seems to make for about a one-third reduction in the typical sizes of system catalog indexes that involve "name" columns, of which we have many. These two changes are only weakly related, but the alignment change makes me feel safer that the padding change won't introduce problems, so I'm committing them together.
*	Restructure some header files a bit, in particular heapam.h, by removing some	Alvaro Herrera	2008-05-12
\| \| \| \| \| \| \| \| \| \| \| \|	unnecessary #include lines in it. Also, move some tuple routine prototypes and macros to htup.h, which allows removal of heapam.h inclusion from some .c files. For this to work, a new header file access/sysattr.h needed to be created, initially containing attribute numbers of system columns, for pg_dump usage. While at it, make contrib ltree, intarray and hstore header files more consistent with our header style.
*	Allow float8, int8, and related datatypes to be passed by value on machines	Tom Lane	2008-04-21
\| \| \| \| \| \| \| \| \| \|	where Datum is 8 bytes wide. Since this will break old-style C functions (those still using version 0 calling convention) that have arguments or results of these types, provide a configure option to disable it and retain the old pass-by-reference behavior. Likewise, provide a configure option to disable the recently-committed float4 pass-by-value change. Zoltan Boszormenyi, plus configurability stuff by me.
*	Modify the float4 datatype to be pass-by-val. Along the way, remove the last	Alvaro Herrera	2008-04-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	uses of the long-deprecated float32 in contrib/seg; the definitions themselves are still there, but no longer used. fmgr/README updated to match. I added a CREATE FUNCTION to account for existing seg_center() code in seg.c too, and some tests for it and the neighbor functions. At the same time, remove checks for NULL which are not needed (because the functions are declared STRICT). I had to do some adjustments to contrib's btree_gist too. The choices for representation there are not ideal for changing the underlying types :-( Original patch by Zoltan Boszormenyi, with some adjustments by me.
*	Move the HTSU_Result enum definition into snapshot.h, to avoid including	Alvaro Herrera	2008-03-26
\| \| \| \| \| \|	tqual.h into heapam.h. This makes all inclusion of tqual.h explicit. I also sorted alphabetically the includes on some source files.
*	Add back #include <time.h> in a couple of files that seem to need it	Tom Lane	2008-02-17
\| \| \| \|	on Linux.
*	Update copyrights in source tree to 2008.	Bruce Momjian	2008-01-01
\|
*	pgindent run for 8.3.	Bruce Momjian	2007-11-15
\|
*	Move session_start out of MyProcPort stucture and make it a global called ↵	Andrew Dunstan	2007-08-02
\| \| \| \| \| \| \| \|	MyStartTime, so that we will be able to create a cookie for all processes for CSVlogs. It is set wherever MyProcPid is set. Take the opportunity to remove the now unnecessary session-only restriction on the %s and %c escapes in log_line_prefix.
*	Create a new dedicated Postgres process, "wal writer", which exists to write	Tom Lane	2007-07-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	and fsync WAL at convenient intervals. For the moment it just tries to offload this work from backends, but soon it will be responsible for guaranteeing a maximum delay before asynchronously-committed transactions will be flushed to disk. This is a portion of Simon Riggs' async-commit patch, committed to CVS separately because a background WAL writer seems like it might be a good idea independently of the async-commit feature. I rebased walwriter.c on bgwriter.c because it seemed like a more appropriate way of handling signals; while the startup/shutdown logic in postmaster.c is more like autovac because we want walwriter to quit before we start the shutdown checkpoint.
*	Implement "distributed" checkpoints in which the checkpoint I/O is spread	Tom Lane	2007-06-28
\| \| \| \| \| \| \| \| \| \| \| \| \|	over a fairly long period of time, rather than being spat out in a burst. This happens only for background checkpoints carried out by the bgwriter; other cases, such as a shutdown checkpoint, are still done at full speed. Remove the "all buffers" scan in the bgwriter, and associated stats infrastructure, since this seems no longer very useful when the checkpoint itself is properly throttled. Original patch by Itagaki Takahiro, reworked by Heikki Linnakangas, and some minor API editorialization by me.
*	Cleanup the bootstrap code a little, and rename "dummy procs" in the code	Alvaro Herrera	2007-03-07
\| \| \| \|	comments and variables to "auxiliary proc", per Heikki's request.
*	Remove useless database name from bootstrap argument processing (including	Alvaro Herrera	2007-02-16
\| \| \| \|	startup and bgwriter processes), and the -y flag. It's not used anywhere.
*	Restructure autovacuum in two processes: a dummy process, which runs	Alvaro Herrera	2007-02-15
\| \| \| \| \| \| \| \| \|	continuously, and requests vacuum runs of "autovacuum workers" to postmaster. The workers do the actual vacuum work. This allows for future improvements, like allowing multiple autovacuum jobs running in parallel. For now, the code keeps the original behavior of having a single autovac process at any time by sleeping until the previous worker has finished.
*	StrNCpy -> strlcpy (not complete)	Peter Eisentraut	2007-02-10
\|
*	Add COST and ROWS options to CREATE/ALTER FUNCTION, plus underlying pg_proc	Tom Lane	2007-01-22
\| \| \| \| \| \| \| \| \| \| \| \|	columns procost and prorows, to allow simple user adjustment of the estimated cost of a function call, as well as control of the estimated number of rows returned by a set-returning function. We might eventually wish to extend this to allow function-specific estimation routines, but there seems to be consensus that we should try a simple constant estimate first. In particular this provides a relatively simple way to control the order in which different WHERE clauses are applied in a plan node, which is a Good Thing in view of the fact that the recent EquivalenceClass planner rewrite made that much less predictable than before.