postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	Add some enumeration commas, for consistency	Peter Eisentraut	2012-02-24
\|
*	Fix the general case of quantified regex back-references.	Tom Lane	2012-02-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Cases where a back-reference is part of a larger subexpression that is quantified have never worked in Spencer's regex engine, because he used a compile-time transformation that neglected the need to check the back-reference match in iterations before the last one. (That was okay for capturing parens, and we still do it if the regex has only capturing parens ... but it's not okay for backrefs.) To make this work properly, we have to add an "iteration" node type to the regex engine's vocabulary of sub-regex nodes. Since this is a moderately large change with a fair risk of introducing new bugs of its own, apply to HEAD only, even though it's a fix for a longstanding bug.
*	Correctly handle NULLs in JSON output.	Andrew Dunstan	2012-02-23
\| \| \| \|	Error reported by David Wheeler.
*	Remove arbitrary limitation on length of common name in SSL certificates.	Tom Lane	2012-02-23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Both libpq and the backend would truncate a common name extracted from a certificate at 32 bytes. Replace that fixed-size buffer with dynamically allocated string so that there is no hard limit. While at it, remove the code for extracting peer_dn, which we weren't using for anything; and don't bother to store peer_cn longer than we need it in libpq. This limit was not so terribly unreasonable when the code was written, because we weren't using the result for anything critical, just logging it. But now that there are options for checking the common name against the server host name (in libpq) or using it as the user's name (in the server), this could result in undesirable failures. In the worst case it even seems possible to spoof a server name or user name, if the correct name is exactly 32 bytes and the attacker can persuade a trusted CA to issue a certificate in which that string is a prefix of the certificate's common name. (To exploit this for a server name, he'd also have to send the connection astray via phony DNS data or some such.) The case that this is a realistic security threat is a bit thin, but nonetheless we'll treat it as one. Back-patch to 8.4. Older releases contain the faulty code, but it's not a security problem because the common name wasn't used for anything interesting. Reported and patched by Heikki Linnakangas Security: CVE-2012-0867
*	Require execute permission on the trigger function for CREATE TRIGGER.	Tom Lane	2012-02-23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This check was overlooked when we added function execute permissions to the system years ago. For an ordinary trigger function it's not a big deal, since trigger functions execute with the permissions of the table owner, so they couldn't do anything the user issuing the CREATE TRIGGER couldn't have done anyway. However, if a trigger function is SECURITY DEFINER, that is not the case. The lack of checking would allow another user to install it on his own table and then invoke it with, essentially, forged input data; which the trigger function is unlikely to realize, so it might do something undesirable, for instance insert false entries in an audit log table. Reported by Dinesh Kumar, patch by Robert Haas Security: CVE-2012-0866
*	Remove inappropriate quotes	Peter Eisentraut	2012-02-23
\| \| \| \|	And adjust wording for consistency.
*	Fix build without OpenSSL	Peter Eisentraut	2012-02-23
\| \| \| \|	This is a fixup for commit a445cb92ef5b3a31313ebce30e18cc1d6e0bdecb.
*	Make EXPLAIN (BUFFERS) track blocks dirtied, as well as those written.	Robert Haas	2012-02-22
\| \| \| \| \| \|	Also expose the new counters through pg_stat_statements. Patch by me. Review by Fujii Masao and Greg Smith.
*	Fix typo in comment.	Robert Haas	2012-02-22
\| \| \| \|	Sandro Santilli
*	Add parameters for controlling locations of server-side SSL files	Peter Eisentraut	2012-02-22
\| \| \| \| \| \| \| \| \| \| \| \|	This allows changing the location of the files that were previously hard-coded to server.crt, server.key, root.crt, root.crl. server.crt and server.key continue to be the default settings and are thus required to be present by default if SSL is enabled. But the settings for the server-side CA and CRL are now empty by default, and if they are set, the files are required to be present. This replaces the previous behavior of ignoring the functionality if the files were not found.
*	REASSIGN OWNED: Support foreign data wrappers and servers	Alvaro Herrera	2012-02-22
\| \| \| \| \| \| \|	This was overlooked when implementing those kinds of objects, in commit cae565e503c42a0942ca1771665243b4453c5770. Per report from Pawel Casperek.
*	Don't clear btpo_cycleid during _bt_vacuum_one_page.	Tom Lane	2012-02-21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When "vacuuming" a single btree page by removing LP_DEAD tuples, we are not actually within a vacuum operation, but rather in an ordinary insertion process that could well be running concurrently with a vacuum. So clearing the cycleid is incorrect, and could cause the concurrent vacuum to miss removing tuples that it needs to remove. This is a longstanding bug introduced by commit e6284649b9e30372b3990107a082bc7520325676 of 2006-07-25. I believe it explains Maxim Boguk's recent report of index corruption, and probably some other previously unexplained reports. In 9.0 and up this is a one-line fix; before that we need to introduce a flag to tell _bt_delitems what to do.
*	Cosmetic cleanup for commit a760893dbda9934e287789d54bbd3c4ca3914ce0.	Tom Lane	2012-02-21
\| \| \| \|	Mostly, fixing overlooked comments.
*	Avoid double close of file handle in syslogger on win32	Magnus Hagander	2012-02-21
\| \| \| \| \| \| \|	This causes an exception when running under a debugger or in particular when running on a debug version of Windows. Patch from MauMau
*	Fix typo, noticed by Will Crawford.	Andrew Dunstan	2012-02-21
\|
*	Fix a couple of cases of JSON output.	Andrew Dunstan	2012-02-20
\| \| \| \| \| \|	First, as noted by Itagaki Takahiro, a datum of type JSON doesn't need to be escaped. Second, ensure that numeric output not in the form of a legal JSON number is quoted and escaped.
*	Fix regex back-references that are directly quantified with *.	Tom Lane	2012-02-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The syntax "\n", that is a backref with a quantifier directly applied to it, has never worked correctly in Spencer's library. This has been an open bug in the Tcl bug tracker since 2005: https://sourceforge.net/tracker/index.php?func=detail&aid=1115587&group_id=10894&atid=110894 The core of the problem is in parseqatom(), which first changes "\n" to "\n+\|" and then applies repeat() to the NFA representing the backref atom. repeat() thinks that any arc leading into its "rp" argument is part of the sub-NFA to be repeated. Unfortunately, since parseqatom() already created the arc that was intended to represent the empty bypass around "\n+", this arc gets moved too, so that it now leads into the state loop created by repeat(). Thus, what was supposed to be an "empty" bypass gets turned into something that represents zero or more repetitions of the NFA representing the backref atom. In the original example, in place of ^([bc])\1$ we now have something that acts like ^([bc])(\1+\|[bc])$ At runtime, the branch involving the actual backref fails, as it's supposed to, but then the other branch succeeds anyway. We could no doubt fix this by some rearrangement of the operations in parseqatom(), but that code is plenty ugly already, and what's more the whole business of converting "x" to "x+\|" probably needs to go away to fix another problem I'll mention in a moment. Instead, this patch suppresses the *-conversion when the target is a simple backref atom, leaving the case of m == 0 to be handled at runtime. This makes the patch in regcomp.c a one-liner, at the cost of having to tweak cbrdissect() a little. In the event I went a bit further than that and rewrote cbrdissect() to check all the string-length-related conditions before it starts comparing characters. It seems a bit stupid to possibly iterate through many copies of an n-character backreference, only to fail at the end because the target string's length isn't a multiple of n --- we could have found that out before starting. The existing coding could only be a win if integer division is hugely expensive compared to character comparison, but I don't know of any modern machine where that might be true. This does not fix all the problems with quantified back-references. In particular, the code is still broken for back-references that appear within a larger expression that is quantified (so that direct insertion of the quantification limits into the BACKREF node doesn't apply). I think fixing that will take some major surgery on the NFA code, specifically introducing an explicit iteration node type instead of trying to transform iteration into concatenation of modified regexps. Back-patch to all supported branches. In HEAD, also add a regression test case for this. (It may seem a bit silly to create a regression test file for just one test case; but I'm expecting that we will soon import a whole bunch of regex regression tests from Tcl, so might as well create the infrastructure now.)
*	Add caching of ctype.h/wctype.h results in regc_locale.c.	Tom Lane	2012-02-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While this doesn't save a huge amount of runtime, it still seems worth doing, especially since I realized that the data copying I did in my first draft was quite unnecessary. In this version, once we have the results cached, getting them back for re-use is really very cheap. Also, remove the hard-wired limitation to not consider wctype.h results for character codes above 255. It turns out that we can't push the limit as far up as I'd originally hoped, because the regex colormap code is not efficient enough to cope very well with character classes containing many thousand letters, which a Unicode locale is entirely capable of producing. Still, we can push it up to U+7FF (which I chose as the limit of 2-byte UTF8 characters), which will at least make Eastern Europeans happy pending a better solution. Thus, this commit resolves the specific complaint in bug #6457, but not the more general issue that letters of non-western alphabets are mostly not recognized as matching [[:alpha:]].
*	Create the beginnings of internals documentation for the regex code.	Tom Lane	2012-02-19
\| \| \| \| \| \| \| \| \| \|	Create src/backend/regex/README to hold an implementation overview of the regex package, and fill it in with some preliminary notes about the code's DFA/NFA processing and colormap management. Much more to do there of course. Also, improve some code comments around the colormap and cvec code. No functional changes except to add one missing assert.
*	Improve pretty printing of viewdefs.	Andrew Dunstan	2012-02-19
\| \| \| \| \| \| \| \| \|	Some line feeds are added to target lists and from lists to make them more readable. By default they wrap at 80 columns if possible, but the wrap column is also selectable - if 0 it wraps after every item. Andrew Dunstan, reviewed by Hitoshi Harada.
*	Sync regex code with Tcl 8.5.11.	Tom Lane	2012-02-17
\| \| \| \| \| \| \| \| \|	Sync our regex code with upstream changes since last time we did this, which was Tcl 8.5.0 (see commit df1e965e12cdd48c11057ee6e15346ee2b8b02f5). There are no functional changes here; the main point is just to lay down a commit-log marker that somebody has looked at this recently, and to do what we can to keep the two codebases comparable.
*	Improve statistics estimation to make some use of DISTINCT in sub-queries.	Tom Lane	2012-02-16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Formerly, we just punted when trying to estimate stats for variables coming out of sub-queries using DISTINCT, on the grounds that whatever stats we might have for underlying table columns would be inapplicable. But if the sub-query has only one DISTINCT column, we can consider its output variable as being unique, which is useful information all by itself. The scope of this improvement is pretty narrow, but it costs nearly nothing, so we might as well do it. Per discussion with Andres Freund. This patch differs from the draft I submitted yesterday in updating various comments about vardata.isunique (to reflect its extended meaning) and in tweaking the interaction with security_barrier views. There does not seem to be a reason why we can't use this sort of knowledge even when the sub-query is such a view.
*	Run a portal's cleanup hook immediately when pushing it to FAILED state.	Tom Lane	2012-02-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This extends the changes of commit 6252c4f9e201f619e5eebda12fa867acd4e4200e so that we run the cleanup hook earlier for failure cases as well as success cases. As before, the point is to avoid an assertion failure from an Assert I added in commit a874fe7b4c890d1fe3455215a83ca777867beadd, which was meant to check that no user-written code can be called during portal cleanup. This fixes a case reported by Pavan Deolasee in which the Assert could be triggered during backend exit (see the new regression test case), and also prevents the possibility that the cleanup hook is run after portions of the portal's state have already been recycled. That doesn't really matter in current usage, but it foreseeably could matter in the future. Back-patch to 9.1 where the Assert in question was added.
*	Fix VPATH builds, broken by my recent commit to speed up tuplesorting.	Robert Haas	2012-02-15
\| \| \| \|	The relevant commit is 337b6f5ecf05b21b5e997986884d097d60e4e3d0.
*	Speed up in-memory tuplesorting.	Robert Haas	2012-02-15
\| \| \| \| \| \| \| \| \| \| \| \| \|	Per recent work by Peter Geoghegan, it's significantly faster to tuplesort on a single sortkey if ApplySortComparator is inlined into quicksort rather reached via a function pointer. It's also faster in general to have a version of quicksort which is specialized for sorting SortTuple objects rather than objects of arbitrary size and type. This requires a couple of additional copies of the quicksort logic, which in this patch are generate using a Perl script. There might be some benefit in adding further specializations here too, but thus far it's not clear that those gains are worth their weight in code footprint.
*	Make CREATE/ALTER FUNCTION support NOT LEAKPROOF.	Robert Haas	2012-02-15
\| \| \| \|	Because it isn't good to be able to turn things on, and not off again.
*	Preserve column names in the execution-time tupledesc for a RowExpr.	Tom Lane	2012-02-14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The hstore and json datatypes both have record-conversion functions that pay attention to column names in the composite values they're handed. We used to not worry about inserting correct field names into tuple descriptors generated at runtime, but given these examples it seems useful to do so. Observe the nicer-looking results in the regression tests whose results changed. catversion bump because there is a subtle change in requirements for stored rule parsetrees: RowExprs from ROW() constructs now have to include field names. Andrew Dunstan and Tom Lane
*	Allow LEAKPROOF functions for better performance of security views.	Robert Haas	2012-02-13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We don't normally allow quals to be pushed down into a view created with the security_barrier option, but functions without side effects are an exception: they're OK. This allows much better performance in common cases, such as when using an equality operator (that might even be indexable). There is an outstanding issue here with the CREATE FUNCTION / ALTER FUNCTION syntax: there's no way to use ALTER FUNCTION to unset the leakproof flag. But I'm committing this as-is so that it doesn't have to be rebased again; we can fix up the grammar in a future commit. KaiGai Kohei, with some wordsmithing by me.
*	Fix heap_multi_insert to set t_self field in the caller's tuples.	Heikki Linnakangas	2012-02-13
\| \| \| \| \| \| \| \|	If tuples were toasted, heap_multi_insert didn't update the ctid on the original tuples. This caused a failure if there was an after trigger (including a foreign key), on the table, and a tuple got toasted. Per off-list report and test case from Ted Phelps
*	Add a comment to AdjustIntervalForTypmod to reduce chance of future bugs.	Robert Haas	2012-02-09
\| \| \| \| \| \|	It's not entirely evident how the logic here relates to the interval_transform function, so let's clue people in that they need to check that if the rules change.
*	Improve interval_transform function to detect a few more cases.	Robert Haas	2012-02-09
\| \| \| \|	Noah Misch, per a review comment from me.
*	Add new keywords SNAPSHOT and TYPES to the keyword list in gram.y	Heikki Linnakangas	2012-02-09
\| \| \| \| \| \| \| \|	These were added to kwlist.h as unreserved keywords in separate patches, but authors forgot to add them to the corresponding list in gram.y. Because of that, even though they were supposed to be unreserved keywords, they could not be used as identifiers. src/tools/check_keywords.pl is your friend.
*	Throw error sooner for unlogged GiST indexes.	Tom Lane	2012-02-08
\| \| \| \| \| \|	Throwing an error only after we've built the main index fork is pretty unfriendly when the table already contains data. Per gripe from Jay Levitt.
*	Check misplaced window functions before checking aggregate/group by sanity.	Tom Lane	2012-02-08
\| \| \| \| \| \| \| \| \| \| \|	If somebody puts a window function in WHERE, we should complain about that in so many words. The previous coding tended to complain about the window function's arguments instead, which is likely to be misleading to users who are unclear on the semantics of window functions; as seen for example in bug #6440 from Matyas Novak. Just another example of how "add new code at the end" is frequently a bad heuristic.
*	Add transform functions for various temporal typmod coercisions.	Robert Haas	2012-02-08
\| \| \| \| \| \|	This enables ALTER TABLE to skip table and index rebuilds in some cases. Noah Misch, with trivial changes by me.
*	Rename LWLockWaitUntilFree to LWLockAcquireOrWait.	Heikki Linnakangas	2012-02-08
\| \| \| \| \|	LWLockAcquireOrWait makes it more clear that the lock is acquired if it's free.
*	Fix typos pointed out by Noah Misch.	Robert Haas	2012-02-07
\|
*	Add a transform function for varbit typmod coercisions.	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \|	This enables ALTER TABLE to skip table and index rebuilds when the new type is unconstraint varbit, or when the allowable number of bits is not decreasing. Noah Misch, with review and a fix for an OID collision by me.
*	Add a transform function for numeric typmod coercisions.	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \| \|	This enables ALTER TABLE to skip table and index rebuilds when a column is changed to an unconstrained numeric, or when the scale is unchanged and the precision does not decrease. Noah Misch, with a few stylistic changes and a fix for an OID collision by me.
*	Add TIMING option to EXPLAIN, to allow eliminating of timing overhead.	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \|	Sometimes it may be useful to get actual row counts out of EXPLAIN (ANALYZE) without paying the cost of timing every node entry/exit. With this patch, you can say EXPLAIN (ANALYZE, TIMING OFF) to get that. Tomas Vondra, reviewed by Eric Theise, with minor doc changes by me.
*	When building with LWLOCK_STATS, initialize the stats in LWLockWaitUntilFree.	Heikki Linnakangas	2012-02-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If LWLockWaitUntilFree was called before the first LWLockAcquire call, you would either crash because of access to uninitialized array or account the acquisition incorrectly. LWLockConditionalAcquire doesn't have this problem because it doesn't update the lwlock stats. In practice, this never happens because there is no codepath where you would call LWLockWaitUntilfree before LWLockAcquire after a new process is launched. But that's just accidental, there's no guarantee that that's always going to be true in the future. Spotted by Jeff Janes.
*	Fix postmaster to attempt restart after a hot-standby crash.	Tom Lane	2012-02-06
\| \| \| \| \| \| \| \| \| \| \| \|	The postmaster was coded to treat any unexpected exit of the startup process (i.e., the WAL replay process) as a catastrophic crash, and not try to restart it. This was OK so long as the startup process could not have any sibling postmaster children. However, if a hot-standby backend crashes, we SIGQUIT the startup process along with everything else, and the resulting exit is hardly "unexpected". Treating it as such meant we failed to restart a standby server after any child crash at all, not only a crash of the WAL replay process as intended. Adjust that. Back-patch to 9.0 where hot standby was introduced.
*	Avoid throwing ERROR during WAL replay of DROP TABLESPACE.	Tom Lane	2012-02-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Although we will not even issue an XLOG_TBLSPC_DROP WAL record unless removal of the tablespace's directories succeeds, that does not guarantee that the same operation will succeed during WAL replay. Foreseeable reasons for it to fail include temp files created in the tablespace by Hot Standby backends, wrong directory permissions on a standby server, etc etc. The original coding threw ERROR if replay failed to remove the directories, but that is a serious overreaction. Throwing an error aborts recovery, and worse means that manual intervention will be needed to get the database to start again, since otherwise the same error will recur on subsequent attempts to replay the same WAL record. And the consequence of failing to remove the directories is only that some probably-small amount of disk space is wasted, so it hardly seems justified to throw an error. Accordingly, arrange to report such failures as LOG messages and keep going when a failure occurs during replay. Back-patch to 9.0 where Hot Standby was introduced. In principle such problems can occur in earlier releases, but Hot Standby increases the odds of trouble significantly. Given the lack of field reports of such issues, I'm satisfied with patching back as far as the patch applies easily.
*	Add locking around WAL-replay modification of shared-memory variables.	Tom Lane	2012-02-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Originally, most of this code assumed that no Postgres backends could be running concurrently with it, and so no locking could be needed. That assumption fails in Hot Standby. While it's still true that Hot Standby backends should never change values like nextXid, they can examine them, and consistency is important in some cases such as when computing a snapshot. Therefore, prudence requires that WAL replay code obtain the relevant locks when modifying such variables, even though it can examine them without taking a lock. We were following that coding rule in some places but not all. This commit applies the coding rule uniformly to all updates of ShmemVariableCache and MultiXactState fields; a search of the replay routines did not find any other cases that seemed to be at risk. In addition, this commit fixes a longstanding thinko in replay of NEXTOID and checkpoint records: we tried to advance nextOid only if it was behind the value in the WAL record, but the comparison would draw the wrong conclusion if OID wraparound had occurred since the previous value. Better to just unconditionally assign the new value, since OID assignment shouldn't be happening during replay anyway. The additional locking seems to be more in the nature of future-proofing than fixing any live bug, so I am not going to back-patch it. The NEXTOID fix will be back-patched separately.
*	Fix transient clobbering of shared buffers during WAL replay.	Tom Lane	2012-02-05
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	RestoreBkpBlocks was in the habit of zeroing and refilling the target buffer; which was perfectly safe when the code was written, but is unsafe during Hot Standby operation. The reason is that we have coding rules that allow backends to continue accessing a tuple in a heap relation while holding only a pin on its buffer. Such a backend could see transiently zeroed data, if WAL replay had occasion to change other data on the page. This has been shown to be the cause of bug #6425 from Duncan Rance (who deserves kudos for developing a sufficiently-reproducible test case) as well as Bridget Frey's re-report of bug #6200. It most likely explains the original report as well, though we don't yet have confirmation of that. To fix, change the code so that only bytes that are supposed to change will change, even transiently. This actually saves cycles in RestoreBkpBlocks, since it's not writing the same bytes twice. Also fix seq_redo, which has the same disease, though it has to work a bit harder to meet the requirement. So far as I can tell, no other WAL replay routines have this type of bug. In particular, the index-related replay routines, which would certainly be broken if they had to meet the same standard, are not at risk because we do not have coding rules that allow access to an index page when not holding a buffer lock on it. Back-patch to 9.0 where Hot Standby was added.
*	Improve comment.	Tom Lane	2012-02-04
\|
*	Add missing Assert and fix inaccurate elog message in standby_redo().	Tom Lane	2012-02-04
\| \| \| \| \| \|	All other WAL redo routines either call RestoreBkpBlocks() or Assert that they haven't been passed any backup blocks. Make this one do likewise. Also, fix incorrect routine name in its failure message.
*	Allow SQL-language functions to reference parameters by name.	Tom Lane	2012-02-04
\| \| \| \|	Matthew Draper, reviewed by Hitoshi Harada
*	Add array_to_json and row_to_json functions.	Andrew Dunstan	2012-02-03
\| \| \| \| \| \| \|	Also move the escape_json function from explain.c to json.c where it seems to belong. Andrew Dunstan, Reviewd by Abhijit Menon-Sen.
*	Allow spgist's text_ops to handle pattern-matching operators.	Robert Haas	2012-02-02
\| \| \| \| \| \| \|	This was presumably intended to work this way all along, but a few key bits of indxpath.c didn't get the memo. Robert Haas and Tom Lane