postgresql - postgresql mirror

diff options

author	Thomas Munro <tmunro@postgresql.org>	2023-03-31 11:01:51 +1300
committer	Thomas Munro <tmunro@postgresql.org>	2023-03-31 11:34:03 +1300
commit	11c2d6fdf5af1aacec9ca2005543f1b0fc4cc364 (patch)
tree	24f7dcba5bd58fbf207e8e58b3f97291b2a873b4 /src/backend/utils/adt/pgstatfuncs.c
parent	ca7b3c4c00042038ba9c282c4807e05c0a527e42 (diff)
download	postgresql-11c2d6fdf5af1aacec9ca2005543f1b0fc4cc364.tar.gz postgresql-11c2d6fdf5af1aacec9ca2005543f1b0fc4cc364.zip

Parallel Hash Full Join.

Full and right outer joins were not supported in the initial implementation of Parallel Hash Join because of deadlock hazards (see discussion). Therefore FULL JOIN inhibited parallelism, as the other join strategies can't do that in parallel either. Add a new PHJ phase PHJ_BATCH_SCAN that scans for unmatched tuples on the inner side of one batch's hash table. For now, sidestep the deadlock problem by terminating parallelism there. The last process to arrive at that phase emits the unmatched tuples, while others detach and are free to go and work on other batches, if there are any, but otherwise they finish the join early. That unfairness is considered acceptable for now, because it's better than no parallelism at all. The build and probe phases are run in parallel, and the new scan-for-unmatched phase, while serial, is usually applied to the smaller of the two relations and is either limited by some multiple of work_mem, or it's too big and is partitioned into batches and then the situation is improved by batch-level parallelism. Author: Melanie Plageman <melanieplageman@gmail.com> Author: Thomas Munro <thomas.munro@gmail.com> Reviewed-by: Thomas Munro <thomas.munro@gmail.com> Discussion: https://postgr.es/m/CA%2BhUKG%2BA6ftXPz4oe92%2Bx8Er%2BxpGZqto70-Q_ERwRaSyA%3DafNg%40mail.gmail.com

Diffstat (limited to 'src/backend/utils/adt/pgstatfuncs.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: