PostgreSQL extension to schedules and manages jobs in a job queue similar to Oracle DBMS_JOB package.
- Description
- Installation
- Manage the extension
- Running the scheduler
- Configuration
- Jobs definition
- View ALL_JOBS
- Security
- Jobs execution history
- Procedures
- Limitations
- Authors
- License
This PostgreSQL extension provided full compatibility with the DBMS_JOB Oracle module.
It allows to manage scheduled jobs from a job queue or to execute immediately jobs asynchronously. A job definition consist on a code to execute, the next date of execution and how often the job is to be run. A job runs a SQL command, plpgsql code or an existing stored procedure.
If the submit stored procedure is called without the next_date (when) and interval (how often) attributes, the job is executed immediately in an asynchronous process. If interval is NULL and that next_date is lower or equal to current timestamp the job is also executed immediately as an asynchronous process. In all other cases the job is to be started when appropriate but if interval is NULL the job is executed only once and the job is deleted.
If a scheduled job completes successfully, then its new execution date is placed in next_date. The new date is calculated by evaluating the SQL expression defined as interval. The interval parameter must evaluate to a time in the future.
This extension consist in a SQL script to create all the objects related to its operation and a daemon that must be run attached to the database where jobs are defined. The daemon is responsible to execute the queued asynchronous jobs and the scheduled ones. It can be run on the same host of the database, where the jobs are defined, or on any other host. The schedule time is taken from the database host not where the daemon is running.
The number of jobs that can be executed at the same time is limited to 1000 by default. If this limit is reached the daemon will wait that a process ends to run a new one.
The use of an external scheduler daemon instead of a background worker is a choice, being able to fork thousands of sub-processes from a background worker is not a good idea.
The job execution is caused by a NOTIFY event received by the scheduler when a new job is submitted or modified. The notifications are polled every 0.1 second. When there is no notification the scheduler polls every job_queue_interval
seconds (5 seconds by default) the tables where job definition are stored. This mean that at worst a job will be executed job_queue_interval
seconds after the next execution date defined.
There is no special requirement to run this extension but your PostgreSQL version must support extensions (>= 9.1) and Perl must be available as well as the DBI, DBD::Pg and Time::Hires Perl modules. If your distribution doesn't include these Perl modules you can always install them using CPAN:
perl -MCPAN -e 'install DBI'
perl -MCPAN -e 'install DBD::Pg'
or in Debian like distribution use:
apt install libdbi-perl libpg-perl
and on RPM based system:
yum install perl-DBI perl-DBD-Pg perl-Time-HiRes
To install the extension execute
make
sudo make install
Test of the extension can be done using:
make installcheck
Each database that needs to use pg_dbms_job
must creates the extension:
psql -d mydb -c "CREATE EXTENSION pg_dbms_job"
To upgrade to a new version execute:
psql -d mydb -c 'ALTER EXTENSION pg_dbms_job UPDATE TO "1.2.0"'
If you doesn't have the privileges to create an extension you can just import the extension file into the database, for example:
psql -d mydb -f sql/pg_dbms_job--1.2.0.sql
This is especially useful for database in DBaas cloud services. To upgrade just import the extension upgrade files using psql.
A dedicated scheduler per database using the extension must be started.
The scheduler is a Perl program that runs in background it can be executed by any system user as follow:
pg_dbms_job -c /etc/pg_dbms_job/mydb-dbms_job.conf
There must be one scheduler daemon running per database using the extension with a dedicated configuration file.
The configuration file must define the database connection settings where the pg_dbms_job extension is used. This connection must be the extension tables owner or have the superuser privileges to be able to bypass the Row Level Security rules defined on the pg_dbms_job tables.
usage: pg_dbms_job [options]
options:
-c, --config file configuration file. Default: /etc/pg_dbms_job/pg_dbms_job.conf
-d, --debug run in debug mode.
-k, --kill stop current running daemon gracefully waiting
for all job completion.
-m, --immediate stop running daemon and jobs immediatly.
-r, --reload reload configuration file and jobs definition.
-s, --single do not detach and run in single loop mode and exit.
To stop gracefully the scheduler daemon after all running jobs are terminated, you can run the same command but with the -k
option:
pg_dbms_job -c /etc/pg_dbms_job/mydb-dbms_job.conf -k
you can also send the TERM signal to the main process:
$ ps auwx | grep "pg_dbms_job:main" | grep -v grep
gilles 14754 0.0 0.0 39636 17492 ? Ss 10:15 0:00 pg_dbms_job:main
$ kill -15 14754
To force the scheduler to stop immedialely interrupting the running jobs use the -m
option:
pg_dbms_job -c /etc/pg_dbms_job/mydb-dbms_job.conf -m
or send the INT signal:
$ ps auwx | grep "pg_dbms_job:main" | grep -v grep
gilles 14754 0.0 0.0 39636 17492 ? Ss 10:15 0:00 pg_dbms_job:main
$ kill -2 14754
The format of the configuration file is the same as postgresql.conf
.
debug
: debug mode. Default 0, disabled.pidfile
: path to pid file. Default to/tmp/pg_dbms_job.pid
.logfile
: log file name pattern, can include strftime() escapes, for example to have a log file per week day use%a
in the log file name. Default/tmp/pg_dbms_job.log
.log_truncate_on_rotation
: If activated an existing log file with the same name as the new log file will be truncated rather than appended to. But such truncation only occurs on time-driven rotation, not on restarts. Default0
, disabled.job_queue_interval
: poll interval of the jobs queue. Default 5 seconds.job_queue_processes
: Maximum number of job processed at the same time. Default1000
.
host
: ip adresse or hostname where the PostgreSQL cluster is running.port
: port where the PostgreSQL cluster is listening.database
: name of the database where to connect.user
: username used to connect to the database, it must be a superuser role.passwd
: password for this role.
#-----------
# General
#-----------
# Toogle debug mode
debug=0
# Path to the pid file
pidfile=/tmp/pg_dbms_job.pid
# log file name pattern, can include strftime() escapes, for example
# to have a log file per week day use %a in the log file name.
logfile=/tmp/pg_dbms_job.log
# If activated an existing log file with the same name as the new log
# file will be truncated rather than appended to. But such truncation
# only occurs on time-driven rotation, not on restarts.
log_truncate_on_rotation=0
# Poll interval of the job queue
job_queue_interval=5
#Maximum number of job processed at the same time
job_queue_processes=1000
#-----------
# Database
#-----------
host=localhost
port=5432
database=dbms_job
user=gilles
passwd=gilles
To force the scheduler to reread the configuration file after changes you can use the -r
option:
pg_dbms_job -c /etc/pg_dbms_job/mydb-dbms_job.conf -r
or send the HUP signal:
$ ps auwx | grep "pg_dbms_job:main" | grep -g grep
gilles 14758 0.0 0.0 39636 17492 ? Ss 10:17 0:00 pg_dbms_job:main
$ kill -1 14758
Jobs to run are stored in table dbms_job.all_scheduled_jobs
which is the same structure as the one in Oracle. Some columns are just here for compatibility but are not used. They are executed when current timestamp of the database polled by the scheduler is upper or equal to the date defined in the next_date
attribute.
Unlike with cron-like scheduler, when the pg_dbms_job scheduler starts it executes all active jobs with a next date in the past. That also mean that the interval of execution will be the same but the first execution date will change.
CREATE TABLE dbms_job.all_scheduled_jobs
(
job bigint DEFAULT nextval('dbms_job.jobseq') PRIMARY KEY, -- identifier of job
log_user name DEFAULT current_user, -- user that submit the job
priv_user name DEFAULT current_user, -- user whose default privileges apply to this job (not used)
schema_user text DEFAULT current_setting('search_path'), -- default schema used to parse the job
last_date timestamp with time zone, -- date on which this job last successfully executed
last_sec text, -- same as last_date (not used)
this_date timestamp with time zone, -- date that this job started executing, null when the job is not running
this_sec text, -- same as this_date (not used)
next_date timestamp(0) with time zone NOT NULL, -- date that this job will next be executed
next_sec timestamp with time zone, -- same as next_date (not used)
total_time interval, -- total wall clock time spent by the system on this job, in seconds
broken boolean DEFAULT false, -- true: no attempt is made to run this job, false: an attempt is made to run this job
interval text, -- a date function, evaluated at the start of execution, becomes next next_date
failures bigint, -- number of times the job has started and failed since its last success
what text NOT NULL, -- body of the anonymous pl/sql block that the job executes
nls_env text, -- session parameters describing the nls environment of the job (not used)
misc_env bytea, -- Other session parameters that apply to this job (not used)
instance integer DEFAULT 0 -- ID of the instance that can execute or is executing the job (not used)
);
Job submitted without execution date are jobs that need to be executed asynchronously as soon as possible after being created. They are stored in the queue (FIFO) table dbms_job.all_async_jobs
.
Same as for scheduled jobs, if jobs exist in the queue at start of the scheduler, they are executed immediately.
CREATE TABLE dbms_job.all_async_jobs
(
job bigint DEFAULT nextval('dbms_job.jobseq') PRIMARY KEY, -- identifier of job
log_user name DEFAULT current_user, -- user that submit the job
schema_user text DEFAULT current_setting('search_path'), -- default search_path used to execute the job
create_date timestamp with time zone DEFAULT current_timestamp, -- date on which this job has been created.
what text NOT NULL -- body of the anonymous pl/sql block that the job executes
);
All jobs that have to be executed can be listed from the view dbms_job.all_jobs
, this is the equivalent of the Oracle table DBMS_JOB.ALL_JOBS. This view reports all jobs to be run by execution a union between the two tables described in previous chapters.
Jobs are only visible by their own creator. A user can not access to a job defined by an other user unless it has the superuser privileges or it is the owner of the pg_dbms_job tables.
By default a user can not use pg_dbms_job, he must be granted privileges to the pg_dbms_job objects as follow.
GRANT USAGE ON SCHEMA dbms_job TO <role>;
GRANT ALL ON ALL TABLES IN SCHEMA dbms_job TO <role>;
GRANT ALL ON ALL SEQUENCES IN SCHEMA dbms_job TO <role>;
GRANT EXECUTE ON ALL FUNCTIONS IN SCHEMA dbms_job TO <role>;
GRANT EXECUTE ON ALL PROCEDURES IN SCHEMA dbms_job TO <role>;
A job will be taken in account by the scheduler only when the transaction where it has been created is committed. It is transactional so no risk that it will be executed if the transaction is aborted.
When starting or when it is reloaded the pg_dbms_job daemon first checks that another daemon is not already attached to the same database. If this is the case it will refuse to continue. This is a double verification, the first one is on an existing pid file and the second is done by looking at pg_stat_activity to see if a pg_dbms_job:main
process already exists.
By default the scheduler allow 1000 job to be executed at the same time, you may want to control this value to a lower or a upper value. This limit can be changed in the configuration file with directive job_queue_processes
. Note that if your system doesn't enough resources to run all the job at the same time it could be problematic. You must also take attention to who is authorised to submit jobs because this could affect the performances of the server.
Jobs are executed with as the user that defined the job and with the search path used at the time of the job submission. This information is available in attributes log_user
and schema_user
of table dbms_job.all_scheduled_jobs
and dbms_job.all_async_jobs
. That mean that the database connection user of the scheduler must have the privilege to change the user using SET ROLE <jobuser>.
. This allow the user that have submitted the job to view its entries in the history table.
Oracle DBMS_JOB doesn't provide a log history. This feature is available in DBMS_SCHEDULER and the past activity of the scheduler can be seen in table ALL_SCHEDULER_JOB_RUN_DETAILS. This extension stores all PG_DBMS_JOB activity in a similar table named dbms_job.all_scheduler_job_run_details
.
CREATE TABLE dbms_job.all_scheduler_job_run_details
(
log_id bigserial PRIMARY KEY, -- unique identifier of the log entry
log_date timestamp with time zone DEFAULT current_timestamp, -- date of the log entry
owner name, -- owner of the scheduler job
job_name varchar(261), -- name of the scheduler job
job_subname varchar(261), -- Subname of the Scheduler job (for a chain step job)
status varchar(128), -- status of the job run
error char(5), -- error code in the case of an error
req_start_date timestamp with time zone, -- requested start date of the job run
actual_start_date timestamp with time zone, -- actual date on which the job was run
run_duration bigint, -- duration of the job run
instance_id integer, -- identifier of the instance on which the job was run
session_id integer, -- session identifier of the job run
slave_pid integer, -- process identifier of the slave on which the job was run
cpu_used integer, -- amount of cpu used for the job run
additional_info text -- additional information on the job run, error message, etc.
);
Disables or suspend job execution. This procedure sets the broken flag. Broken jobs are never run.
Syntax:
pg_dbms_job.broken (
job IN bigint,
broken IN boolean
next_date IN timestamp DEFAULT current_timestamp);
Parameters:
- job : ID of the job being run.
- broken : Sets the job as broken or not broken.
true
sets it as broken;false
sets it as not broken. - next_date : Next date when the job will be run, default is
current_timestamp
.
If you set job as broken while it is running, unlike Oracle, the scheduler will not reset the job's status to normal after the job completes. Therefore, you can execute this procedure for jobs when they are running they will be disabled.
Example:
BEGIN;
CALL pg_dbms_job.broken(12345, true);
COMMIT;
Alters any of the user-definable parameters associated with a job. Any value you do not want to change can be specified as NULL.
Syntax:
dbms_job.change (
job IN bigint,
what IN text,
next_date IN timestamp with time zone,
interval IN text
[, instance IN integer DEFAULT 0,
force IN boolean DEFAULT false ]);
Parameters:
- job : ID of the job being run.
- what : PL/SQL procedure to run.
- next_date : Next date when the job will be run.
- interval : Date function; evaluated immediately before the job starts running.
- instance : unused
- force : unused
Your job change will not be available for processing by the job queue in the background until it is committed. If the parameters what, next_date, or interval are NULL, then leave that value as it is.
Example:
Change the interval of execution of job 12345 to run every 3 days
BEGIN;
CALL pg_dbms_job.change(12345, null, null, 'current_timestamp + ''3 days''::interval');
COMMIT;
Alters the interval between executions for a specified job
Syntax:
dbms_job.interval (
job IN bigint,
interval IN text);
Parameters:
- job : ID of the job being run.
- interval : Code of the date function, evaluated immediately before the job starts running.
If the job completes successfully, then this new date is placed in next_date. interval
is evaluated by plugging it into the statement select interval into next_date;
The interval parameter must evaluate to a time in the future.
If interval evaluates to NULL and if a job completes successfully, then the job is automatically deleted from the queue.
With Oracle this is the kind of interval values that we can find:
- Execute daily:
SYSDATE + 1
- Execute once per week:
SYSDATE + 7
- Execute hourly:
SYSDATE + 1/24
- Execute every 2 hour:
SYSDATE + 2/24
- Execute every 12 hour:
SYSDATE + 12/24
- Execute every 10 min.:
SYSDATE + 10/1440
- Execute every 30 sec.:
SYSDATE + 30/86400
The equivalent to use with pg_dbms_job are the following:
- Execute daily:
date_trunc('second',LOCALTIMESTAMP) + '1 day'::interval
- Execute once per week:
date_trunc('second',LOCALTIMESTAMP) + '7 days'::interval
ordate_trunc('second',current_timestamp) + '1 week'::interval
- Execute hourly:
date_trunc('second',LOCALTIMESTAMP) + '1 hour'::interval
- Execute every 2 hour:
date_trunc('second',LOCALTIMESTAMP) + '2 hours'::interval
- Execute every 12 hour:
date_trunc('second',LOCALTIMESTAMP) + '12 hours'::interval
- Execute every 10 min.:
date_trunc('second',LOCALTIMESTAMP) + '10 minutes'::interval
- Execute every 30 sec.:
date_trunc('second',LOCALTIMESTAMP) + '30 secondes'::interval
Example:
BEGIN;
CALL pg_dbms_job.interval(12345, 'current_timestamp + '10 seconds'::interval);
COMMIT;
Alters the next execution time for a specified job
Syntax:
dbms_job.next_date (
job IN bigint,
next_date IN timestamp with time zone);
Parameters:
- job : ID of the job being run.
- next_date : Date of the next refresh: it is when the job will be automatically run, assuming there are background processes attempting to run it.
Example:
BEGIN;
CALL pg_dbms_job.next_date(12345, current_timestamp + '1 day'::interval);
COMMIT;
Removes specified job from the job queue. You can only remove jobs that you own. If this is run while the job is executing, it will not be interrupted but will not be run again.
Syntax:
dbms_job.remove (
job IN bigint);
Parameters:
- job : ID of the job being run.
Example:
BEGIN;
CALL pg_dbms_job.remove(12345);
COMMIT;
Forces a specified job to run. This procedure runs the job now. It runs even if it is broken. If it was broken and it runs successfully, the job is updated to indicates that it is no longer broken and goes back to running on its schedule.
Running the job recomputes next_date based on the time you run the procedure.
When runs in foreground there is no logging to the jobs history table but information on the dbms_job.all_scheduled_jobs table are updated in case of error or success. In case of error the exception is raise to the client.
Syntax:
dbms_job.run (
job IN bigint);
Parameters:
- job : ID of the job being run.
Example:
BEGIN;
CALL pg_dbms_job.run(12345, false);
COMMIT;
Submits a new job to the job queue. It chooses the job from the sequence dbms_job.jobseq.
Actually this is a function as PostgreSQL < 14 do not support out parameters.
Syntax
dbms_job.submit (
job OUT bigint,
what IN text,
[ next_date IN timestamp(0) with time zone DEFAULT current_timestamp
[ , interval IN text DEFAULT NULL
[ , no_parse IN boolean DEFAULT false ] ] ] );
Parameters:
- job : ID of the job being run.
- what : text of the code to the job to be run. This must be a valid SQL statement or block of plpgsql code. The SQL code that you submit in the
what
parameter is wrapped in the following plpgsql block:
DO $$
DECLARE
job bigint := $jobid;
next_date timestamp with time zone := current_timestamp;
broken boolean := false;
BEGIN
WHAT
END;
$$;
Ensure that you include the ; semi-colon with the statement.
- next_date : Next date when the job will be run.
- interval : Date function that calculates the next time to run the job. The default is NULL. This must evaluate to a either a future point in time or NULL.
- no_parse : Unused.
Example:
This submits a new job to the job queue. The job calls ANALYZE to generate optimizer statistics for the table public.accounts. The job is run every 24 hours:
BEGIN;
DO $$
DECLARE
jobno bigint;
BEGIN
SELECT dbms_job.submit(
'ANALYZE public.accounts.',
LOCALTIMESTAMP, 'LOCALTIMESTAMP + ''1 day''::interval') INTO jobno;
END;
COMMIT;
Alters the job description for a specified job. This procedure changes what an existing job does, and replaces its environment.
Syntax:
dbms_job.what (
job IN bigint,
what IN text);
Parameters:
- job : ID of the job being run.
- what : PL/SQL procedure to run.
Example:
BEGIN;
CALL dbms_job.what('ANALYZE public.accounts.');
COMMIT;
Following the job activity a certain amount of bloat can be created in queues tables which can slow down the collect of job to execute by the scheduler. In this case it is recommended to execute a VACUUM FULL on these tables periodically when there is no activity.
VACUUM FULL dbms_job.all_scheduled_jobs, dbms_job.all_async_jobs;
If you have a very high job execution use that generates thousands of NOTIFY per seconds you should better disable this feature to avoid filling the notify queue. The queue is quite large (8GB in a standard installation) but when it is full the transaction that emit the NOTIFY will fail. Once the queue is half full you will see warnings in the log file. If you experience this limitation you can disable this feature by dropping the triggers responsible of the notification.
DROP TRIGGER dbms_job_scheduled_notify_trg ON dbms_job.all_scheduled_jobs;
DROP TRIGGER dbms_job_async_notify_trg ON dbms_job.all_async_jobs;
Once the trigger are dropped the polling of job will only be done every job_queue_interval
seconds (5 seconds by default).
- Gilles Darold
This extension is free software distributed under the PostgreSQL License.
Copyright (c) 2021-2022 MigOps Inc.