Adds background job scheduling and execution #10906

cannikin · 2024-07-02T21:42:46Z

This new package provides scheduling and processing of background jobs. We want everything needed to run a modern web application to be included in Redwood itself—you shouldn't need any third party integrations if you don't want. Background jobs have been sorely missed, but the time has come! (If you do want to use a third party service we have had an integration with Inngest since May of 2023!)

What's Included

A base RedwoodJob class from which your own custom jobs will extend. You only need to fill out the details of a single perform() action, accepting whatever arguments you want, and the underlying RedwoodJob code will take care of scheduling, delaying, running, and, in the case your job fails, recording the error and rescheduling in the future for a retry.
Backend adapters for storing your jobs. Today we're shipping with a PrismaAdapter but we also provide a BaseAdapter from which you can extend and build your own.
A persistent process to watch for new jobs and execute them. It can be run in dev mode, which stays attached to your console so you can monitor and execute jobs in development, or in daemon mode which detaches from the console and runs in the background forever (you'll use this mode in production).

Decoupling the jobs from their backends means you can swap out backends as your app grows, or even use different backends for different jobs!

The actual Worker and Executor classes that know how to find a job and work on it are self-contained, so you can write your own runner if you want.

Features

Named queues: you can schedule jobs in separate named queues and have a different number of workers monitoring each one—makes it much easier to scale your background processing
Priority: give your jobs a priority from 1 (highest) to 100 (lowest). Workers will sort available jobs by priority, working the most important ones first.
Configurable delay: run your job as soon as possible (default), wait a number of seconds before running, or run at a specific time in the future
Run inline: instead of scheduling to run in the background, run immediately
Auto-retries with backoff: if your job fails it will back off at the rate of attempts ** 4 for a default of 24 tries, the time between the last two attempts is a little over three days. The number of max retries is configurable per job.
Integrates with Redwood's logger: use your existing one in api/src/lib/logger or create a new one just for job logging

How it Works

Using the PrismaAdapter means your jobs are stored in your database. The yarn rw setup jobs script will add a BackgroundJob model in your schema.prisma file. Any job that is invoked with .performLater() will add a row to this table:

WelcomeEmailJob.performLater({ user.email })

If using the PrismaAdapter, any arguments you want to give to your job must be serializable as JSON since the values will be stored in the database as text.

The persistent job workers (started in dev with yarn rw jobs work or detached to run in the background with yarn rw jobs start) will periodically check the database for any jobs that are qualified to run: not already locked by another worker and with a runAt time before or equal to right now. They'll lock the record, instantiate your job class and call perform() on it, passing in the arguments you gave when scheduling it.

If the job succeeds it is removed from the database
If the job fails the error is recorded, the job is rescheduled to try again, and the lock is removed

Repeat until the queue is empty!

Usage

Setup

To simplify the setup, run the included setup script:

yarn rw setup jobs

This creates api/src/lib/jobs with the basic config included to get up and running, as well as the model added to your schema.prisma file.

You can generate a job with the shell ready to go:

yarn rw g job WelcomeEmail

This creates a file at api/src/jobs/WelcomeEmailJob.js along with the shell of your job. All you need to is fill out the perform() function:

// api/src/jobs/WelcomeEmailJob.js

export class WelcomeEmailJob extends RedwoodJob {
  perform(email) {
    // send email...
  }
}

Scheduling

A typical place you'd use this job would be in a service. In this case, let's add it to the users service after creating a user:

// api/src/services/users/users.js

export const createUser = async ({ input }) {
  const user = await db.user.create({ data: input })
  await WelcomeEmailJob.performLater(user.email)
  return user
})

With the above syntax your job will run as soon as possible, in the queue named "default" and with a priority of 50. You can also delay your job for, say, 5 minutes:

OnboardingJob.set({ wait: 300 }).performLater(user.email)

Or run it at a specific time in the future:

MilleniumReminderJob.set({ waitUntil: new Date(2999, 11, 31, 12, 0, 0) }).performLater(user.email)

There are lots of ways to customize the scheduling and worker processes. Check out the docs for the full list!

Execution

To run your jobs, start up the runner:

yarn rw jobs work

This process will stay attached the console and continually look for new jobs and execute them as they are found. To work on whatever outstanding jobs there are and then exit, use the workoff mode instead.

To run the worker(s) in the background, use the start mode:

yarn rw jobs start

To stop them:

yarn rw jobs stop

You can start more than one worker by passing the -n flag:

yarn rw jobs start -n 4

If you want to specify that some workers only work on certain named queues:

yarn rw jobs start -n default:2,email:1

Make sure you pass the same flags to the stop process as the start so it knows which ones to stop. You can restart your workers as well.

In production you'll want to hook the workers up to a process monitor as, just like with any other process, they could die unexpectedly. More on this in the docs.

The Future

More adapters (maybe the community wants to get involved?): Redis, SQS, RabbitMQ
Studio integration: monitor the state of your outstanding jobs
Baremetal integration: if jobs are enabled, monitor the workers
Recurring jobs (performEvery?)
Lifecycle hooks: beforePerform(), afterPerform(), afterSuccess(), afterFailure()

… to framework

…apter`

cannikin · 2024-08-10T23:44:46Z

packages/jobs/src/core/Executor.ts

      await this.adapter.success({
        job: this.job,
        deleteJob: DEFAULT_DELETE_SUCCESSFUL_JOBS,
      })
    } catch (error: any) {
-      this.logger.error(`Error in job ${this.job.id}: ${error.message}`)
+      // TODO(jgmw): Handle the error 'any' better
+      this.logger.error(`Error in job ${this.jobId}: ${error.message}`)


I wanted this to be the job's actual ID so you can find it in the database...it was very common for me to see an error in the logs and then go find the job to get more info.

I do like the idea of having the name/path output in the logs though. I'd probably use a different name than jobId because I'd assume that's just the integer ID...jobName or jobPath or something? Kind of weird that we have job.name and job.path as well. fullJobName, jobResourceId, jobLocation ...

The reason I made this change was because we haven't enforced that the general BaseJob must have an ID. That's only available from the prisma job right now. I agree this is less actionable when you see it in the terminal now but I didn't want to go and add the constraint that any job must have a numberic id without discussing it with you.

Kind of weird that we have job.name and job.path as well.

Not sure I follow what you mean by weird that these are here?

Not sure I follow what you mean by weird that these are here?

haha Sorry, I meant if we added this.jobName but there's also this.job.name it could be confusing, just like seeing this.jobId I assumed would be the same value as this.job.id.

The reason I made this change was because we haven't enforced that the general BaseJob must have an ID. That's only available from the prisma job right now. I agree this is less actionable when you see it in the terminal now but I didn't want to go and add the constraint that any job must have a numberic id without discussing it with you.

Good point! I should have added id to the required fields in BaseJob I think, otherwise there isn't enough info between name, path, and args to uniquely identify it.

I don't think we can force it be only numeric though, some people like uuids and guids for their id fields. GROSS

cannikin · 2024-08-13T22:24:32Z

Closing this one as most of the conversation is now irrelevant after the refactor. Opened a new one: #11238

cannikin added 30 commits June 26, 2024 16:21

Adds RedwoodJob, adapters, Worker and Executor classes, along with bins

0e81360

Hardcode expected time

16deb0a

Additional errors

b72921c

Rename tests

2421f65

Start trying to load logger and adapter from app filesystem

43cbdd0

Import console and process

ef533dc

Use registerApiSideBabelHook() to get files ready for import from app…

fd3a03d

… to framework

Add paths for jobs/logger

54002a9

Convert tests over to vitest

bb7e821

Update deps, switch to vitest

ff74491

New build config

8653bb6

Don't need babelrc

c5d3112

Update .lock

2f30187

Catch any error importing logger, just log to console instead

d932690

Catch errors if api/src/lib/jobs doesn't exist, or doesn't export `ad…

30a24df

…apter`

Add paths for jobs

ba92f60

Move bins/shared to core/loaders

d07ed75

Load job from app (only works for jobs in the root path for now)

2f5eacd

Adds error for job not exporting a named member

d5cfd0b

Update import locations

31a2038

Include process title in log message

4c13b89

Add debug logging for job success and failure

93ffcd9

Use console if logger not yet, remove private #log() function

1de0a44

Find job in any subdirectory of api/src/jobs

c927e35

Comment loaders

8b12c77

Import setTimeout

d8fbab0

Remove index file in adapters

91ca380

Comments, remove unused args

192e929

Convert sqlite query to regular prisma calls

7d90508

Update worker debug message to include queue name being searched

f6caf73

Josh-Walker-GM added 7 commits August 11, 2024 00:21

fix jobs having name and path

636c22c

add await to fix bins

11bcf95

fix prisma adapter tests

14d5c87

update comments

cbdf2d5

remove type testing package for now

06978d0

Merge branch 'main' into rc-background-jobs

f1242bb

fix deps

333f6a2

cannikin commented Aug 10, 2024

View reviewed changes

Josh-Walker-GM and others added 19 commits August 11, 2024 00:50

actually fix deps

505e175

Updates log messages to be consistent

054ebbb

Scheduler tests

d7bc65a

Update signature for computeRunAt to be an object

0314521

Await response to scheduling before returning true

2bd6cad

Fix DEFAULT_LOGGER usage and [RedwoodJob] prefix

b25a278

Add adapter exports

88db895

Fix/remove TODOs

d2f59ee

Add @prisma/client as dev dependency

f39ae36

Remove unused babel mock

594ab3e

Move worker instantiation to JobManager

3b61b8e

Update command line flag descriptions

484da82

Don’t execute script if imported in test env

83ca4e6

Reorg imports

3c7432e

Fix types in Worker

9749bee

Use completeJobPath to test if file exists

7dc7e60

Update templates

b250186

loadEnvFiles in jobs worker, switch to warn() level for startup logs

688b0a5

Fix script name

0b39586

cannikin closed this Aug 13, 2024

Josh-Walker-GM removed this from the next-release milestone Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds background job scheduling and execution #10906

Adds background job scheduling and execution #10906

cannikin commented Jul 2, 2024 •

edited

Loading

cannikin Aug 10, 2024

Josh-Walker-GM Aug 10, 2024

Josh-Walker-GM Aug 10, 2024

cannikin Aug 10, 2024

cannikin Aug 10, 2024

cannikin commented Aug 13, 2024

Adds background job scheduling and execution #10906

Adds background job scheduling and execution #10906

Conversation

cannikin commented Jul 2, 2024 • edited Loading

What's Included

Features

How it Works

Usage

Setup

Scheduling

Execution

The Future

cannikin Aug 10, 2024

Choose a reason for hiding this comment

Josh-Walker-GM Aug 10, 2024

Choose a reason for hiding this comment

Josh-Walker-GM Aug 10, 2024

Choose a reason for hiding this comment

cannikin Aug 10, 2024

Choose a reason for hiding this comment

cannikin Aug 10, 2024

Choose a reason for hiding this comment

cannikin commented Aug 13, 2024

cannikin commented Jul 2, 2024 •

edited

Loading