BatchIt is a very simple F# library that allows you to write batched async functions and consume them as if they were non-batched.
It’s useful if you have DB queries that are executed very frequently. This may for example be the case when using GraphQL resolvers or similar (e.g. JSON-API with Felicity). BatchIt will also transparently take care of deduplicating your input arguments, so that the batched function receives a distinct set of inputs.
Install the BatchIt package from NuGet.
BatchIt centers around these two function signatures:
- Normal:
'a -> Async<'b>
- Accepts a single argument
'a
(which can be a tuple or a record if you need multiple values) and returns a single value'b
- Accepts a single argument
- Batched:
'a [] -> Async<('a * 'b) []>'
- Accepts an array of
'a
(all the values to use for a single batch) and returns an array of output values'b
together with their corresponding input values (so that BatchIt can correlate inputs with outputs).
- Accepts an array of
Your job is to implement the batched function, and then you use BatchIt to configure timing and convert it to the normal version. Any calls to the normal version will then be batched using your configuration and your batched function.
type Person = { Name: string }
type FindArgs = { PersonId: Guid; TenantId: Guid }
let getPersonBatched (args: FindArgs []) : Async<(FindArgs * Person option) []> =
async {
// BatchIt is not concerned with what actually happens inside this function.
// Here you can e.g. query a DB with all the args. If using SQL, you might
// use TVP or XML to get the args into the query and use INNER JOIN instead
// of WHERE to filter your results.
// Note that the query must return the input args for each row along with the
// output values, so that you can create the FindArgs to return, too.
// Example:
let! rows = args |> ... // get rows for all input args
return
rows
|> Seq.toArray
|> Array.groupBy (fun r ->
{ PersonId = r.inputPersonId; TenantId = r.inputTenantId })
|> Array.map (fun (batchKey, rows) ->
batchKey,
rows |> Array.tryHead |> Option.map (fun r -> { Name = r.name }))
}
// Now create the "normal" function using BatchIt:
open BatchIt
let getPerson : PersonArgs -> Async<Person option> =
Batch.Create(getPersonBatched, 50, 100, 1000)
// Now any calls to the seemingly normal getPerson function will be batched.
// Each call will reset wait time to 50ms, and it will wait at least 100ms
// and at most 1000ms.
The non-batched function needs to be defined without arguments (as a module-level function-typed value), as shown above. Don’t do the following:
// Don't do this!
let getPerson args = Batch.Create(getPersonBatched, 50, 100, 1000) args
The above would result in a new batched version to be created for every invocation, and no batching would take place ( all batch sizes would be 1, plus there’d be a big overhead for each call).
If you want to make the non-batched function prettier (e.g. to name the arguments), simply wrap it in another function:
let private getPerson' = Batch.Create(getPersonBatched, 50, 100, 1000)
let getPerson args = getPerson' args
BatchIt’s only public API is the Batch.Create
overloads. You have the following options:
- Throttle-like: Execute the batch X ms after the first call
- Optionally also limit max batch size
- Debounce-like: Execute the batch X ms after the last call (timer resets when new calls come in before X ms), but wait
at least Y ms and at most Z ms
- Optionally also limit max batch size
- Specify the value the “normal” function should return if the batched output did not contain an item corresponding to
the supplied input
- For
option
,voption
,list
, andarray
, will automatically useNone
,ValueNone
,List.empty
, andArray.empty
, respectively - For other types, you must supply the missing value yourself
- For
BatchIt supports functions that accept one extra parameter in addition to the argument array. This can be used e.g. for
passing a connection string. (To pass multiple values, use a tuple or record.) The extra argument will be passed through
from the non-batched function returned by Batch.Create
.
Example:
let getPersonBatched
(connStr: string)
(args: FindArgs array)
: Async<(FindArgs * Person option) array> =
... // like shown previously, but obviously uses the connection string somewhere
open BatchIt
let getPerson : string -> PersonArgs -> Async<Person option> =
Batch.Create(getPersonBatched, 50, 100, 1000)
If the non-batched function is called with different values for the extra argument, all values are still collected and run as part of the same batch (regarding timing, max size, etc.), but the batched arguments are grouped by the corresponding extra argument value and the batched function is then invoked once for each unique extra value (with the corresponding batched arguments).
Contributions and ideas are welcome! Please see Contributing.md for details.