[NB: Documentation is out of sync with development]
XSLT functions for SVG plotting
- Linear smoothing
- [In progress] Weighted smoothing
- [In progress] LOWESS smoothing
- Linear regression lines
- [In progress] Linear regression parabolas
- Bézier splines
Author: David J. Birnbaum
Email: [email protected]
URL: http://www.obdurodon.org
Repo: https://github.com/djbpitt/plot
- Clone the repo.
- Edit ee_config.xml or he_config.xml to adapt for your filesystem locations.
- The regression, lowess, smoothing. and spline packages can be incorporating into your code with
<xsl:use-package>
. The lib package is used by the others, and does not have to be imported directly.
- All point data is assumed to be in SVG coordinates, that is, with negative Y pointing up and positive Y pointing down. Input data in regular Cartesian coordinates must be preprocessed accordingly.
- Debug versions of the functions are intended to be run on their own. The subdirectories include driver XSLT stylesheets that generate sample output. Run with
saxon -config:path_to_config -it -xsl:driver_filename.xsl
.
The regression_line()
functions draw a regression line. Support for drawing a parabola is under development.
djb:regression_line($points as xs:string+, $debug as xs:boolean) as (element(svg:g), map(*))
Argument | Type | Default | Meaning |
---|---|---|---|
$points |
xs:string+ | (required) | A sequence of at least three strings, representing points in SVG coordinate space, each of which has the form X,Y , so that the X and Y values are separated by a comma without intervening whitespace. For example, the input "50,182", "100,166", "150,87", "200,191", "250,106" describes five points. Each X and Y value may include a decimal point and may be preceded by + or - . |
$debug |
xs:boolean | False |
If set to True , the function returns debugging information, in the form of a map, along with the primary SVG <g> result. |
Result | element(svg:g) | (none) | If $debug is False , an SVG <g> element that contains just an SVG <line> element representing a regression line through the data points. The <line> element has on @x1 , @y1 , x2 , and @y2 attributes, plus a @class attribute with the value "regression", which can be used by the classing program to control the rendering (specifying values for @stroke , stroke-width , etc.). If $debug is True , the <g> also plots the original points and a connecting SVG <polyline> , includes a background SVG <rect> , and specifies CSS for @stroke and @stroke-width . |
Result | map(*) | (none) | If $debug is True , in addition to the SVG <g> element, a map(*) with xs:double values for "m" (slope) and "b" (intercept) is returned. If $debug is False , this map is not returned. |
The function raises a fatal error if:
$points
contains fewer than three strings, each representing a point, or- any point does not match the pattern
X,Y
, with no whitespace and whereX
andY
are doubles with optional leading+
or-
signs, or - X is not ordered monotonically (consecutive equal values are allowed).
djb:regression_line($points as xs:string+) as element(svg:g)
Argument | Type | Default | Meaning |
---|---|---|---|
$points |
xs:string+ | (required) | A sequence of at least three strings, representing points in SVG coordinate space, each of which has the form X,Y , so that the X and Y values are separated by a comma without intervening whitespace. For example, the input "50,182", "100,166", "150,87", "200,191", "250,106" describes five points. Each X and Y value may include a decimal point and may be preceded by + or - . |
Result | element(svg:g) | (none) | An SVG <g> element that contains just an SVG <line> element representing a regression line through the data points. The <line> element has on @x1 , @y1 , x2 , and @y2 attributes, plus a @class attribute with the value "regression", which can be used by the classing program to control the rendering (specifying values for @stroke , stroke-width , etc.). |
- The function raises a fatal error if:
$points
contains fewer than three strings, each representing a point, or- any point does not match the pattern
X,Y
, with no whitespace and whereX
andY
are doubles with optional leading+
or-
signs, or - X is not ordered monotonically (consecutive equal values are allowed).
- The arity-1 version of
djb:regression_line()
is equivalent to calling the arity-2 version and specifyingFalse
as the value of the$debug
argument.
The smoothing package implements a simple moving average by returning each input point with the X
value unchanged, while the Y
value is replaced by the average of the points within a window of size $window
(default value is 3
) centered on the point at X
. The windows is always the same size; if there are not enough points (toward the beginning and end of the sequence of points) to center the current point, missing points are made up on the other side. For example, with a window size of 3
, both the first and second points return the average of the first three points, and the fourth returns the average of the second, third, and fourth.
Smoothing reduces the extent of the impact of outliers, making it easier to see a trend. But because the neighboring points carry as much weight as the central one, a local peak or trough does not mean the correspnding original data point is necessarily higher (resp. lower) than its immediate neighors.
djb:smoothing($points as xs:string+, $window as xs:integer) as xs:string+
Argument | Type | Default | Meaning |
---|---|---|---|
$points |
xs:string | (required) | A sequence of at least three strings, representing points in SVG coordinate space, each of which has the form X,Y , so that the X and Y values are separated by a comma without intervening whitespace. For example, the input "50,182", "100,166", "150,87", "200,191", "250,106" describes five points. Each X and Y value may include a decimal point and may be preceded by + or - . |
$window |
xs:integer | 3 | The number of points whose Y value is incorporated into the simple moving average returned for each input point. $window must be a positive odd number. |
Result | xs:string+ | (none) | A sequence of strings in the same format as $points , but with the Y values modified by smoothing. |
The function raises a fatal error if:
$points
contains fewer than three strings, each representing a point, or- any point does not match the pattern
X,Y
, with no whitespace and whereX
andY
are doubles with optional leading+
or-
signs, or - X is not ordered monotonically (consecutive equal values are allowed), or
$window
is not an odd positive integer.
djb:smoothing($points as xs:string+) as xs:string+
Argument | Type | Default | Meaning |
---|---|---|---|
$points |
xs:string | (required) | A sequence of at least three strings, representing points in SVG coordinate space, each of which has the form X,Y , so that the X and Y values are separated by a comma without intervening whitespace. For example, the input "50,182", "100,166", "150,87", "200,191", "250,106" describes five points. Each X and Y value may include a decimal point and may be preceded by + or - . |
Result | xs:string+ | (none) | A sequence of strings in the same format as $points , but with the Y values modified by smoothing. |
- The function raises a fatal error if:
$points
contains fewer than three strings, each representing a point, or- any point does not match the pattern
X,Y
, with no whitespace and whereX
andY
are doubles with optional leading+
or-
signs, or - X is not ordered monotonically (consecutive equal values are allowed).
- The arity-1 version of
djb:smoothing()
is equivalent to calling the arity-2 version and specifying3
as the value of the$window
argument.