Skip to content

Commit 28f8d3d

Browse files
authored
Merge pull request #288 from stan-dev/v1.0
Release version 1.0
2 parents 41b523f + 8f296a1 commit 28f8d3d

File tree

357 files changed

+3242
-378
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

357 files changed

+3242
-378
lines changed

.github/workflows/posteriordb_content.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ jobs:
3434
3535
- name: Cache R packages
3636
if: runner.os != 'Windows'
37-
uses: actions/cache@v2
37+
uses: actions/cache@v4
3838
with:
3939
path: ${{ env.R_LIBS_USER }}
4040
key: ${{ runner.os }}-r-${{ matrix.config.r }}-${{ hashFiles('DESCRIPTION') }}

LICENCE.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
BSD 3-Clause License
2+
3+
Copyright (c) 2011-2024, Stan Developers and their Assignees
4+
All rights reserved.
5+
6+
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
7+
8+
* Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
9+
10+
* Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.
11+
12+
* Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
13+
14+
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

README.Rmd

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,11 @@ See [DATABASE_CONTENT.md](https://github.com/stan-dev/posteriordb/blob/master/do
3434

3535
We are happy with any help in adding posteriors, data, and models to the database! See [CONTRIBUTING.md](https://github.com/stan-dev/posteriordb/blob/master/doc/CONTRIBUTING.md) for the details on how to contribute.
3636

37+
## Licensing
38+
The posteriordb is licensed under the [new BSD
39+
license](https://github.com/stan-dev/posteriordb/LICENCE.md).
40+
41+
Most model code are using the the same BSD-3 licence. Although, some models and some data might have other open licences such as MIT. Each model has a ```licence``` element in the info JSON files that specifies the actual licence of the model. Some data might also have licences, then that is specified in a similar way.
3742

3843
## Using `posteriordb`
3944

README.md

Lines changed: 26 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -7,19 +7,16 @@ Content](https://github.com/stan-dev/posteriordb/actions/workflows/posteriordb_c
77
coverage](https://codecov.io/gh/stan-dev/posteriordb-r/branch/main/graph/badge.svg)](https://codecov.io/gh/stan-dev/posteriordb-r?branch=main)
88
[![Python](https://github.com/stan-dev/posteriordb-python/actions/workflows/push.yml/badge.svg)](https://github.com/stan-dev/posteriordb-python/actions/workflows/push.yml)
99

10-
`posteriordb`: a database of Bayesian posterior inference
11-
=========================================================
10+
# `posteriordb`: a database of Bayesian posterior inference
1211

13-
What is `posteriordb`?
14-
----------------------
12+
## What is `posteriordb`?
1513

1614
`posteriordb` is a set of posteriors, i.e. Bayesian statistical models
1715
and data sets, reference implementations in probabilistic programming
1816
languages, and reference posterior inferences in the form of posterior
1917
samples.
2018

21-
Why use `posteriordb`?
22-
----------------------
19+
## Why use `posteriordb`?
2320

2421
`posteriordb` is designed to test inference algorithms across a wide
2522
range of models and data sets. Applications include testing for
@@ -36,34 +33,42 @@ definitions, well-curated data sets, and reference posteriors.
3633
Python.
3734

3835
For more details regarding the use cases of `posteriordb`, see
39-
[doc/use\_cases.md](https://github.com/stan-dev/posteriordb/blob/master/doc/use_cases.md).
36+
[doc/use_cases.md](https://github.com/stan-dev/posteriordb/blob/master/doc/use_cases.md).
4037

41-
Content
42-
-------
38+
## Content
4339

4440
See
45-
[DATABASE\_CONTENT.md](https://github.com/stan-dev/posteriordb/blob/master/doc/DATABASE_CONTENT.md)
41+
[DATABASE_CONTENT.md](https://github.com/stan-dev/posteriordb/blob/master/doc/DATABASE_CONTENT.md)
4642
for the details content of the posterior database.
4743

48-
Contributing
49-
------------
44+
## Contributing
5045

5146
We are happy with any help in adding posteriors, data, and models to the
5247
database! See
5348
[CONTRIBUTING.md](https://github.com/stan-dev/posteriordb/blob/master/doc/CONTRIBUTING.md)
5449
for the details on how to contribute.
5550

56-
Using `posteriordb`
57-
-------------------
51+
## Licensing
52+
53+
The posteriordb is licensed under the [new BSD
54+
license](https://github.com/stan-dev/posteriordb/LICENCE.md).
55+
56+
Most model code are using the the same BSD-3 licence. Although, some
57+
models and some data might have other open licences such as MIT. Each
58+
model has a `licence` element in the info JSON files that specifies the
59+
actual licence of the model. Some data might also have licences, then
60+
that is specified in a similar way.
61+
62+
## Using `posteriordb`
5863

5964
To simplify the use of `posteriordb`, there are convenience functions
60-
both in R and in Python.
61-
- For R, see the [posteriordb-r](https://github.com/stan-dev/posteriordb-r) repository.
62-
- For Python, see the [posteriordb-python](https://github.com/stan-dev/posteriordb-python)
65+
both in Python and in R. To use R, see the
66+
[posteriordb-r](https://github.com/stan-dev/posteriordb-r) repository,
67+
and to use Python, see the
68+
[posteriordb-python](https://github.com/stan-dev/posteriordb-python)
6369
repository.
6470

65-
Citing `posteriordb`
66-
--------------------
71+
## Citing `posteriordb`
6772

6873
Developing and maintaining open-source software is an important yet
6974
often underappreciated contribution to scientific progress. Thus, please
@@ -81,8 +86,7 @@ M., Brubaker M., Guo J., Li P., and Riddell A. (2017). Stan: A
8186
probabilistic programming language. Journal of Statistical Software.
8287
76(1). 10.18637/jss.v076.i01
8388

84-
Design choices (so far)
85-
-----------------------
89+
## Design choices (so far)
8690

8791
The main focus of the database is simplicity, both in understanding and
8892
in use.
@@ -108,8 +112,7 @@ database.
108112
included in the PPL syntax files as comments, not in the
109113
model.info.json files.
110114

111-
Versioning of models
112-
--------------------
115+
## Versioning of models
113116

114117
We might update models included in posteriordb over time. However, the
115118
models will only have the same name in posteriordb if the log density is

croissant.json

Lines changed: 118 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,118 @@
1+
{
2+
"@context": {
3+
"@language": "en",
4+
"@vocab": "https://schema.org/",
5+
"citeAs": "cr:citeAs",
6+
"column": "cr:column",
7+
"conformsTo": "dct:conformsTo",
8+
"cr": "http://mlcommons.org/croissant/",
9+
"rai": "http://mlcommons.org/croissant/RAI/",
10+
"data": {
11+
"@id": "cr:data",
12+
"@type": "@json"
13+
},
14+
"dataType": {
15+
"@id": "cr:dataType",
16+
"@type": "@vocab"
17+
},
18+
"dct": "http://purl.org/dc/terms/",
19+
"examples": {
20+
"@id": "cr:examples",
21+
"@type": "@json"
22+
},
23+
"extract": "cr:extract",
24+
"field": "cr:field",
25+
"fileProperty": "cr:fileProperty",
26+
"fileObject": "cr:fileObject",
27+
"fileSet": "cr:fileSet",
28+
"format": "cr:format",
29+
"includes": "cr:includes",
30+
"isLiveDataset": "cr:isLiveDataset",
31+
"jsonPath": "cr:jsonPath",
32+
"key": "cr:key",
33+
"md5": "cr:md5",
34+
"parentField": "cr:parentField",
35+
"path": "cr:path",
36+
"recordSet": "cr:recordSet",
37+
"references": "cr:references",
38+
"regex": "cr:regex",
39+
"repeated": "cr:repeated",
40+
"replace": "cr:replace",
41+
"sc": "https://schema.org/",
42+
"separator": "cr:separator",
43+
"source": "cr:source",
44+
"subField": "cr:subField",
45+
"transform": "cr:transform"
46+
},
47+
"@type": "sc:Dataset",
48+
"name": "posteriordb",
49+
"description": " Probabilistic programming languages (PPL), such as Stan, Tensorflow Probability, PyMC, Pyro, and Turing.jl,are becoming increasingly popular for Bayesian probabilistic data analysis and predictive modelling. These languages all rely on general inference algorithms such as Markov chain Monte Carlo sampling or black-box variational inference. We introduce posteriordb, a database of models, compatible data sets, and reference posteriors intended to make evaluatingand comparing models easier and more trustworthy. Posteriordb currently includes mixed effect regressions, time series, hierarchical Gaussian processes, hidden Markov models, mixture models, ordinary differential equation models, and more. ",
50+
"conformsTo": "http://mlcommons.org/croissant/1.0",
51+
"url": "https://https://github.com/stan-dev/posteriordb",
52+
"distribution": [
53+
{
54+
"@type": "cr:FileObject",
55+
"@id": "github-repository",
56+
"name": "github-repository",
57+
"description": "PosteriorDB's GitHub repository.",
58+
"contentUrl": "https://https://github.com/stan-dev/posteriordb",
59+
"encodingFormat": "git+https",
60+
"sha256": "main"
61+
},
62+
{
63+
"@type": "cr:FileSet",
64+
"@id": "data-zip-files",
65+
"name": "data-zip-files",
66+
"description": "Datasets in json format stored in zip files.",
67+
"containedIn": {
68+
"@id": "github-repository"
69+
},
70+
"encodingFormat": "application/zip",
71+
"includes": "posterior_database/data/data/*.zip"
72+
},
73+
{
74+
"@type": "cr:FileSet",
75+
"@id": "data-csv-files",
76+
"name": "data-csv-files",
77+
"description": "Datasets stored in csv format.",
78+
"containedIn": {
79+
"@id": "github-repository"
80+
},
81+
"encodingFormat": "application/zip",
82+
"includes": "posterior_database/data/data-raw/*.csv"
83+
},
84+
{
85+
"@type": "cr:FileSet",
86+
"@id": "draws-zip-files",
87+
"name": "draws-zip-files",
88+
"description": "Posterior samples in json format stored in zip files.",
89+
"containedIn": {
90+
"@id": "github-repository"
91+
},
92+
"encodingFormat": "application/zip",
93+
"includes": "posterior_database/reference_posteriors/draws/draws/*.zip"
94+
},
95+
{
96+
"@type": "cr:FileSet",
97+
"@id": "stan-model-files",
98+
"name": "stan-model-files",
99+
"description": "Posterior models in .stan format.",
100+
"containedIn": {
101+
"@id": "github-repository"
102+
},
103+
"encodingFormat": "application/stan",
104+
"includes": "posterior_database/models/stan/*.stan"
105+
},
106+
{
107+
"@type": "cr:FileSet",
108+
"@id": "pymc3-model-files",
109+
"name": "pymc3-model-files",
110+
"description": "Posterior pymc3 models in .py format.",
111+
"containedIn": {
112+
"@id": "github-repository"
113+
},
114+
"encodingFormat": "application/py",
115+
"includes": "posterior_database/models/pymc3/*.py"
116+
}
117+
]
118+
}

doc/use_cases.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
posteriordb has different uses. Below is list of example use cases and what features are important in each use case.
44

55

6-
## Unit testing
6+
## Testing
77

88
Developers of different algorithms can use the posterior database to get data, models and reference posteriors to asses that the software is working as it should.
99

posterior_database/models/info/2pl_latent_reg_irt.info.json

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
"keywords": "item response model",
44
"title": "Two-parameter logistic item theory model with latent regression",
55
"description": "The Rasch model (Rasch 1960) is an item response theory model for dichotomous items. The two-parameter logistic model (2PL) (Swaminathan and Gifford 1985) is an item response theory model that includes parameters for both the difficulty and discrimination of dichotomous items. The version presented includes a latent regression. However, the latent regression part of the model may be restricted to an intercept only, resulting in a regular 2PL.",
6-
"urls": "https://github.com/danielcfurr/edstan",
6+
"urls": ["https://github.com/danielcfurr/edstan", "https://mc-stan.org/users/documentation/case-studies/rasch_and_2pl.html"],
77
"references": null,
88
"added_by": "Keane Nguyen",
99
"added_date": "2021-08-03",
@@ -12,5 +12,6 @@
1212
"model_code": "models/stan/2pl_latent_reg_irt.stan",
1313
"stan_version": ">=2.26.0"
1414
}
15-
}
15+
},
16+
"licence": "BSD3"
1617
}

posterior_database/models/info/GLMM1_model.info.json

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,5 +12,6 @@
1212
"model_code": "models/stan/GLMM1_model.stan",
1313
"stan_version": ">=2.26.0"
1414
}
15-
}
15+
},
16+
"licence": "BSD3"
1617
}

posterior_database/models/info/GLMM_Poisson_model.info.json

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,5 +12,6 @@
1212
"model_code": "models/stan/GLMM_Poisson_model.stan",
1313
"stan_version": ">=2.26.0"
1414
}
15-
}
15+
},
16+
"licence": "BSD3"
1617
}

posterior_database/models/info/GLM_Binomial_model.info.json

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,5 +12,6 @@
1212
"model_code": "models/stan/GLM_Binomial_model.stan",
1313
"stan_version": ">=2.26.0"
1414
}
15-
}
15+
},
16+
"licence": "BSD3"
1617
}

0 commit comments

Comments
 (0)