Skip to content

AkomaNtoso go-libs for pardocs + dundocs data extraction

License

Notifications You must be signed in to change notification settings

Sinar/go-akomantoso

Repository files navigation

go-akomantoso

AkomaNtoso go-libs for pardocs + dundocs data extraction. For now, will mix up the usage; the input will be the extracted multiline content per question/answer.

NOTE: It is only using a subset of AKN; as per described in

Should pass validation of AKN 3.0 schema; as per in schema.xsd; using example --> https://github.com/lestrrat-go/libxml2 (needs C; skip!) --> https://github.com/terminalstatic/go-xsd-validate (needs C; skip!) --> https://github.com/krolaw/xsd (needs C; skip!)

Would need something like --> https://xmlschema.readthedocs.io/en/latest/intro.html ; for native validation

Java Validators

Sample given by SayIt does NOT comply with the latest AKN standard!

# Works --> https://github.com/amouat/xsd-validator
$ ./xsdv.sh ../go-akomantoso/schema.xsd ../go-akomantoso/testdata/the-tempest.an.xml

If just to extract into struct; this looks OK --> https://github.com/wagner-aos/go-xsd

Use https://github.com/shabbyrobe/cmdy to run this as a standalone on the spilit output from go-pardocs + go-dundocs ..

How to convert data to the standard we use – Akoma Ntoso

Akoma Ntoso1 is a comprehensive XML schema for several Parliamentary document types such as bills, acts, and debates. Various bodies around the world are starting to use or interoperate with Akoma Ntoso to model their data. Whilst it was designed for Parliamentary document types, the schema is general enough that it can be used for many different types of debate.

SayIt can import a subset of Akoma Ntoso, and below we describe which aspects of it we currently cover. You can export an Akoma Ntoso representation of any section on SayIt by adding .an to the end of any section URI, for example Shakespeare’s The Tempest: https://shakespeare.sayit.mysociety.org/the-tempest.an.

Intro

Standard (Full)

Schema Viewer

Examples

Test data

In testdata folder ...

Releases

No releases published

Packages

No packages published