– eXtended S-eXpressions

import "git.fractalqb.de/fractalqb/xsx"

Package XSX provides tools for parsing something I call eXtended S-eXpressions. Extended means the following things compared to SEXP S-expressions:

Nested structures are delimited by balanced braces '()', '[]' or '{}’ – not only by '()'.
XSX provides a notation for "Meta Values", i.e. XSXs that provide some sort of meta information that is not part of the "normal" data.

On the other hand some properties from SEXP were dropped, e.g. typing of the so called "octet strings". Things like that are completely left to the application.

Somewhat more formal description

Frist of all, XSX is not about datatypes, in this it is comparable to e.g. XML (No! don't leave… its much simpler). Instead its building block is the atom, i.e. nothing else than a sequence of characters, aka a 'string'. Atoms come as quoted atoms and as unquoted atoms. One needs to quote an atom when the atom's string contains characters that have a special meaning in XSX: ()[]{}\ and white-space.

Regexp style definition of Atom

atom     := nq-atom | q-atom
nq-atom  := ([^()[]{}]|\s)+
q-atom   := "([^"\]|(\")(\\))+"
XSX      := atom

I.e. x is an atom and foo, bar and baz are atoms. An atom that contains a '"' or '' would be "quote: \" and backslash: \\ in a quoted atom". Also "(" is an atom but ( is not an atom. We need '(' for other things!

Sequences now BNF Style

Each atom is an XSX and from XSX'es one can build sequences:

XSX  ::= atom | seq1 | seq2 | seq3
seq1 ::= '(' ws* ')' | '(' ws* xsxs ws* ')'
seq2 ::= '[' ws* ']' | '[' ws* xsxs ws* ']'
seq3 ::= '{' ws* '}' | '{' ws* xsxs ws* '}'
xsxs ::= XSX | XSX ws* xsxs
ws   ::= “Unicode's White Space”

Out-Of-Band Information with Meta XSXs

You can prefix each XSX with a backslash to make that expression a meta-expression. A meta-expression is not considered to be a XSX, i.e. you cannot create meta-meta-expressions or meta-meta-meta-expressions… hmm… and not event meta-meta-meta-meta-expressions! I think it became clear?

E.g. \4711 is a meta-atom and \{foo 1 bar false baz 3.1415} is a meta-sequence. What meta means is completely up to the application. Imagine e.g. (div hiho) and (div \{class green} hiho) to be a translation from <div>hiho</div> and <div class="green">hiho</div>.

Rationale

None! … despite the fact that I found it to be fun – and useful in some situations.

Because XSX syntax so simple it is easy to use the PullParser as a tokenizer to build customized parsers for proprietary data files. E.g. see the table sub-package. On the other hand the low level parser and scanner API is inspired by the expat streaming parser that allows one to push some data into the paring machinery and it will fire appropriate callbacks when tokes are detected.

So, if you are looking for something that's even simpler than JSON or YAML you might give it a try… Happy coding!

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
doc		doc
gem		gem
table		table
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
autocover.sh		autocover.sh
doc.go		doc.go
parser.go		parser.go
parser_test.go		parser_test.go
pcompact.go		pcompact.go
pcompact_test.go		pcompact_test.go
pnewline.go		pnewline.go
ppretty.go		ppretty.go
ppretty_test.go		ppretty_test.go
printer.go		printer.go
pull.go		pull.go
pull_test.go		pull_test.go
scanner.go		scanner.go
scanner_test.go		scanner_test.go
scanspeed_test.go		scanspeed_test.go
scanws_test.go		scanws_test.go
version.go		version.go
write.go		write.go
write_test.go		write_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

– eXtended S-eXpressions

Somewhat more formal description

Regexp style definition of Atom

Sequences now BNF Style

Out-Of-Band Information with Meta XSXs

Rationale

About

Releases

Packages

Contributors 2

Languages

License

fractalqb/xsx

Folders and files

Latest commit

History

Repository files navigation

– eXtended S-eXpressions

Somewhat more formal description

Regexp style definition of Atom

Sequences now BNF Style

Out-Of-Band Information with Meta XSXs

Rationale

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages