diff --git a/CHANGELOG.md b/CHANGELOG.md index 7e0ac718..a9eb617c 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,5 +1,13 @@ # TOML Changelog +## 0.3.0 / 2014-11-10 + +* Add scientific notation for floats. +* Allow optional + prefix on integers. +* Switch to RFC 3339 for datetimes (allowing offsets and fractional seconds). +* Add multiline and literal strings. +* Clarify what characters valid keys can contain. + ## 0.2.0 / 2013-09-24 * Use "table" instead of "key group" terminology. diff --git a/README.md b/README.md index e4deb7c0..f049177c 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,7 @@ Tom's Obvious, Minimal Language. By Tom Preston-Werner. Latest tagged version: -[v0.2.0](https://github.com/mojombo/toml/blob/master/versions/toml-v0.2.0.md). +[v0.3.0](https://github.com/mojombo/toml/blob/master/versions/toml-v0.3.0.md). Be warned, this spec is still changing a lot. Until it's marked as 1.0, you should assume that it is unstable and act accordingly. diff --git a/versions/toml-v0.3.0.md b/versions/toml-v0.3.0.md new file mode 100644 index 00000000..6b1dede8 --- /dev/null +++ b/versions/toml-v0.3.0.md @@ -0,0 +1,601 @@ +TOML v0.3.0 +=========== + +Tom's Obvious, Minimal Language. + +By Tom Preston-Werner. + +Be warned, this spec is still changing a lot. Until it's marked as 1.0, you +should assume that it is unstable and act accordingly. + +Objectives +---------- + +TOML aims to be a minimal configuration file format that's easy to read due to +obvious semantics. TOML is designed to map unambiguously to a hash table. TOML +should be easy to parse into data structures in a wide variety of languages. + +Example +------- + +```toml +# This is a TOML document. Boom. + +title = "TOML Example" + +[owner] +name = "Lance Uppercut" +dob = 1979-05-27T07:32:00-0800 # First class dates? Why not? + +[database] +server = "192.168.1.1" +ports = [ 8001, 8001, 8002 ] +connection_max = 5000 +enabled = true + +[servers] + + # You can indent as you please. Tabs or spaces. TOML don't care. + [servers.alpha] + ip = "10.0.0.1" + dc = "eqdc10" + + [servers.beta] + ip = "10.0.0.2" + dc = "eqdc10" + +[clients] +data = [ ["gamma", "delta"], [1, 2] ] + +# Line breaks are OK when inside arrays +hosts = [ + "alpha", + "omega" +] +``` + +Spec +---- + +* TOML is case sensitive. +* Whitespace means tab (0x09) or space (0x20). + +Comment +------- + +Speak your mind with the hash symbol. They go from the symbol to the end of the +line. + +```toml +# I am a comment. Hear me roar. Roar. +key = "value" # Yeah, you can do this. +``` + +String +------ + +There are four ways to express strings: basic, multi-line basic, literal, and +multi-line literal. All strings must contain only valid UTF-8 characters. + +**Basic strings** are surrounded by quotation marks. Any Unicode character may +be used except those that must be escaped: quotation mark, backslash, and the +control characters (U+0000 to U+001F). + +```toml +"I'm a string. \"You can quote me\". Name\tJos\u00E9\nLocation\tSF." +``` + +For convenience, some popular characters have a compact escape sequence. + +``` +\b - backspace (U+0008) +\t - tab (U+0009) +\n - linefeed (U+000A) +\f - form feed (U+000C) +\r - carriage return (U+000D) +\" - quote (U+0022) +\/ - slash (U+002F) +\\ - backslash (U+005C) +\uXXXX - unicode (U+XXXX) +\UXXXXXXXX - unicode (U+XXXXXXXX) +``` + +Any Unicode character may be escaped with the `\uXXXX` or `\UXXXXXXXX` forms. +Note that the escape codes must be valid Unicode code points. + +Other special characters are reserved and, if used, TOML should produce an +error. + +ProTip™: You may notice that the above string specification is the same as +JSON's string definition, except that TOML requires UTF-8 encoding. This is on +purpose. + +Sometimes you need to express passages of text (e.g. translation files) or would +like to break up a very long string into multiple lines. TOML makes this easy. +**Multi-line basic strings** are surrounded by three quotation marks on each +side and allow newlines. If the first character after the opening delimiter is a +newline (`0x0A`), then it is trimmed. All other whitespace remains intact. + +```toml +# The following strings are byte-for-byte equivalent: +key1 = "One\nTwo" +key2 = """One\nTwo""" +key3 = """ +One +Two""" +``` + +For writing long strings without introducing extraneous whitespace, end a line +with a `\`. The `\` will be trimmed along with all whitespace (including +newlines) up to the next non-whitespace character or closing delimiter. If the +first two characters after the opening delimiter are a backslash and a newline +(`0x5C0A`), then they will both be trimmed along with all whitespace (including +newlines) up to the next non-whitespace character or closing delimiter. All of +the escape sequences that are valid for basic strings are also valid for +multi-line basic strings. + +```toml +# The following strings are byte-for-byte equivalent: +key1 = "The quick brown fox jumps over the lazy dog." + +key2 = """ +The quick brown \ + + + fox jumps over \ + the lazy dog.""" + +key3 = """\ + The quick brown \ + fox jumps over \ + the lazy dog.\ + """ +``` + +Any Unicode character may be used except those that must be escaped: backslash +and the control characters (U+0000 to U+001F). Quotation marks need not be +escaped unless their presence would create a premature closing delimiter. + +If you're a frequent specifier of Windows paths or regular expressions, then +having to escape backslashes quickly becomes tedious and error prone. To help, +TOML supports literal strings where there is no escaping allowed at all. +**Literal strings** are surrounded by single quotes. Like basic strings, they +must appear on a single line: + +```toml +# What you see is what you get. +winpath = 'C:\Users\nodejs\templates' +winpath2 = '\\ServerX\admin$\system32\' +quoted = 'Tom "Dubs" Preston-Werner' +regex = '<\i\c*\s*>' +``` + +Since there is no escaping, there is no way to write a single quote inside a +literal string enclosed by single quotes. Luckily, TOML supports a multi-line +version of literal strings that solves this problem. **Multi-line literal +strings** are surrounded by three single quotes on each side and allow newlines. +Like literal strings, there is no escaping whatsoever. If the first character +after the opening delimiter is a newline (`0x0A`), then it is trimmed. All other +content between the delimiters is interpreted as-is without modification. + +```toml +regex2 = '''I [dw]on't need \d{2} apples''' +lines = ''' +The first newline is +trimmed in raw strings. + All other whitespace + is preserved. +''' +``` + +For binary data it is recommended that you use Base64 or another suitable ASCII +or UTF-8 encoding. The handling of that encoding will be application specific. + +Integer +------- + +Integers are whole numbers. Positive numbers may be prefixed with a plus sign. +Negative numbers are prefixed with a minus sign. + +```toml ++99 +42 +0 +-17 +``` + +Leading zeros are not allowed. Hex, octal, and binary forms are not allowed. +Values such as "infinity" and "not a number" that cannot be expressed as a +series of digits are not allowed. + +64 bit (signed long) range expected (−9,223,372,036,854,775,808 to +9,223,372,036,854,775,807). + +Float +----- + +A float consists of an integer part (which may be prefixed with a plus or minus +sign) followed by a fractional part and/or an exponent part. If both a +fractional part and exponent part are present, the fractional part must precede +the exponent part. + +```toml +# fractional ++1.0 +3.1415 +-0.01 + +# exponent +5e+22 +1e6 +-2E-2 + +# both +6.626e-34 +``` + +A fractional part is a decimal point followed by one or more digits. + +An exponent part is an E (upper or lower case) followed by an integer part +(which may be prefixed with a plus or minus sign). + +64-bit (double) precision expected. + +Boolean +------- + +Booleans are just the tokens you're used to. Always lowercase. + +```toml +true +false +``` + +Datetime +-------- + +Datetimes are [RFC 3339](http://tools.ietf.org/html/rfc3339) dates. + +```toml +1979-05-27T07:32:00Z +1979-05-27T00:32:00-0700 +1979-05-27T00:32:00.999999-0700 +``` + +Array +----- + +Arrays are square brackets with other primitives inside. Whitespace is ignored. +Elements are separated by commas. Data types may not be mixed. + +```toml +[ 1, 2, 3 ] +[ "red", "yellow", "green" ] +[ [ 1, 2 ], [3, 4, 5] ] +[ [ 1, 2 ], ["a", "b", "c"] ] # this is ok +[ 1, 2.0 ] # note: this is NOT ok +``` + +Arrays can also be multiline. So in addition to ignoring whitespace, arrays also +ignore newlines between the brackets. Terminating commas are ok before the +closing bracket. + +```toml +key = [ + 1, 2, 3 +] + +key = [ + 1, + 2, # this is ok +] +``` + +Table +----- + +Tables (also known as hash tables or dictionaries) are collections of key/value +pairs. They appear in square brackets on a line by themselves. You can tell them +apart from arrays because arrays are only ever values. + +```toml +[table] +``` + +Under that, and until the next table or EOF are the key/values of that table. +Keys are on the left of the equals sign and values are on the right. Keys start +with the first character that isn't whitespace or `[` and end with the last +non-whitespace character before the equals sign. Keys cannot contain a `#` +character. Key/value pairs within tables are not guaranteed to be in any +specific order. + +```toml +[table] +key = "value" +``` + +You can indent keys and their values as much as you like. Tabs or spaces. Knock +yourself out. Why, you ask? Because you can have nested tables. Snap. + +Nested tables are denoted by table names with dots in them. Name your tables +whatever crap you please, just don't use `#`, `.`, `[` or `]`. + +```toml +[dog.tater] +type = "pug" +``` + +In JSON land, that would give you the following structure: + +```json +{ "dog": { "tater": { "type": "pug" } } } +``` + +You don't need to specify all the super-tables if you don't want to. TOML knows +how to do it for you. + +```toml +# [x] you +# [x.y] don't +# [x.y.z] need these +[x.y.z.w] # for this to work +``` + +Empty tables are allowed and simply have no key/value pairs within them. + +As long as a super-table hasn't been directly defined and hasn't defined a +specific key, you may still write to it. + +```toml +[a.b] +c = 1 + +[a] +d = 2 +``` + +You cannot define any key or table more than once. Doing so is invalid. + +```toml +# DO NOT DO THIS + +[a] +b = 1 + +[a] +c = 2 +``` + +```toml +# DO NOT DO THIS EITHER + +[a] +b = 1 + +[a.b] +c = 2 +``` + +All table names and keys must be non-empty. + +```toml +# NOT VALID TOML +[] +[a.] +[a..b] +[.b] +[.] + = "no key name" # not allowed +``` + +Array of Tables +--------------- + +The last type that has not yet been expressed is an array of tables. These can +be expressed by using a table name in double brackets. Each table with the same +double bracketed name will be an element in the array. The tables are inserted +in the order encountered. A double bracketed table without any key/value pairs +will be considered an empty table. + +```toml +[[products]] +name = "Hammer" +sku = 738594937 + +[[products]] + +[[products]] +name = "Nail" +sku = 284758393 +color = "gray" +``` + +In JSON land, that would give you the following structure. + +```json +{ + "products": [ + { "name": "Hammer", "sku": 738594937 }, + { }, + { "name": "Nail", "sku": 284758393, "color": "gray" } + ] +} +``` + +You can create nested arrays of tables as well. Just use the same double bracket +syntax on sub-tables. Each double-bracketed sub-table will belong to the most +recently defined table element above it. + +```toml +[[fruit]] + name = "apple" + + [fruit.physical] + color = "red" + shape = "round" + + [[fruit.variety]] + name = "red delicious" + + [[fruit.variety]] + name = "granny smith" + +[[fruit]] + name = "banana" + + [[fruit.variety]] + name = "plantain" +``` + +The above TOML maps to the following JSON. + +```json +{ + "fruit": [ + { + "name": "apple", + "physical": { + "color": "red", + "shape": "round" + }, + "variety": [ + { "name": "red delicious" }, + { "name": "granny smith" } + ] + }, + { + "name": "banana", + "variety": [ + { "name": "plantain" } + ] + } + ] +} +``` + +Attempting to define a normal table with the same name as an already established +array must produce an error at parse time. + +```toml +# INVALID TOML DOC +[[fruit]] + name = "apple" + + [[fruit.variety]] + name = "red delicious" + + # This table conflicts with the previous table + [fruit.variety] + name = "granny smith" +``` + +Seriously? +---------- + +Yep. + +But why? +-------- + +Because we need a decent human-readable format that unambiguously maps to a hash +table and the YAML spec is like 80 pages long and gives me rage. No, JSON +doesn't count. You know why. + +Oh god, you're right +-------------------- + +Yuuuup. Wanna help? Send a pull request. Or write a parser. BE BRAVE. + +Implementations +--------------- + +If you have an implementation, send a pull request adding to this list. Please +note the commit SHA1 or version tag that your parser supports in your Readme. + +- C#/.NET - https://github.com/LBreedlove/Toml.net +- C#/.NET - https://github.com/rossipedia/toml-net +- C#/.NET - https://github.com/RichardVasquez/TomlDotNet +- C#/.NET - https://github.com/azyobuzin/HyperTomlProcessor +- C (@ajwans) - https://github.com/ajwans/libtoml +- C (@mzgoddard) - https://github.com/mzgoddard/tomlc +- C++ (@evilncrazy) - https://github.com/evilncrazy/ctoml +- C++ (@skystrife) - https://github.com/skystrife/cpptoml +- Clojure (@lantiga) - https://github.com/lantiga/clj-toml +- Clojure (@manicolosi) - https://github.com/manicolosi/clojoml +- CoffeeScript (@biilmann) - https://github.com/biilmann/coffee-toml +- Common Lisp (@pnathan) - https://github.com/pnathan/pp-toml +- Erlang - https://github.com/kalta/etoml.git +- Erlang - https://github.com/kaos/tomle +- Emacs Lisp (@gongoZ) - https://github.com/gongo/emacs-toml +- Go (@thompelletier) - https://github.com/pelletier/go-toml +- Go (@laurent22) - https://github.com/laurent22/toml-go +- Go w/ Reflection (@BurntSushi) - https://github.com/BurntSushi/toml +- Go (@achun) - https://github.com/achun/tom-toml +- Go (@naoina) - https://github.com/naoina/toml +- Haskell (@seliopou) - https://github.com/seliopou/toml +- Haxe (@raincole) - https://github.com/raincole/haxetoml +- Java (@agrison) - https://github.com/agrison/jtoml +- Java (@johnlcox) - https://github.com/johnlcox/toml4j +- Java (@mwanji) - https://github.com/mwanji/toml4j +- Java - https://github.com/asafh/jtoml +- Java w/ ANTLR (@MatthiasSchuetz) - https://github.com/mschuetz/toml +- Julia (@pygy) - https://github.com/pygy/TOML.jl +- Literate CoffeeScript (@JonathanAbrams) - https://github.com/JonAbrams/tomljs +- node.js/browser - https://github.com/ricardobeat/toml.js (npm install tomljs) +- node.js - https://github.com/BinaryMuse/toml-node +- node.js/browser (@redhotvengeance) - https://github.com/redhotvengeance/topl (topl npm package) +- node.js/browser (@alexanderbeletsky) - https://github.com/alexanderbeletsky/toml-js (npm browser amd) +- Objective C (@mneorr) - https://github.com/mneorr/toml-objc.git +- Objective-C (@SteveStreza) - https://github.com/amazingsyco/TOML +- OCaml (@mackwic) https://github.com/mackwic/to.ml +- Perl (@alexkalderimis) - https://github.com/alexkalderimis/config-toml.pl +- Perl - https://github.com/dlc/toml +- PHP (@leonelquinteros) - https://github.com/leonelquinteros/php-toml.git +- PHP (@jimbomoss) - https://github.com/jamesmoss/toml +- PHP (@coop182) - https://github.com/coop182/toml-php +- PHP (@checkdomain) - https://github.com/checkdomain/toml +- PHP (@zidizei) - https://github.com/zidizei/toml-php +- PHP (@yosymfony) - https://github.com/yosymfony/toml +- Python (@f03lipe) - https://github.com/f03lipe/toml-python +- Python (@uiri) - https://github.com/uiri/toml +- Python - https://github.com/bryant/pytoml +- Python (@elssar) - https://github.com/elssar/tomlgun +- Python (@marksteve) - https://github.com/marksteve/toml-ply +- Python (@hit9) - https://github.com/hit9/toml.py +- Racket (@greghendershott) - https://github.com/greghendershott/toml +- Ruby (@jm) - https://github.com/jm/toml (toml gem) +- Ruby (@eMancu) - https://github.com/eMancu/toml-rb (toml-rb gem) +- Ruby (@charliesome) - https://github.com/charliesome/toml2 (toml2 gem) +- Ruby (@sandeepravi) - https://github.com/sandeepravi/tomlp (tomlp gem) +- Rust (@mneumann) - https://github.com/mneumann/rust-toml +- Rust (@alexcrichton) - https://github.com/alexcrichton/toml-rs +- Scala - https://github.com/axelarge/tomelette + +Validators +---------- + +- Go (@BurntSushi) - https://github.com/BurntSushi/toml/tree/master/cmd/tomlv + +Language agnostic test suite for TOML decoders and encoders +----------------------------------------------------------- + +- toml-test (@BurntSushi) - https://github.com/BurntSushi/toml-test + +Editor support +-------------- + +- Emacs (@dryman) - https://github.com/dryman/toml-mode.el +- Sublime Text 2 & 3 (@lmno) - https://github.com/lmno/TOML +- TextMate (@infininight) - https://github.com/textmate/toml.tmbundle +- Vim (@cespare) - https://github.com/cespare/vim-toml +- Notepad++ (@fireforge) - https://github.com/fireforge/toml-notepadplusplus + +Encoder +-------------- +- Go w/ Reflection (@BurntSushi) - https://github.com/BurntSushi/toml +- PHP (@ayushchd) - https://github.com/ayushchd/php-toml-encoder + +Converters +---------- + +- remarshal (@dbohdan) - https://github.com/dbohdan/remarshal