Skip to content

Go-catdoc, get text and metadata from .doc files.

License

Notifications You must be signed in to change notification settings

semvis123/go-catdoc

Repository files navigation

Go-catdoc, get text and metadata from .doc files.

GoDoc Tests

Uses Wazero to run catdoc as webassembly in Go. The catdoc source is slightly modified to support reading metadata in .doc.
The msdoc.hexpat file is a pattern file for imhex that can parse the summaryinformation ole object inside the .doc file.

To compile the webassembly binary, go to ./catdoc/src/ and run make catdoc-wasm.
To run the tests, do go test ./...

Usage:

f, err := os.Open("test.doc")
text, err := gocatdoc.GetTextFromFile(f)