Make the text-to-speech chain flexible #46

rhdunn · 2013-04-11T17:09:18Z

It should be possible to specify the specific call chain of the different text analysis parts:

text reader -- splits the document events into words, numbers and punctuation;
context analysis -- identifies the type of punctuation (comma, etc.) and number (ordinal, year, etc.)
word stream -- converts the numbers and audible punctuation to words
part of speech tagger -- tags words with their associated part of speech
part of speech disambiguator -- resolves ambiguous part of speech tag assignments

It should be possible to build a pipeline of these and others in an arbitary order. This means:

Creating an abstract class:

struct text_event_reader
{
    virtual const text_event &event() const = 0;
    virtual bool read() = 0;
};

Having all the analysis parts above implement the abstract class.
Making the classes take a std::shared_ptr<text_event_reader> instead of a std::shared_ptr<document_reader> (except for text_reader which starts the process).
Optionally hiding them behind a createXYZ method.

This will be a performance hit (need to measure to see how much), but it adds flexibility -- especially for using different grapheme to phoneme rules.

Want to back this issue? Post a bounty on it! We accept bounties via Bountysource.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the text-to-speech chain flexible #46

Make the text-to-speech chain flexible #46

rhdunn commented Apr 11, 2013 •

edited

Loading

Make the text-to-speech chain flexible #46

Make the text-to-speech chain flexible #46

Comments

rhdunn commented Apr 11, 2013 • edited Loading

rhdunn commented Apr 11, 2013 •

edited

Loading