Performance enhancements #22

anselor · 2018-08-02T15:56:28Z

When generating large tables the performance drops significantly.
A large part of this is likely because tableformatter will seek through all of the fields of all of the rows multiple times to generate the display text from the data, measure the text, and then format/wrap the text to fit the column width/alignment.

Some of the following could be used to improve performance:

Limit the number of rows analyzed to determine column widths (configurable limit)
Once all columns have reached the maximum allowable width, stop measure rows
If all columns have pre-defined fixed widths, skip the measuring step

Also, tableformatter currently generates a full in-memory model of the entire table before rendering to a string. This can use a lot of memory. We can reduce the memory usage by only building the in-memory model up until the maximum analysis depth and then process/render the remaining rows on-demand.

tleonhardt · 2018-08-02T16:27:04Z

Regardless of which approach we take, we should probably add a section at the bottom of the README called something like "Performance considerations" where we explain what types of things are likely to be slow and why.

You say we are seeking through all of the fields and all of the rows multiple times. Is that more than twice? I can see why we may need to parse through all of it twice, but I can't see why it would ever need to be more than that.

Perhaps we could add an additional API function which allows the user to pass in the data as well as an integer specifying the maximum number of lines of text and/or rows of data to return at a time and then a generator which will return the next N rows?

Alternatively, I like the concept of limiting the number of rows analyzed for performance reasons.

tleonhardt added enhancement question labels Aug 10, 2018

anselor added this to the 0.2.0 milestone Aug 12, 2019

anselor mentioned this issue Aug 13, 2019

Tableformatter 0.2.X #37

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance enhancements #22

Performance enhancements #22

anselor commented Aug 2, 2018

tleonhardt commented Aug 2, 2018

Performance enhancements #22

Performance enhancements #22

Comments

anselor commented Aug 2, 2018

tleonhardt commented Aug 2, 2018