Add analysis of schema structure decomposition of field keys and subtypes #12

ivbeg · 2022-08-06T08:00:34Z

Flat table datasets (CSV) files, database tables, and sometimes objects with nested objects ofter include elements that could be grouped.

For example CSV file Zaara_D.csv
includes following fields: title, text, date, place, placeURL, placeLocation, placeType, reviewScore, avgScore

We could find that prefix 'place' is a subtype identifier. It could be decomposed as
place:

And postfix Score identifies value type, whether integer or float.

Most data tables use case change or "_" symbol as dividers. Very rarely is the '-' symbol also used.

Detection of field groups and decomposition of field names could help with:

Add group detection to the final report as field_group property.

The text was updated successfully, but these errors were encountered:

ivbeg added the enhancement New feature or request label Aug 6, 2022

ivbeg self-assigned this Aug 6, 2022

ivbeg added this to APICrafter stack (DataCrafter, Metacrafter and e.t.c) Aug 6, 2022

ivbeg moved this to Current in APICrafter stack (DataCrafter, Metacrafter and e.t.c) Aug 6, 2022

Provide feedback