Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

stats: make --infer-boolean configurable #2147

Open
jqnatividad opened this issue Sep 17, 2024 · 0 comments
Open

stats: make --infer-boolean configurable #2147

jqnatividad opened this issue Sep 17, 2024 · 0 comments
Labels
enhancement New feature or request. Once marked with this label, its in the backlog.

Comments

@jqnatividad
Copy link
Collaborator

jqnatividad commented Sep 17, 2024

Currently, booleans are inferred when --infer-boolean is enabled. The heuristic for inferring booleans is as follows:

  • When a column's cardinality is 2, and;
  • the 2 values' first characters are 0/1, t/f & y/n case-insensitive

Make it so the the first character heuristic is configurable, as t/f (True/False) and y/n (Yes/No) are English-centric.

Further, allow the user to specify more than just the first character, to eliminate false positives (e.g. A column whose cardinality is 2, and a domain of "Tom", "Frank" will be incorrectly inferred as boolean).

@jqnatividad jqnatividad added the enhancement New feature or request. Once marked with this label, its in the backlog. label Sep 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request. Once marked with this label, its in the backlog.
Projects
None yet
Development

No branches or pull requests

1 participant