Make case ids shorter and easier to read #2973

maciej-zarzeczny · 2023-03-09T10:58:16Z

Currently cases are identified by default MongoDB IDs. We should make them shorter and easier to read for curators.

abhidg · 2023-03-15T11:53:42Z

One approach is to keep the ObjectIds as is in the DB, but use a frontend function to make it more readable. Any reduction in information could lead to more collisions, but I think this scheme will make it extremely unlikely:

Number in three parts, separated by hyphens, obtained from the timestamp embedded in ObjectId:

Number of days since outbreak (should be well-defined as we have OUTBREAK_DATE)
Number of seconds elapsed on the day in the timestamp, integer divided by 100, so this will go from 0 to 864
Last two bytes (0 - 65536) from incrementing number in the last block of ObjectId (A 3-byte incrementing counter, initialized to a random value)

Assuming an outbreak lasts upto 1000 days (3 years, which would be a pandemic, and thus unlikely to happen frequently), this would give a maximum number of digits as 11, while not touching the DB at all. In most cases, curators working on a single day’s cases would only need the second two bits as the number of days since outbreak would be the same.

This is assuming numerical IDs only, if we can do alphanumeric, we can shorten further by using hex or by using one of several naming systems such as https://pypi.org/project/human-id/ mapping UUIDs to a string of words; disadvantage is that alphanumeric systems usually lack monotonicity.

maciej-zarzeczny · 2023-03-15T12:47:00Z

@abhidg Those are all great ideas! I think it all depends on Curator's preferences. @aimeehan1 is there any solution that works for you better than the other?

maciej-zarzeczny added Data UI Bug is related to Data frontend functionality turnkey labels Mar 9, 2023

maciej-zarzeczny assigned stanislaw-zakrzewski and maciej-zarzeczny Mar 9, 2023

stanislaw-zakrzewski mentioned this issue Mar 27, 2023

[LIST-2973] shorter case ids globaldothealth/turnkey-curator-portal#4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make case ids shorter and easier to read #2973

Make case ids shorter and easier to read #2973

maciej-zarzeczny commented Mar 9, 2023 •

edited

Loading

abhidg commented Mar 15, 2023

maciej-zarzeczny commented Mar 15, 2023

Make case ids shorter and easier to read #2973

Make case ids shorter and easier to read #2973

Comments

maciej-zarzeczny commented Mar 9, 2023 • edited Loading

abhidg commented Mar 15, 2023

maciej-zarzeczny commented Mar 15, 2023

maciej-zarzeczny commented Mar 9, 2023 •

edited

Loading