Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide statistics #29

Open
nichtich opened this issue May 6, 2021 · 5 comments
Open

Provide statistics #29

nichtich opened this issue May 6, 2021 · 5 comments
Labels
data issues related to decomposition data
Milestone

Comments

@nichtich
Copy link
Member

nichtich commented May 6, 2021

Can be calculated once and stored on disk (I'd use jq). Numbers such as

  • Timestamp of last update
  • Number of analyzed DDC numbers
  • Number of incompletely analyzed DDC numbers
  • Number of distinct incompletely analyzed DDC numbers
  • Number of distinct elements and frequency of their use
  • ...
@stefandesu
Copy link
Member

  • Number of incompletely analyzed DDC numbers
  • Number of distinct incompletely analyzed DDC numbers

What's the difference between these two? The database uses the concept URI as the primary key, so there should be no duplicates.

I'd use jq

How would you use jq for this? The data is currently only in the PostgreSQL database.

@nichtich
Copy link
Member Author

nichtich commented May 6, 2021

How would you use jq for this? The data is currently only in the PostgreSQL database.

The data is converted to JSON and imported then, so we could also create a JSON dump and work in this. It's just a matter of convenience.

@stefandesu
Copy link
Member

The data is converted to JSON and imported then, so we could also create a JSON dump and work in this. It's just a matter of convenience.

I'll check the convert script whether the conversion to JSON works properly at the moment. I haven't tested it since we haven't used it at all.

If it's easier for you to write a jq call that calculates the data than writing a small helper script in JavaScript, then go ahead. I wouldn't even know where to start. 😅

@stefandesu
Copy link
Member

Added a small fix in 6f63368. Now the convert script correctly outputs ndjson when using it without the --import flag.

@nichtich
Copy link
Member Author

nichtich commented May 7, 2021

There is a statistics script on the dev branch. The server needs to be adjusted to server stats.json (if available) and/or a summary in the interface. Date of last update is also not included yet.

@nichtich nichtich added this to the 0.2.0 milestone May 10, 2021
@nichtich nichtich added the data issues related to decomposition data label Nov 25, 2021
@stefandesu stefandesu modified the milestones: 0.2.0, 0.3.0 Dec 1, 2021
@nichtich nichtich modified the milestones: 0.3.0, 0.4.0 Jan 21, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data issues related to decomposition data
Projects
None yet
Development

No branches or pull requests

2 participants