-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WFCatalog metadata dependency #23
Comments
@jbienkowski, Question: Is reprocessing a crucial issue. I mean performance-wise? The workflow could be something like:
Note, that this change requires adjusting the collector's delete facilities, too.
I'm not sure if I would implement this approach. Imagine a request which queries the entire waveform metadata inventory. Then, filtering becomes costly. I'm aware that there are service level configuration parameters such as wfcatalog/service/configuration.json Lines 12 to 14 in 93f3d7b
wfcatalog/service/configuration.json Line 19 in 93f3d7b
|
@jbienkowski, have you seen already this wfcatalog/collector/config.json Lines 20 to 22 in 93f3d7b
See also wfcatalog/collector/WFCatalogCollector.py Lines 475 to 512 in 93f3d7b
wfcatalog/collector/WFCatalogCollector.py Lines 279 to 299 in 93f3d7b
|
We originally decided not to include just the channels in the metadata because a lot of nodes wanted to have their full archive processed, and not just what is exposed through FDSNWS. I guess updating the white list is too much manual labor.. It is probably better to add another option and add the FDSNWS response [net.sta.loc.cha] to a hashmap and do a lookup on whether to skip or not. |
I would be in favor of processing everything, and filtering the output. This way, as soon as the metadata is available, wfcatalog can spit all the information out and there is no need to start looking for all the data to index each time some metadata is submitted . Of course, it should be implemented without making a fdsnws-station call for every wfcatalog request. |
@jschaeff, I get your points. However, this approach implies:
|
I often hit the wall of ignoring when the metadata changes. Could wfcatalog manage a cache of the StationXML metadata for each network he knows about (or just the part he needs) ? The cache can be refreshed at arbitrary frequency or manualy. |
Unfortunately, that's not enough. The
Versioning most probably requires more than just a simple time stamp.
OT: Interestingly, not caching StationXML metadata was a requirement when designing |
Sorry, github sent my comment with some keyboard shortcut I hit ... |
Currently, WFCatalog does not depend on station metadata - it calculates metrics for acquired data even if some channels are not defined in StationXML. In those cases users can retrieve the metrics, but are not able to download the data itself via FDSNWS-Dataselect web service which strongly depends on metadata.
Possible solutions:
metadata
query parameter with default valuetrue
in the WFCatalog implementation which would still allow retrieval of all available metricsThe text was updated successfully, but these errors were encountered: