-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Exclusion of unnecessary parameters from AdvancedTreeSearchLmImageAndGlobalCacheJob hash calculation #514
Comments
This is possibly a duplicate of #430 I am not really familiar with the crp handling of RASR, so I can not tell what the best solution is, but deleting non-relevant parameters sounds like breaking existing setups. |
I edited the original text, of course I meant that there is a flag that conserves the default behavior |
@michelwi still had open changes requested, and I assume @NeoLegends just changed his priorities as there was not enough interest from the rest to have it solved. |
@NeoLegends could you do a summary of why #446 was not merged yet and what were the open questions? |
#446 would reduce the problem but not mitigate it completely: We would not duplicate the lm image which takes up most of the wasted storage, but we would still have multiple unnecessary duplicates of the global cache. So we can leave the LmImageAndGlobalCacheJob as is (and deprecate it) but apply the same discussion points to the separate GlobalCacheJob. |
Exactly.
Not sure if I understand what you are proposing. |
We already have two separate jobs. I say:
|
Yeah agreed.
If this is not a merged PR, then I would not say that we have them. Is it correct as @JackTemaki said, that in order to merge it we should address your last list of comments? Or is there anything changed in the meanwhile? |
The jobs exist. They are just not used currently in the i6 standard pipelines.
I haven't looked at it in a while, but I assume I still hold to my comments. But I can give it another look later. |
that would be great, thank you |
Upon inspection I found all previous points (of me and others) still valid and I found even more problems that I added to the review. |
Thank you, I will look into this next week. |
Recently, I discovered that one terabyte of my space is occupied by different
AdvancedTreeSearchLmImageAndGlobalCacheJob
s that differ only by transition distortion penalty (TDP) values. However, all TDPs are part of our dynamic search procedure meaning that they are not relevant for the static search space and the global cache.I am opening this issue to discuss the most clean way to exclude this or other parameters from the hash calculation. After a short discussion with @vieting, we think there are two different scenarios one can explore:
crp
s to theAdvancedTreeSearchJob
, where the additionalcrp
, if notNone
, is used here. Otherwise the old behavior is performed.RasrConfig
if a default value within RASR is definedPlease let me know what are your thoughts, before I prepare a PR.
The text was updated successfully, but these errors were encountered: