-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retrieval/train #50
Retrieval/train #50
Conversation
I have not ran an inference yet. I know how to do so. There is an example in our google drive. I need to be able to save the model after training as a prior step. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work! I see a few cosmetic things and have some questions, but none of that is important.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@artificialwisdomai/maintainers Hi gang. I think this PR is mergeable as is. This will enable future development. There are many gaps. The main problems with this PR are
and then:
After merge, lets solve these problems. I will file issues for the problems not identified by Robert. |
The OneApi from Intel must be installed. The linker must be told about the oneapi libraries. I did this by dropping a file: ``` sdake@beast-06:/etc/ld.so.conf.d$ cat artificial_wisdom_intel_one_api.conf /opt/intel/oneapi/mkl/2023.1.0/lib/intel64 ``` then rebuild the linker cache: ``` sudo ldconfig ``` - You may have to build faiss on your local system with GPU support. - The safety checks on the inputs could be better. - 100G appaers to break faiss. - Render the loss rate in the train loop. - Add the epoch to the saved name. - Move the dataloader to within the train loop. - requirements files are idpedently named. Change requirements file name. Remove the index_name parameter
Let's get this training loop merged. Remaining work from this PR:
Longer term we need to decide how our natural language processing library will interface with the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Developers spend a significant amount of time waiting for things to build. Atleast make the process visually pleasing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Train our custom retrieval transformer (based upon RETRO):