-
Notifications
You must be signed in to change notification settings - Fork 218
trim input to TGI, moved clustering and summarization to dataprep and store in DB #893
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
… DB store Signed-off-by: Rita Brugarolas <[email protected]>
for more information, see https://pre-commit.ci
Codecov ReportAttention: Patch coverage is
|
Signed-off-by: Rita Brugarolas <[email protected]>
Signed-off-by: Rita Brugarolas <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Rita Brugarolas <[email protected]>
Signed-off-by: Rita Brugarolas <[email protected]>
for more information, see https://pre-commit.ci
…1 cluster call for full graph in database Signed-off-by: Rita Brugarolas <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Rita Brugarolas <[email protected]>
Signed-off-by: theresa <[email protected]>
Signed-off-by: Rita Brugarolas <[email protected]>
ashahba
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @rbrugaro for this PR.
My feedback should be pretty straight forward.
Most of questions about timeout's are just to bring them to your attention and it's your call to decide on the final default values.
Signed-off-by: Rita Brugarolas <[email protected]>
Signed-off-by: Rita Brugarolas <[email protected]>
ashahba
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
… store in DB (opea-project#893) * trim input to TGI, moved clustering and summarization to dataprep and DB store Signed-off-by: Rita Brugarolas <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed inspect_db causing error in precommit Signed-off-by: Rita Brugarolas <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add HF token to dataprep container because tokenizer is used now Signed-off-by: Rita Brugarolas <[email protected]> * updated READMEs to reflect latest changes Signed-off-by: Rita Brugarolas <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix all files are ingested and graph extracted first followed by 1 cluster call for full graph in database Signed-off-by: Rita Brugarolas <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update README based on fix for multifile Signed-off-by: Rita Brugarolas <[email protected]> * Changes to make graphrag ui work Signed-off-by: theresa <[email protected]> * fix bug build communities done once at end of ingestion Signed-off-by: Rita Brugarolas <[email protected]> * minor fixes Signed-off-by: Rita Brugarolas <[email protected]> * README fixes Signed-off-by: Rita Brugarolas <[email protected]> --------- Signed-off-by: Rita Brugarolas <[email protected]> Signed-off-by: theresa <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: theresa <[email protected]>
Several enhancements: