forked from huggingface/chat-ui
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Web Search: Playwright, spatial parsing, markdown (huggingface#1094)
* feat: playwright, spatial parsing, markdown for web search Co-authored-by: Aaditya Sahay <[email protected]> * feat: choose multiple clusters if necessary (#2) * chore: resolve linting failures * feat: improve paring performance and error messages * feat: combine embeddable chunks together on cpu * feat: reduce parsed pages from 10 to 8 * feat: disable javascript in playwright by default * feat: embedding and parsing error messages * feat: move isURL, fix type errors, misc * feat: misc cleanup * feat: change serializedHtmlElement to interface * fix: isUrl filename * fix: add playwright dependencies to docker * feat: add playwright browsers to docker image * feat: enable javascript by default * feat: remove error message from console on failed page --------- Co-authored-by: Aaditya Sahay <[email protected]> Co-authored-by: Aaditya Sahay <[email protected]>
- Loading branch information
1 parent
18fba9f
commit 9ec5d84
Showing
39 changed files
with
1,871 additions
and
481 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.