-
Notifications
You must be signed in to change notification settings - Fork 609
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
September 2024 release #362
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* add agentic workflow module and cli * update pyproject.toml and split google into its own import * remove unused import * update default search chain * Update __init__.py * Update search.py * move reqs into pyproject.toml, addressing PR comments * rename search_chain to openai_get_search_query * remove get_llm_name * move table_formatter to helpers.py * Update paperqa/agents/main.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/main.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/main.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/docs.py Co-authored-by: James Braza <[email protected]> * Update paperqa/types.py Co-authored-by: James Braza <[email protected]> * rename compute_cost to compute_total_model_token_cost * remove stream_answer * rename to stub_manifest, and use Path for all paths * Update paperqa/llms.py Co-authored-by: James Braza <[email protected]> * move SKIP_AGENT_TESTS = False * nix _ = assignments * add test comments * types in conftest.py * split libs into llms * link openai chat timeout to query.timeout * Update paperqa/agents/__init__.py Co-authored-by: James Braza <[email protected]> * logging revamp and renaming * Update tests/test_cli.py Co-authored-by: James Braza <[email protected]> * Update tests/test_cli.py Co-authored-by: James Braza <[email protected]> * move vertex import to func call, add docstring to SupportsPickle * docstring * remove _ = * remove bool return type from set * update gitignore * add config attribute to baase LLMModel class * replace get_current_settings -> get_settings * replace get_current_settings -> get_settings * PR simplifications * remove all stream_* functions * avoid modifying the root logger * re-organize logger import location * move hashlib into utils * refactor strip_answer into Answer object * label circular imports * ensure absolute paths are used in index name * limit select to be used only when DOI is not present in crossref * Update paperqa/agents/search.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/search.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/search.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/search.py Co-authored-by: James Braza <[email protected]> * Update paperqa/agents/models.py Co-authored-by: James Braza <[email protected]> * reconfigure logging to not prevent propagation * remove newlines in the current year * use required fields as a subset * replace . with Path.cwd() --------- Co-authored-by: James Braza <[email protected]>
* add unpaywall provider * remove unused clean query method, update test cassettes to use [email protected]
…tests (#311) * making SearchIndex.fields a list * replace ids with md5 sum * rename stub for test_agent_sharing_state * run github actions on PRs into release branches too * ruff formatting * remove nested quotes for <3.12 * added staging data into tests; reimplemented tests to avoid downloads and keep consistent hashes; aded env vars to test.yaml * conftest removes extension from near match filenames * add match_on for sequential runs since they may be out of order * refresh casettes * replace deprecated pytest-vcr with pytest-recording * change matching criteria * set unpaywall email for CI * try explicit conditions on cassette files, set the cassette dir * removed vcr for sequential client tests * add fixture to ensure that log levels are reset * added cassettes back in for sequential tests after determining it was a log level failure * limit cli logger mutation to avoid clobbering other library defaults --------- Co-authored-by: Siddharth Narayanan <[email protected]>
Large refactor to decouple Docs, LLMs, and Config. Moves to centralized config that can be loaded from package data or defaults. This changes many function calls so that they pass config as an object or name of a config object. Removed doc_match and doc_index - these were only sometimes useful and not really part of current usage. We can add back if needed, but just complicated things. Removed the unnecessary complicated get_callbacks factories. Now, callbacks can have a kwarg of name to get access to name of chain being called. Switched to contextvars for setting answer_id in LLMResults so that we do not need to have so much back and forth for callbacks. Deferred updates to Answer objects until end of functions so that retrying is possible (except token counts) Generally moved all config out of Docs, except LLMModel which will be fixed in port to litellm Co-authored-by: Michael Skarlinski <[email protected]> Co-authored-by: James Braza <[email protected]> Co-authored-by: mskarlin <[email protected]>
I accidentally put an extra struct=True and broke everything
Co-authored-by: Michael Skarlinski <[email protected]>
Co-authored-by: James Braza <[email protected]>
Check if a journal name starts with `the`
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This comment is a draft for release notes below (this will be a merge commit)
New Features
Improvements
Regressions/Deprecation
We've removed the following features to keep our library focused:
doc_match
- we do not have enough data to support that this method actually helps for very large corpusesDetailed Changes:
list
and dynamically compute md5 hash in tests by @mskarlin in Rollback search fields tolist
and dynamically compute md5 hash in tests #311LLMModel.axyz_iter
type hints by @jamesbraza in FixingLLMModel.axyz_iter
type hints #324black
ened code to prevent IDE scrolling by @jamesbraza inblack
ened code to prevent IDE scrolling #330pytest-mock
plugin by @jamesbraza in Removedpytest-mock
plugin #328pytest-xdist
plugin by @jamesbraza in Addingpytest-xdist
plugin #329mypy
by @jamesbraza in Passingmypy
#332make_chain
in favor ofrun_prompt
by @jamesbraza in Removingmake_chain
in favor ofrun_prompt
#325refurb
tool, andlint
CI by @jamesbraza in Addingrefurb
tool, andlint
CI #333make_chain
in favor ofrun_prompt
#325 by @jamesbraza in Fixing arg ordering after #325 #334parse_text
after Passingmypy
#332 by @jamesbraza in Fixingparse_text
after #332 #335the
by @geemi725 in Check if a journal name starts withthe
#320ANN
autofixes by @jamesbraza in All RuffANN
autofixes #341.mailmap
by @jamesbraza in Adding in.mailmap
#342LangchainVectorStore
,llms
extra, and fixing upREADME
by @jamesbraza in RemovedLangchainVectorStore
,llms
extra, and fixing upREADME
#343requests
dependency by @jamesbraza in Droppingrequests
dependency #346html2text
requirement by @jamesbraza in Removedhtml2text
requirement #347litellm_get_search_query
by @jamesbraza in Better control flow inlitellm_get_search_query
#351aviary
andldp
for agents overlangchain
by @jamesbraza inaviary
andldp
for agents overlangchain
#358pylint
by @jamesbraza in Addingpylint
#349New Contributors
the
#320Full Changelog: v4.9.0...vnew