issues Search Results · repo:Unstructured-IO/unstructured-python-client language:Python
Filter by
37 results
(91 ms)37 results
inUnstructured-IO/unstructured-python-client (press backspace or delete to remove)Is your feature request related to a problem? Please describe. . Ex. I m always frustrated when Yes. I m currently using
the Unstructured API with chunking_strategy= by_title to segment documents for downstream ...
enhancement
mahendra867
- Opened 10 days ago
- #258
Hi! I am developing an ETL pipeline using restate and Unstructured, using python. I have the following problem:
- I do not want to use Partition Endpoint because I also need the enrichment step.
- ...
edoproch
- Opened 18 days ago
- #254
Describe the bug I m following the quick start on colab and when I run
response = unstructured_client.workflows.run_workflow(
request={
workflow_id : workflow_id,
}
)
I got the following ...
bug
edoproch
- 4
- Opened on Mar 7
- #245
Overview
Request to upgrade the project s Pydantic dependency from the current version to Pydantic 2.10 to leverage significant
performance improvements.
Motivation
Pydantic 2.10 brings substantial ...
enhancement
Tibiritabara
- 1
- Opened on Jan 14
- #218
Describe the bug For the python-client, setting split_pdf_cache_tmp_data_dir does not work. After inspecting the code I
see that the wrong key is being used in the parameters for PartitionParameters
...
bug
sam-ayo
- 1
- Opened on Dec 3, 2024
- #213
Describe the bug Getting following error when working with pdf (split document) ocr:
ERROR: Request (page 1) failed (LocalProtocolError( Illegal header value b XpNzdq29l4PsoAvw4Hh2VJWeLvjIox )). Waiting ...
bug
tkanhe-karini
- 1
- Opened on Nov 4, 2024
- #206
Describe the bug When specifying output_format as csv, the response from the api is different when split_pdf_page is
True or False. When False, the elements contain an extra metadata field: text_as_html. ...
bug
Coniferish
- Opened on Oct 28, 2024
- #201
Describe the bug When I call the partition function for a 100-page PDF, it raises an AsyncLibraryNotFoundError. This
issue does not always reproduce.
my parameters: files=files, pdf_infer_table_structure=True, ...
bug
felixchen464atrc
- 1
- Opened on Oct 21, 2024
- #196
Describe the bug When I set split_pdf_page=True,split_pdf_concurrency_level=15. Assuming the pdf is divided into 10
sets, it will report an error: ERROR: Failed to send request for page 1 ... WARNING: ...
issj6
- 1
- Opened on Oct 14, 2024
- #220
Describe the bug There is a hardcoded dummy GET request to api.unstructuredapp.io. There does not appear to be an easy
way to override this, and it also doesn t look necessary. This causes a timeout error ...
bug
alex-atkins
- 2
- Opened on Oct 9, 2024
- #191

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Press the /
key to activate the search input again and adjust your query.
Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.