Document parsing jam #5011

GAowc-stack · 2025-02-16T10:06:29Z

GAowc-stack
Feb 16, 2025

ragflow Local deployment The resolution of the uploaded file is stuck

KevinHuSh · 2025-02-17T05:28:10Z

KevinHuSh
Feb 17, 2025
Maintainer

The logs seems fine.
File parsing is one by one by default.

2 replies

troubleshootme Jun 16, 2025

how do we enable asynch parsing?

ZhenhangTung Jun 18, 2025
Collaborator

how do we enable async parsing?

could you explain more?

troubleshootme · 2025-06-18T19:30:16Z

troubleshootme
Jun 18, 2025

I believe I found the env vars pertaining to this: DOC_BULK_SIZE =50 EMBEDDING_BATCH_SIZE =25 Those are the settings I am currently using. The issue is that when using graph, it takes forever to parse and embed. I am using external providers such as OpenAI and JINA and the tokens they are receiving is very low. The vm I am running ragflow in at the moment is allocated 64 GB ram and 32 cores. Utilization is fairly low and I can't help but think things could move along much faster. I guess my question is, how do those 2 values above look and would I get any benefit increasing them any more? Are there any other config values that will allow ragflow to send much higher RPM to external embedding apis? Tony Bruno / Consultant ***@***.*** / 205.876.3715 Bruno Computer Support www.troubleshootme.com ( https://troubleshootme.com/ ) 205.876.3715 Sent via Superhuman ( ***@***.*** )

…

On Wed, Jun 18, 2025 at 1:33 AM, LeonTung < ***@***.*** > wrote: > > > how do we enable async parsing? > > could you explain more? — Reply to this email directly, view it on GitHub ( #5011 (reply in thread) ) , or unsubscribe ( https://github.com/notifications/unsubscribe-auth/AZAVWCPLKR4WJFBY32UI2ED3EEB3HAVCNFSM6AAAAABXHNRGU2VHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTGNJQGQ3DGMI ). You are receiving this because you commented. Message ID: <infiniflow/ragflow/repo-discussions/5011/comments/13504631 @ github. com>

1 reply

ZhenhangTung Jun 19, 2025
Collaborator

Pls check this PR to see if it can answer your question: #7845

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfiniFlow

Document parsing jam #5011

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

InfiniFlow

Document parsing jam #5011

Uh oh!

GAowc-stack Feb 16, 2025

Replies: 2 comments · 3 replies

Uh oh!

KevinHuSh Feb 17, 2025 Maintainer

Uh oh!

troubleshootme Jun 16, 2025

Uh oh!

ZhenhangTung Jun 18, 2025 Collaborator

Uh oh!

troubleshootme Jun 18, 2025

Uh oh!

ZhenhangTung Jun 19, 2025 Collaborator

GAowc-stack
Feb 16, 2025

Replies: 2 comments 3 replies

KevinHuSh
Feb 17, 2025
Maintainer

ZhenhangTung Jun 18, 2025
Collaborator

troubleshootme
Jun 18, 2025

ZhenhangTung Jun 19, 2025
Collaborator