You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The issue you're experiencing with the Doc Extractor not preserving the order of elements in a DOCX document might be related to how the parse_docx method processes the document. This method is designed to maintain the order of paragraphs and tables as they appear in the document's XML structure [1][2]. If tables are being placed at the beginning, it could be due to a bug or an unexpected behavior in the code.
To address this, you might want to check if there are any modifications or customizations in your setup that could affect the order. Additionally, reviewing the implementation of the parse_docx method in your version of Dify could help identify any discrepancies. If the issue persists, consider reporting it as a bug in the repository, providing detailed steps to reproduce the problem and any relevant code snippets or configurations.
Self Checks
Dify version
0.15.2
Cloud or Self Hosted
Self Hosted (Docker)
Steps to reproduce
Hello,
When I have a docx document with tables, the extractor places all the tables at the beginning and the order of the document is no longer respected.
✔️ Expected Behavior
respect the order
❌ Actual Behavior
No response
The text was updated successfully, but these errors were encountered: