You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have built a RAG pipeline with metadata extraction. I parsed 3 pdfs and got 3 Metadatas, one for each.
from llama_parse import LlamaParse
parser = LlamaParse(result_type="text")
docs = parser.load_data(file_path=full_files)
# attach metadata
for metadata, doc in zip(metadatas, docs):
doc.metadata.update(metadata)
Using the above code, I obtained five docs since the documents are divided page-wise, and two PDFs contain two pages each. The first three documents have metadata, while the last two do not.
The text was updated successfully, but these errors were encountered:
I have built a RAG pipeline with metadata extraction. I parsed 3 pdfs and got 3 Metadatas, one for each.
Using the above code, I obtained five docs since the documents are divided page-wise, and two PDFs contain two pages each. The first three documents have metadata, while the last two do not.
The text was updated successfully, but these errors were encountered: