You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched the existing issues and this bug is not already filed.
My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.
Describe the bug
When following the Global Search notebook (link), I am getting a KeyError raised from the read_indexer_communities method.
KeyError: "Column(s) ['sub_community'] do not exist"
I also noticed that read_indexer_communities method was not present when I installed graphrag via pip. When following the notebook, I copied the method manually into the file and re-ran indexing (deleted the cache and output dirs) before re-indexing.
At line 227
Steps to reproduce
No response
Expected Behavior
No response
GraphRAG Config Used
encoding_model: cl100k_baseskip_workflows: []llm:
api_key: ${GRAPHRAG_API_KEY}type: azure_openai_chatmodel_supports_json: true # recommended if this is available for your model.max_tokens: 4096# request_timeout: 180.0api_base: <REDACTED>api_version: 2024-02-15-previewdeployment_name: <REDACTED>temperature: 0# temperature for samplingtop_p: 0.999# top-p samplingn: 1# Number of completions to generateparallelization:
stagger: 0.3# num_threads: 50 # the number of threads to use for parallel processingasync_mode: threaded # or asyncioembeddings:
## parallelization: override the global parallelization settings for embeddingsasync_mode: threaded # or asyncio# target: required # or all# batch_size: 16 # the number of documents to send in a single request# batch_max_tokens: 8191 # the maximum number of tokens to send in a single requestvector_store:
type: lancedbdb_uri: 'output/lancedb'container_name: default # A prefix for the vector store to create embedding containers. Default: 'default'.overwrite: truellm:
api_key: ${GRAPHRAG_API_KEY}type: azure_openai_embeddingapi_base: <REDACTED>api_version: 2024-02-15-previewdeployment_name: <REDACTED>chunks:
size: 1200overlap: 100group_by_columns: [id] # by default, we don't allow chunks to cross documentsinput:
type: file # or blobfile_type: text # or csvbase_dir: "input"file_encoding: utf-8file_pattern: ".*\\.txt$"cache:
type: file # or blobbase_dir: "cache"storage:
type: file # or blobbase_dir: "output"reporting:
type: file # or console, blobbase_dir: "logs"entity_extraction:
prompt: "prompts/entity_extraction.txt"entity_types: [organization,person,geo,event]max_gleanings: 1summarize_descriptions:
prompt: "prompts/summarize_descriptions.txt"max_length: 500claim_extraction:
prompt: "prompts/claim_extraction.txt"description: "Any claims or facts that could be relevant to information discovery."max_gleanings: 1community_reports:
prompt: "prompts/community_report.txt"max_length: 2000max_input_length: 8000cluster_graph:
max_cluster_size: 10umap:
enabled: false # if true, will generate UMAP embeddings for nodessnapshots:
graphml: falseraw_entities: falsetop_level_nodes: false
Logs and screenshots
No response
Additional Information
GraphRAG Version: 0.4.1
Operating System: MacOS
Python Version: 3.12.5
Related Issues:
The text was updated successfully, but these errors were encountered:
adni03
added
bug
Something isn't working
triage
Default label assignment, indicates new issue needs reviewed by a maintainer
labels
Nov 15, 2024
Do you need to file an issue?
Describe the bug
When following the Global Search notebook (link), I am getting a KeyError raised from the read_indexer_communities method.
KeyError: "Column(s) ['sub_community'] do not exist"
I also noticed that read_indexer_communities method was not present when I installed graphrag via pip. When following the notebook, I copied the method manually into the file and re-ran indexing (deleted the cache and output dirs) before re-indexing.
At line 227
Steps to reproduce
No response
Expected Behavior
No response
GraphRAG Config Used
Logs and screenshots
No response
Additional Information
The text was updated successfully, but these errors were encountered: