You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched the existing issues and this bug is not already filed.
My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.
Describe the bug
I'm trying to use GRAPHRA_API.build_index to create my indexes, but I'm getting the following error:
[2025-01-2319:21:28,418: INFO/ForkPoolWorker-1] foundtextfilesfrominput, found [('Ata_Reuniao_Condominio_2.pdf__0.txt', {})]
[2025-01-2319:21:28,427: INFO/ForkPoolWorker-1] Found1files, loading1
[2025-01-2319:21:28,457: INFO/ForkPoolWorker-1] Final# of rows loaded: 1
[2025-01-2319:21:28,551: INFO/ForkPoolWorker-1] readingtablefromstorage: input.parquet
[2025-01-2319:21:29,097: INFO/ForkPoolWorker-1] readingtablefromstorage: input.parquet
[2025-01-2319:21:29,104: INFO/ForkPoolWorker-1] readingtablefromstorage: create_base_text_units.parquet
[2025-01-2319:21:29,134: INFO/ForkPoolWorker-1] readingtablefromstorage: create_base_text_units.parquet
[2025-01-2319:21:29,405: ERROR/ForkPoolWorker-1] errorextractinggraphTraceback (mostrecentcalllast):
File"/usr/local/lib/python3.12/site-packages/graphrag/index/operations/extract_entities/graph_extractor.py", line127, in__call__result=awaitself._process_document(text, prompt_variables)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/graphrag/index/operations/extract_entities/graph_extractor.py", line156, in_process_documentself._extraction_prompt.format(**{
ValueError: unexpected'{'infieldname
[2025-01-2319:21:29,407: INFO/ForkPoolWorker-1] EntityExtractionErrordetails={'doc_index': 0, 'text': 'Ata de Reunião do Condomínio Residencial Monte Aurora\n\nData: 04 de dezembro de 2024\n\nHorário: 20:00\n\nLocal: Auditório Privativo do Condomínio\n\n1. Abertura da Reunião\n\nA reunião foi iniciada pelo síndico, Sr. Eduardo Fontana, pontualmente às\n\n20:00. Estiveram presentes 45 moradores, representando 75% das unidades,\n\nalém de representantes da administradora de condomínios LuxGest.\n\n2. Pautas Discutidas\n\n2.1 Instalação de Painéis Solares\n\nFoi apresentada a proposta de instalação de painéis solares nas áreas comuns,\n\ncom o objetivo de reduzir os custos de energia elétrica e promover\n\nsustentabilidade.\n\nO projeto foi aprovado por unanimidade, com um orçamento estimado de R$\n\n150.000,00, a ser financiado em 12 parcelas.\n\n2.2 Ampliação da Academia\n\nOs condôminos discutiram a necessidade de modernizar e ampliar a academia.\n\nUm arquiteto será contratado para projetar a reforma, e os custos serão\n\napresentados na próxima reunião.\n\n2.3 Segurança e Monitoramento\n\nDiante de recentes relatos de tentativas de invasão na região, foi aprovada a\n\n\x0cimplementação de novas câmeras de segurança e a contratação de uma\n\nempresa de monitoramento 24 horas.\n\nO investimento inicial será de R$ 25.000,00, com uma taxa mensal de R$\n\n3.500,00.\n\n2.4 Regras para Uso do Spa e Sauna\n\nMoradores sugeriram melhorias nas regras de agendamento e higiene do spa e\n\nsauna. Decidiu-se criar um sistema online de reservas e exigir o uso\n\nobrigatório de toalhas para proteção dos assentos.\n\n3. Encerramento\n\nO síndico agradeceu a presença e participação de todos e encerrou a reunião\n\nàs 22:15.\n\nA próxima reunião foi marcada para o dia 10 de março de 2025.\n\nAssinam a presente ata'}
[2025-01-2319:21:29,411: ERROR/ForkPoolWorker-1] errorextractinggraphTraceback (mostrecentcalllast):
File"/usr/local/lib/python3.12/site-packages/graphrag/index/operations/extract_entities/graph_extractor.py", line127, in__call__result=awaitself._process_document(text, prompt_variables)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/graphrag/index/operations/extract_entities/graph_extractor.py", line156, in_process_documentself._extraction_prompt.format(**{
ValueError: unexpected'{'infieldname
[2025-01-2319:21:29,412: INFO/ForkPoolWorker-1] EntityExtractionErrordetails={'doc_index': 0, 'text': 'um sistema online de reservas e exigir o uso\n\nobrigatório de toalhas para proteção dos assentos.\n\n3. Encerramento\n\nO síndico agradeceu a presença e participação de todos e encerrou a reunião\n\nàs 22:15.\n\nA próxima reunião foi marcada para o dia 10 de março de 2025.\n\nAssinam a presente ata:\n\n__________________________ __________________________\n\nSíndico Eduardo Fontana Moradora Mariana Torres, Secretária da\n\nReunião'}
[2025-01-2319:21:29,416: ERROR/ForkPoolWorker-1] errorrunningworkflowextract_graphTraceback (mostrecentcalllast):
File"/usr/local/lib/python3.12/site-packages/graphrag/index/run/run_workflows.py", line166, in_run_workflowsresult=awaitrun_workflow(
^^^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/graphrag/index/workflows/extract_graph.py", line45, inrun_workflowbase_entity_nodes, base_relationship_edges=awaitextract_graph(
^^^^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/graphrag/index/flows/extract_graph.py", line33, inextract_graphentities, relationships=awaitextract_entities(
^^^^^^^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/graphrag/index/operations/extract_entities/extract_entities.py", line136, inextract_entitiesentities=_merge_entities(entity_dfs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/graphrag/index/operations/extract_entities/extract_entities.py", line168, in_merge_entitiesall_entities.groupby(["title", "type"], sort=False)
File"/usr/local/lib/python3.12/site-packages/pandas/core/frame.py", line9183, ingroupbyreturnDataFrameGroupBy(
^^^^^^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/pandas/core/groupby/groupby.py", line1329, in__init__grouper, exclusions, obj=get_grouper(
^^^^^^^^^^^^File"/usr/local/lib/python3.12/site-packages/pandas/core/groupby/grouper.py", line1043, inget_grouperraiseKeyError(gpr)
KeyError: 'title'
[2025-01-2319:21:29,425: INFO/ForkPoolWorker-1] Errorrunningpipeline! details=None
[2025-01-2319:21:29,506: ERROR/MainProcess] Taskhandlerraisederror: WorkerLostError('Worker exited prematurely: exitcode 0 Job: 0.')
Traceback (mostrecentcalllast):
File"/usr/local/lib/python3.12/site-packages/billiard/pool.py", line1265, inmark_as_worker_lostraiseWorkerLostError(
billiard.einfo.ExceptionWithTraceback:
"""Traceback (most recent call last): File "/usr/local/lib/python3.12/site-packages/billiard/pool.py", line1265, inmark_as_worker_lostraiseWorkerLostError(
billiard.exceptions.WorkerLostError: Workerexitedprematurely: exitcode0Job: 0.
Steps to reproduce
I'm not sure if this is important tbh, but I'm using the following:
Do you need to file an issue?
Describe the bug
I'm trying to use
GRAPHRA_API.build_index
to create my indexes, but I'm getting the following error:Steps to reproduce
I'm not sure if this is important tbh, but I'm using the following:
Please let me know if I'm doing anything wrong.
Expected Behavior
My documents should be indexed.
GraphRAG Config Used
Logs and screenshots
No response
Additional Information
The text was updated successfully, but these errors were encountered: