Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

404 error on chembl_28_molecule data #458

Open
caufieldjh opened this issue Jun 13, 2022 · 0 comments
Open

404 error on chembl_28_molecule data #458

caufieldjh opened this issue Jun 13, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@caufieldjh
Copy link
Contributor

Describe the bug

During the most recent Jenkins build (and going back a while - this has been happening since at least March 2022), the downloader encountered a 404 error when trying to retrieve the chembl_28_molecule dataset:

querying for index: chembl_28_molecule: 0it [00:00, ?it/s]�[AWARNING:elasticsearch:POST https://www.ebi.ac.uk:443/chembl/elk/es/chembl_28_molecule/_search?scroll=1m [status:404 request:0.183s]
[2022-06-10T16:04:50.546Z] 
querying for index: chembl_28_molecule: 0it [00:00, ?it/s]
[2022-06-10T16:04:50.546Z] 
Downloading files:  86%|████████▌ | 25/29 [04:56<00:47, 11.86s/it]
[2022-06-10T16:04:50.796Z] Traceback (most recent call last):
[2022-06-10T16:04:50.796Z]   File "run.py", line 166, in <module>
[2022-06-10T16:04:50.796Z]     cli()
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/click/core.py", line 829, in __call__
[2022-06-10T16:04:50.796Z]     return self.main(*args, **kwargs)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/click/core.py", line 782, in main
[2022-06-10T16:04:50.796Z]     rv = self.invoke(ctx)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/click/core.py", line 1259, in invoke
[2022-06-10T16:04:50.796Z]     return _process_result(sub_ctx.command.invoke(sub_ctx))
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/click/core.py", line 1066, in invoke
[2022-06-10T16:04:50.796Z]     return ctx.invoke(self.callback, **ctx.params)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/click/core.py", line 610, in invoke
[2022-06-10T16:04:50.796Z]     return callback(*args, **kwargs)
[2022-06-10T16:04:50.796Z]   File "run.py", line 39, in download
[2022-06-10T16:04:50.796Z]     kg_download(*args, **kwargs)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/kg_covid_19/download.py", line 20, in download
[2022-06-10T16:04:50.796Z]     download_from_yaml(yaml_file=yaml_file, output_dir=output_dir,
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/kghub_downloader/download_utils.py", line 78, in download_from_yaml
[2022-06-10T16:04:50.796Z]     download_from_api(item, outfile)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/kghub_downloader/download_utils.py", line 166, in download_from_api
[2022-06-10T16:04:50.796Z]     records = elastic_search_query(es_conn, index=yaml_item['index'], query=query_data)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/kghub_downloader/download_utils.py", line 200, in elastic_search_query
[2022-06-10T16:04:50.796Z]     for item in tqdm(results, desc="querying for index: " + index):
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/tqdm/std.py", line 1195, in __iter__
[2022-06-10T16:04:50.796Z]     for obj in iterable:
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/helpers/actions.py", line 584, in scan
[2022-06-10T16:04:50.796Z]     resp = client.search(**search_kwargs)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/client/utils.py", line 347, in _wrapped
[2022-06-10T16:04:50.796Z]     return func(*args, params=params, headers=headers, **kwargs)
[2022-06-10T16:04:50.796Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/client/__init__.py", line 1821, in search
[2022-06-10T16:04:50.797Z]     return self.transport.perform_request(
[2022-06-10T16:04:50.797Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/transport.py", line 466, in perform_request
[2022-06-10T16:04:50.797Z]     raise e
[2022-06-10T16:04:50.797Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/transport.py", line 427, in perform_request
[2022-06-10T16:04:50.797Z]     status, headers_response, data = connection.perform_request(
[2022-06-10T16:04:50.797Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/connection/http_urllib3.py", line 291, in perform_request
[2022-06-10T16:04:50.797Z]     self._raise_error(response.status, raw_data)
[2022-06-10T16:04:50.797Z]   File "/var/lib/jenkins/workspace/dge-graph-hub_kg-covid-19_master/gitrepo/venv/lib/python3.8/site-packages/elasticsearch/connection/base.py", line 328, in _raise_error
[2022-06-10T16:04:50.797Z]     raise HTTP_EXCEPTIONS.get(status_code, TransportError)(
[2022-06-10T16:04:50.797Z] elasticsearch.exceptions.NotFoundError: NotFoundError(404, 'index_not_found_exception', 'no such index [chembl_28_molecule]', chembl_28_molecule, index_or_alias)

The cached copy was used instead.

To Reproduce

See build 20220610

@caufieldjh caufieldjh added the bug Something isn't working label Jun 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant