Below is an overview of all parsing and processing issues that appeared while importing the data.
Go back to the global issues overview or view the dataset page.
Issue #6475c5ce4ac09112991c891eb8c98521bbb507aa: us_nv_med_exclusions| error_str | HTTPSConnectionPool(host='api.zyte.com', port=443): Max retries exceeded with url: /v1/extract (Caused by ResponseError('too many 520 error responses')) |
|---|---|
| url | https://api.zyte.com/v1/extract |
| exception | urllib3.exceptions.ResponseError: too many 520 error responses
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/venv/lib/python3.12/site-packages/requests/adapters.py", line 644, in send
resp = conn.urlopen(
^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/urllib3/connectionpool.py", line 942, in urlopen
return self.urlopen(
^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/urllib3/connectionpool.py", line 942, in urlopen
return self.urlopen(
^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/urllib3/connectionpool.py", line 942, in urlopen
return self.urlopen(
^^^^^^^^^^^^^
[Previous line repeated 7 more times]
File "/venv/lib/python3.12/site-packages/urllib3/connectionpool.py", line 932, in urlopen
retries = retries.increment(method, url, response=response, _pool=self)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/urllib3/util/retry.py", line 519, in increment
raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='api.zyte.com', port=443): Max retries exceeded with url: /v1/extract (Caused by ResponseError('too many 520 error responses'))
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/opensanctions/zavod/zavod/crawl.py", line 35, in crawl_dataset
entry_point(context)
File "${ZAVOD_DATASETS_PATH}/us/nv/med_exclusions/crawler.py", line 104, in crawl
context, "source.pdf", crawl_pdf_url(context), PDF, geolocation="US"
^^^^^^^^^^^^^^^^^^^^^^
File "${ZAVOD_DATASETS_PATH}/us/nv/med_exclusions/crawler.py", line 93, in crawl_pdf_url
doc = zyte_api.fetch_html(
^^^^^^^^^^^^^^^^^^^^
File "/opensanctions/zavod/zavod/shed/zyte_api.py", line 391, in fetch_html
zyte_result = fetch(
^^^^^^
File "/opensanctions/zavod/zavod/shed/zyte_api.py", line 238, in fetch
api_response = context.http.post(
^^^^^^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/requests/sessions.py", line 637, in post
return self.request("POST", url, data=data, json=json, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/requests/sessions.py", line 589, in request
resp = self.send(prep, **send_kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/requests/sessions.py", line 703, in send
r = adapter.send(request, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/venv/lib/python3.12/site-packages/requests/adapters.py", line 668, in send
raise RetryError(e, request=request)
requests.exceptions.RetryError: HTTPSConnectionPool(host='api.zyte.com', port=443): Max retries exceeded with url: /v1/extract (Caused by ResponseError('too many 520 error responses')) |
| dataset | us_nv_med_exclusions |
| exc_info | true |
OpenSanctions is free for non-commercial users. Businesses must acquire a data license to use the dataset.