Warnings and errors: md_companies JSON

Below is an overview of all parsing and processing issues that appeared while importing the data.

Go back to the global issues overview or view the dataset page.

Issue #6f69bced6c542540d7fbf7198ca3dc713046a70c: zavod.runtime.safety
HTML/XSS suspicion in property value: Ungheni, Costuleni, str. <, ap.(of.) >
entity_idoc-companies-md-1016609001891
propaddress
prop_typeaddress
raw_valueUngheni, Costuleni, str. <, ap.(of.) >
cleaned_valueUngheni, Costuleni, str. <, ap.(of.) >
datasetmd_companies
Issue #269064171c442f412cc87517cdf9133495c3dde0: zavod.runtime.safety
HTML/XSS suspicion in property value: Ungheni, Floriţoaia Veche, str. <, ap.(of.) >
entity_idoc-companies-md-1016609001905
propaddress
prop_typeaddress
raw_valueUngheni, Floriţoaia Veche, str. <, ap.(of.) >
cleaned_valueUngheni, Floriţoaia Veche, str. <, ap.(of.) >
datasetmd_companies
Issue #e34db6989be302aa3bd36ef356962fc5340f1509: zavod.runtime.safety
HTML/XSS suspicion in property value: Ungheni, Costuleni, str. <, ap.(of.) >
entity_idoc-companies-md-1016609001938
propaddress
prop_typeaddress
raw_valueUngheni, Costuleni, str. <, ap.(of.) >
cleaned_valueUngheni, Costuleni, str. <, ap.(of.) >
datasetmd_companies
Issue #28e7bc0742e3bc24fb98a9e3f403974614a372df: zavod.runtime.safety
HTML/XSS suspicion in property value: Rezina, Mateuţi, str. <, >
entity_idoc-companies-md-1020606001361
propaddress
prop_typeaddress
raw_valueRezina, Mateuţi, str. <, >
cleaned_valueRezina, Mateuţi, str. <, >
datasetmd_companies
Issue #410a54ea911945654a8f314c4c6fb72aa5dc60c1: md_companies
Runner failed with HTTPError on https://dataset.gov.md/dataset/18516-date-din-registrul-de-stat-al-unitatilor-de-drept-privind-organizatiile-necomerciale
error_str404 Client Error: Not Found for url: https://dataset.gov.md/dataset/18516-date-din-registrul-de-stat-al-unitatilor-de-drept-privind-organizatiile-necomerciale
urlhttps://dataset.gov.md/dataset/18516-date-din-registrul-de-stat-al-unitatilor-de-drept-privind-organizatiile-necomerciale
response_code404
response_text<!DOCTYPE html> <html lang="en"> <head> <meta charset="utf-8"> <meta http-equiv="X-UA-Compatible" content="IE=edge"> <meta name="viewport" content="width=device-width, initial-scale=1"> <!-- The above 3 meta tags *must* come first in the head; any other head content must come *after* these tags --> <title>Resursa Inexistentă</title> <!-- Bootstrap --> <link href="https://maintenance.itsec.md/css/custom.css" rel="stylesheet"> <link href="https://maintenance.itsec.md/css/bootstrap.min.css" rel="stylesheet"> <!-- HTML5 shim and Respond.js for IE8 support of HTML5 elements and media queries --> <!-- WARNING: Respond.js doesn't work if you view the page via file:// --> <!--[if lt IE 9]> <script src="https://oss.maxcdn.com/html5shiv/3.7.3/html5shiv.min.js"></script> <script src="https://oss.maxcdn.com/respond/1.4.2/respond.min.js"></script> <![endif]--> </head> <body class="midnightblue"> <nav id="myNavbar" class="navbar navbar-default navbar-inverse" role="navigation"> <!-- Brand and toggle get grouped for better mobile display --> <div class="container"> <div class="navbar-header"> <button type="button" class="navbar-toggle" data-toggle="collapse" data-target="#navbarCollapse"> <span class="sr-only">Toggle navigation</span> <span class="icon-bar"></span> <span class="icon-bar"></span> <span class="icon-bar"></span> </button> <a class="navbar-brand" href="#"><img src="https://maintenance.itsec.md/img/stemapng.png" class="icon" align="left" /> <span>GUVERNUL REPUBLICII MOLDOVA</span></a> </div> <!-- Collect the nav links, forms, and other content for toggling --> <div class="collapse navbar-collapse" id="navbarCollapse"> <ul class="nav navbar-nav"> <li class="active"><a href="https://stisc.gov.md" target="_blank">STISC</a></li> <li><a href="https://stisc.gov.md/ro/content/date-de-contact" target="_blank">Contacte</a></li> <!--<li><a href="https://www.tutorialrepublic.com/contact-us.php" target="_blank">Contact</a></li>--> </ul> </div> </div> </nav> <div class="content container"> <img src="https://maintenance.itsec.md/img/error-404.png" class="img-responsive" /> <div class="text-first row">Resursa nu a fost găsită</div> <div class="text-second row">Resursa accesată de Dvs nu a fost găsită, Vă rugăm să verificați corectitudinea adresei URL.</div> </div> <!-- jQuery (necessary for Bootstrap's JavaScript plugins) --> <script src="https://ajax.googleapis.com/ajax/libs/jquery/1.12.4/jquery.min.js"></script> <!-- Include all compiled plugins (below), or include individual files as needed --> <script src="https://maintenance.itsec.md/js/bootstrap.min.js"></script> </body> </html>
exceptionTraceback (most recent call last): File "/opensanctions/zavod/zavod/crawl.py", line 36, in crawl_dataset entry_point(context) File "${ZAVOD_DATASETS_PATH}/md/companies/parse.py", line 246, in crawl nonprofits_url = read_ckan(context, NONPROFITS_URL, "nonprofits") ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "${ZAVOD_DATASETS_PATH}/md/companies/parse.py", line 28, in read_ckan resource_list_doc = context.fetch_html(source_url) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opensanctions/zavod/zavod/context.py", line 373, in fetch_html text = self.fetch_text( ^^^^^^^^^^^^^^^^ File "/opensanctions/zavod/zavod/context.py", line 288, in fetch_text response = self.fetch_response( ^^^^^^^^^^^^^^^^^^^^ File "/opensanctions/zavod/zavod/context.py", line 241, in fetch_response response.raise_for_status() File "/venv/lib/python3.12/site-packages/requests/models.py", line 1028, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://dataset.gov.md/dataset/18516-date-din-registrul-de-stat-al-unitatilor-de-drept-privind-organizatiile-necomerciale
datasetmd_companies
exc_infoTrue