Below is an overview of all parsing and processing issues that appeared while importing the data.
Go back to the global issues overview or view the dataset page.
Issue #f0bc9841b5f963b159f46674879f32c9c1bfa0f8: shu_uyghur_companies
error_str | 404 Client Error: Not Found for url: https://www.shu.ac.uk/helena-kennedy-centre-international-justice/research-and-projects/all-projects/useful-resources |
---|---|
url | https://www.shu.ac.uk/helena-kennedy-centre-international-justice/research-and-projects/all-projects/useful-resources |
response_code | 404 |
response_text |
<!doctype html>
<html lang="en">
<head>
<!-- Anti-flicker snippet (recommended) -->
<style>
.async-hide {
opacity: 0 !important
}
</style>
<script>
(function (a, s, y, n, c, h, i, d, e) {
s.className += ' ' + y; h.start = 1 * new Date;
h.end = i = function () { s.className = s.className.replace(RegExp(' ?' + y), '') };
(a[n] = a[n] || []).hide = h; setTimeout(function () { i(); h.end = null }, c); h.timeout = c;
})(window, document.documentElement, 'async-hide', 'dataLayer', 4000,
{ 'GTM-MKVMPL': true });</script>
<!-- END Anti-flicker snippet (recommended) -->
<script>
window.dataLayer = window.dataLayer || [];
window.dataLayer.push({
'pageID': '{52E3E2D6-EB1F-4D8B-9F1B-D87F5E182B52}',
'templateID': '{C6C582A6-1535-4A7B-84CF-D5A36C58150E}',
'templateName': 'CentersLanding'
});
</script>
<!-- Google Tag Manager -->
<script>
(function (w, d, s, l, i) {
w[l] = w[l] || []; w[l].push(
{ 'gtm.start': new Date().getTime(), event: 'gtm.js' }
); var f = d.getElementsByTagName(s)[0],
j = d.createElement(s), dl = l != 'dataLayer' ? '&l=' + l : ''; j.async = true; j.src = 'https://***:***@large" type="button">
<svg height="20" width="20" xmlns="http://***:***@large" type="button">
<svg height="20" width="20" xmlns="http://***:***@large">
<a href="https://***:***@large">
<a href="https://***:***@large">
<a href="https://***:***@large">
<button type="button" class="m-nav--masthead__btn">More<svg height="20" width="20" xmlns="http://***:***@large">
<ul class="m-masthead__nav-list list--inline">
<li>
<a class="m-masthead__nav-item m-masthead__nav-item--has-divider" href="https://***:***@media screen and (min-width: 768px) {
.cover-image {background-image: url('https://***:***@medium g__col-3@large">
<div class="site-footer__org" itemscope="" itemtype="https://***:***@medium">
<span class="sr-only" itemprop="name">Sheffield Hallam University</span>
<span itemprop="streetAddress">City Campus, Howard Street</span>,
<span itemprop="addressLocality">Sheffield</span>,
<span itemprop="postalCode">S1 1WB</span>,
<span itemprop="addressCountry">UK</span>
</address>
<p class="site-footer__phone p-top--double">Phone <a itemprop="telephone" href="tel:+441142255555">+44 (0)114 225 5555</a></p>
</div>
</div>
</div>
<div class="is--hidden is--visible@large">
<h4 class="sr-only">Social media links</h4>
<ul class="site-footer__social list--inline p-top--double">
<li>
<a href="https://***:***@medium p-top--double p-top--reset@medium">
<nav>
<h3 class="u-text-white">Courses and study</h3>
<ul class="site-footer__linklist list--unstyled p-top--double">
<li><a href="https://***:***@medium g__col g__col-6">
<div class="p-top--double p-top--reset@medium">
<nav>
<h3 class="u-text-white">About</h3>
<ul class="site-footer__linklist list--unstyled p-top--double">
<li>
<a href="https://***:***@medium">
<nav>
<h3 class="u-text-white">Legal information</h3>
<ul class="site-footer__linklist list--unstyled p-top--double">
<li>
<a href="https://***:***@medium g__col g__col-6 g__col-3@medium p-top--double p-top--reset@medium">
<nav>
<h3 class="u-text-white">About</h3>
<ul class="site-footer__linklist list--unstyled p-top--double">
<li>
<a href="https://***:***@medium g__col g__col-6 g__col-6@medium g__col-3@large p-top--double p-top--reset@large">
<nav>
<h3 class="u-text-white">Legal information</h3>
<ul class="site-footer__linklist list--unstyled p-top--double">
<li><a href="https://***:***@large p-top--double">
<div class="site-footer__social--mobile p-top--double u-text-center">
<h4 class="sr-only">Social media links</h4>
<ul class="site-footer__social g align-center p-top--double">
<li>
<a href="https://x.com/sheffhallamuni">
<svg height="32" width="32" xmlns="http://www.w3.org/2000/svg" role="img" class="svg-icon u-text-white">
<title>X</title>
<use xlink:href="/dist/img/icons.svg#icon-twitter" xmlns:xlink="http://www.w3.org/1999/xlink"></use>
</svg>
</a>
</li>
<li>
<a href="https://www.facebook.com/sheffieldhallamuniversity">
<svg height="32" width="32" xmlns="http://www.w3.org/2000/svg" role="img" class="svg-icon u-text-white">
<title>Facebook</title>
<use xlink:href="/dist/img/icons.svg#icon-facebook" xmlns:xlink="http://www.w3.org/1999/xlink"></use>
</svg>
</a>
</li>
<li>
<a href="https://www.instagram.com/sheffhallamuni">
<svg height="32" width="32" xmlns="http://www.w3.org/2000/svg" role="img" class="svg-icon u-text-white">
<title>Instagram</title>
<use xlink:href="/dist/img/icons.svg#icon-instagram" xmlns:xlink="http://www.w3.org/1999/xlink"></use>
</svg>
</a>
</li>
<li>
<a href="https://www.youtube.com/user/sheffieldhallamuni">
<svg height="32" width="32" xmlns="http://www.w3.org/2000/svg" role="img" class="svg-icon u-text-white">
<title>YouTube</title>
<use xlink:href="/dist/img/icons.svg#icon-youtube" xmlns:xlink="http://www.w3.org/1999/xlink"></use>
</svg>
</a>
</li>
</ul>
</div>
</div>
</div>
<div class="site-footer__copyright p-top--double p-bottom--double">
<div class="wrapper u-text-black u-text-center">
<p><small>© Copyright 2025 Sheffield Hallam University</small></p>
</div>
</div>
</footer>
</div>
<div class="confirm-modal">
<h3>Cancel event</h3>
<p>Are you sure you want to cancel your place on <strong id="confirm-modal-date">Saturday 12 November</strong>?</p>
<div class="buttons">
<a href="#" class="button confirm-close">Close</a>
<input type="submit" class="pink" id="cancel-confirm" value="Confirm cancellation" data-event-id="" />
</div>
</div>
<script async defer src="https://maps.googleapis.com/maps/api/js?key=AIzaSyBHwFGAnIa3ncWlv7lRY8g7CcujqAA7nmU&v=quarterly&callback=initMap"></script>
<script src="/assets/js/compiled/sitecore.min.js"></script>
<script src="/assets/js/compiled/chunk-vendors.js"></script>
<script src="/assets/js/compiled/client.min.js"></script>
<script src="/assets/js/compiled/sc-forms.js"></script>
<script src="/assets/js/compiled/chunk-common.js"></script>
<script src="/assets/js/compiled/shu-design-system.js"></script>
</body>
</html>
|
exception | Traceback (most recent call last):
File "/opensanctions/zavod/zavod/crawl.py", line 35, in crawl_dataset
entry_point(context)
File "${ZAVOD_DATASETS_PATH}/_global/shu_uyghur_companies/crawler.py", line 168, in crawl
doc = context.fetch_html(context.data_url, cache_days=1)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opensanctions/zavod/zavod/context.py", line 396, in fetch_html
text = self.fetch_text(
^^^^^^^^^^^^^^^^
File "/opensanctions/zavod/zavod/context.py", line 314, in fetch_text
response = self.fetch_response(
^^^^^^^^^^^^^^^^^^^^
File "/opensanctions/zavod/zavod/context.py", line 267, in fetch_response
response.raise_for_status()
File "/venv/lib/python3.12/site-packages/requests/models.py", line 1026, in raise_for_status
raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://www.shu.ac.uk/helena-kennedy-centre-international-justice/research-and-projects/all-projects/useful-resources |
dataset | shu_uyghur_companies |
exc_info | true |
OpenSanctions is free for non-commercial users. Businesses must acquire a data license to use the dataset.