Page MenuHomePhabricator

CCC : Timeout error File:JR久慈 駅 - panoramio (1).jpg
Open, HighPublic

Description

2020-02-14 06:43:34,471 corruptBackwards: DEBUG Skipping check on File:JR久慈 駅 - panoramio (1).jpg
Traceback (most recent call last):
  File "fullscan_reversed.py", line 23, in <module>
    scan.process_file() # Do the work
  File "/home/ccc/Commons-image-corruption-detector/base_scan.py", line 65, in process_file
    for image_page in pwb_wrappers.allimages(reverse=self.reverse):
  File "/usr/local/lib/python3.8/site-packages/pywikibot/data/api.py", line 2808, in __iter__
    self.data = self.request.submit()
  File "/usr/local/lib/python3.8/site-packages/pywikibot/data/api.py", line 1985, in submit
    rawdata, use_get = self._http_request(use_get, uri, body, headers,
  File "/usr/local/lib/python3.8/site-packages/pywikibot/data/api.py", line 1767, in _http_request
    self.wait()
  File "/usr/local/lib/python3.8/site-packages/pywikibot/data/api.py", line 2088, in wait
    raise TimeoutError('Maximum retries attempted without success.')
pywikibot.exceptions.TimeoutError: Maximum retries attempted without success.

Revisions and Commits

Event Timeline

TheSandDoctor triaged this task as High priority.EditedFeb 14 2020, 12:49 PM
TheSandDoctor created this task.

Continuously gets stuck at that entry. Further work needed to identify and address.

TheSandDoctor added a commit: Restricted Diffusion Commit.Feb 15 2020, 7:25 PM
TheSandDoctor added a commit: Restricted Diffusion Commit.Feb 15 2020, 7:31 PM

I got pywikibot to fail attempting to access

https://commons.wikimedia.org/w/api.php?gaifrom=%F0%A9%B5%8F&prop=info%7Cimageinfo%7Ccategoryinfo&inprop=protection&iiprop=timestamp%7Cuser%7Ccomment%7Curl%7Csize%7Csha1%7Cmetadata&iilimit=max&generator=allimages&action=query&indexpageids=&continue=gaicontinue%7C%7Cuserinfo&gaidir=descending&gailimit=500&meta=userinfo&uiprop=blockinfo%7Chasmsg&maxlag=5&format=json&gaicontinue=%EF%BC%90%EF%BC%97%E5%BC%8F%E6%A9%9F%E5%8B%95%E6%94%AF%E6%8F%B4%E6%A9%8B%EF%BC%88%E8%A3%85%E5%82%99%E5%93%81%E5%B1%95%E7%A4%BA%E3%83%BB%E6%96%BD%E8%A8%AD%E5%AD%A6%E6%A0%A1%EF%BC%89_%284%29.jpg

Without the percent-encoding, that's

https://commons.wikimedia.org/w/api.php?gaifrom=𩵏&prop=info|imageinfo|categoryinfo&inprop=protection&iiprop=timestamp|user|comment|url|size|sha1|metadata&iilimit=max&generator=allimages&action=query&indexpageids=&continue=gaicontinue||userinfo&gaidir=descending&gailimit=500&meta=userinfo&uiprop=blockinfo|hasmsg&maxlag=5&format=json&gaicontinue=07式機動支援橋(装備品展示・施設学校)_(4).jpg

Pywikibot is using POST and not GET, but it fails the same. Most recent attempt resulted in

{"error":{"code":"internal_api_error_WMFTimeoutException","info":"[XknCBQpAAEsAAJ7w@woAAACH] Caught exception of type WMFTimeoutException","errorclass":"WMFTimeoutException"},"servedby":"mw1280"}

I'd suggest taking this upstream to MediaWiki-API and/or operations so that someone with logstash access can look into what's going on with the media-wiki end.

TheSandDoctor moved this task from Backlog to Upstream on the Commons-Corruption-Checker board.
2020-02-24 08:42:16,590 __main__    : INFO File:�রপক্ষ মন্দির থেকে ভক্তদের বের হওয়াjpgশ্য..
Traceback (most recent call last):
  File "rcworker.py", line 221, in <module>
    main()
  File "rcworker.py", line 214, in main
    run_worker()
  File "rcworker.py", line 63, in run_worker
    file_page = pywikibot.FilePage(site, change.title)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/tools/__init__.py", line 1744, in wrapper
    return obj(*__args, **__kw)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/page.py", line 2478, in __init__
    super(FilePage, self).__init__(source, title, 6)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/tools/__init__.py", line 1744, in wrapper
    return obj(*__args, **__kw)
File "/usr/local/lib/python3.8/site-packages/pywikibot/page.py", line 2478, in __init__
    super(FilePage, self).__init__(source, title, 6)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/tools/__init__.py", line 1744, in wrapper
    return obj(*__args, **__kw)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/page.py", line 2327, in __init__
    super(Page, self).__init__(source, title, ns)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/page.py", line 200, in __init__
    self._link = Link(title, source=source, default_namespace=ns)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/tools/__init__.py", line 1744, in wrapper
    return obj(*__args, **__kw)
  File "/usr/local/lib/python3.8/site-packages/pywikibot/page.py", line 6029, in __init__
    raise pywikibot.Error(
pywikibot.exceptions.Error: Title contains illegal char (\uFFFD 'REPLACEMENT CHARACTER')
TheSandDoctor changed the visibility from "All Users" to "Public (No Login Required)".Apr 8 2020, 5:04 PM