Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S3 with wikitext dataset is dead #10

Open
snimu opened this issue Feb 23, 2024 · 1 comment
Open

S3 with wikitext dataset is dead #10

snimu opened this issue Feb 23, 2024 · 1 comment

Comments

@snimu
Copy link

snimu commented Feb 23, 2024

When I try to run main.py, I get the following output:

~/hlb-gpt$ python main.py 
downloading data and tokenizing (1-2 min)
Traceback (most recent call last):
  File "/home/ubuntu/hlb-gpt/main.py", line 102, in <module>
    urllib.request.urlretrieve(raw_data_source, raw_data_cache+'data.zip')
  File "/usr/lib/python3.10/urllib/request.py", line 241, in urlretrieve
    with contextlib.closing(urlopen(url, data)) as fp:
  File "/usr/lib/python3.10/urllib/request.py", line 216, in urlopen
    return opener.open(url, data, timeout)
  File "/usr/lib/python3.10/urllib/request.py", line 525, in open
    response = meth(req, response)
  File "/usr/lib/python3.10/urllib/request.py", line 634, in http_response
    response = self.parent.error(
  File "/usr/lib/python3.10/urllib/request.py", line 563, in error
    return self._call_chain(*args)
  File "/usr/lib/python3.10/urllib/request.py", line 496, in _call_chain
    result = func(*args)
  File "/usr/lib/python3.10/urllib/request.py", line 643, in http_error_default
    raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 403: Forbidden

This has been the case for at least several days now, so I assume that the S3 instance is dead.

@tysam-code
Copy link
Owner

Yes, it looks like salesforce took down the references to Wikitext as well. Let me look into this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants