-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can I get the data? #27
Comments
I notice that the file path in your error message seems strange, such as ``/mnt/ssd_mnt/pyj/SynGEC/bash/english_exp/../../src/src_gopar/parse.py''. |
Ok, I'll try soon Then, I wonder that the how can I get the data like: FileNotFoundError: [Errno 2] No such file or directory: '../../data/wi_locness_train/tgt.txt' |
You should download the preprocessed data, unzip them, and put them into https://github.com/HillZhang1999/SynGEC/tree/main/data |
Is that preprocessed data same the link of data: So I downloaded the data and unzip ./data/ Apply BPE... |
please enter the directory of this bash file, then run |
First I use the unzip like "tar -zxvf syngec_preprocess.tar.gz" and then the log is and I run the bash file: root@309e7fc0781e:/mnt/ssd_mnt/pyj/SynGEC/data# cd /mnt/ssd_mnt/pyj/SynGEC/bash/english_exp/ warnings.warn( Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. /opt/conda/bin/python: No module named supar.cmds.biaffine_dep
|
If you don't want to re-train the parser, you can directly skip the data preprocess step. The preprocessed file can be directly downloaded from our Google Drive. |
First, I downloaded 'Transformer-en' and renamed it like './model/syngec/english_transformer_baseline.pt'.
Then, I downloaded the preprocessed data.
And I run the code './pipeline_gopar.sh'.
But the error is:
Traceback (most recent call last):
File "/mnt/ssd_mnt/pyj/SynGEC/bash/english_exp/../../src/src_gopar/parse.py", line 17, in
input_sentences = load(sys.argv[1])
File "/mnt/ssd_mnt/pyj/SynGEC/bash/english_exp/../../src/src_gopar/parse.py", line 9, in load
with open(filename, 'r') as f:
FileNotFoundError: [Errno 2] No such file or directory: '../../data/wi_locness_train/tgt.txt'
Loading resources...
Processing parallel files...
Traceback (most recent call last):
File "/opt/conda/bin/errant_parallel", line 8, in
sys.exit(main())
File "/opt/conda/lib/python3.10/site-packages/errant/commands/parallel_to_m2.py", line 16, in main
in_files = [stack.enter_context(open(i)) for i in [args.orig]+args.cor]
File "/opt/conda/lib/python3.10/site-packages/errant/commands/parallel_to_m2.py", line 16, in
in_files = [stack.enter_context(open(i)) for i in [args.orig]+args.cor]
FileNotFoundError: [Errno 2] No such file or directory: '../../data/wi_locness_train/tgt.txt'
Traceback (most recent call last):
File "/mnt/ssd_mnt/pyj/SynGEC/bash/english_exp/../../src/src_gopar/convert_gec_data_to_parsing_data_english.py", line 153, in
with open(conll_file, "r") as f1:
FileNotFoundError: [Errno 2] No such file or directory: '../../data/wi_locness_train/tgt.txt.conll_predict'
/opt/conda/lib/python3.10/site-packages/torch/distributed/launch.py:181: FutureWarning: The module torch.distributed.launch is deprecated
and will be removed in future. Use torchrun.
Note that --use-env is set by default in torchrun.
If your script expects
--local-rank
argument to be set, pleasechange it to read from
os.environ['LOCAL_RANK']
instead. Seehttps://pytorch.org/docs/stable/distributed.html#launch-utility for
further instructions
warnings.warn(
WARNING:torch.distributed.run:
Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed.
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
/opt/conda/bin/python: No module named supar.cmds.biaffine_dep
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 30676) of binary: /opt/conda/bin/python
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/opt/conda/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "/opt/conda/lib/python3.10/site-packages/torch/distributed/launch.py", line 196, in
main()
File "/opt/conda/lib/python3.10/site-packages/torch/distributed/launch.py", line 192, in main
launch(args)
File "/opt/conda/lib/python3.10/site-packages/torch/distributed/launch.py", line 177, in launch
run(args)
File "/opt/conda/lib/python3.10/site-packages/torch/distributed/run.py", line 785, in run
elastic_launch(
File "/opt/conda/lib/python3.10/site-packages/torch/distributed/launcher/api.py", line 134, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/opt/conda/lib/python3.10/site-packages/torch/distributed/launcher/api.py", line 250, in launch_agent
raise ChildFailedError(
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
supar.cmds.biaffine_dep FAILED
Failures:
[1]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 1 (local_rank: 1)
exitcode : 1 (pid: 30677)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
[2]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 2 (local_rank: 2)
exitcode : 1 (pid: 30678)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
[3]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 3 (local_rank: 3)
exitcode : 1 (pid: 30679)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
[4]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 4 (local_rank: 4)
exitcode : 1 (pid: 30680)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
[5]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 5 (local_rank: 5)
exitcode : 1 (pid: 30681)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
[6]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 6 (local_rank: 6)
exitcode : 1 (pid: 30682)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
[7]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 7 (local_rank: 7)
exitcode : 1 (pid: 30683)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
Root Cause (first observed failure):
[0]:
time : 2023-08-24_08:10:36
host : 309e7fc0781e
rank : 0 (local_rank: 0)
exitcode : 1 (pid: 30676)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
nohup: appending output to 'nohup.out'
nohup: appending output to 'nohup.out'
nohup: appending output to 'nohup.out'
nohup: appending output to 'nohup.out'
nohup: appending output to 'nohup.out'
nohup: appending output to 'nohup.out'
How can I fix it? plz help me...
The text was updated successfully, but these errors were encountered: