数据集是ACE2005多语种数据集,进行预训练
代码部分:
output_dir = "./data/ace-event/processed-data/json"
for fold in ["train", "dev", "test"]:
g_convert = open(path.join(output_dir, fold + "_convert.json"), "w")
with open(path.join(output_dir, fold + ".json"), "r") as g:
for line in g:
line = json.loads(line)
错误原因:
Traceback (most recent call last):
File "./scripts/data/ace-event/convert_examples.py", line 10, in
line = json.loads(line)
File "/root/miniconda3/lib/python3.7/json/init.py", line 348, in loads
return _default_decoder.decode(s)
File "/root/miniconda3/lib/python3.7/json/decoder.py", line 337, in decode
obj, end = self.raw_decode(s, idx=_w(s, 0).end())
File "/root/miniconda3/lib/python3.7/json/decoder.py", line 353, in raw_decode
obj, end = self.scan_once(s, idx)
json.decoder.JSONDecodeError: Expecting property name enclosed in double quotes: line 2 column 1 (char 2)
1.首先这个train.json文件数据太多,我放弃了由‘’(单引号)转变为“”“(双引号)
我想要解决这个问题,使代码正常运行