Yike Wang*, Shangbin Feng*, Heng Wang, Weijia Shi, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov
The top-level keys in the json file correspond to primary fields, and each data point within a field is represented as a dictionary, with the following key-value pairs:
main_entity
(str): an entity from the generated entity listparametric_knowledge
(str): extracted parametric knowledge about themain_entity
named_entity_lst
(lst): named entities with corresponding types returned by NER modelsconflict_generation_method
(str): either "substitution" or "shuffling", representing in-domain named entity substitution and in-domain entity shuffling respectivelyentity_before
(str): an entity originally presents in theparametric_knowledge
before substitution or shufflingentity_after
(str): the entity that replaces theentity_before
in cases of substitution or shufflingconflicting_knowledge
(str): the conflicting knowledge created by substitution or shufflingquestion_about_conflicting_segments
(str): a question related to the conflicting segments ofconflicting_knowledge
question_about_nonconflicting_segments
(str): a question related to the nonconflicting segments ofconflicting_knowledge
If you found this work helpful, please consider starring this repository and citing our paper as shown below:
@article{wang2023resolving,
title={Resolving knowledge conflicts in large language models},
author={Wang, Yike and Feng, Shangbin and Wang, Heng and Shi, Weijia and Balachandran, Vidhisha and He, Tianxing and Tsvetkov, Yulia},
journal={arXiv preprint arXiv:2310.00935},
year={2023}
}