Skip to content

Different label categories than expected in a spark-nlp NER model #13167

Answered by maziyarpanahi
ag-din asked this question in Q&A
Discussion options

You must be logged in to vote

Hi,

  1. Yes! I am not sure about your own CoNLL file, but you don't have any LABELONE or LABELTWO inside that file. Instead, you have the following labels (Obviously, Spark NLP is not making these labels up so you are either reading the wrong CoNLL index or this is what's actually inside that conll file):
U_SPECSKILL
I-SPECSKILL
I-HUMANSKILL
B-SPECSKILL
B-HUMANSKILL
U-HUMANSKILL

The number of labels are with B- and I- plus O when they are counted. That's why for CoNLL2003 file you have tested with 4 entities (not labels) you have 9 labels, 8 different labels starting with B- and I- and O which makes them 9 unique labels to learn during the training. You have mistaken the entities with labe…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@ag-din
Comment options

Answer selected by ag-din
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants