Skip to content

ssplit.eolonly rises NullPointerException at edu.stanford.nlp.pipeline.NERCombinerAnnotato #20

@verosol

Description

@verosol

So, basically we have an already tokenised corpus with golden sentence segmentation, which we want to preserve. Evidently, we found this parameters :
tokenize.whitespace = true
ssplit.eolonly = true

They work alright together with tokenize,ssplit,pos,lemma and parses, but it we want to pass all the annotators needed for the coreference resolution
annotators = tokenize,ssplit,pos,lemma,ner,parse,coref

it gives error Nullpointer exception specifically on NER annotation part.

Processing file /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt ... writing to /Users/nikahelicopter/Downloads/stanford-corenlp-full-2018-10-05/xx00.txt.xml
Exception in thread "main" java.lang.NullPointerException
at edu.stanford.nlp.pipeline.NERCombinerAnnotator.annotate(NERCombinerAnnotator.java:322)
at edu.stanford.nlp.pipeline.AnnotationPipeline.annotate(AnnotationPipeline.java:76)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:637)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:647)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1226)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1060)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.run(StanfordCoreNLP.java:1326)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.main(StanfordCoreNLP.java:1389)

We use stanford-corenlp-full-2018-10-05 version 3.9.2
An example file:
xx01.txt

Parameters:
annotators = tokenize,ssplit,pos,lemma,ner,parse,coref
tokenize.whitespace = true
ssplit.eolonly = true
coref.algorithm = neural
file = /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions