logofirst
logofirst GitHub

HTML Documents NER

Perform named entity recognition for HTML documents.

Run

label-studio init html_document_project
label-studio start html_document_project

After starting Label Studio, set up the labeling interface and browse to this template.

Config

<View>
  <Labels name="ner" toName="text">
    <Label value="Person"></Label>
    <Label value="Organization"></Label>
  </Labels>
  <HyperText name="text" value="$text"></HyperText>
</View>