Transformer-Based Multilabel NER Using Wikipedia Corpora in Multiple Languages

wikidata 2025-05-20

Summary:

The high cost of manual data labeling and privacy concerns result in a considerable dearth of medical annotations in non-English texts. Recent work by Frank and Kramer [1] introduces an unsupervised approach for constructing an ontology-annotated corpora from Wikipedia (https://www.wikidata.org) for German medical NER. We evaluate the proposed approach across English, German, Spanish, and French for medication and diagnosis entity recognition. Our multilabel corpora yield notable improvements in...

Link:

https://pubmed.ncbi.nlm.nih.gov/40380596/?utm_source=Other&utm_medium=rss&utm_campaign=pubmed-2&utm_content=1VSjW0JqT_vVo4exSnaEa8DS8viTn4bOW9m_0JY8UcVGX5Esjj&fc=20220129234853&ff=20250520152136&v=2.18.0.post9+e462414

From feeds:

📚BioDBS Bibliography » wikidata

Tags:

Authors:

Yelyzaveta Ahapova, Johann Frei, Frank Kramer

Date tagged:

05/20/2025, 15:21

Date published:

05/17/2025, 06:00