Accueil > Espace de travail > Flux HAL > HAL L3i
https://hal.archives-ouvertes.fr/rss.php?lab=EA2118
Articles
-
[cea-04363097] The importance of character-level information in an event detection model
24 décembre 2023, par ano.nymous@ccsd.cnrs.fr.invalid (Emanuela Boros), Emanuela BorosThis paper tackles the task of event detection that aims at identifying and categorizing event mentions in texts. One of the difficulties of this task is the problem of event mentions corresponding to misspelled, custom, or out-of-vocabulary words. To analyze the impact of character-level (...) -
[hal-04351020] Can Cross-domain Term Extraction Benefit from Cross-lingual Transfer ?
18 décembre 2023, par ano.nymous@ccsd.cnrs.fr.invalid (Tran Thi Hong Hanh), Tran Thi Hong HanhAutomatic term extraction (ATE) is a natural language processing task that eases the effort of manually identifying terms from domain-specific corpora by providing a list of candidate terms. In this paper, we experiment with XLM-RoBERTa to evaluate the abilities of cross-lingual and (...) -
[hal-04350988] Experimenting with Unsupervised Multilingual Event Detection in Historical Newspapers
18 décembre 2023, par ano.nymous@ccsd.cnrs.fr.invalid (Emanuela Boros), Emanuela BorosTo prevent historical knowledge's fading, research in event detection could facilitate access to digitized collections. In this paper, we propose a method for annotating multilingual historical documents for event detection in an unsupervised manner by leveraging entities and semantic notions (...) -
[hal-03026933] Linking Named Entities across Languages using Multilingual Word Embeddings
18 décembre 2023, par ano.nymous@ccsd.cnrs.fr.invalid (Elvys Linhares Pontes), Elvys Linhares PontesDigital libraries are online collections of digital objects that can include text, images, audio, or videos in several languages. It has long been observed that named entities (NEs) are key to the access to digital library portals as they are contained in most user queries. However, NEs can (...) -
[hal-02518252] Post-OCR Error Detection by Generating Plausible Candidates
18 décembre 2023, par ano.nymous@ccsd.cnrs.fr.invalid (Thi-Tuyet-Hai Nguyen), Thi-Tuyet-Hai Nguyen[...]