Multi-task learning approaches have shown significant improvements in different fields by training different related tasks simultaneously. The multi-task model learns common features among different tasks where they share some layers. However, it is observed that the multi-task learning approach can suffer performance degradation with respect to single task learning in some of the natural language processing tasks, specifically in sequence labelling problems. To tackle this limitation we formulate a simple but effective approach that combines multi-task learning with transfer learning. We use a simple model that comprises of bidirectional long-short term memory and conditional random field. With this simple model, we are able to achieve better F1-score compared to our single task and the multi-task models as well as state-of-the-art multi-task models.

Combining multi-task learning with transfer learning for biomedical named entity recognition

Mehmood T.;Gerevini A. E.;Serina I.
2020-01-01

Abstract

Multi-task learning approaches have shown significant improvements in different fields by training different related tasks simultaneously. The multi-task model learns common features among different tasks where they share some layers. However, it is observed that the multi-task learning approach can suffer performance degradation with respect to single task learning in some of the natural language processing tasks, specifically in sequence labelling problems. To tackle this limitation we formulate a simple but effective approach that combines multi-task learning with transfer learning. We use a simple model that comprises of bidirectional long-short term memory and conditional random field. With this simple model, we are able to achieve better F1-score compared to our single task and the multi-task models as well as state-of-the-art multi-task models.
2020
Procedia Computer Science
Ateneo di appartenenza
PE6_2 Computer systems, parallel/distributed systems, sensor networks, embedded systems, cyber-physical systems
PE6_9 Human computer interaction and interface, visualization and natural language processing
Esperti anonimi
Inglese
24th KES International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES 2020
2020
Internazionale
ELETTRONICO
176
848
857
10
Elsevier B.V.
Biomedical Named Entity Recognition; Deep Learning; Long Short-Term Memory; Multi-task Learning; Transfer Learning
no
none
Mehmood, T.; Gerevini, A. E.; Lavelli, A.; Serina, I.
273
info:eu-repo/semantics/conferenceObject
4
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/535657
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 27
  • ???jsp.display-item.citation.isi??? ND
social impact