Background: Emergency room reports pose specific challenges to natural language processing techniques. In this setting, violence episodes on women, elderly and children are often under-reported. Categorizing textual descriptions as containing violence-related injuries (V) vs. non-violence-related injuries (NV) is thus a relevant task to the ends of devising alerting mechanisms to track (and prevent) violence episodes. Methods: We present VIDES (so dubbed after VIOLENCE DETECTION SYSTEM), a system to detect episodes of violence from narrative texts in emergency room reports. It employs a deep neural network for categorizing textual ER reports data, and complements such output by making explicit which elements corroborate the interpretation of the record as reporting about violence-related injuries. To these ends we designed a novel hybrid technique for filling semantic frames that employs distributed representations of terms herein, along with syntactic and semantic information. The system has been validated on real data annotated with two sorts of information: about the presence vs. absence of violence-related injuries, and about some semantic roles that can be interpreted as major cues for violent episodes, such as the agent that committed violence, the victim, the body district involved, etc.. The employed dataset contains over 150K records annotated with class (V,NV) information, and 200 records with finer-grained information on the aforementioned semantic roles. Results: WeuseddatacomingfromanItalianbranchoftheEU-InjuryDatabase(EU-IDB)project,compiledby hospital staff. Categorization figures approach full precision and recall for negative cases and .97 precision and .94 recall on positive cases. As regards as the recognition of semantic roles, we recorded an accuracy varying from .28 to .90 according to the semantic roles involved. Moreover, the system allowed unveiling annotation errors committed by hospital staff. Conclusions: Explainingsystems’results,sotomaketheiroutputmorecomprehensibleandconvincing,istoday necessary for AI systems. Our proposal is to combine distributed and symbolic (frame-like) representations as a possible answer to such pressing request for interpretability. Although presently focused on the medical domain, the proposed methodology is general and, in principle, it can be extended to further application areas and categorization tasks.

Violence detection explanation via semantic roles embeddings

Enrico Mensa;Davide Colla;Daniele P. Radicioni
2020-01-01

Abstract

Background: Emergency room reports pose specific challenges to natural language processing techniques. In this setting, violence episodes on women, elderly and children are often under-reported. Categorizing textual descriptions as containing violence-related injuries (V) vs. non-violence-related injuries (NV) is thus a relevant task to the ends of devising alerting mechanisms to track (and prevent) violence episodes. Methods: We present VIDES (so dubbed after VIOLENCE DETECTION SYSTEM), a system to detect episodes of violence from narrative texts in emergency room reports. It employs a deep neural network for categorizing textual ER reports data, and complements such output by making explicit which elements corroborate the interpretation of the record as reporting about violence-related injuries. To these ends we designed a novel hybrid technique for filling semantic frames that employs distributed representations of terms herein, along with syntactic and semantic information. The system has been validated on real data annotated with two sorts of information: about the presence vs. absence of violence-related injuries, and about some semantic roles that can be interpreted as major cues for violent episodes, such as the agent that committed violence, the victim, the body district involved, etc.. The employed dataset contains over 150K records annotated with class (V,NV) information, and 200 records with finer-grained information on the aforementioned semantic roles. Results: WeuseddatacomingfromanItalianbranchoftheEU-InjuryDatabase(EU-IDB)project,compiledby hospital staff. Categorization figures approach full precision and recall for negative cases and .97 precision and .94 recall on positive cases. As regards as the recognition of semantic roles, we recorded an accuracy varying from .28 to .90 according to the semantic roles involved. Moreover, the system allowed unveiling annotation errors committed by hospital staff. Conclusions: Explainingsystems’results,sotomaketheiroutputmorecomprehensibleandconvincing,istoday necessary for AI systems. Our proposal is to combine distributed and symbolic (frame-like) representations as a possible answer to such pressing request for interpretability. Although presently focused on the medical domain, the proposed methodology is general and, in principle, it can be extended to further application areas and categorization tasks.
2020
20
1
263
275
https://link.springer.com/article/10.1186/s12911-020-01237-4
XAI,Explanation,Textcategorization,Categorizationexplanation,Wordembeddings,Semanticframes, Slot filling, Event extraction, Violent event tracking
Enrico Mensa, Davide Colla, Marco Dalmasso, Marco Giustini, Carlo Mamo, Alessio Pitidis, Daniele P. Radicioni
File in questo prodotto:
File Dimensione Formato  
Mensa2020_Article_ViolenceDetectionExplanationVi.pdf

Accesso aperto

Descrizione: pdf ottenibile all'URL https://link.springer.com/article/10.1186/s12911-020-01237-4
Tipo di file: PDF EDITORIALE
Dimensione 1.28 MB
Formato Adobe PDF
1.28 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1762844
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 4
social impact