New developments in deep learning for natural language processing and explanation

Simeri, Andrea; Tagarelli, Andrea; Fortino, Giancarlo

New developments in deep learning for natural language processing and explanation

dc.contributor.author	Simeri, Andrea
dc.contributor.author	Tagarelli, Andrea
dc.contributor.author	Fortino, Giancarlo
dc.date.accessioned	2026-04-09T09:48:53Z
dc.date.issued	2024-01
dc.description	Università della Calabria Dipartimento di Ingegneria Informatica, Modellistica, Elettronica e Sistemistica (DIMES). Dottorato di ricerca in INFORMATION AND COMMUNICATION TECHNOLOGIES. Ciclo XXXVI
dc.description.abstract	This PhD thesis introduces LamBERTa, a pioneering BERT-based language understanding framework tailored for law article retrieval as a prediction task. LamBERTa addresses the challenges posed by a multi-class classification scenario featuring hundreds or thousands of classes and minimal per-class training instances, generated in an unsupervised manner. The study explores domain-adaptation coupled with task-adaptation to train LamBERTa models for the Italian Civil Code (ICC) article prediction, incorporating a recently defined Italian legal BERT and investigating the impact of enhancing the tokenizer with terms from the target legal corpus. The work’s contributions extend to the exploration of LamBERTa models’ explainability, employing model-level and instance-level explainability techniques. While initially focused on the Italian Civil Code, the LamBERTa architecture proves adaptable to other law code corpora such as the General Data Protection Regulation (GDPR), further powered by selfsupervised task-adaptive pre-training stages, with or without data enrichment based on recitals, marking the first comprehensive study of domain-adaptive and task-adaptive legal pre-trained language models for GDPR article retrieval. Furthermore, the thesis conducts a network analysis and mining study, presenting the first examination of citation networks inferred from ICC articles. Insights into structural features reveal hidden patterns, facilitating new interpretations and studies of the ICC. This methodology is adaptable to other civil law code systems organized similarly to the ICC, supporting tasks like statute law retrieval, entailment, and question answering. The study also introduces LawNet-Viz, a web-based tool for modeling, analyzing, and visualizing law reference networks, aiming to assist legal experts and citizens in legal information processing and understanding. Finally, the thesis outlines the TrustSearch project’s ambitious goal to develop an AI tool that enhances the search experience promoting critical thinking. Two stance detection approaches, Definition-based and User-driven, are presented, showcasing the effectiveness of AI in transforming the search and data discovery experience. In particular, Chapter 1 introduces the motivations behind this research work, then it overviews recent works that address legal classification and retrieval problems based on deep learning methods and network analysis techniques as well as entailment resolution using Transformers and LLMs. Chapter 2 provides the reader with essential information about the research topic and it serves to establish the context around The Italian Civil Code (ICC) and The General Data Protection Regulation (GDPR). Chapter 3 presents our proposed framework for the civil-law article retrieval problem named as LamBERTa with the application on the ICC corpus. Within it, we investigate the performance of the models through an extensive quantitative and qualitative set of experiments. Furthermore, we study the effects of domain adaptive pre-training strategies and task adaptive training strategies and we scrutinize the concept of Explainability leveraging on the use of different techniques. Chapter 4 presents the LamBERTa models applied on the General Data Protection Regulation (GDPR) corpus, also powering them through self-supervised task-adaptive pre-training schemes. Chapter 5 explores the Italian Civil Code (ICC) from an unprecedented perspective based on network analysis. Furthermore, we present LawNet-Viz, a web-based tool for the modeling, analysis and visualization of law reference networks extracted from a statute law corpus. Chapter 6 is based on the research I carried out during the PhD visiting period abroad at (UPV) University Polytechnic of Valencia, Spain, where I actively participated in the TrustSearch project, a research project funded by the European Union, developing an AI tool that offers a new search experience to promote critical thinking. In this regard, we developed two different approaches, the first one which is based on the state of art LLMs prompt engineer, the second one consists is an application of an encoder-decoder Transformer-based model for the stance detection leveraging on the entailment task. Chapter 7 summarizes findings and discusses future studies which could extend the research.
dc.identifier.uri	http://hdl.handle.net/10955/5754
dc.language.iso	en
dc.publisher	Università della Calabria
dc.relation.ispartofseries	FIS/02
dc.subject	nlp
dc.subject	transformers
dc.subject	llm
dc.subject	legal ai
dc.subject	question answering
dc.title	New developments in deep learning for natural language processing and explanation
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: PhD_Thesis_Andrea_Simeri_signed_signed_signed.pdf
Size:: 7.72 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 16 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Dipartimento di Ingegneria Informatica, Modellistica, Elettronica e Sistemistica - Tesi di Dottorato