Building a Production-Ready Multi-Label Classifier for Legal Documents with Digital-Twin-Distiller

Csányi Gergely Márk; Vági Renátó; Nagy Dániel; Üveges István; Vadász János Pál; Megyeri Andrea; Orosz Tamás

Building a Production-Ready Multi-Label Classifier for Legal Documents with Digital-Twin-Distiller

Csányi Gergely Márk; Vági Renátó; Nagy Dániel; Üveges István; Vadász János Pál; Megyeri Andrea; Orosz Tamás

DOI : 10.3390/app12031470

URI : http://hdl.handle.net/20.500.12944/17573

MTMT : 32634154

Megjelenés dátuma : 2022

Folyóirat címe : Applied Sciences (Switzerland)

Évfolyam : 12

Szám : 3

Dokumentum típusa : folyóiratcikk

Kulcsszó : Digital twin, Legal document classification, Legaltech, Multi-label classification, Multi-label documents, Társadalomtudományok, Társadalomtudományok/Állam- és jogtudományok

Absztrakt :

One of the most time-consuming parts of an attorney’s job is finding similar legal cases. Categorization of legal documents by their subject matter can significantly increase the discoverability of digitalized court decisions. This is a multi-label classification problem, where each relatively long text can fit into more than one legal category. The proposed paper shows a solution where this multilabel classification problem is decomposed into more than a hundred binary classification problems. Several approaches have been tested, including different machine-learning and text-augmentation techniques to produce a practically applicable model. The proposed models and the methodologies were encapsulated and deployed as a digital-twin into a production environment. The performance of the created machine learning-based application reaches and could also improve the human-experts performance on this monotonous and labor-intensive task. It could increase the e-discoverability of the documents by about 50%.

A tétel részletes adatai

A tételhez tartozó fájlok

Building a Production-Ready Multi-Label Classifier for Legal Documents with Digital-Twin-Distiller

Név: Building_a_Production_Ready.pdf

Méret: 480.3Kb

Formátum: PDF

Leírás: folyóiratcikk

Megnyitás

Building a Production-Ready Multi-Label Classifier for Legal Documents with Digital-Twin-Distiller

Absztrakt :

A tételhez tartozó fájlok

Tallózás a gyűjteményekben