Date: 2021
Type: Contribution to book
A corpus for multilingual analysis of online terms of service
DRAZEWSKI, Kasper
; GALASSI, Andrea; JABŁONOWSKA, Agnieszka
; LAGIOIA, Francesca
; LIPPI, Marco
; MICKLITZ, Hans-Wolfgang
; SARTOR, Giovanni
; TAGIURI, Giacomo
; TORRONI, Paolo












Proceedings of the Natural Legal Language Processing Workshop 2021 [conference proceedings], Stroudsburg : Association for Computational Linguistics, 2021, pp. 1-8
DRAZEWSKI, Kasper, GALASSI, Andrea, JABŁONOWSKA, Agnieszka, LAGIOIA, Francesca, LIPPI, Marco, MICKLITZ, Hans-Wolfgang, SARTOR, Giovanni, TAGIURI, Giacomo, TORRONI, Paolo, A corpus for multilingual analysis of online terms of service, in Proceedings of the Natural Legal Language Processing Workshop 2021 [conference proceedings], Stroudsburg : Association for Computational Linguistics, 2021, pp. 1-8
- https://hdl.handle.net/1814/74836
Retrieved from Cadmus, EUI Research Repository
We present the first annotated corpus for multilingual analysis of potentially unfair clauses in online Terms of Service. The data set comprises a total of 100 contracts, obtained from 25 documents annotated in four different languages: English, German, Italian, and Polish. For each contract, potentially unfair clauses for the consumer are annotated, for nine different unfairness categories. We show how a simple yet efficient annotation projection technique based on sentence embeddings could be used to automatically transfer annotations across languages.
Cadmus permanent link: https://hdl.handle.net/1814/74836
Full-text via DOI: 10.18653/v1/2021.nllp-1.1
ISBN: 9781954085985
Publisher: Association for Computational Linguistics
Files associated with this item
- Name:
- A_corpus_for_multilingual_anal ...
- Size:
- 170.4Kb
- Format:
- Description:
- Full-text in Open Access