The I.Sicily Sketch Engine Corpus
The dataset consists of the 723 early imperial (1 BC to AD 401) funerary and honorific inscriptions in Greek, Latin, and Hebrew from the I.Sicily database. The dataset is stored on Zenodo in the form of two .conllu and two .vert files, one for the (primarily) Latin and one for the (primarily) Greek...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2024-12-01
|
Series: | Journal of Open Humanities Data |
Subjects: | |
Online Access: | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/258 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The dataset consists of the 723 early imperial (1 BC to AD 401) funerary and honorific inscriptions in Greek, Latin, and Hebrew from the I.Sicily database. The dataset is stored on Zenodo in the form of two .conllu and two .vert files, one for the (primarily) Latin and one for the (primarily) Greek inscriptions, containing the tokenized, lemmatized, part-of-speech tagged, and verticalized inscription texts with their ISicXXXXXX identifiers. The Python scripts used to prepare the dataset and the modified corpus configuration file to implement the .vert files into Sketch Engine are provided for reuse and development. |
---|---|
ISSN: | 2059-481X |