Digital Humanities and Scholarship Open access Peer reviewed

The Latin Text Archive. A Platform for Historical Semantics and Text Mining

Tim Geelhaar

Umanistica Digitale | May 21, 2026

Abstract

Abstract

The Latin Text Archive (LTA) is an online platform hosted by the Berlin-Brandenburg Academy of Sciences (BBAW) since 2020 (https://LTA.bbaw.de). Its primary objective is to facilitate computer-assisted semantic analysis of Latin texts and corpora spanning various epochs and genres. The LTA collaborates with prominent text providers and related projects in this field. Its core activities center on post-philological editorial text preparation, which is essential for implementing text mining techniques in corpus-based historical semantics. The archive lemmatizes and stores Latin texts, augments them with relevant metadata, and organizes them within thematic or genre-specific corpora. These texts can be also read online and downloaded in various formats. Currently in a beta version, the LTA offers already 12,960 curated texts authored by 1,280 identified individuals, amounting to 54 million words. Furthermore, the LTA supplies access to its morphological lexicon, which supports the lemmatization process. Through the 'Latin Universe', users may also access additional texts not yet fully curated. Both texts and corpora are searchable via third-party tools such as 'Voyant-Tools' or through integrated functionalities like the 'Time series query' — which allows for diachronic comparison of keywords and lemmas — and 'Diacollo', which analyses co-occurring lemmas over time.

Direct answer

What can I do from this paper page?

Use this page to scan "The Latin Text Archive. A Platform for Historical Semantics and Text Mining" quickly: start with the summary and abstract, then check the authors, source, topics, and related papers. From here, open Scollr to follow Digital Humanities and Scholarship research, save the paper, or map adjacent work.

Authors

Researchers on this paper

Tim Geelhaar

first | Goethe University Frankfurt | ORCID 0000-0002-7653-5859

Research areas

Follow related topics

Citation

BibTeX

@article{Geelhaar2026Latin,
  title = {The Latin Text Archive. A Platform for Historical Semantics and Text Mining},
  author = {Tim Geelhaar},
  journal = {Umanistica Digitale},
  year = {2026},
  doi = {10.60923/issn.2532-8816/23548},
  url = {https://doi.org/10.60923/issn.2532-8816/23548}
}

FAQ

Using this paper in a discovery workflow

How do I find related work for this paper?

Use the related papers and topic links on this page as starting points. In Scollr, you can also open the paper and build a literature map around its references, citing papers, and related work.

How can I keep up with new Digital Humanities and Scholarship research papers?

Follow Digital Humanities and Scholarship research in Scollr. New papers from the topic flow into a personalized feed, and you can save useful studies to revisit later.

Can I cite this paper from this page?

This page includes a static BibTeX block for The Latin Text Archive. A Platform for Historical Semantics and Text Mining. Always verify the DOI, source, and publication details against the publisher record before submitting a manuscript.

Follow this research in Scollr

Follow the topics and authors behind this paper, save useful studies, and build a literature map when you are ready to go deeper.

Get the app