eTRACES
This project has been ended.
Be it science or the everyday life – our language contains numerous trails of our cultural legacy in the form of winged words and quotations. Scientists now created new software tools for making this cultural legacy available in digital libraries. With the help of those programs the origin and dissemination of text passages, quotes and common phrases can be reconstructed in a quick and easy manner.
The main focus of “eTRACES” (which is the name of the project) lies on temporal traces and interconnecting relations of text passages in German language novels from between 1500 and 1900, as well as social science texts created since 1909. Project partners are the chair for Natural Language Processing at the University of Leipzig (ASV), the Göttingen Centre for Digital Humanities (GCDH), as well as the GESIS – Leibniz-Institute for the Social Sciences in Bonn.
Funding of about 1.2 Mio € is granted by the Federal Ministry of Education and Research (BMBF), covering a period of three years. “The cooperation of computer science experts with the specialists from the humanities and the social sciences bears great potential for the advancement of all three disciplines” explains State Secretary Cornelia Quennet-Thielen from the BMBF. Further, she emphasized, how eTRACES did exemplarily implement the “Recommendations on Research Infrastructure for the Humanities and Social Sciences” that were recently issued by the science council.
Harnessing the latest methods in text mining, new methods should be developed and tested to determine the demarcation of the intentional re-use of a text passage and and its utilization as a commonly used text block or boilerplate. Further attention lies on analyzing and visualizing the geographical, temporal and semantic cross-linking of citations.
In the application in the literary studies (partner Göttingen) the central question is, which practices of text passage re-use did coin the history of german novels. The initial subject of the research interests is the Luther Bible. The pivotal question posed by the partner GESIS is the examination of a textual differentiation of qualitative and quantitative social research. The application of informatics (ASV Leipzig) is to utilize the information on text re-use to build a search engine that also considers the citation frequency of a text or text fragment to determine its relevancy.
Project Data
Project execution organisation: DLR ( Project execution organisation for the BMBF)
Project number: 01UA1101A (ASV), 01UA1101B (GCDH), 01UA1101C (GESIS)
Partner: Göttingen Centre for Digital Humanities (GCDH); GESIS – Leibniz-Institute for the Social Sciences, Bonn
Time frame: 2011-07-01 to 2014-06-30
Kontakt: Marco Büchler
Publications
- BCMBH2011 - One Step Closer To Paraphrase Detection On Historical Texts: About The Quality of Text Re-use Techniques and the Ability to Learn Paradigmatic Relations
- BCMB2012 - Increasing Recall for Text Re-use in Historical Documents to Support Research in the Humanities
- BGBE2012 - Measuring the Influence of a Work by Text Re-Use
- BGH2012 - Historical Relevance Feedback Detection by Text Re-use Mining
- BPW2012 - Transmission of Greek Gnomologia in Syriac and Arabic: The Corpus of Arabic and Syriac Gnomologia
- BBEP2011 - Collecting Ancient Greek, Arabic, and Syriac Sapiential Statements - The Gnomology Database of Arabic and Syriac
- BBE2010 - Discovering Latent Relations of Concepts by Graph Mining Approaches on Ancient Greek Texts
- HB2010 - Some Challenges Posed to Computer Science by the eHumanities
- BH2010 - Salton and Wittgenstein in the Humanities: About Semantics in Philosophical Texts
