Graph – analysing text reuse in rare books

Analysing text reuse is a task that is of particular relevance to architectural history. The platform Graph automates this process by applying large-scale textual processing for detecting similar text passages to a large corpus of digitised books.

The Plattform Graph, a collaboration between the ETH Library and the Chair of the History and Theory of Architecture at the Institute for the History and Theory of Architecture, visualises patterns of text reuse across a corpus of works to create a map outlining their various connections.

Pinpoint the origin of certain text passages with Graph

Tracing the relationships between books by finding out which sections of text they share helps us understand which books authors were referencing when writing their own texts and enables us to trace the genealogy, dissemination and transformation of certain ideas across various publications, both temporally and geographically. This means we are able to pinpoint the origin of certain formulations or constellations of words within the mass of printed material. Mapping the reuse of texts roots the circulation of ideas in material objects, each with its own history.

By playing the video you accept the privacy policy of YouTube.Learn more OK

Advantages of text reuse for architectural history

Text reuse is particularly relevant to architectural history, with a great number of texts being rooted in Vitruvius’ De architectura, the only remaining architecture treatise from Antiquity. From the fifteenth century up to the present day, architects and authors have continuously been editing, translating, citing, commenting on, and adapting this key work. This growing body of texts itself also provides reference material for subsequent works. Exploring the connections within this body of texts may help to generate new insights into how ideas surrounding architecture were developed, and, perhaps more importantly, could raise new questions about the provenance, dissemination and transformation of architectural thought.

How to use the Graph platform

The extracted passages can be used in various ways:

  • Find correlations between books in the graphical visualisation, which can be arranged chronologically and makes it possible to quickly identify works containing many passages that have been reused.
  • Filter or scroll through the full list of books and find out which other works they are related to, enabling you to switch between wider and more local contexts.
  • Use the comparator view for a side-by-side reading of text passages extracted from two different works.
  • New: Use the ‘overview’ to see how passages taken from one book are distributed in another.

The current dataset is based on roughly 1,300 rare architecture books from the holdings of the ETH Library. A total of 800,000 pages can be viewed on external page e-rara, the platform for digitised rare books from Swiss institution.

More than just data – Benefit from know-how

#KnowMore – The prepared content issued by the ETH Library is at your free disposal and gives you a head start.

JavaScript has been disabled in your browser