Graph – analysing text reuse in rare books

Analysing text reuse is a task that is of particular relevance to architectural history. The new platform Graph automates this process by applying large-scale textual processing for detecting similar text passages to a large corpus of digitised architecture books.

The Graph platform, a collaboration between the ETH Library and the Chair of the History and Theory of Architecture at the Institute for the History and Theory of Architecture, visualises patterns of text reuse across a corpus of works to create a map outlining their various connections.

Pinpoint the origin of certain text passages with Graph

Tracing the relationships between books by finding out which sections of text they share helps us understand which books authors were referencing when writing their own texts and enables us to trace the genealogy, dissemination and transformation of certain ideas across various publications, both temporally and geographically. This means we are able to pinpoint the origin of certain formulations or constellations of words within the mass of printed material. Mapping the reuse of texts roots the circulation of ideas in material objects, each with its own history.

Advantages of text reuse for architectural history

Text reuse is particularly relevant to architectural history, with a great number of texts being rooted in Vitruvius’ De architectura, the only remaining architecture treatise from Antiquity. From the fifteenth century up to the present day, architects and authors have continuously been editing, translating, citing, commenting on, and adapting this key work. This growing body of texts itself also provides reference material for subsequent works. Exploring the connections within this body of texts may help to generate new insights into how ideas surrounding architecture were developed, and, perhaps more importantly, could raise new questions about the provenance, dissemination and transformation of architectural thought.

By playing the video you accept the privacy policy of YouTube.Learn more OK

How to use the Graph platform

The extracted passages can be used in three different ways:

  • Find correlations between books in the graphical visualisation, which can be arranged chronologically and makes it possible to quickly identify works containing many passages that have been reused.
  • Filter or scroll through the full list of books and find out which other works they are related to, enabling you to switch between wider and more local contexts.
  • Use the comparator view for a side-by-side reading of text passages extracted from two different works.

The current dataset is based on roughly 1,000 rare architecture books from the holdings of the ETH Library. A total of 500,000 pages can be viewed on external page e-rara, the platform for digitised rare books from Swiss institutions.

#KnowMore – The prepared content issued by the ETH Library is at your free disposal and gives you a head start.

#ETHLibraryDigital – The ETH Library is there for you digitally with various offers and services, not only during the COVID-19 protective measures.

JavaScript has been disabled in your browser