- About the Project
- How we did it
About the Project
Margaret Cavendish (1623-1673) wrote numerous works of philosophy, plays, and poetry, as well as a science fiction work, an autobiography and a biography of her husband. While many of her works are available online, her 1663 edition of Philosophical and Physical Opinions has not yet had an open access and easily searchable edition until now. The work is important in that is it a substantial revision of her 1655 edition of Philosophical and Physical Opinions. The 1655 edition was a mere 174 pages, while the revised and expanded 1663 edition comes in at 510 pages including the unpaginated prefaces and index.
Given the growing interest in Cavendish’s philosophical works by early modern scholars, it seemed it was time to make this work more widely available. It was clear that we needed to hand transcribe the work, but its length was too much for a small group to undertake. Thus, we decided to get help from the community by hosting a transcribe-a-thon on September 27, 2019. With support from KU Institute for Research in the Digital Humanities, KU Libraries, and the KU Philosophy Department, as well as over 70 transcribers from around the world, the text was completely transcribed by October 4th, 2019.
In transcribing the text, we decided to ignore most of the elements that were due to features of the printing of the book (such as running headers and hyphenations at the margins, catch words were ignored except when hyphenated), but the one feature of the book we did maintain was the pagination as this is valuable to scholars and students. We maintained the features of the text as written by Cavendish in that we followed Cavendish’s spelling, capitalization, and punctuation. We added notes to document Cavendish’s corrections in the text as noted in the Errata.
If you have questions or comments about this project, please contact Marcy Lascano at firstname.lastname@example.org.
How we did it
In developing this project we were guided by principles of minimal computing to create an electronic edition of the text that would be:
- lightweight (suitable for reading, downloading, and sharing);
- flexible (designed to be built upon with annotations and bibliographic enancements); and
- durable (easy to maintain with minimal risk of broken links or functonality).
It also required that we do this with no budget.
Transforming the text from the scanned page of the 1663 edition (available in the Early English Books Online (EEBO) database) to a minimal full text edition involved two general stages of the project.
The first step was to create editable, electronic text from the scanned page images. Optical character recognition (OCR) was not a viable approach due to the layout, structure and font of the original publication availabe in EEBO; it would have required too much reformatting and cleanup. Transcribing the text from scratch was the way to go, but at over 500 pages it was too long for a single person or small group to achieve in a reasonable time period.
We decided to enlist the help of collaborators, both local and around the world, to transcribe the text online. Inspired by such crowdsouring transcription services as From the Page, Brian Rosenblum of the IDRH built a simple website to facilitate this process. Participants could select and “check out” a page and type a plain-text transcription following a set of instructions on how to capture and represent abbreviations, chapter headings, page numbers, ligatures, catch words, unclear text, and other elements in the book. On September 27, 2019 we launched the Cavendish Transcribe-a-Thon, open to participants locally at the University of Kansas and online to anyone who wanted to participate.
About 15 people participated in person over the course the day, sustained by coffee and donuts, and several dozen participants joined online, from Australia to the U.K. to California. Within a week the text was completely transcribed, with contributions by over 70 participants from around the world. All transcribers are listed below.
Editing and Ed.
The second phase of the project included editing the transcriptions and publishing the text online. Using a simple text editor, Marcy Lascano worked through the text page by page to clean up the transcriptions, double checking against the original and resolving inconsistencies in formatting among the contributions of the 70 transcribers. She also added some basic structural formatting using markdown, a simple language for structuring plain text documents.
Meanwhile, Brian Rosenblum built the website using the Ed publishing platform. Ed is designed specifically for minimal scholarly or reading editions and contains templates for publishing prose, poetry and drama in elegant, flexible formats meant to be easy to maintain and share. Ed itself is built on Jekyll, a static website generator that turns our markdown-structured text into static HTML pages that can be published directly online without the need for a database-dependent back end, or for the often complex set of processes needed to create and maintain other markup-based digital editions.
Conceived and created by Alex Gil at Columbia University, Ed is designed to be a low-cost but highly functional platform that enables the kind of text recovery work undertaken here in the Cavendish project.
Plain text, github project, and digital tools
The markdown-structured plain text, and the entire Ed/Jekyll projects files, are available on the digital tools page of this website, along with some links to web-based tools for visualization, analysis, and annotation.
This project was made possible by the generous support of the University of Kansas Philosophy Department and Institute for Research in the Digital Humanities. Special thanks goes to Brian Rosenblum for setting up the transcription site and the final Ed. site, as well as providing other technical guidance and expertise. In addition, a debt of gratitude goes to Rachel Henderson, the project research assistant, who worked tirelessly on various aspects of the project, and to David Tamez in IRDH who helped with the transcribe-a-thon. A special thanks goes to Jon Shaheen for re-starting the Cavendish Reading Group in the Spring of 2020 and for passing along a weekly typo list. Finally, thank you to all the transcribers who put in their time to help make this text available:
Christopher P. Noble
Jonathan L. Shaheen
Jonathan P. Lamb
Larry M. Jorgensen
Mary Ann Baker
Michael Bennett McNulty
Michael J Trout
Renata Martins Prado Matos Augusto
Saron T. Tran
William A. B. Parkhurst
And thanks to those who transcribed anonymously as well!