Data Paper and Normalized Datasets for Hahn’s and Zahn’s Reconstructions of Marcion’s Gospel Published Today in the Journal of Open Humanities Data

The DOI for this data paper is: https://doi.org/10.5334/johd.63. This makes the third data paper and batch of normalized datasets of major reconstructions of Marcion’s Gospel now published in JOHD, following after the papers and datasets based on the Harnack and Roth reconstructions. The fourth (based on Klinghardt’s and Nicolotti’s recent reconstructions) has been accepted, cleared for copyright, and typeset, so it should be published soon. Those four papers and datasets together represent all major Greek reconstructions of Marcion’s Gospel published thus far, comprising a new historical computational linguistic corpus of Postclassical Greek that contains 57241 tokens altogether.

We are also in talks with Jason BeDuhn about creating a Greek reconstruction derived from his 2013 English reconstruction of Marcion’s Gospel. In my view, BeDuhn’s reconstruction is the closest to the actual contents of Marcion’s Gospel in size and content, so a corresponding Greek edition and related normalized datasets would make for a very significant contribution to scholarship. We are also in talks with various international institutions about a collaborative project focused on the restoration of Marcion’s Gospel and resolution of the Synoptic Problem using data science and computational linguistics methods. Interested parties are welcome to connect.

Data Paper and Normalized Datasets for Roth’s 2015 GMarc Reconstruction Accepted for Journal of Open Humanities Data (LODLIB 2.15)

This week’s LODLIB version now contains the author’s accepted version of our data paper and related transformational, normalized datasets based on Roth’s 2015 reconstruction of the Gospel of Marcion (GMarc). As we note in the paper, Roth’s is the most widely accepted reconstruction in scholarship today. Sincere thanks go to Dr. Roth and to Tanja Cowall at Brill’s copyright office for securing an agreement for the distribution of these datasets under a CC-BY-NC-ND license. We also would like to thank the editor-in-chief and three anonymous reviewers at JOHD for their excellent and constructive feedback on this work. While the data paper is still at print, we are happy to go ahead and share the DOIs that have already been minted for the paper (https://doi.org/10.5334/johd.57) and datasets (https://doi.org/10.7910/DVN/BYPOOR). [The latter DOI was randomly (providentially?) generated, by the way! We have no control over what specific DOI is generated when we upload datasets to the Harvard JOHD Dataverse.]

Lemmatization and Part of Speech Tagging Completed for All Greek GMarc Datasets (LODLIB v2.13)

Following up on today’s publication in Journal of Open Humanities Data of my data paper and accompanying normalized, lemmatized, morphologized, born-digital, and peer-reviewed version of Harnack’s reconstruction of the Gospel of Marcion (GMarc), in v2.13 of my LODLIB I’ve now released a lemmatized and morphologized dataset of August Hahn’s 1832 reconstruction of GMarc. After many grueling months of work on these Greek texts in parallel, I’ve also completed lemmatizing and morphologizing the reconstructions of GMarc by Zahn, Klinghardt, and Nicolotti. Since the latter two are based on works still under copyright, we will start conversations to see how best to publish these and would like to take this opportunity to invite Klinghardt and Nicolotti publicly to join as collaborators on these datasets. The Zahn dataset will appear in next week’s LODLIB, and I will soon submit both the Hahn and Zahn datasets and accompanying data papers for peer-review and formal publication.

In other related news, Jason BeDuhn and I are in talks about how best to structure next month’s Westar SBL session on Q and the Gospel of Marcion. If any scholars specializing in GMarc and/or Q would like to be respondents in the session, please let me know.

Normalization and Quality Control Pass Completed for All GMarc Datasets (LODLIB v2.12)

Today’s LODLIB update reflects datatype normalization and quality control checks across all of our GMarc datasets (Hahn, Zahn, Harnack, Tsutsui, BeDuhn, Roth, Klinghardt, Nicolotti). While we have only released the full text of the first three, since their print works are in the public domain, we have made use of all of this normalized data in our new data tabulations (3.7) and data visualizations (3.8). While our own iterative critical edition is still in progress, the counts and graphs for all earlier editions should now remain static, thus we are now comfortable building these data tabulations and visualizations into forthcoming journal articles and book reviews.

In other related news, Jason BeDuhn and I are meeting later today to discuss the Westar SBL session on Q and the Gospel of Marcion. Given our overlapping scholarly work, I’m very much looking forward to the conversation. I also received just today the proofs of my forthcoming data paper for the Journal of Open Humanities Data. It’s always nice to see one’s work as it’s about to go to (digital) press.

Cycles of Continuous Improvement (LODLIB v2.11)

Today’s LODLIB update reflects a major quality control check and normalization of our Hahn (1832) dataset of human-readable Greek, as well as minor corrections to some of the calculations in our Cluster Analysis and Statistical Analysis sections. We’re also happy to confirm via David Galston that Westar will be hosting an online/virtual session devoted to Q and the Gospel of Marcion as part of the upcoming Society of Biblical Literature annual meeting. If you are interested in planning or participating in that session, please let me or David know! Here’s hoping that venue provides a launching pad for a new kind of Jesus Seminar focused on the scientific restoration and reconstruction of the many historical voices embedded within early canonical and non-canonical gospels. This week we also made a few minor corrections to our Harnack JOHD data paper, which should be published very soon.

Channeling the Ghost of Thomas Kuhn (LODLIB v2.10)

This week’s LODLIB spices things up with numerous inspirational/illuminating quotations from Thomas Kuhn’s The Structure of Scientific Revolutions strewn throughout Part One of our LODLIB. These quotations inherently convey an outlandish confidence from someone who really thinks to be leading a scientific revolution in the study of the Gospels. Whether our five hypotheses, triangulation theorem, and various other proofs and methods are mostly right or mostly wrong, the field will eventually decide! All I can do is keep writing and moving forward.

Read More »

Data Paper and Datasets Accepted for Journal of Open Humanities Data (LODLIB v2.09)

This week’s LODLIB contains the author’s accepted version of our data paper and related datasets of Harnack’s 1924 reconstruction of the Gospel of Marcion (GMarc). Heartfelt thanks go to the journal’s editor-in-chief, Barbara McGillivray, to the four anonymous reviewers for their patient and thorough feedback, and to Paul Dilley for advising me to submit this work to JOHD, one of the many excellent Open Access journals hosted by Ubiquity Press. Because of them, both the paper and the datasets are far better than what I initially submitted. Their constructive criticism is ultimately what pushed me to develop consistent data normalization standards, both for the Harnack datasets and all other reconstructions of GMarc. These standards will allow for consistent and meaningful Computational Linguistics analysis. The fruits of this work are already evident in our data tabulations and visualizations in our LODLIB (a freshly released sample below) and will become more evident as we submit additional datasets and related papers for peer-review and formal publication. We’ll be sure to share DOIs for the paper (https://doi.org/10.5334/johd.47) and datasets (https://doi.org/10.7910/DVN/5TEA5A) as they are published.

Epiphanius Citation Journey Complete (LODLIB v2.02)

The major milestone in today’s upload is that we have finished combing through the nearly 230 references to the text of Marcion’s Gospel made by Epiphanius in his Panarion. Our LODLIB now includes quotations, translations, and page references for this content according to the latest critical editions of the series Griechischen Christlichen Schriftsteller. As a condensation of this work, we have assembled a catalog of these passages, a helpful panoramic overview of the journey Epiphanius took through Marcion’s Gospel, with references sorted in order of appearance. A screenshot of half of this literary itinerary is appended below. We hope it is useful to other scholars and the interested public.

Lots of other improvements have been made, but we will leave those for readers to discover. One other important bit of news is that our Harnack GMarc dataset is now in the third round of review for the Journal of Open Humanities Data. The review process has been quick and very helpful, and we are grateful to the reviewers for their constructive feedback. Fingers crossed on its acceptance! As always, we welcome constructive feedback on our LODLIB as well as offers of collaboration and research fellowship/position opportunities.