Below are some resources I’ve found to be particularly helpful
biblehub.com
Bible Hub has become my go-to study resource when reading the Bible. This website contains a wealth of information, including different translations, commentaries, dictionaries and lexicons. Because it’s a one-stop-shop, this is also a great resource for scraping data and building datasets.
Christian Classics Ethereal Library
ccel.org is an excellent resource for classic Christian texts. They have a large selection of public domain works stretching back to the generation after the Apostles. Formats come in readable versions (like epub or pdf) and in machine-friendly formats (like raw text files).
TENAGRA
TENAGRA is the python package I built while studying the SBC resolutions. This code provides analysis and visualization tools to enable one to perform the same sort of work as I did on the SBC’s resolutions. And yes, it’s named for the Star Trek Next Generation Episode Darmok…
viz.bible
viz.bible contains lots of really interesting visualizations of information related to the Bible, including timelines, cross-references, and geographical information. The real attraction is the dataset powering everything which, among other things, connects the verses of the Bible to people, places, and events in a structured manner.
derekgreene.com
While I have never met Derek Green, I have relied predominantly on his work using Non-negative Matrix Factorization to identify topics within texts. His website contains some publications that I found particularly helpful, along with datasets that have proven useful for validating and developing models.
Berean Study Bible
The Berean Study Bible is an incredible resource primarily because of its openness. The BSB website provides a host of useful files for anyone interested in knowing the Bible better or using it for data science applications. They’re also very open with licensing and usage, making this one of the few modern translations that seeks to truly make the Word of God free to all.
openbible.info
openbible.info contains both a way to search the Bible using topics, rather than specific key words, and a data set providing geospatial links between the books of the Bible and the locations they reference.
Apostolic Bible Polyglot
The Apostolic Bible Polyglot is an incredible resource, from a data science perspective; it is an English translation of the Greek Old Testament and New Testament that contains a consistent numbering of Greek words. This means that, were someone to parse this text they would have a database with the entire text of the Bible tied to a single language.
Internet Sacred Texts Archive
The Internet Sacred Texts Archive contains a large number of spiritual writings from different faiths, all in the public domain (or close enough to it). Of particular interest is the full text of several commentaries which, while in public domain, are notoriously difficult to obtain in a useful form for creative works or study.