Dr Donald Sturgeon donald.j.sturgeon@durham.ac.uk
Assistant Professor
Chinese Text Project: a dynamic digital library of premodern Chinese
Sturgeon, Donald
Authors
Abstract
This article presents technical approaches and innovations in digital library design developed during the design and implementation of the Chinese Text Project, a widely-used, large-scale full-text digital library of premodern Chinese writing. By leveraging a combination of domain-optimized Optical Character Recognition, a purpose-designed crowdsourcing system, and an Application Programming Interface (API), this project simultaneously provides a sustainable transcription system, search interface and reading environment, as well as an extensible platform for transcribing and working with premodern Chinese textual materials. By means of the API, intentionally loosely integrated text mining tools are used to extend the platform, while also being reusable independently with materials from other sources and in other languages.
Citation
Sturgeon, D. (2021). Chinese Text Project: a dynamic digital library of premodern Chinese. Digital Scholarship in the Humanities, 36(S1), i101-i112. https://doi.org/10.1093/llc/fqz046
Journal Article Type | Article |
---|---|
Online Publication Date | Aug 29, 2019 |
Publication Date | 2021-06 |
Deposit Date | Aug 30, 2019 |
Publicly Available Date | Nov 2, 2021 |
Journal | Digital Scholarship in the Humanities |
Print ISSN | 2055-7671 |
Electronic ISSN | 2055-768X |
Publisher | Oxford University Press |
Peer Reviewed | Peer Reviewed |
Volume | 36 |
Issue | S1 |
Pages | i101-i112 |
DOI | https://doi.org/10.1093/llc/fqz046 |
Public URL | https://durham-repository.worktribe.com/output/1323952 |
Files
Accepted Journal Article
(1.3 Mb)
PDF
Copyright Statement
This is a pre-copyedited, author-produced PDF of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record: Sturgeon, Donald (2021). Chinese Text Project: a dynamic digital library of premodern Chinese. Digital Scholarship in the Humanities 36(S1), i101-i112 is available online at:https://doi.org/10.1093/llc/fqz046
You might also like
Crowdsourcing the Historical Record: Creating Linked Open Data for Chinese History at Scale
(2022)
Journal Article
Digitizing Premodern Text with the Chinese Text Project
(2020)
Journal Article
Digital Approaches to Text Reuse in the Early Chinese Corpus
(2018)
Journal Article
Zhuangzi, Perspectives, and Greater Knowledge
(2015)
Journal Article
Large-scale Optical Character Recognition of Pre-modern Chinese Texts
(2018)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search