DH200 Lecture Notes - Lecture 3: Xml, Text Encoding Initiative, Roberto Busa
Document Summary
Long tradition of scholars using whatever it available: research, dissemination, interaction with each other and public. Beginnings of digital humanities are linked to roberto busa and index thomesticus. Sought out head of ibm in 1949 to aid indexing works of thomas aquina for text searches. Project lasted decades with printed (cid:448)olu(cid:373)es (cid:894)(cid:1005)(cid:1013)(cid:1011)(cid:1004)"s(cid:895), cd-rom (1989), and online (cid:448)e(cid:396)sio(cid:374)s (cid:894)(cid:1006)(cid:1004)(cid:1004)(cid:1004)"s(cid:895) Powerful new indexing techniques; lemmatization (more complex form of indexing) e. g. distinguishing nouns and verbs but can also group together similar words e. g. Powerful new indexing techniques; lemmatization (more complex form of indexing) e. g. distinguishing nouns and verbs but can also group together similar words e. g. mark vs. marks, and good better best. Digital li(cid:271)(cid:396)a(cid:396)ies a(cid:374)d (cid:862)ebooks(cid:863): p(cid:396)oje(cid:272)t gute(cid:374)(cid:271)e(cid:396)g (cid:894)est. (cid:1005)(cid:1013)(cid:1011)(cid:1005)(cid:895) a(cid:374)d mi(cid:272)hael ha(cid:396)t. Make available public domain works in simple text. Began on arpanet, connected computer (early part of internet) New online interface in 2004, and today around a million titles and many languages.