XML Tagging in Entry-Files


Table of Contents

XML Tagging in Entry-Files
2017.March.12, 4:00pm
2017.March.13, 11:00am
2017.March.17, 7:00pm
2017.March.19, 5:00pm
Name: Sohini Chattopadhyay
Date and Time:

2017.March.12, 4:00pm

Location: 112th Street, NY 10025
Subject: Begin exercise

I began working on the xml files. I began the task by using sublime text.

I followed the sample xml sheet but it did not have an example of tagging images. I decided to also shift to notepad++ just to play around with different text editors. I saw that it's easier to work on the tagging after saving the file with an xml extension. This was so because once notepad++ recognizes that it is editing an xml document, it automatically renders the elements and the attributes into different colours.

Going back to tagging images, I kept on working on some tags while figuring out how to use them correctly. In order to vaguely check for errors, I copied the xml files to the web browser which would highlight errors. This appeared to be very helpful. I eventually realized that some of my errors were due to the clashing of elements. I was not tagging the image as a sub element within the entry element which created a problem. Once I figured this, it became easier to tag.
Many online xml tutorials had files that showed an xml declaration. I included one xml declaration by checking the encoding I was using on the bottom right of my notepad++ editor. I did this as a learning process - I wanted to understand what are the versions of XML, and how the structure of the declaration is different from structures of attributes. I understand that XML document without a declaration is an XML 1.0 document, therefore it's better to specify it since XML 1.1 makes the XML declaration mandatory. Moreover, the encoding of the document is for convenience for transmitting the document, and since the foundation of XML is Unicode. If we do not specify the encoding through an XML declaration, XML processor assumes its UTF-8 or UTF-16.
I began searching for a list because it would throw up some challenges in tagging and require some editorial interpretation as well. I tagged each content of the list 169r separately while putting them within the broad element of <list>. I wanted to do this because I explicitly wanted to highlight that each are semantically distinct.



Name: Sohini Chattopadhyay
Date and Time:

2017.March.13, 11:00am

Location: 112th Street, NY 10025
Subject: Selection and modification of some unique entries

entry p169v_2 continued into different folios so I chose this to experiment with my xml taggings.

for entry p158v_3 I tried to follow one of the user stories that cropped up in our earlier discussions on user story: to juxtapose the digitized texts and the folio facsimile for the viewers to see both. Therefore I created a new entry <manuscript> to tag the images. I did not want to use the term folio as we were already using that to signify the folio number and the url. I used the word manuscript for want of a better word to substitute folio.



Name: Sohini Chattopadhyay
Date and Time:

2017.March.17, 7:00pm

Location: 112th Street, NY 10025
Subject: Committed and sent xml files to remote repository

I fetched/pulled, committed and pushed the xml files to the remote github repository on 17th March.
___

Name: Sohini Chattopadhyay
Date and Time:

2017.March.19, 5:00pm

Location: 112th Street, NY 10025
Subject: Working on another unique file

began working on an entry with historical names and documents following tei guidelines in http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ND.html.
I picked this file because it had sub-headings on the left margins. I checked with the facsimile of the folio to check the positions. I created a new element called subheading to demarcate the subheadings. While in the margin, these cannot be categorized as marginalia. I assume that instead these were the author's way of categorizing for this particular entry. This also contained names of groups of people like 'Greeks' which I described through the element <people> as I did not want to use the term <nation> without a sound knowledge of the prevalent meaning of the term contemporaneous to the author of the manuscript. It actually made me realize that tagging is a strong intellectual exercise that allows one to attribute new characteristics and stable meanings to words. Therefore, it's not just a way of rendering a manuscript into a digital form, but the process itself is an active form of reading as well as interpreting and editing. For the purpose of learning and to experiment with xml, I tried to insert a link to the mention of Herodotus. I renders questions about editorial liberty, for which reason I put the link in the background of the text. I am also unsure if I did it correctly at all or not. I also want to learn more about creating a 'class' as shown in the tei guidelines. So far, my last three attempts to do so were very vague at best.


References:

http://www.tei-c.org/release/doc/tei-p5-doc/en/html/

http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SG.html

http://dh.obdurodon.org/what-is-xml.xhtml

http://www.ibm.com/developerworks/library/x-tipdecl/

https://www.w3.org/standards/xml/core

I found this website helpful to clarify some of my doubts: https://www.tutorialspoint.com/xml/index.htm