Skip to content
This repository has been archived by the owner on Nov 16, 2020. It is now read-only.

TEI Export: output might not be well-formed XML/valid TEI #26

Open
2 tasks
kahlep opened this issue Oct 13, 2017 · 0 comments
Open
2 tasks

TEI Export: output might not be well-formed XML/valid TEI #26

kahlep opened this issue Oct 13, 2017 · 0 comments

Comments

@kahlep
Copy link
Contributor

kahlep commented Oct 13, 2017

For documents including overlapping tags, the TEI XML created by the export routine is not always well-formed and is thus not usable with TEI-conformant tools such as Voyant Tools.

An example can be found in Document 8393, page 141. The produced TEI XML for this page includes:

<l facs='#facs_141_line_1490048050651_115'>given <hi rend='underlined:true;'>by the King of Sweden to his Subjects <choice><expan></expan><abbr>Aug</hi><hi rend='underlined:true; superscript:true;'>t</hi><hi rend='underlined:true;'>.</abbr></choice> 21</hi><hi rend='underlined:true; superscript:true;'>st</hi><hi rend='underlined:true;'>.</hi> 1772</l>

Oxygen error message:
The element type "abbr" must be terminated by the matching end-tag "".

There should also be an export option to exclude user-defined tags without a valid TEI equivalent, in order to produce valid TEI XML.

  • fix nesting of tags
  • add "exclude invalid tags" option to export
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant