Skip to content
This repository has been archived by the owner on Nov 16, 2020. It is now read-only.

Invalid PAGE XML caused by PrintSpace with negative PageCoords #45

Open
stweil opened this issue May 23, 2020 · 0 comments
Open

Invalid PAGE XML caused by PrintSpace with negative PageCoords #45

stweil opened this issue May 23, 2020 · 0 comments

Comments

@stweil
Copy link

stweil commented May 23, 2020

The NZZ PAGE XML file was created by Transkribus, and it contains data which is reported as invalid:

<Page imageFilename="0111_nzz_18901222_0_0_a1_p1_1.tif" imageWidth="3839" imageHeight="5551">
    <PrintSpace>
        <Coords points="4,-27 3842,-27 3842,5524 4,5524"/>
    </PrintSpace>

ocr-validate.py reports that negative values are invalid here.
PRImA page viewer refuses to load PAGE XML with such data.

See also issue #38 with other PAGE XML related problems.

@stweil stweil changed the title Invalid PAGE XML caused by negative PageCoords Invalid PAGE XML caused by PrintSpace with negative PageCoords May 23, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant