Article Segmentation
The Automatic-Article-Segmentation (AAS) – a software developed by PPS – structures the blocks generated from the FR native XMLs using complex algorithms and various analysis methods and assigns them to the correct reading order. Advertisements are also filtered out and obituaries are recognized and tagged accordingly. Furthermore the AAS recognizes uptitle, main titles, subtitles, opening credits and the article text as well as picture captions by a typographic analysis and tagged them accordingly.
On request we also deliver msh Web:digiPaper-, DC-X- and fink & PARTNER huGO-compliant.
We analyse the following article elements:
- uptitle
- title
- subtitles
- description
- picture caption
- image
- article text
- authors
- department
- columns