Jiang, Congfeng; Liu, Junming; Ou, Dongyang; Wang, Yumei; … - In: Journal of Database Management (JDM) 29 (2018) 2, pp. 1-22
The authors propose to use formatting templates and implicit formatting semantics information for automatic metadata identification and segmentation. The pure texts and their corresponding formatting information including line height, font type, and font size, are recognized in parallel to guide...