Feature based Text Extraction System using Connected Component Method
Text detection and segmentation system serves as important method for document analysis as it helps in many content based image analysis tasks. This research paper proposes a connected component technique for text extraction and character segmentation using maximally stable extremal regions (MSERs) for text line formation followed by connected components to determined separate characters. The system uses a cluster size of five which is selected by experimental evaluation for identifying characters. Sobel edge detector is used as it reduces the execution time but at the same time maintains quality of the results. The algorithm is tested along a set of JPEG, PNG and BMP images over varying features like font size, style, colour, background colour and text variation. Further the CPU time in execution of the algorithm with three different edge detectors namely prewitt, sobel and canny is observed. Text identification using MSER gave very good results whereas character segmentation gave on average 94.572% accuracy for the various test cases considered for this study.
Year of publication: |
2016
|
---|---|
Authors: | Sambyal, Nitigya ; Abrol, Pawanesh |
Published in: |
International Journal of Synthetic Emotions (IJSE). - IGI Global, ISSN 1947-9107, ZDB-ID 2703808-7. - Vol. 7.2016, 1 (01.01.), p. 41-57
|
Publisher: |
IGI Global |
Subject: | Canny | Character Segmentation | Cluster | Connected Component Method | Edge Detectors | Maximally Stable Extremal Regions | Prewitt | Sobel | Text Detection |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Medsem : a Stata package for statistical mediation analysis
Mehmetoglu, Mehmet, (2018)
-
LUPU, Catalin, (2011)
-
Miao, Ruiqiang, (2024)
- More ...