Abstract
This paper presents mathematical formula detection in heterogeneous document images that may contain figures, tables, text, and math formulas. We adopt the method originally proposed for sign detection in natural images to detect non-homogeneous regions and accordingly achieve text line detection and segmentation. Novel features based on centroid fluctuation information of non-homogeneous regions are proposed to more appropriately characterize both displayed formulas and embedded formulas. By comparing the proposed method with previous works, we demonstrate the effectiveness of the proposed features.
Original language | English |
---|---|
Pages | 140-145 |
Number of pages | 6 |
DOIs | |
Publication status | Published - 2013 |
Event | 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 - Taipei, Taiwan Duration: 2013 Dec 6 → 2013 Dec 8 |
Other
Other | 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 |
---|---|
Country/Territory | Taiwan |
City | Taipei |
Period | 13-12-06 → 13-12-08 |
All Science Journal Classification (ASJC) codes
- Artificial Intelligence