This paper presents mathematical formula detection in heterogeneous document images that may contain figures, tables, text, and math formulas. We adopt the method originally proposed for sign detection in natural images to detect non-homogeneous regions and accordingly achieve text line detection and segmentation. Novel features based on centroid fluctuation information of non-homogeneous regions are proposed to more appropriately characterize both displayed formulas and embedded formulas. By comparing the proposed method with previous works, we demonstrate the effectiveness of the proposed features.
|出版狀態||Published - 2013 一月 1|
|事件||2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 - Taipei, Taiwan|
持續時間: 2013 十二月 6 → 2013 十二月 8
|Other||2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013|
|期間||13-12-06 → 13-12-08|
All Science Journal Classification (ASJC) codes