Mathematical formula detection in heterogeneous document images

Wei Ta Chu, Fan Liu

Research output: Contribution to conferencePaper

7 Citations (Scopus)

Abstract

This paper presents mathematical formula detection in heterogeneous document images that may contain figures, tables, text, and math formulas. We adopt the method originally proposed for sign detection in natural images to detect non-homogeneous regions and accordingly achieve text line detection and segmentation. Novel features based on centroid fluctuation information of non-homogeneous regions are proposed to more appropriately characterize both displayed formulas and embedded formulas. By comparing the proposed method with previous works, we demonstrate the effectiveness of the proposed features.

Original languageEnglish
Pages140-145
Number of pages6
DOIs
Publication statusPublished - 2013 Jan 1
Event2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 - Taipei, Taiwan
Duration: 2013 Dec 62013 Dec 8

Other

Other2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013
CountryTaiwan
CityTaipei
Period13-12-0613-12-08

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence

Cite this

Chu, W. T., & Liu, F. (2013). Mathematical formula detection in heterogeneous document images. 140-145. Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan. https://doi.org/10.1109/TAAI.2013.38
Chu, Wei Ta ; Liu, Fan. / Mathematical formula detection in heterogeneous document images. Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan.6 p.
@conference{13ddac5f286e4b109dcc941c8dfff49b,
title = "Mathematical formula detection in heterogeneous document images",
abstract = "This paper presents mathematical formula detection in heterogeneous document images that may contain figures, tables, text, and math formulas. We adopt the method originally proposed for sign detection in natural images to detect non-homogeneous regions and accordingly achieve text line detection and segmentation. Novel features based on centroid fluctuation information of non-homogeneous regions are proposed to more appropriately characterize both displayed formulas and embedded formulas. By comparing the proposed method with previous works, we demonstrate the effectiveness of the proposed features.",
author = "Chu, {Wei Ta} and Fan Liu",
year = "2013",
month = "1",
day = "1",
doi = "10.1109/TAAI.2013.38",
language = "English",
pages = "140--145",
note = "2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013 ; Conference date: 06-12-2013 Through 08-12-2013",

}

Chu, WT & Liu, F 2013, 'Mathematical formula detection in heterogeneous document images', Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan, 13-12-06 - 13-12-08 pp. 140-145. https://doi.org/10.1109/TAAI.2013.38

Mathematical formula detection in heterogeneous document images. / Chu, Wei Ta; Liu, Fan.

2013. 140-145 Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan.

Research output: Contribution to conferencePaper

TY - CONF

T1 - Mathematical formula detection in heterogeneous document images

AU - Chu, Wei Ta

AU - Liu, Fan

PY - 2013/1/1

Y1 - 2013/1/1

N2 - This paper presents mathematical formula detection in heterogeneous document images that may contain figures, tables, text, and math formulas. We adopt the method originally proposed for sign detection in natural images to detect non-homogeneous regions and accordingly achieve text line detection and segmentation. Novel features based on centroid fluctuation information of non-homogeneous regions are proposed to more appropriately characterize both displayed formulas and embedded formulas. By comparing the proposed method with previous works, we demonstrate the effectiveness of the proposed features.

AB - This paper presents mathematical formula detection in heterogeneous document images that may contain figures, tables, text, and math formulas. We adopt the method originally proposed for sign detection in natural images to detect non-homogeneous regions and accordingly achieve text line detection and segmentation. Novel features based on centroid fluctuation information of non-homogeneous regions are proposed to more appropriately characterize both displayed formulas and embedded formulas. By comparing the proposed method with previous works, we demonstrate the effectiveness of the proposed features.

UR - http://www.scopus.com/inward/record.url?scp=84899460005&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84899460005&partnerID=8YFLogxK

U2 - 10.1109/TAAI.2013.38

DO - 10.1109/TAAI.2013.38

M3 - Paper

AN - SCOPUS:84899460005

SP - 140

EP - 145

ER -

Chu WT, Liu F. Mathematical formula detection in heterogeneous document images. 2013. Paper presented at 2013 Conference on Technologies and Applications of Artificial Intelligence, TAAI 2013, Taipei, Taiwan. https://doi.org/10.1109/TAAI.2013.38