Journal ArticleAuthors: Tran, Vinh Quang; Nguyen, Thi Ngoc Diep (2021)
Table is one of the most common ways to represent structured data in documents. Existing
researches on image-based table structure recognition often rely on limited datasets with the largest
amount of 3,789 human-labeled tables as ICDAR 19 Track B dataset. A recent Table Bank dataset
for table structures contains 145K tables, however, the tables are labeled in an HTML tag sequence
format, which impedes the development of image-based recognition methods. In this paper, we
propose several processing methods that automatically convert an HTML tag sequence annotation
into bounding box annotation for table cells in one table image. By assembling these methods, we
could convert 42,02...