MUSTARD (Multilingual Scanned and Scene Table Structure Recognition Dataset)
MUSTARD (Multilingual Scanned and Scene Table Structure Recognition Dataset) is a diverse dataset curated for table structure recognition across multiple languages. The dataset consists of tables extracted from magazines, including printed, scanned, and scene-text tables, labeled with Optimized Table Structure Language (OTSL) sequences. It is designed to facilitate research in multilingual table structure recognition, particularly for non-English documents.
CC0 1.0 Public Domain
3 files, 12 directories
1 files, 4 directories
109.14 KB
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.