Related references
Note: Only part of the references are listed.TextMountain: Accurate scene text detection via instance segmentation
Yixing Zhu et al.
PATTERN RECOGNITION (2021)
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Minghui Liao et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)
Accuracy vs. complexity: A trade-off in visual question answering models
Moshiur Farazi et al.
PATTERN RECOGNITION (2021)
Linguistically-aware attention for reducing the semantic gap in vision-language tasks
K. V. Gouthaman et al.
PATTERN RECOGNITION (2021)
Track, Attend, and Parse (TAP): An End-to-End Framework for Online Handwritten Mathematical Expression Recognition
Jianshu Zhang et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2019)
ASTER: An Attentional Scene Text Recognizer with Flexible Rectification
Baoguang Shi et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)
Dense semantic embedding network for image captioning
Xinyu Xiao et al.
PATTERN RECOGNITION (2019)
DeCNT: Deep Deformable CNN for Table Detection
Shoaib Ahmed Siddiqui et al.
IEEE ACCESS (2018)
Deep Visual-Semantic Alignments for Generating Image Descriptions
Andrej Karpathy et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)
Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition
Jianshu Zhang et al.
PATTERN RECOGNITION (2017)
Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks
Xiao Yang et al.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)
Tree edit distance: Robust and memory-efficient
Mateusz Pawlik et al.
INFORMATION SYSTEMS (2016)