- Jiahuan Cao, Dezhi Peng, Peirong Zhang, Yongxin Shi, Yang Liu, Kai Ding, Lianwen Jin:
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models. CoRR abs/2407.03937 (2024) - Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin:
Generalized Tampered Scene Text Detection in the era of Generative AI. CoRR abs/2407.21422 (2024) - 2023
- Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin:
SPTS v2: Single-Point Scene Text Spotting. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15665-15679 (2023) - Hongyi Wang, Yang Xue, Jiaxin Zhang, Lianwen Jin:
Scene table structure recognition with segmentation collaboration and alignment. Pattern Recognit. Lett. 165: 146-153 (2023) - Dezhi Peng, Lianwen Jin, Weihong Ma, Canyu Xie, Hesuo Zhang, Shenggao Zhu, Jing Li:
Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach. IEEE Trans. Multim. 25: 2368-2381 (2023) - Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Zhe Li, Dezhi Peng:
SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text. IEEE Trans. Neural Networks Learn. Syst. 34(11): 8503-8515 (2023) - Bingyan Liu, Weifeng Lin, Zhongjie Duan, Chengyu Wang, Ziheng Wu, Zhang Zipeng, Kui Jia, Lianwen Jin, Cen Chen, Jun Huang:
Rapid Diffusion: Building Domain-Specific Text-to-Image Synthesizers with Fast Inference Speed. ACL (industry) 2023: 295-304 - Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin:
CocaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval. ACL (industry) 2023: 71-80 - Weifeng Lin, Canyu Xie, Dezhi Peng, Jiapeng Wang, Lianwen Jin, Wei Ding, Cong Yao, Mengchao He:
Building A Mobile Text Recognizer via Truncated SVD-based Knowledge Distillation-Guided NAS. BMVC 2023: 375 - Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin:
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis. CVPR 2023: 15138-15147 - Chenfan Qu, Chongyu Liu, Yuliang Liu, Xinhong Chen, Dezhi Peng, Fengjun Guo, Lianwen Jin:
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution. CVPR 2023: 5937-5946 - Yingxiu Zhao, Bowen Yu, Bowen Li, Haiyang Yu, Jinyang Li, Chao Wang, Fei Huang, Yongbin Li, Nevin L. Zhang:
Causal Document-Grounded Dialogue Pre-training. EMNLP 2023: 7160-7174 - Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin:
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer. ICCV 2023: 19438-19448 - Qing Jiang, Jiapeng Wang, Dezhi Peng, Chongyu Liu, Lianwen Jin:
Revisiting Scene Text Recognition: A Data Perspective. ICCV 2023: 20486-20497 - Weifeng Lin, Ziheng Wu, Jiayu Chen, Jun Huang, Lianwen Jin:
Scale-Aware Modulation Meet Transformer. ICCV 2023: 5992-6003 - Chenyang Gao, Yuliang Liu, Shiyu Yao, Jinfeng Bai, Xiang Bai, Lianwen Jin, Cheng-Lin Liu:
ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical Expressions. ICDAR (2) 2023: 566-576 - Liufeng Huang, Bangdong Chen, Chongyu Liu, Dezhi Peng, Weiying Zhou, Yaqiang Wu, Hui Li, Hao Ni, Lianwen Jin:
EnsExam: A Dataset for Handwritten Text Erasure on Examination Papers. ICDAR (3) 2023: 470-485 - Jiarong Huang, Dezhi Peng, Hongliang Li, Hao Ni, Lianwen Jin:
SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion Between Explicit and Implicit Segmentation. ICDAR (4) 2023: 332-349 - Cheng Jian, Lianwen Jin, Lingyu Liang, Chongyu Liu:
HisDoc R-CNN: Robust Chinese Historical Document Text Line Detection with Dynamic Rotational Proposal Network and Iterative Attention Head. ICDAR (1) 2023: 428-445 - Haiyang Li, Chongyu Liu, Jiapeng Wang, Mingxin Huang, Weiying Zhou, Lianwen Jin:
DTDT: Highly Accurate Dense Text Line Detection in Historical Documents via Dynamic Transformer. ICDAR (1) 2023: 381-396 - Zhuoming Li, Fan Peng, Yang Xue, Ni Hao, Lianwen Jin:
Scene Table Structure Recognition with Segmentation and Key Point Collaboration. ICDAR (2) 2023: 295-310 - Dongliang Luo, Yu Zhou, Rui Yang, Yuliang Liu, Xianjin Liu, Jishen Zeng, Enming Zhang, Biao Yang, Ziming Huang, Lianwen Jin, Xiang Bai:
ICDAR 2023 Competition on Detecting Tampered Text in Images. ICDAR (2) 2023: 587-600 - Wentao Yang, Zhe Li, Dezhi Peng, Lianwen Jin, Mengchao He, Cong Yao:
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition. ACM Multimedia 2023: 2066-2077 - Yongxin Shi, Chongyu Liu, Dezhi Peng, Cheng Jian, Jiarong Huang, Lianwen Jin:
M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark. NeurIPS 2023 - Zongyuan Jiang, Jiapeng Wang, Jiahuan Cao, Xue Gao, Lianwen Jin:
Towards Better Translations from Classical to Modern Chinese: A New Dataset and a New Method. NLPCC (1) 2023: 387-399 - Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin:
SPTS v2: Single-Point Scene Text Spotting. CoRR abs/2301.01635 (2023) - Yuliang Liu, Zhang Li, Hongliang Li, Wenwen Yu, Mingxin Huang, Dezhi Peng, Mingyu Liu, Mingrui Chen, Chunyuan Li, Lianwen Jin, Xiang Bai:
On the Hidden Mystery of OCR in Large Multimodal Models. CoRR abs/2305.07895 (2023) - Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin:
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis. CoRR abs/2305.08719 (2023) - Yingxiu Zhao, Bowen Yu, Haiyang Yu, Bowen Li, Jinyang Li, Chao Wang, Fei Huang, Yongbin Li, Nevin L. Zhang:
Causal Document-Grounded Dialogue Pre-training. CoRR abs/2305.10927 (2023) - Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin:
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval. CoRR abs/2305.17652 (2023)