


default search action
Lianwen Jin
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j104]Chongyu Liu, Qing Jiang, Dezhi Peng, Yuxin Kong, Jiaixin Zhang, Longfei Xiong, Jiwei Duan, Cheng Sun, Lianwen Jin:
QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text recognition using a Query-aware Transformer. Neurocomputing 620: 129241 (2025) - [j103]Yuyi Zhang
, Yuanzhi Zhu, Dezhi Peng
, Peirong Zhang
, Zhenhua Yang, Zhibo Yang
, Cong Yao
, Lianwen Jin
:
HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition. Pattern Recognit. 158: 110963 (2025) - [j102]Hongliang Li, Dezhi Peng
, Lianwen Jin
:
EGO-LM: An efficient, generic, and out-of-the-box language model for handwritten text recognition. Pattern Recognit. 159: 111130 (2025) - [i108]Ling Fu, Biao Yang, Zhebin Kuang, Jiajun Song, Yuzhe Li, Linghao Zhu, Qidi Luo, Xinyu Wang, Hao Lu, Mingxin Huang, Zhang Li, Guozhi Tang, Bin Shan, Chunhui Lin, Qi Liu, Binghong Wu, Hao Feng, Hao Liu, Can Huang, Jingqun Tang, Wei Chen, Lianwen Jin, Yuliang Liu, Xiang Bai:
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning. CoRR abs/2501.00321 (2025) - 2024
- [j101]Zhishu Sun
, Luojun Lin
, Yuanlong Yu
, Lianwen Jin
:
Learning feature alignment across attribute domains for improving facial beauty prediction. Expert Syst. Appl. 249: 123644 (2024) - [j100]Zhile Chen, Yuhui Quan, Ruotao Xu
, Lianwen Jin, Yong Xu:
Enhancing texture representation with deep tracing pattern encoding. Pattern Recognit. 146: 109959 (2024) - [j99]Ziyan Li, Yuhao Huang, Dezhi Peng, Mengchao He, Lianwen Jin
:
SideNet: Learning representations from interactive side information for zero-shot Chinese character recognition. Pattern Recognit. 148: 110208 (2024) - [j98]Zhe Li
, Wentao Yang, Hengnian Qi, Lianwen Jin
, Yichao Huang, Kai Ding:
A tree-based model with branch parallel decoding for handwritten mathematical expression recognition. Pattern Recognit. 149: 110220 (2024) - [j97]Ziyan Li, Lianwen Jin
, Chengquan Zhang
, Jiaxin Zhang, Zecheng Xie, Pengyuan Lyu
, Kun Yao
:
Irregular text block recognition via decoupling visual, linguistic, and positional information. Pattern Recognit. 153: 110516 (2024) - [j96]Jiaxin Zhang
, Lingyu Liang
, Kai Ding
, Fengjun Guo
, Lianwen Jin
:
Appearance Enhancement for Camera-Captured Document Images in the Wild. IEEE Trans. Artif. Intell. 5(5): 2319-2330 (2024) - [j95]Peirong Zhang
, Lianwen Jin
:
Online Writer Retrieval With Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach. IEEE Trans. Inf. Forensics Secur. 19: 10387-10399 (2024) - [j94]Zhe Li
, Xinyu Wang
, Yuliang Liu
, Lianwen Jin
, Yichao Huang, Kai Ding:
Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing. IEEE Trans. Multim. 26: 90-102 (2024) - [j93]Jiale Cheng
, Dongzi Shi
, Chenyang Li
, Yu Li
, Hao Ni
, Lianwen Jin
, Xin Zhang
:
Skeleton-Based Gesture Recognition With Learnable Paths and Signature Features. IEEE Trans. Multim. 26: 3951-3961 (2024) - [c202]Dezhi Peng, Chongyu Liu, Yuliang Liu, Lianwen Jin:
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining. AAAI 2024: 4468-4477 - [c201]Ruilu Wang, Yang Xue, Lianwen Jin:
DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations. AAAI 2024: 5563-5571 - [c200]Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen Jin:
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning. AAAI 2024: 6603-6611 - [c199]Ning Zhang, Hiuyi Cheng, Jiayu Chen, Zongyuan Jiang, Jun Huang, Yang Xue, Lianwen Jin:
M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis. AAAI 2024: 7233-7241 - [c198]Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin:
DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation. ACL (Findings) 2024: 8826-8840 - [c197]Wenhui Liao, Jiapeng Wang, Zening Lin, Longfei Xiong, Lianwen Jin:
PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents. ACL (Findings) 2024: 10522-10539 - [c196]Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Deciphering Oracle Bone Language with Diffusion Models. ACL (1) 2024: 15554-15567 - [c195]Chenfan Qu, Yiwu Zhong, Chongyu Liu, Guitao Xu, Dezhi Peng, Fengjun Guo, Lianwen Jin:
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods. CVPR 2024: 10781-10790 - [c194]Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin:
Bridging the Gap Between End-to-End and Two-Step Text Spotting. CVPR 2024: 15608-15618 - [c193]Jiaxin Zhang, Dezhi Peng, Chongyu Liu, Peirong Zhang, Lianwen Jin:
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks. CVPR 2024: 15654-15664 - [c192]Jiahuan Cao, Dezhi Peng, Peirong Zhang, Yongxin Shi, Yang Liu, Kai Ding, Lianwen Jin:
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models. EMNLP (Findings) 2024: 4196-4210 - [c191]Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin:
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models. EMNLP 2024: 16061-16075 - [c190]Qixiang Li, Zhaoya Wang, Lianwen Jin, Nurbiya Yadikar, Kurban Ubul:
MMHSV: A Multimodal Handwritten Signature Verification Fusing Dynamic and Static Feature. ICASSP 2024: 4730-4734 - [c189]Zhen Xu, Ziqiang Chen, Yaqiang Wu, Hui Li, Wanjun Lv, Lianwen Jin, Qianying Wang:
A Multi-Scale Bimodal Fusion Network for Robust and Accurate Online Handwriting Recognition. ICASSP 2024: 6460-6464 - [c188]Linger Deng, Mingxin Huang, Xudong Xie, Yuliang Liu, Lianwen Jin, Xiang Bai:
Progressive Evolution from Single-Point to Polygon for Scene Text. ICDAR (5) 2024: 111-128 - [c187]Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction. ICDAR (1) 2024: 169-187 - [c186]Yuxin Kong
, Weihong Ma
, Lianwen Jin
, Yang Xue
:
GARDEN: Generative Prior Guided Network for Scene Text Image Super-Resolution. ICDAR (5) 2024: 196-214 - [c185]Xinhong Chen
, Bangdong Chen
, Chenfan Qu
, Dezhi Peng
, Chongyu Liu
, Lianwen Jin
:
DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator. ICDAR (1) 2024: 438-452 - [c184]Zongyuan Jiang, Jiayu Chen, Chongyu Liu, Ning Zhang, Jun Huang, Xue Gao, Lianwen Jin:
RISC: Boosting High-quality Referring Image Segmentation via Foundation Model CLIP. ICME 2024: 1-6 - [c183]Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen Jin:
UPOCR: Towards Unified Pixel-Level OCR Interface. ICML 2024 - [c182]Zening Lin, Jiapeng Wang, Wenhui Liao, Weicong Dai, Longfei Xiong, Lianwen Jin:
ROISER: Towards Real World Semantic Entity Recognition from Visually-Rich Documents. ICPR (31) 2024: 76-90 - [c181]Yaqiang Wu
, Wanjun Lyu
, Xianchen Liang, Qinghua Zheng, Jin Wei
, Lianwen Jin
:
MRCI: Multi-range Context Interaction for Boundary Refinement in Image Segmentation. ICPR (33) 2024: 211-226 - [c180]Zening Lin
, Jiapeng Wang
, Teng Li
, Wenhui Liao
, Dayi Huang
, Longfei Xiong
, Lianwen Jin
:
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction. ACM Multimedia 2024: 5171-5180 - [c179]Yaqiang Wu
, Zhen Xu
, Yong Duan
, Yanlai Wu
, Qinghua Zheng
, Hui Li
, Xiaochen Hu
, Lianwen Jin
:
RDLNet: A Novel and Accurate Real-world Document Localization Method. ACM Multimedia 2024: 9847-9855 - [c178]Jiahuan Cao, Yang Liu, Yongxin Shi, Kai Ding, Lianwen Jin:
WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts. NeurIPS 2024 - [c177]Teng Li, Jiapeng Wang, Lianwen Jin:
Enhancing Visual Information Extraction with Large Language Models Through Layout-Aware Instruction Tuning. PRCV (7) 2024: 276-289 - [i107]Zening Lin, Jiapeng Wang, Teng Li, Wenhui Liao, Dayi Huang, Longfei Xiong, Lianwen Jin:
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction. CoRR abs/2401.03472 (2024) - [i106]Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen Jin:
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting. CoRR abs/2401.07641 (2024) - [i105]Haisu Guan, Jinpeng Wan, Yuliang Liu, Pengjie Wang, Kaile Zhang, Zhebin Kuang, Xinyu Wang, Xiang Bai, Lianwen Jin:
An open dataset for the evolution of oracle bone characters: EVOBC. CoRR abs/2401.12467 (2024) - [i104]Pengjie Wang, Kaile Zhang, Yuliang Liu, Jinpeng Wan, Haisu Guan, Zhebin Kuang, Xinyu Wang, Lianwen Jin, Xiang Bai:
An open dataset for oracle bone script recognition and decipherment. CoRR abs/2401.15365 (2024) - [i103]Yang Liu, Jiahuan Cao, Chongyu Liu, Kai Ding, Lianwen Jin:
Datasets for Large Language Models: A Comprehensive Survey. CoRR abs/2402.18041 (2024) - [i102]Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin:
DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation. CoRR abs/2403.04997 (2024) - [i101]Yuyi Zhang, Yuanzhi Zhu, Dezhi Peng, Peirong Zhang, Zhenhua Yang, Zhibo Yang
, Cong Yao, Lianwen Jin:
HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition. CoRR abs/2403.13761 (2024) - [i100]Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin:
Bridging the Gap Between End-to-End and Two-Step Text Spotting. CoRR abs/2404.04624 (2024) - [i99]Yuliang Liu, Mingxin Huang, Hao Yan, Linger Deng, Weijia Wu, Hao Lu, Chunhua Shen, Lianwen Jin, Xiang Bai:
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization. CoRR abs/2404.19652 (2024) - [i98]Jiaxin Zhang, Dezhi Peng, Chongyu Liu, Peirong Zhang, Lianwen Jin:
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks. CoRR abs/2405.04408 (2024) - [i97]Jiahuan Cao, Yongxin Shi, Dezhi Peng, Yang Liu, Lianwen Jin:
C3Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models. CoRR abs/2405.17732 (2024) - [i96]Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Deciphering Oracle Bone Language with Diffusion Models. CoRR abs/2406.00684 (2024) - [i95]Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu:
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction. CoRR abs/2406.03019 (2024) - [i94]Jiaxin Zhang, Wentao Yang, Songxuan Lai, Zecheng Xie, Lianwen Jin:
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming. CoRR abs/2406.19101 (2024) - [i93]Jiahuan Cao, Dezhi Peng, Peirong Zhang, Yongxin Shi, Yang Liu, Kai Ding, Lianwen Jin:
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models. CoRR abs/2407.03937 (2024) - [i92]Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin:
Generalized Tampered Scene Text Detection in the era of Generative AI. CoRR abs/2407.21422 (2024) - [i91]Mingxin Huang, Yuliang Liu, Dingkang Liang, Lianwen Jin, Xiang Bai:
Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models. CoRR abs/2408.02034 (2024) - [i90]Yujin Ren, Jiaxin Zhang, Lianwen Jin:
LEGO: Self-Supervised Representation Learning for Scene Text Images. CoRR abs/2408.02036 (2024) - [i89]Wenhui Liao, Jiapeng Wang, Hongliang Li, Chengyu Wang, Jun Huang, Lianwen Jin:
DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding. CoRR abs/2408.15045 (2024) - [i88]Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin:
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models. CoRR abs/2410.00741 (2024) - [i87]Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin:
Omni-IML: Towards Unified Image Manipulation Localization. CoRR abs/2411.14823 (2024) - [i86]Zhenhua Yang, Dezhi Peng, Yongxin Shi, Yuyi Zhang, Chongyu Liu, Lianwen Jin:
Predicting the Original Appearance of Damaged Historical Documents. CoRR abs/2412.11634 (2024) - [i85]Peirong Zhang, Lianwen Jin:
Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach. CoRR abs/2412.11668 (2024) - [i84]Chenfan Qu, Jian Liu, Haoxing Chen, Baihan Yu, Jingjing Liu, Weiqiang Wang, Lianwen Jin:
Explainable Tampered Text Detection via Multimodal Large Models. CoRR abs/2412.14816 (2024) - 2023
- [j92]Yuliang Liu
, Jiaxin Zhang
, Dezhi Peng
, Mingxin Huang
, Xinyu Wang
, Jingqun Tang
, Can Huang
, Dahua Lin
, Chunhua Shen
, Xiang Bai
, Lianwen Jin
:
SPTS v2: Single-Point Scene Text Spotting. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15665-15679 (2023) - [j91]Hongyi Wang
, Yang Xue
, Jiaxin Zhang, Lianwen Jin:
Scene table structure recognition with segmentation collaboration and alignment. Pattern Recognit. Lett. 165: 146-153 (2023) - [j90]Dezhi Peng
, Lianwen Jin
, Weihong Ma
, Canyu Xie, Hesuo Zhang, Shenggao Zhu, Jing Li:
Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach. IEEE Trans. Multim. 25: 2368-2381 (2023) - [j89]Canjie Luo
, Yuanzhi Zhu
, Lianwen Jin
, Zhe Li
, Dezhi Peng
:
SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text. IEEE Trans. Neural Networks Learn. Syst. 34(11): 8503-8515 (2023) - [c176]Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin:
CocaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval. ACL (industry) 2023: 71-80 - [c175]Bingyan Liu, Weifeng Lin, Zhongjie Duan, Chengyu Wang, Ziheng Wu, Zhang Zipeng, Kui Jia, Lianwen Jin, Cen Chen, Jun Huang:
Rapid Diffusion: Building Domain-Specific Text-to-Image Synthesizers with Fast Inference Speed. ACL (industry) 2023: 295-304 - [c174]Weifeng Lin, Canyu Xie, Dezhi Peng, Jiapeng Wang, Lianwen Jin, Wei Ding, Cong Yao, Mengchao He:
Building A Mobile Text Recognizer via Truncated SVD-based Knowledge Distillation-Guided NAS. BMVC 2023: 375 - [c173]Chenfan Qu, Chongyu Liu, Yuliang Liu, Xinhong Chen, Dezhi Peng, Fengjun Guo, Lianwen Jin:
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution. CVPR 2023: 5937-5946 - [c172]Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin:
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis. CVPR 2023: 15138-15147 - [c171]Weifeng Lin, Ziheng Wu, Jiayu Chen, Jun Huang, Lianwen Jin:
Scale-Aware Modulation Meet Transformer. ICCV 2023: 5992-6003 - [c170]Mingxin Huang
, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin:
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer. ICCV 2023: 19438-19448 - [c169]Qing Jiang, Jiapeng Wang, Dezhi Peng, Chongyu Liu, Lianwen Jin:
Revisiting Scene Text Recognition: A Data Perspective. ICCV 2023: 20486-20497 - [c168]Zhuoming Li, Fan Peng, Yang Xue, Ni Hao, Lianwen Jin:
Scene Table Structure Recognition with Segmentation and Key Point Collaboration. ICDAR (2) 2023: 295-310 - [c167]Jiarong Huang, Dezhi Peng, Hongliang Li, Hao Ni, Lianwen Jin:
SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion Between Explicit and Implicit Segmentation. ICDAR (4) 2023: 332-349 - [c166]Haiyang Li, Chongyu Liu, Jiapeng Wang, Mingxin Huang, Weiying Zhou, Lianwen Jin:
DTDT: Highly Accurate Dense Text Line Detection in Historical Documents via Dynamic Transformer. ICDAR (1) 2023: 381-396 - [c165]Cheng Jian
, Lianwen Jin
, Lingyu Liang
, Chongyu Liu
:
HisDoc R-CNN: Robust Chinese Historical Document Text Line Detection with Dynamic Rotational Proposal Network and Iterative Attention Head. ICDAR (1) 2023: 428-445 - [c164]Liufeng Huang, Bangdong Chen, Chongyu Liu, Dezhi Peng, Weiying Zhou, Yaqiang Wu, Hui Li, Hao Ni, Lianwen Jin:
EnsExam: A Dataset for Handwritten Text Erasure on Examination Papers. ICDAR (3) 2023: 470-485 - [c163]Chenyang Gao, Yuliang Liu, Shiyu Yao, Jinfeng Bai, Xiang Bai, Lianwen Jin, Cheng-Lin Liu:
ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical Expressions. ICDAR (2) 2023: 566-576 - [c162]Dongliang Luo, Yu Zhou, Rui Yang, Yuliang Liu, Xianjin Liu, Jishen Zeng, Enming Zhang, Biao Yang, Ziming Huang, Lianwen Jin, Xiang Bai:
ICDAR 2023 Competition on Detecting Tampered Text in Images. ICDAR (2) 2023: 587-600 - [c161]Wentao Yang
, Zhe Li
, Dezhi Peng
, Lianwen Jin
, Mengchao He
, Cong Yao
:
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition. ACM Multimedia 2023: 2066-2077 - [c160]Yongxin Shi, Chongyu Liu, Dezhi Peng, Cheng Jian, Jiarong Huang, Lianwen Jin:
M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark. NeurIPS 2023 - [c159]Zongyuan Jiang, Jiapeng Wang, Jiahuan Cao, Xue Gao, Lianwen Jin:
Towards Better Translations from Classical to Modern Chinese: A New Dataset and a New Method. NLPCC (1) 2023: 387-399 - [i83]Yuliang Liu, Jiaxin Zhang
, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin:
SPTS v2: Single-Point Scene Text Spotting. CoRR abs/2301.01635 (2023) - [i82]Yuliang Liu, Zhang Li, Hongliang Li, Wenwen Yu, Mingxin Huang, Dezhi Peng, Mingyu Liu, Mingrui Chen, Chunyuan Li, Lianwen Jin, Xiang Bai:
On the Hidden Mystery of OCR in Large Multimodal Models. CoRR abs/2305.07895 (2023) - [i81]Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin:
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis. CoRR abs/2305.08719 (2023) - [i80]Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin:
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval. CoRR abs/2305.17652 (2023) - [i79]Jiaxin Zhang, Bangdong Chen, Hiuyi Cheng, Lianwen Jin, Kai Ding, Fengjun Guo:
DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures. CoRR abs/2306.05749 (2023) - [i78]Dezhi Peng, Chongyu Liu, Yuliang Liu, Lianwen Jin:
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining. CoRR abs/2306.12106 (2023) - [i77]Weifeng Lin, Ziheng Wu, Jiayu Chen, Jun Huang, Lianwen Jin:
Scale-Aware Modulation Meet Transformer. CoRR abs/2307.08579 (2023) - [i76]Qing Jiang, Jiapeng Wang, Dezhi Peng, Chongyu Liu, Lianwen Jin:
Revisiting Scene Text Recognition: A Data Perspective. CoRR abs/2307.08723 (2023) - [i75]Mingxin Huang, Jiaxin Zhang
, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin:
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer. CoRR abs/2308.10147 (2023) - [i74]Weifeng Lin, Ziheng Wu, Jiayu Chen, Wentao Yang, Mingxin Huang, Jun Huang, Lianwen Jin:
Hierarchical Side-Tuning for Vision Transformers. CoRR abs/2310.05393 (2023) - [i73]Yongxin Shi, Dezhi Peng, Wenhui Liao, Zening Lin, Xinhong Chen, Chongyu Liu, Yuyi Zhang, Lianwen Jin:
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation. CoRR abs/2310.16809 (2023) - [i72]Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen Jin:
UPOCR: Towards Unified Pixel-Level OCR Interface. CoRR abs/2312.02694 (2023) - [i71]Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen Jin:
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning. CoRR abs/2312.12142 (2023) - 2022
- [j88]Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang:
Text Recognition in the Wild: A Survey. ACM Comput. Surv. 54(2): 42:1-42:35 (2022) - [j87]Dezhi Peng, Lianwen Jin
, Yuliang Liu, Canjie Luo, Songxuan Lai:
PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition. Int. J. Comput. Vis. 130(11): 2623-2645 (2022) - [j86]Jiajia Jiang, Songxuan Lai, Lianwen Jin, Yecheng Zhu, Jiaxin Zhang, Bangdong Chen:
Forgery-free signature verification with stroke-aware cycle-consistent generative adversarial network. Neurocomputing 507: 345-357 (2022) - [j85]Songxuan Lai
, Lianwen Jin
, Yecheng Zhu, Zhe Li
, Luojun Lin
:
SynSig2Vec: Forgery-Free Learning of Dynamic Signature Representations by Sigma Lognormal-Based Synthesis and 1D CNN. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6472-6485 (2022) - [j84]Yuliang Liu
, Chunhua Shen
, Lianwen Jin
, Tong He
, Peng Chen, Chongyu Liu
, Hao Chen:
ABCNet v2: Adaptive Bezier-Curve Network for Real-Time End-to-End Text Spotting. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 8048-8064 (2022) - [j83]Ying Cai
, Yuliang Liu, Chunhua Shen, Lianwen Jin, Yidong Li, Daji Ergu:
Arbitrarily shaped scene text detection with dynamic convolution. Pattern Recognit. 127: 108608 (2022) - [j82]Ruben Tolosana
, Rubén Vera-Rodríguez
, Carlos Gonzalez-Garcia, Julian Fiérrez
, Aythami Morales, Javier Ortega-Garcia, Juan-Carlos Ruiz-Garcia
, Sergio Romero-Tapiador, Santiago Rengifo, Miguel Caruana, Jiajia Jiang
, Songxuan Lai, Lianwen Jin, Yecheng Zhu, Javier Galbally, Moisés Díaz
, Miguel Ángel Ferrer
, Marta Gomez-Barrero, Ilya A. Hodashinsky
, Konstantin S. Sarin
, Artem Slezkin
, Marina Bardamova
, Mikhail Svetlakov
, Mohammad Saleem
, Cintia Lia Szücs
, Bence Kovári
, Falk Pulsmeyer
, Mohamad Wehbi, Dario Zanca
, Sumaiya Ahmad
, Sarthak Mishra
, Suraiya Jabin
:
SVC-onGoing: Signature verification competition. Pattern Recognit. 127: 108609 (2022) - [j81]