Остановите войну!
for scientists:
default search action
Cha Zhang
Publications
- 2023
- [c87]Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Zhoujun Li, Furu Wei:
TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models. AAAI 2023: 13094-13102 - [c86]Li Sun, Florian Luisier, Kayhan Batmanghelich, Dinei A. F. Florêncio, Cha Zhang:
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding. ACL (1) 2023: 3605-3620 - [c84]Liu He, Yijuan Lu, John Corring, Dinei A. F. Florêncio, Cha Zhang:
Diffusion-Based Document Layout Generation. ICDAR (1) 2023: 361-378 - [i20]Liu He, Yijuan Lu, John Corring, Dinei A. F. Florêncio, Cha Zhang:
Diffusion-based Document Layout Generation. CoRR abs/2303.10787 (2023) - [i19]Li Sun, Florian Luisier, Kayhan Batmanghelich, Dinei A. F. Florêncio, Cha Zhang:
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding. CoRR abs/2305.14571 (2023) - 2022
- [c83]Yiheng Xu, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Furu Wei:
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding. ACL (Findings) 2022: 3214-3224 - [c81]Guoxin Wang, Yijuan Lu, Lei Cui, Tengchao Lv, Dinei A. F. Florêncio, Cha Zhang:
A Simple yet Effective Learnable Positional Encoding Method for Improving Document Transformer Model. AACL/IJCNLP (Findings) 2022: 453-463 - [i16]Hai Pham, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang:
Understanding Long Documents with Different Position-Aware Attentions. CoRR abs/2208.08201 (2022) - 2021
- [c79]Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou:
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding. ACL/IJCNLP (1) 2021: 2579-2591 - [c77]Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florêncio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo:
TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption. CVPR 2021: 8751-8761 - [i13]Yiheng Xu, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei Florêncio, Cha Zhang, Furu Wei:
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding. CoRR abs/2104.08836 (2021) - [i12]Minghao Li, Tengchao Lv, Lei Cui, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Zhoujun Li, Furu Wei:
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models. CoRR abs/2109.10282 (2021) - [i11]Baoguang Shi, Wenfeng Cheng, Yijuan Lu, Cha Zhang, Dinei A. F. Florêncio:
Improving Structured Text Recognition with Regular Expression Biasing. CoRR abs/2111.06738 (2021) - 2020
- [i8]Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei A. F. Florêncio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo:
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption. CoRR abs/2012.04638 (2020) - [i7]Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou:
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding. CoRR abs/2012.14740 (2020) - 2019
- [c73]Aaditya Prakash, James A. Storer, Dinei A. F. Florêncio, Cha Zhang:
RePr: Improved Training of Convolutional Filters. CVPR 2019: 10666-10675 - 2018
- [i4]Aaditya Prakash, James A. Storer, Dinei A. F. Florêncio, Cha Zhang:
RePr: Improved Training of Convolutional Filters. CoRR abs/1811.07275 (2018) - 2016
- [j26]Pengfei Wan, Gene Cheung, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Image Bit-Depth Enhancement via Maximum A Posteriori Estimation of AC Signal. IEEE Trans. Image Process. 25(6): 2896-2909 (2016) - 2015
- [j24]Pengfei Wan, Gene Cheung, Philip A. Chou, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Precision Enhancement of 3-D Surfaces from Compressed Multiview Depth Maps. IEEE Signal Process. Lett. 22(10): 1676-1680 (2015) - 2014
- [j21]Wenxiu Sun, Gene Cheung, Philip A. Chou, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Rate-Constrained 3D Surface Estimation From Noise-Corrupted Multiview Depth Videos. IEEE Trans. Image Process. 23(7): 3138-3151 (2014) - [c66]Cha Zhang, Dinei A. F. Florêncio, Charles T. Loop:
Point cloud attribute compression with graph transform. ICIP 2014: 2066-2070 - [c65]Pengfei Wan, Gene Cheung, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Image bit-depth enhancement via maximum-a-posteriori estimation of graph AC component. ICIP 2014: 4052-4056 - [i1]Pengfei Wan, Gene Cheung, Philip A. Chou, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Precision Enhancement of 3D Surfaces from Multiple Compressed Depth Maps. CoRR abs/1405.2062 (2014) - 2013
- [j18]Cha Zhang, Dinei A. F. Florêncio:
Analyzing the Optimality of Predictive Transform Coding Using Graph-Based Models. IEEE Signal Process. Lett. 20(1): 106-109 (2013) - [c56]Wenxiu Sun, Gene Cheung, Philip A. Chou, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Rate-distortion optimized 3D reconstruction from noise-corrupted multiview depth videos. ICME 2013: 1-6 - [c55]Pengfei Wan, Gene Cheung, Philip A. Chou, Dinei Florêncio, Cha Zhang, Oscar C. Au:
Precision enhancement of 3D surfaces from multiple quantized depth maps. IVMSP 2013: 1-4 - 2012
- [j16]Flavio P. Ribeiro, Dinei A. F. Florêncio, Demba E. Ba, Cha Zhang:
Geometrically Constrained Room Modeling With Compact Microphone Arrays. IEEE Trans. Speech Audio Process. 20(5): 1449-1460 (2012) - 2011
- [j15]Cha Zhang, Dinei A. F. Florêncio, Zhengyou Zhang:
Improving Immersive Experiences in Telecommunication with Motion Parallax [Applications Corner]. IEEE Signal Process. Mag. 28(1): 139-144 (2011) - [j14]Myung-Suk Song, Cha Zhang, Dinei A. F. Florêncio, Hong-Goo Kang:
An Interactive 3-D Audio System With Loudspeakers. IEEE Trans. Multim. 13(5): 844-855 (2011) - [c51]Flavio P. Ribeiro, Dinei A. F. Florêncio, Cha Zhang, Michael L. Seltzer:
CROWDMOS: An approach for crowdsourcing mean opinion score studies. ICASSP 2011: 2416-2419 - 2010
- [j13]Flavio P. Ribeiro, Cha Zhang, Dinei A. F. Florêncio, Demba E. Ba:
Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization. IEEE Trans. Speech Audio Process. 18(7): 1781-1792 (2010) - [c46]Demba E. Ba, Flavio P. Ribeiro, Cha Zhang, Dinei A. F. Florêncio:
L1 regularized room modeling with compact microphone arrays. ICASSP 2010: 157-160 - [c45]Flavio P. Ribeiro, Demba E. Ba, Cha Zhang, Dinei A. F. Florêncio:
Turning enemies into friends: Using reflections to improve sound source localization. ICME 2010: 731-736 - [c44]Myung-Suk Song, Cha Zhang, Dinei A. F. Florêncio, Hong-Goo Kang:
Personal 3D audio system with loudspeakers. ICME 2010: 1600-1605 - [c43]Myung-Suk Song, Cha Zhang, Dinei A. F. Florêncio, Hong-Goo Kang:
Enhancing loudspeaker-based 3D audio with room modeling. MMSP 2010: 34-39 - [c42]Cha Zhang, Dinei Florêncio:
Joint tracking and multiview video compression. VCIP 2010: 77440P - 2009
- [c39]Dinei A. F. Florêncio, Cha Zhang:
Multiview video compression and streaming based on predicted viewer position. ICASSP 2009: 657-660 - [c37]Cha Zhang, Zhaozheng Yin, Dinei A. F. Florêncio:
Improving depth perception with motion parallax and its application in teleconferencing. MMSP 2009: 1-6 - 2008
- [j10]Cha Zhang, Dinei A. F. Florêncio, Demba E. Ba, Zhengyou Zhang:
Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings. IEEE Trans. Multim. 10(3): 538-548 (2008) - [c35]Cha Zhang, Dinei A. F. Florêncio, Zhengyou Zhang:
Why does PHAT work well in lownoise, reverberative environments? ICASSP 2008: 2565-2568 - 2007
- [c32]Cha Zhang, Zhengyou Zhang, Dinei A. F. Florêncio:
Maximum Likelihood Sound Source Localization for Multiple Directional Microphones. ICASSP (1) 2007: 125-128 - [c30]Demba E. Ba, Dinei A. F. Florêncio, Cha Zhang:
Enhanced MVDR Beamforming for Arrays of Directional Microphones. ICME 2007: 1307-1310
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-05-09 00:00 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint