


Остановите войну!
for scientists:


default search action
Zhou Zhao
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [b1]Zhou Zhao:
Heart Segmentation and Evaluation of Fibrosis. (Segmentation cardiaque et évaluation de la fibrose). Sorbonne University, Paris, France, 2023 - [j64]Zhou Zhao, Qingkai Guo, Yu Sun
, Ningli An, Pengzhe Hui, Laihao Yang, Xuefeng Chen:
Bioinspired Hierarchical Structure for an Ultrawide-Range Multifunctional Flexible Sensor Using Porous Expandable Polyethylene/Loofah-Like Polyurethane Sponge Material. Adv. Intell. Syst. 5(1) (2023) - [i110]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023) - [i109]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. CoRR abs/2301.13430 (2023) - [i108]Zijian Zhang, Zhou Zhao, Jun Yu, Qi Tian:
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories. CoRR abs/2302.02373 (2023) - [i107]Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. CoRR abs/2303.05309 (2023) - [i106]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). CoRR abs/2303.13932 (2023) - [i105]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. CoRR abs/2303.13939 (2023) - [i104]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-commerce. CoRR abs/2304.03669 (2023) - [i103]Jiong Wang, Zhou Zhao, Fei Wu:
Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary. CoRR abs/2304.06249 (2023) - [i102]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. CoRR abs/2304.12995 (2023) - [i101]Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023) - [i100]Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Wenqiao Zhang, Rui Zhang, Xiaofei He, Fei Wu:
Denoising Multi-modal Sequential Recommenders with Contrastive Learning. CoRR abs/2305.01915 (2023) - [i99]Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu:
ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos. CoRR abs/2305.02519 (2023) - [i98]Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. CoRR abs/2305.04476 (2023) - [i97]Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. CoRR abs/2305.10686 (2023) - [i96]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023) - [i95]Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. CoRR abs/2305.12552 (2023) - [i94]Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao:
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. CoRR abs/2305.12708 (2023) - 2022
- [j63]Tao Jin, Zhou Zhao, Peng Wang, Jun Yu, Fei Wu:
Interaction augmented transformer with decoupled decoding for video captioning. Neurocomputing 492: 496-507 (2022) - [j62]Pengcheng Zhang
, Zhou Zhao
, Nannan Wang
, Jun Yu
, Fei Wu
:
Local-Global Graph Pooling via Mutual Information Maximization for Video-Paragraph Retrieval. IEEE Trans. Circuits Syst. Video Technol. 32(10): 7133-7146 (2022) - [j61]Jingkuan Song
, Jingqiu Zhang, Lianli Gao
, Zhou Zhao
, Heng Tao Shen
:
AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs. IEEE Trans. Multim. 24: 791-804 (2022) - [j60]Zhaoyu Guo
, Zhou Zhao
, Weike Jin
, Dazhou Wang, Ruitao Liu, Jun Yu
:
TaoHighlight: Commodity-Aware Multi-Modal Video Highlight Detection in E-Commerce. IEEE Trans. Multim. 24: 2606-2616 (2022) - [j59]Wenhua Wang, Yuqun Zhang, Yulei Sui
, Yao Wan, Zhou Zhao
, Jian Wu, Philip S. Yu
, Guandong Xu
:
Reinforcement-Learning-Guided Source Code Summarization Using Hierarchical Attention. IEEE Trans. Software Eng. 48(2): 102-119 (2022) - [c175]Jinzheng He, Zhou Zhao, Yi Ren, Jinglin Liu, Baoxing Huai, Nicholas Jing Yuan:
Flow-Based Unconstrained Lip to Speech Generation. AAAI 2022: 843-851 - [c174]Jinglin Liu, Zhiying Zhu, Yi Ren, Wencan Huang, Baoxing Huai, Nicholas Jing Yuan, Zhou Zhao:
Parallel and High-Fidelity Text-to-Lip Generation. AAAI 2022: 1738-1746 - [c173]Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao:
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism. AAAI 2022: 11020-11028 - [c172]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
Prior Knowledge and Memory Enriched Transformer for Sign Language Translation. ACL (Findings) 2022: 3766-3775 - [c171]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. ACL (1) 2022: 7970-7983 - [c170]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. ACL (1) 2022: 8197-8213 - [c169]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan
, Jin Wang, Peng Wang, Shiliang Pu, Fei Wu:
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding. ACL (1) 2022: 8707-8717 - [c168]Wenwen Pan, Haonan Shi, Zhou Zhao, Jieming Zhu, Xiuqiang He, Zhigeng Pan, Lianli Gao, Jun Yu, Fei Wu, Qi Tian:
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks. CVPR 2022: 1310-1321 - [c167]Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He:
MLSLT: Towards Multilingual Sign Language Translation. CVPR 2022: 5099-5109 - [c166]Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song:
Fine-Grained Predicates Learning for Scene Graph Generation. CVPR 2022: 19445-19453 - [c165]Yan Xia, Zhou Zhao:
Cross-modal Background Suppression for Audio-Visual Event Localization. CVPR 2022: 19957-19966 - [c164]Lichao Zhang, Yi Ren, Liqun Deng, Zhou Zhao:
HiFiDenoise: High-Fidelity Denoising Text to Speech with Adversarial Networks. ICASSP 2022: 7232-7236 - [c163]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ICASSP 2022: 7577-7581 - [c162]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. ICLR 2022 - [c161]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. IJCAI 2022: 4157-4163 - [c160]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. IJCAI 2022: 4468-4474 - [c159]Lichao Zhang, Zhou Zhao, Yi Ren, Liqun Deng:
EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling. IJCAI 2022: 4503-4509 - [c158]Zhou Zhao, Zhenyu Lu:
Multi-purpose Tactile Perception Based on Deep Learning in a New Tendon-driven Optical Tactile Sensor. IROS 2022: 2099-2104 - [c157]Yuxiao Lin, Zhihao Du, Shiliang Zhang, Fan Yu, Zhou Zhao, Fei Wu:
Separate-to-Recognize: Joint Multi-target Speech Separation and Speech Recognition for Speaker-attributed ASR. ISCSLP 2022: 150-154 - [c156]Rongjie Huang, Chenye Cui, Feiyang Chen, Yi Ren, Jinglin Liu, Zhou Zhao, Baoxing Huai, Zhefeng Wang:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. ACM Multimedia 2022: 2525-2535 - [c155]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech. ACM Multimedia 2022: 2595-2605 - [c154]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Wenqiao Zhang, Jiaxu Miao, Shiliang Pu, Fei Wu:
HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding. ACM Multimedia 2022: 3801-3810 - [c153]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
MC-SLT: Towards Low-Resource Signer-Adaptive Sign Language Translation. ACM Multimedia 2022: 4939-4947 - [c152]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. ACM Multimedia 2022: 5191-5200 - [c151]Ziqi Jiang, Shengyu Zhang, Siyuan Yao, Wenqiao Zhang, Sihan Zhang, Juncheng Li, Zhou Zhao, Fei Wu:
Weakly-supervised Disentanglement Network for Video Fingerspelling Detection. ACM Multimedia 2022: 5446-5455 - [c150]Wencan Huang, Zhou Zhao, Jinzheng He, Mingmin Zhang:
DualSign: Semi-Supervised Sign Language Production with Balanced Multi-Modal Multi-Task Dual Transformation. ACM Multimedia 2022: 5486-5495 - [c149]Yongqi Wang, Zhou Zhao:
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis. ACM Multimedia 2022: 5678-5687 - [c148]Jiong Wang, Zhou Zhao, Fei Wu:
Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary. ACM Multimedia 2022: 6125-6135 - [c147]Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech. NeurIPS 2022 - [c146]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. NeurIPS 2022 - [c145]Lichao Zhang, Ruiqi Li, Shoutong Wang, Liqun Deng, Jinglin Liu, Yi Ren, Jinzheng He, Rongjie Huang, Jieming Zhu, Xiao Chen, Zhou Zhao:
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus. NeurIPS 2022 - [c144]Zijian Zhang, Zhou Zhao, Zhijie Lin:
Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models. NeurIPS 2022 - [c143]Yang Zhao, Chen Zhang, Haifeng Huang, Haoyuan Li, Zhou Zhao:
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization. NeurIPS 2022 - [c142]Ziqi Tan, Shengyu Zhang, Nuanxin Hong, Kun Kuang, Yifan Yu, Jin Yu, Zhou Zhao, Hongxia Yang, Shiyuan Pan, Jingren Zhou, Fei Wu:
Uncovering Causal Effects of Online Short Videos on Consumer Behaviors. WSDM 2022: 997-1006 - [c141]Shengyu Zhang, Lingxiao Yang, Dong Yao, Yujie Lu, Fuli Feng, Zhou Zhao, Tat-Seng Chua, Fei Wu:
Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation. WWW 2022: 2216-2226 - [c140]Dong Yao, Zhou Zhao, Shengyu Zhang, Jieming Zhu, Yudong Zhu, Rui Zhang, Xiuqiang He:
Contrastive Learning with Positive-Negative Frame Mask for Music Representation. WWW 2022: 2906-2915 - [i93]Lei Li, Fuping Wu, Sihan Wang, Xinzhe Luo, Carlos Martín-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, Xiaoping Yang, Élodie Puybareau, Ilkay Öksüz, Stéphanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris, Laura Maria Schreiber, Mingjing Yang, Guocai Liu, Yong Xia, Guotai Wang, Sergio Escalera, Xiahai Zhuang:
MyoPS: A Benchmark of Myocardial Pathology Segmentation Combining Three-Sequence Cardiac Magnetic Resonance Images. CoRR abs/2201.03186 (2022) - [i92]Shoutong Wang, Jinglin Liu, Yi Ren, Zhen Wang, Changliang Xu, Zhou Zhao:
MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder. CoRR abs/2201.03864 (2022) - [i91]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. CoRR abs/2202.07816 (2022) - [i90]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. CoRR abs/2202.09778 (2022) - [i89]Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He:
VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation. CoRR abs/2202.10301 (2022) - [i88]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. CoRR abs/2202.13066 (2022) - [i87]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. CoRR abs/2202.13277 (2022) - [i86]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan, Jin Wang, Peng Wang, Shiliang Pu, Fei Wu:
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding. CoRR abs/2203.08013 (2022) - [i85]Dong Yao, Zhou Zhao, Shengyu Zhang, Jieming Zhu, Yudong Zhu, Rui Zhang, Xiuqiang He:
Contrastive Learning with Positive-Negative Frame Mask for Music Representation. CoRR abs/2203.09129 (2022) - [i84]Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song:
Fine-Grained Predicates Learning for Scene Graph Generation. CoRR abs/2204.02597 (2022) - [i83]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. CoRR abs/2204.09934 (2022) - [i82]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. CoRR abs/2204.11792 (2022) - [i81]Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis. CoRR abs/2205.07211 (2022) - [i80]Rongjie Huang, Zhou Zhao, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. CoRR abs/2205.12523 (2022) - [i79]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. CoRR abs/2206.02147 (2022) - [i78]Yang Zhao, Xuan Lin, Wenqiang Xu, Maozong Zheng, Zhengyong Liu, Zhou Zhao:
AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism. CoRR abs/2206.04888 (2022) - [i77]Yongqi Wang, Zhou Zhao:
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis. CoRR abs/2207.03800 (2022) - [i76]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech. CoRR abs/2207.06389 (2022) - [i75]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Wenqiao Zhang, Jiaxu Miao, Shiliang Pu, Fei Wu:
HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding. CoRR abs/2208.05818 (2022) - [i74]Shengyu Zhang, Lingxiao Yang, Dong Yao, Yujie Lu, Fuli Feng, Zhou Zhao, Tat-Seng Chua, Fei Wu:
Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation. CoRR abs/2208.08011 (2022) - [i73]Shengyu Zhang, Bofang Li, Dong Yao, Fuli Feng, Jieming Zhu, Wenyan Fan, Zhou Zhao, Xiaofei He, Tat-Seng Chua, Fei Wu:
CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation. CoRR abs/2208.08024 (2022) - [i72]Yang Zhao, Wenqiang Xu, Xuan Lin, Jingjing Huo, Hong Chen, Zhou Zhao:
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments. CoRR abs/2208.09612 (2022) - [i71]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. CoRR abs/2209.00277 (2022) - [i70]Jiong Wang, Zhou Zhao, Weike Jin:
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering. CoRR abs/2209.03609 (2022) - [i69]Chenye Cui, Yi Ren, Jinglin Liu, Rongjie Huang, Zhou Zhao:
VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement. CoRR abs/2211.10666 (2022) - [i68]Luping Liu, Yi Ren, Xize Cheng, Zhou Zhao:
Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection. CoRR abs/2211.11255 (2022) - [i67]Zijian Zhang, Zhou Zhao, Zhijie Lin:
Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models. CoRR abs/2212.12990 (2022) - 2021
- [j58]Lianli Gao
, Daiyuan Chen, Zhou Zhao
, Jie Shao, Heng Tao Shen:
Lightweight dynamic conditional GAN with pyramid attention for text-to-image synthesis. Pattern Recognit. 110: 107384 (2021) - [j57]Zhaoyu Guo
, Zhou Zhao
, Weike Jin
, Zhicheng Wei, Min Yang
, Nannan Wang
, Nicholas Jing Yuan:
Multi-Turn Video Question Generation via Reinforced Multi-Choice Attention Network. IEEE Trans. Circuits Syst. Video Technol. 31(5): 1697-1710 (2021) - [j56]Aming Wu
, Yahong Han
, Zhou Zhao
, Yi Yang:
Hierarchical Memory Decoder for Visual Narrating. IEEE Trans. Circuits Syst. Video Technol. 31(6): 2438-2449 (2021) - [j55]Mao Gu
, Zhou Zhao
, Weike Jin
, Richang Hong, Fei Wu
:
Graph-Based Multi-Interaction Network for Video Question Answering. IEEE Trans. Image Process. 30: 2758-2770 (2021) - [j54]Weike Jin
, Zhou Zhao
, Xiaochun Cao
, Jieming Zhu, Xiuqiang He, Yueting Zhuang:
Adaptive Spatio-Temporal Graph Enhanced Vision-Language Representation for Video QA. IEEE Trans. Image Process. 30: 5477-5489 (2021) - [j53]Zijian Zhang
, Zhou Zhao
, Zhu Zhang
, Zhijie Lin
, Qi Wang, Richang Hong:
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks. IEEE Trans. Multim. 23: 3306-3317 (2021) - [j52]Min Yang
, Chengming Li
, Ying Shen
, Qingyao Wu
, Zhou Zhao
, Xiaojun Chen
:
Hierarchical Human-Like Deep Neural Networks for Abstractive Text Summarization. IEEE Trans. Neural Networks Learn. Syst. 32(6): 2744-2757 (2021) - [j51]Min Yang
, Qiang Qu
, Ying Shen, Zhou Zhao
, Xiaojun Chen
, Chengming Li
:
An Effective Hybrid Learning Model for Real-Time Event Summarization. IEEE Trans. Neural Networks Learn. Syst. 32(10): 4419-4431 (2021) - [c139]Dong Yao, Shengyu Zhang, Zhou Zhao, Wenyan Fan, Jieming Zhu, Xiuqiang He, Fei Wu:
Modeling High-order Interactions across Multi-interests for Micro-video Reommendation (Student Abstract). AAAI 2021: 15945-15946 - [c138]Shengyu Zhang, Tan Jiang, Tan Wang, Kun Kuang, Zhou Zhao, Jianke Zhu, Jin Yu, Hongxia Yang, Fei Wu:
DeVLBert: Out-of-Distribution Visio-Linguistic Pretraining With Causality. CVPR Workshops 2021: 1744-1747 - [c137]Shengyu Zhang, Tan Jiang, Qinghao Huang, Ziqi Tan, Kun Kuang, Zhou Zhao, Siliang Tang, Jin Yu, Hongxia Yang, Yi Yang, Fei Wu:
Grounded, Controllable and Debiased Image Completion With Lexical Semantics. CVPR Workshops 2021: 1748-1751 - [c136]Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin:
Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval. CVPR 2021: 2215-2224 - [c135]Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin:
Cascaded Prediction Network via Segment Tree for Temporal Video Grounding. CVPR 2021: 4197-4206 - [c134]Min Zhang, Yang Guo, Na Lei, Zhou Zhao, Jianfeng Wu, Xiaoyin Xu, Yalin Wang, Xianfeng Gu
:
Cortical Surface Shape Analysis Based on Alexandrov Polyhedra. ICCV 2021: 14224-14232 - [c133]Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. ICLR 2021 - [c132]Zhu Zhang, Chang Zhou, Jianxin Ma, Zhijie Lin, Jingren Zhou, Hongxia Yang, Zhou Zhao:
Learning to Rehearse in Long Sequence Memorization. ICML 2021: 12663-12673 - [c131]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. IJCAI 2021: 3829-3835 - [c130]Kexun Zhang, Yi Ren, Changliang Xu, Zhou Zhao:
WSRGlow: A Glow-Based Waveform Generative Model for Audio Super-Resolution. Interspeech 2021: 1649-1653 - [c129]Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. Interspeech 2021: 2766-2770 - [c128]Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. ACM Multimedia 2021: 1359-1367 - [c127]Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He:
VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation. ACM Multimedia 2021: 1497-1506 - [c126]Wencan Huang, Wenwen Pan, Zhou Zhao, Qi Tian:
Towards Fast and High-Quality Sign Language Production. ACM Multimedia 2021: 3172-3181 - [c125]Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu,