


default search action
5th MIPR 2022: Virtual Event, USA
- 5th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2022, Virtual Event, USA, August 2-4, 2022. IEEE 2022, ISBN 978-1-6654-9548-6

- Zhongzheng Yuan

, Samyak Rawlekar, Siddharth Garg
, Elza Erkip, Yao Wang
:
Feature Compression for Rate Constrained Object Detection on the Edge. 1-6 - Zhuoyi Wang, Yibo Hu, Latifur Khan, Kevin W. Hamlen, Bhavani Thuraisingham:

CAPT: Contrastive Pre-Training based Semi-Supervised Open-Set Learning. 7-13 - Chih-Fan Hsu

, Ming-Ching Chang, Wei-Chao Chen:
A Robust Collaborative Learning Framework Using Data Digests and Synonyms to Represent Absent Clients. 14-19 - Qisheng He, Soumyanil Banerjee, Loren Schwiebert, Ming Dong:

AgileGCN: Accelerating Deep GCN with Residual Connections using Structured Pruning. 20-26 - Chris Henry, Birendra Kathariya, M. Salman Asif, Zhu Li, George York:

Aerial Image Classification through Thin Lensless Camera. 27-30 - Thanh Hong-Phuoc, Ling Guan:

Learning Rotational Invariant Dictionary for Sparse Coding based Key-point Detection. 35-40 - Maryna Veksler, Ramazan Aygun, Kemal Akkaya, S. Sitharama Iyengar:

Video Origin Camera Identification using Ensemble CNNs of Positional Patches. 41-46 - Lei Gao, Ling Guan:

Interpretable Learning-Based Multi-Modal Hashing Analysis for Multi-View Feature Representation Learning. 47-52 - Ju Wang, Wookjin Choi

, Igor Schtau, Taylor Ferro, Wei-bang Chen, Cutrell Trott, Grant Patterson:
Improving Angular Estimation Using a Deep CNN network in 6D pose estimation. 53-58 - Jianqiang Wang, Zhan Ma:

Sparse Tensor-based Point Cloud Attribute Compression. 59-64 - Yi Chen

, Yunhao Mao
, Shiqi Wang
, Xianguo Zhang, Sam Kwong
:
Machine-Learning Based High Efficiency Rate Control for AV1. 65-70 - Hoontaek Oh, Jerry D. Gibson:

Recursive Randomized Tree Coding of Speech. 71-76 - Kyriakos Lite, Bernhard Rinner:

Information-Seeking in Localization and Mission Planning of Multi-Agent Systems. 77-83 - Ziheng Zhang, Chang-Hong Fu, Kai Xie, Hong Hong, Guan-Ming Su:

Fast VVC Intra Coding by Skipping Redundant Coding Block Structures and Unnecessary Directional Partition. 84-89 - Ranjit Kumar Tulabandu, Jayasanker Jayaprakash, Sanampudi Venkata Rao, Cherma Rajan A, Neeraj Gadgil, Frank Galligan, Wan-Teh Chang:

Evolution of AVIF Encoder: Speed and Memory Optimizations. 90-95 - Yixiang Mao, Yueyu Hu, Yao Wang

:
Learning to Predict on Octree for Scalable Point Cloud Geometry Coding. 96-102 - Zixiao Yu, Chenyu Yu, Haohong Wang, Jian Ren:

Enabling Automatic Cinematography with Reinforcement Learning. 103-108 - Yang Lei, Viktor Shkolnikov, Daisy Xin:

Spatially Isotropic 3D Volumetric Reconstruction of Live Biological Cells with Multi-View Geometry. 109-114 - Omkar N. Kulkarni, Shashank Arora, Pradeep K. Atrey:

GARGI: Selecting Gaze-Aware Representative Group Image from a Live Photo. 115-120 - Shlomo Dubnov

, Gérard Assayag, Vignesh Gokul:
Creative Improvised Interaction with Generative Musical Systems. 121-126 - Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho:

Real-Time Super-Resolution for Real-World Images on Mobile Devices. 127-132 - Yuxin Lin, Jian Li, Lanqing Guo, Bihan Wen

:
RE2L: A Real-World Dataset for Outdoor Low-Light Image Enhancement. 133-138 - Peixi Wu, Ge Li, Thomas H. Li:

MOAC: Multi-level Perception Optimizer Based on Dual Augmented Cost for Structure- from-Motion. 139-145 - Congying Cao, Lijun Zhao, Jinjing Zhang, Xinlu Wang, Anhong Wang:

D2-UTransforIner: Deep Modulated Dual-UTransformer for Multiple Description Image Enhancement. 146-151 - Hao Cheng, Joey Tianyi Zhou, Wee Peng Tay, Bihan Wen

:
Attentive Graph Neural Networks for Few-Shot Learning. 152-157 - Daisuke Miyazaki, Hodaka Tanida:

Image enhancement for dichromats using image pyramid based on saturation. 158-161 - Akihito Watanabe, Daisuke Miyazaki:

Surface normal estimation of thin transparent objects from polarization of transmitted light. 162-165 - Liangshan Lou, Ke Lu, Jian Xue:

Skipped-Connection Transformer for Image Captioning*. 166-171 - Dexiang Hong, Guorong Li, Bineng Zhong, Zhenjun Han, Li Su, Qingming Huang:

CRNet: Collaborative Refinement Network for Self-Supervised Video Object Segmentation. 172-177 - Chin-Chia Yang, Yi-Chou Chen, Shan-Ling Chen, Homer H. Chen:

Disparity-Guided Light Field Video Synthesis with Temporal Consistency. 178-183 - Fityanul Akhyar

, Ledya Novamizanti, Trianusa Putra, Elvin Nur Furqon
, Ming-Ching Chang, Chih-Yang Lin:
Lightning YOLOv4 for a Surface Defect Detection System for Sawn Lumber. 184-189 - Kazuhiro Yamawaki, Xian-Hua Han:

Deep Unsupervised Blind Learning for Single Image Super Resolution. 190-193 - Da Huo, Marc A. Kastner, Takahiro Komamizu, Ichiro Ide:

Action Semantic Alignment for Image Captioning. 194-197 - Chengwei Wei, C.-C. Jay Kuo, Rafael Luiz Testa

, Ariane Machado-Lima, Fátima L. S. Nunes:
ExpressionHop: A Lightweight Human Facial Expression Classifier. 198-203 - Praneet Singh

, Haoyu Chen
, Edward J. Delp, Amy R. Reibman
:
Evaluating Image Quality Estimators for Face Matching. 204-209 - Jiaran Zhou, Yuezun Li:

Detection-by-Simulation: Exposing DeepFake via Simulating Forgery using Face Reconstruction. 210-215 - Nathan Galea, Dylan Seychell

:
Facial Expression Recognition in the Wild: Dataset Configurations. 216-219 - Pengcheng Gao, Bin Huang, Jiayi Lyu, Haifeng Ma, Jian Xue:

A Local-Global Metric Learning Method for Facial Expression Animation. 220-223 - Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu:

Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces. 224-227 - Zinan Xiong

, Chenxi Wang
, Ying Li, Yan Luo, Yu Cao:
Swin-Pose: Swin Transformer Based Human Pose Estimation. 228-233 - Astha Verma, A. Venkata Subramanyam, Rajiv Ratn Shah

:
Wasserstein Metric Attack on Person Re-identification. 234-239 - Hirotaka Kato, Takatsugu Hirayama, Keisuke Doman, Ichiro Ide, Yasutomo Kawanishi

, Takahiro Komamizu, Daisuke Deguchi
, Hiroshi Murase:
Intuitive Gait Modeling using Mimetic-Words for Gait Description and Generation. 240-245 - Wei-An Teng

, Su-Ling Yeh, Homer H. Chen:
Comparison of Virtual-Real Integration Efficiency between Light Field and Conventional Near-Eye AR Displays. 246-251 - Vineet Joshi, A. V. Subramanyam:

Contextual Active Learning for Person Re- Identification. 252-257 - Junyi Liu, Esha Naidu, Jialian Wu, Shira Gabriel, Edward Steinfeld, Junsong Yuan:

Personalized Prediction of Indoor Comfort Using Graph Convolutional Matrix Completion. 258-261 - Gautham Vinod, Zeman Shao, Fengqing Zhu:

Image Based Food Energy Estimation With Depth Domain Adaptation. 262-267 - Zixiao Yu, Enhao Guo, Haohong Wang, Jian Ren:

Bridging Script and Animation Utilizing a New Automatic Cinematography Model. 268-273 - Mehmet N. Akcay, Burak Kara

, Ali C. Begen, Saba Ahsan, Igor D. D. Curcio, Emre Aksu:
Rate-Adaptive Streaming of 360-Degree Videos with Head-Motion-Aware Viewport Margins. 274-280 - Qian Zhou

, Klara Nahrstedt:
Ultra-Sparse 360-Degree Camera View Synthesis for Immersive Virtual Tourism. 281-286 - Maria E. Presa Reyes, Yudong Tao, Rui Ma, Shu-Ching Chen, Mei-Ling Shyu:

Multi-Source Weak Supervision Fusion for Disaster Scene Recognition in Videos. 287-292 - Luntian Mou, Yiyuan Zhao, Chao Zhou, Baocai Yin, Wen Gao, Ramesh C. Jain:

A Review of Personalized Health Navigation for Drivers. 293-299 - Saeed Ranjbar Alvar, Korcan Uyanik, Ivan V. Bajic:

License Plate Privacy in Collaborative Visual Analysis of Traffic Scenes. 300-305 - Neha Kumari, Min Chen:

Malware and Piracy Detection in Android Applications. 306-311 - Haoming Guo, Tianyi Huang

, Huixuan Huang, Mingyue Fan, Gerald Friedland:
A Systematic Review of Multimodal Approaches to Online Misinformation Detection. 312-317 - Chih-Fan Hsu

, Jing-Lun Huang, Feng-Hao Liu, Ming-Ching Chang, Wei-Chao Chen:
FedTrust: Towards Building Secure Robust and Trustworthy Moderators for Federated Learning. 318-323 - Kratika Bhagtani, Amit Kumar Singh Yadav, Emily R. Bartusiak, Ziyue Xiang

, Ruiting Shao, Sriram Baireddy, Edward J. Delp:
An Overview of Recent Work in Multimedia Forensics. 324-329 - Tankut Akgul, Deniz Ugur, Ali C. Begen:

Automated Adaptive Playback for Encoder-Adjudicated Live Sports. 330-335 - Nikolaos Passalis, Maria Tzelepi, Polychronis Charitidis, Stavros Doropoulos, Stavros Vologiannidis, Anastasios Tefas

:
Deep Video Stream Information Analysis and Retrieval: Challenges and Opportunities. 336-341 - Yuwei Chen, Ming-Ching Chang:

Towards Multimodal Semantic Consistency Analysis of Long Form Articles. 342-347 - Jiajun Song, Weiqing Min, Yuxin Liu, Zhuo Li, Shuqiang Jiang, Yong Rui:

A Noise-robust Locality Transformer for Fine-grained Food Image Retrieval. 348-353 - Jawaher Alghamdi, Yuqing Lin, Suhuai Luo:

Modeling Fake News Detection Using BERT-CNN-BiLSTM Architecture. 354-357 - Zheng Guo, Thanh Hong-Phuoc, Naimul Khan, Ling Guan:

A Highly Optimized GPU Batched Elasticnet Solver (BENS) with Application to Real- Time Keypoint Detection for Image Retrieval. 358-361 - Keiichi Suekane, Ryo Osawa, Aozora Inagaki, Taiga Matsui, Tomohiro Tanabe, Keita Ishikawa, Tomohiro Takagi:

Personalized Fashion Sequential Recommendation with Visual Feature Based on Conditional Hierarchical VAE. 362-365 - Chengxuan Huang, Dalei Wu

, Yu Liang:
Adaptive Acquisition of Airborne Lidar Point Cloud Based on Deep Reinforcement Learning. 366-371 - Gabriel Lugo Bustillo, Amit Upreti, Irene Cheng:

Multiscale point feature object localization for hydrant surveying using LiDAR. 372-378 - Yuexi Zhang, Ming Chen, Yikang Li, Jenhao Hsiao, Octavia I. Camps, Chiuman Ho:

Generic Action Start Detection. 379-382 - Cheng Yang, Weigang Zhang:

Weakly Supervised Temporal Action Localization Through Contrastive Learning. 383-386 - Tom Liao, Jun-Cheng Chen

, Shyh-Kang Jeng, Chunhwei Tai:
Cross-Domain Knowledge Transfer for Skeleton-based Action Recognition based on Graph Convolutional Gradient Reversal Layer. 387-390 - Ziruo Yi, Eduardo Blanco, Heng Fan, Mark V. Albert:

BAPO: A Large-Scale Multimodal Corpus for Ball Possession Prediction in American Football Games. 391-394 - Oguz M. Aranay

, Pradeep K. Atrey:
Active Genetic Learning with Evidential Uncertainty for Identifying Mushroom Toxicity. 395-400 - Garima Singhal, Priyankar Choudhary

, Vusirikala Abhishek, Seela Sweety, Srinivas Subramanian, Neeraj Goel:
Cattle Collar: An End-to-End Multi-Model Framework for Cattle Monitoring. 401-407 - Sushil Ghildiyal, Neeraj Goel, Mukesh Saini:

Cloud Removal in Satellite Imagery Using Adversarial Network and RGB-Optical Data Fusion. 407-412 - Pratham Goyal, Anjali Raj, Puneet Kumar, Kishore Babu Nampalle

:
Automatic Evaluation of Machine Generated Feedback For Text and Image Data. 413-418 - Charan Charupalli, Karthick Seshadri:

Fine-tuning the Robust Temporal Feature Magnitude Model for Enhancing the Accuracy of Anomaly Detection. 419-424 - Kishore Babu Nampalle, Balasubramanian Raman:

An efficient multi-functional deep learning model for effective medical image classification using skin lesion database. 425-429

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














