


default search action
4th MIPR 2021: Tokyo, Japan
- 4th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2021, Tokyo, Japan, September 8-10, 2021. IEEE 2021, ISBN 978-1-6654-1865-2

- Shixing Chen, Caojin Zhang, Ming Dong, Chengcui Zhang:

Cross-domain Person Re-Identification with Identity-preserving Style Transfer. 1-7 - Maria E. Presa Reyes, Shu-Ching Chen:

Weakly-Supervised Damaged Building Localization and Assessment with Noise Regularization. 8-14 - Zheng Guo, Lei Gao, Ling Guan:

A Manifold Semantic Canonical Correlation Framework for Effective Feature Fusion. 15-20 - Takuya Yonezawa, Yuanyuan Wang

, Yukiko Kawai, Kazutoshi Sumiya:
An Interactive Cooking Support System for Short Recipe Videos based on User Browsing Behavior. 21-27 - Itsuki Hashimoto, Yuanyuan Wang

, Yukiko Kawai, Kazutoshi Sumiya:
Topic Detection for Video Stream based on Geographical Relationships and its Interactive Viewing System. 28-34 - Yao Fu Jan, Kuan-Wei Tseng, Peng-Yuan Kao, Yi-Ping Hung:

Augmented Tai-Chi Chuan Practice Tool with Pose Evaluation. 35-41 - Deli Yu, Peipei Yang, Cheng-Lin Liu:

Learning-based Tensor Decomposition with Adaptive Rank Penalty for CNNs Compression. 42-48 - Makoto Takamoto, Yusuke Morishita:

An Empirical Study of the Effects of Sample-Mixing Methods for Efficient Training of Generative Adversarial Networks. 49-55 - Carina Miwa Yoshimura, Hiroyuki Kitagawa:

TLV-Bandit: Bandit Method for Collecting Topic-related Local Tweets. 56-62 - Honghui Yuan, Keiji Yanai:

Multi-Style Transfer Generative Adversarial Network for Text Images. 63-69 - Takahiro Komamizu, Shoi Ito, Yasuhiro Ogawa, Katsuhiko Toyama:

FPX-G: First Person Exploration for Graph. 70-76 - Andrew Brown, Ernesto Coto, Andrew Zisserman:

Automated Video Labelling: Identifying Faces by Corroborative Evidence. 77-83 - Mickael Cormier, Houraalsadat Mortazavi Moshkenan, Franz Lörch, Jürgen Metzler, Jürgen Beyerer:

Do as we do: Multiple Person Video-To-Video Transfer. 84-90 - Priyanka Singh

:
Robust Homomorphic Video Hashing. 91-96 - Abhinav Ravi, Sandeep Repakula, Ujjal Kr Dutta, Maulik Parmar:

Buy Me That Look: An Approach for Recommending Similar Fashion Products. 97-103 - Abdullah Alfarrarjeh, Xiao Yang, Amani Abu Jabal, Seon Ho Kim, Cyrus Shahabi:

Exploring the Spatial-Visual Locality of Geo-tagged Urban Street Images. 104-110 - Peng-Yuan Kao, Sheng-Wen Shih, Yi-Ping Hung, Aye Mon Tun:

Recalibration of Structured-Light RGB-D Cameras with Parametric Depth Error Correction. 111-117 - Yingjin Wang, Chuanming Wang, Yuchao Zheng, Huiyuan Fu, Huadong Ma:

Transformer based Neural Network for Fine-Grained Classification of Vehicle Color. 118-124 - Yushu Liu, Weigang Zhang, Guorong Li, Li Su, Qingming Huang:

One-Shot Example Videos Localization Network for Weakly-Supervised Temporal Action Localization. 125-130 - Shisheng Wang, Hideki Nakayama:

Stochastic Observation Prediction for Efficient Reinforcement Learning in Robotics. 131-137 - Aozora Inagaki, Shosuke Haji, Ryoko Nakamura, Ryo Osawa, Tomohiro Takagi, Isshu Munemasa:

Predicting Human Behavior Using User's Contextual Embedding by Convolution of Action Graph. 138-143 - Ryo Osawa, Keiichi Suekane, Ryoko Nakamura, Aozora Inagaki, Tomohiro Takagi, Isshu Munemasa:

Predicting Human Behavior with Transformer Considering the Mutual Relationship between Categories and Regions. 144-150 - Yash Garg, K. Selçuk Candan:

XM2A: Multi-Scale Multi-Head Attention with Cross-Talk for Multi-Variate Time Series Analysis. 151-157 - Eric Brewer, Yiu-Kai Ng:

Identifying Maturity Rating Levels of Online Books. 165-171 - Zanyah Ailsworth, Wei-bang Chen, Yongjin Lu, Xiaoliang Wang

, Melissa Tsui, Huda Al-Ghaib, Ben Zimmerman:
A Hybrid Image Segmentation Approach for Thermal Barrier Coating Quality Assessments. 172-178 - Lei Gao, Ling Guan:

A Novel Correntropy Analysis Method with Application to Multi-view Feature Representation. 179-184 - Ryoko Nakamura, Hirofumi Sano, Aozora Inagaki, Ryo Osawa, Tomohiro Takagi, Isshu Munemasa:

Dynamic Topic-Enhanced Memory Networks: Time-series Behavior Prediction based on Changing Intrinsic Consciousnesses. 185-191 - Wenhao Fang, Xian-Hua Han, Xu Qiao, Huiyan Jiang, Yen-Wei Chen:

Multi-Scale Context Interaction Learning network for Medical Image Segmentation. 192-198 - Yixin Zhang

, Yoko Yamakata, Keishi Tajima:
Supplementing Omitted Named Entities in Cooking Procedural Text with Attached Images. 199-205 - Shunta Komatsu, Ryosuke Furuta, Yukinobu Taniguchi:

Passenger Flow Estimation with Bipartite Matching on Bus Surveillance Cameras. 206-212 - Marc A. Kastner, Chihaya Matsuhira, Ichiro Ide, Shin'ichi Satoh:

A multi-modal dataset for analyzing the imageability of concepts across modalities. 213-218 - Satoshi Yamazaki, Hui Lam Ong, Jianquan Liu, Wei Jian Peh, Hong Yen Ong, Qinyu Huang, Xinlai Jiang:

Practice-Oriented Real-time Person Occurrence Search System. 219-222 - Shu Naritomi, Keiji Yanai:

Pop'n Food: 3D Food Model Estimation System from a Single Image. 223-226 - Naoto Homma, Aiko Uemura, Tetsuro Kitahara:

Are Theme Songs Usable for Anime Retrieval? 227-230 - Taku Okamoto, Hisashi Miyamori:

Demo Paper: Ad Hoc Search On Statistical Data Based On Categorization And Metadata Augmentation. 231-234 - Min Chen, Jignasha Borad, Mizuki Miyashita, James Randall:

Integrated Cloud-based System for Endangered Language Documentation and Application. 235-238 - Jing Xu, Junjie Sun, Taishan Li, Qiang Ma

:
Kyoto Sightseeing Map 2.0 for User-Experience Oriented Tourism. 239-242 - Tomoya Furuta, Yu Suzuki:

A Fact-checking Assistant System for Textual Documents*. 243-246 - Saed Rezayi, Saber Soleymani

, Hamid R. Arabnia, Sheng Li:
Socially Aware Multimodal Deep Neural Networks for Fake News Classification. 253-259 - Xiaofeng Wu, Rui Zhang, Lin Li:

Clustering Trajectories via Sparse Auto-encoders. 260-266 - Lin Li, Turghun Tayir

:
Multimodal Machine Translation Enhancement by Fusing Multimodal-attention and Fine-grained Image Features. 267-272 - Tianqi Li, Takuya Akiyama, Liang Wei:

Constructing a highly accurate price prediction model in real estate investment using LightGBM. 273-276 - Youiti Kado, Takashi Hirokata, Koji Matsumura, Xueting Wang, Toshihiko Yamasaki:

Entity Resolution of Japanese Apartment Property Information Using Neural Networks. 277-282 - Takeshi So, Yuta Arai:

Predicting inquiry from potential renters using property listing information. 283-286 - Aaron Louis Bramson, Megumi Hori:

Effect of Walkability on Rental Prices in Tokyo. 287-292 - Mantaro Yamada, Xueting Wang, Toshihiko Yamasaki:

Preference Analysis of Shopping Malls' Followers and Keyword Recommendation on Twitter. 293-298 - Peng Tan, Yi Ji, Yuqing Xu:

Rethinking of Intangible Cultural Heritage Teaching with Creative Programming in China. 299-302 - Pengcheng Shang, Shan Ni, Li Zhou:

A probabilistic and random method for the generation of Bai nationality music fragments. 303-307 - Ling Fan, Yifang Bao, Shuyu Gong, Sida Yan, Harry Jiannan Wang:

The Brain-Machine-Ratio Model for Designer and AI Collaboration. 308-313 - Kejun Zhang, Xinda Wu, Ruiyuan Tang, Qiaoqiao Huang, Changyuan Yang, Hui Zhang:

The JinYue Database for Huqin Music Emotion, Scene and Imagery Recognition. 314-319 - Ellen Pearlman:

AIBO - A Sicko AI Brainwave Opera. 320-322 - Zhijie Qin, Wei Zhong, Fei Hu, Xinyan Yang, Long Ye, Qin Zhang:

Layout Structure Assisted Indoor Image Generation. 323-329 - Yuting Ma, Fan Tang, Weiming Dong, Changsheng Xu:

Text Style Transfer With Decorative Elements. 330-336 - Gen-Fang Chen

:
Distinguishing the "strong/weak" in the 60 Jingfang tones and their optimal distribution. 337-340 - Luntian Mou, Jueying Li, Juehui Li, Feng Gao, Ramesh C. Jain, Baocai Yin:

MemoMusic: A Personalized Music Recommendation Framework Based on Emotion and Memory. 341-347 - Rongfeng Li, Meng Zhao, Xianlin Zhang, Xueming Li:

Dance to Music: Generative Choreography with Music using Mixture Density Networks. 348-353 - Hongwei Li, Hongjian Bo, Lin Ma, Lexiang Wang, Haifeng Li:

Music Emotion Recognition through Sparse Canonical Correlation Analysis. 354-359 - Zhengxin Zheng, Wei Zhong, Long Ye, Li Fang, Qin Zhang:

Violent Scene Detection of Film Videos Based on Multi-Task Learning of Temporal-Spatial Features. 360-365 - Feng Gao, Chengjia Lei

, Xingguo Long, Jin Wang, Peiheng Song:
Design and Development of an Intelligent Pet-Type Quadruped Robot. 366-371 - Lin Gan, Li Lv, Cuicui Wang, Mu Zhang:

Smart Portable Musical Simulation System Based on Unified Temperament. 372-376 - Minghao Wang, Long Ye, Fei Hu, Li Fang, Wei Zhong, Qin Zhang:

Respective Volumetric Heatmap Autoencoder for Multi-Person 3D Pose Estimation. 377-381 - Yufan Li

, Jinggang Zhuo, Ling Fan, Harry Jiannan Wang:
Culture-inspired Multi-modal Color Palette Generation and Colorization: A Chinese Youth Subculture Case. 382-385 - Xiaodan Hu, Pengfei Yu, Kevin Knight, Heng Ji, Bo Li, Honghui Shi:

MUSE: Textual Attributes Guided Portrait Painting Generation. 386-392 - Vanessa Utz, Steve DiPaola:

Exploring the Application of AI-generated Artworks for the Study of Aesthetic Processing. 393-398 - Fabian Kilger, Alexandre Kabil, Volker Tippmann, Gudrun Klinker, Marc-Oliver Pahl

:
Detecting and Preventing Faked Mixed Reality. 399-405 - Frederik Temmermans

, Deepayan Bhowmik, Fernando Pereira, Touradj Ebrahimi
:
An Introduction to the JPEG Fake Media Initiative. 406-411 - Arun Kumar Singh, Priyanka Singh

:
Detection of AI-Synthesized Speech Using Cepstral & Bispectral Statistics. 412-417 - Lucas Florin, Andreas Specker

, Arne Schumann, Jürgen Beyerer:
Hardness Prediction for More Reliable Attribute-based Person Re-identification. 418-424

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














