default search action
CVPR 2018: Salt Lake City, UT, USA - Workshops
- 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2018, Salt Lake City, UT, USA, June 18-22, 2018. Computer Vision Foundation / IEEE Computer Society 2018
Disguised Faces in the Wild
- Vineet Kushwaha, Maneet Singh, Richa Singh, Mayank Vatsa, Nalini K. Ratha, Rama Chellappa:
Disguised Faces in the Wild. 1-9 - Ankan Bansal, Rajeev Ranjan, Carlos Domingo Castillo, Rama Chellappa:
Deep Features for Recognizing Disguised Faces in the Wild. 10-16 - Naman Kohli, Daksha Yadav, Afzel Noore:
Face Verification With Disguise Variations via Deep Disguise Recognizer. 17-24 - Skand Vishwanath Peri, Abhinav Dhall:
DisguiseNet: A Contrastive Approach for Disguised Face Verification in the Wild. 25-31 - Kaipeng Zhang, Ya-Liang Chang, Winston H. Hsu:
Deep Disguised Faces Recognition. 32-36 - Evgeny Smirnov, Aleksandr Melnikov, Andrei Oleinik, Elizaveta Ivanova, Ilya Kalinovskiy, Eugene Luckyanets:
Hard Example Mining With Auxiliary Embeddings. 37-46 - Jun Liu, Ajay Kumar:
Detecting Presentation Attacks From 3D Face Masks Under Multispectral Imaging. 47-52
NVIDIA AI City Challenge
- Milind Naphade, Ming-Ching Chang, Anuj Sharma, David C. Anastasiu, Vamsi Jagarlamudi, Pranamesh Chakraborty, Tingting Huang, Shuo Wang, Ming-Yu Liu, Rama Chellappa, Jenq-Neng Hwang, Siwei Lyu:
The 2018 NVIDIA AI City Challenge. 53-60 - Ming-Ching Chang, Yi Wei, Nenghui Song, Siwei Lyu:
Video Analytics in Smart Transportation for the AIC'18 Challenge. 61-68 - Weitao Feng, Deyi Ji, Yiru Wang, Shuorong Chang, Hansheng Ren, Weihao Gan:
Challenges on Large Scale Surveillance Video Analysis. 69-76 - Jakub Sochor, Jakub Spanhel, Roman Juránek, Petr Dobes, Adam Herout:
Graph@FIT Submission to the NVIDIA AI City Challenge 2018. 77-84 - Tingyu Mao, Wei Zhang, Haoyu He, Yanjun Lin, Vinay Kale, Alexander Stein, Zoran Kostic:
AIC2018 Report: Traffic Surveillance Research. 85-92 - Panagiotis Giannakeris, Vagia Kaltsa, Konstantinos Avgerinakis, Alexia Briassouli, Stefanos Vrochidis, Ioannis Kompatsiaris:
Speed Estimation and Abnormality Detection From Surveillance Cameras. 93-99 - Minh-Triet Tran, Tung Dinh Duy, Thanh-Dat Truong, Vinh Ton-That, Thanh-Nhon Do, Quoc-An Luong, Thanh-An Nguyen, Vinh-Tiep Nguyen, Minh N. Do:
Traffic Flow Analysis With Multiple Adaptive Vehicle Detectors and Velocity Estimation With Landmark-Based Scanlines. 100-107 - Zheng Tang, Gaoang Wang, Hao Xiao, Aotian Zheng, Jenq-Neng Hwang:
Single-Camera and Inter-Camera Vehicle Tracking and 3D Speed Estimation Based on Fusion of Visual and Semantic Features. 108-115 - Honghui Shi, Zhonghao Wang, Yang Zhang, Xinchao Wang, Thomas S. Huang:
Geometry-Aware Traffic Flow Analysis by Detection and Tracking. 116-120 - Chih-Wei Wu, Chih-Ting Liu, Cheng-En Chiang, Wei-Chih Tu, Shao-Yi Chien:
Vehicle Re-Identification With the Space-Time Prior. 121-128 - Jiayi Wei, Jianfei Zhao, Yanyun Zhao, Zhicheng Zhao:
Unsupervised Anomaly Detection for Traffic Surveillance Based on Background Modeling. 129-136 - Amit Kumar, Pirazh Khorramshahi, Wei-An Lin, Prithviraj Dhar, Jun-Cheng Chen, Rama Chellappa:
A Semi-Automatic 2D Solution for Vehicle Speed Estimation From Monocular Videos. 137-144 - Yan Xu, Xi Ouyang, Yu Cheng, Shining Yu, Lin Xiong, Choon-Ching Ng, Sugiri Pranata, Shengmei Shen, Junliang Xing:
Dual-Mode Vehicle Motion Pattern Learning for High Performance Road Traffic Anomaly Detection. 145-152 - Shuai Hua, Manika Kapoor, David C. Anastasiu:
Vehicle Tracking and Speed Estimation From Traffic Videos. 153-160 - Tingting Huang:
Traffic Speed Estimation From Surveillance Video Data. 161-165 - Pedro A. Marín-Reyes, Andrea Palazzi, Luca Bergamini, Simone Calderara, Javier Lorenzo-Navarro, Rita Cucchiara:
Unsupervised Vehicle Re-Identification Using Triplet Networks. 166-171
DeepGlobe: A Challenge for Parsing the Earth through Satellite Images
- Ilke Demir, Krzysztof Koperski, David Lindenbaum, Guan Pang, Jing Huang, Saikat Basu, Forest Hughes, Devis Tuia, Ramesh Raskar:
DeepGlobe 2018: A Challenge to Parse the Earth Through Satellite Images. 172-181 - Lichen Zhou, Chuang Zhang, Ming Wu:
D-LinkNet: LinkNet With Pretrained Encoder and Dilated Convolution for High Resolution Satellite Imagery Road Extraction. 182-186 - Ryuhei Hamaguchi, Shuhei Hikosaka:
Building Detection From Satellite Imagery Using Ensemble of Size-Specific Detectors. 187-191 - Chao Tian, Cong Li, Jianping Shi:
Dense Fusion Classmate Network for Land Cover Classification. 192-196 - Shubhra Aich, William van der Kamp, Ian Stavness:
Semantic Binary Segmentation Using Convolutional Networks Without Decoders. 197-201 - Tao Sun, Zehui Chen, Wenxiang Yang, Yin Wang:
Stacked U-Nets With Multi-Output for Road Extraction. 202-206 - Alexander Buslaev, Selim S. Seferbekov, Vladimir Iglovikov, Alexey Shvets:
Fully Convolutional Network for Automatic Road Extraction From Satellite Imagery. 207-210 - Oleksandr Filin, Anton Zapara, Serhii Panchenko:
Road Detection With EOSResUNet and Post Vectorizing Algorithm. 211-215 - Jigar Doshi:
Residual Inception Skip Network for Binary Segmentation. 216-219 - Dragos Costea, Alina Marcu, Emil Slusanschi, Marius Leordeanu:
Roadmap Generation Using a Multi-Stage Ensemble of Deep Neural Networks With Smoothing-Based Optimization. 220-224 - Matt Dickenson, Lionel Gueguen:
Rotated Rectangles for Symbolized Building Footprint Extraction. 225-228 - Sergey Golovanov, Rauf Kurbanov, Aleksey Artamonov, Alex Davydow, Sergey I. Nikolenko:
Building Detection From Satellite Imagery Using a Composite Loss Function. 229-232 - Vladimir Iglovikov, Selim S. Seferbekov, Alexander Buslaev, Alexey Shvets:
TernausNetV2: Fully Convolutional Network for Instance Segmentation. 233-237 - Weijia Li, Conghui He, Jiarui Fang, Haohuan Fu:
Semantic Segmentation Based Building Extraction Method Using Multi-Source GIS Map Datasets and Satellite Imagery. 238-241 - Rémi Delassus, Romain Giot:
CNNs Fusion for Building Detection in Aerial Images for the Building Detection Challenge. 242-246 - Kang Zhao, Jungwon Kang, Jaewook Jung, Gunho Sohn:
Building Extraction From Satellite Images Using Mask R-CNN With Building Boundary Regularization. 247-251 - Tzu-Sheng Kuo, Keng-Sen Tseng, Jia-Wei Yan, Yen-Cheng Liu, Yu-Chiang Frank Wang:
Deep Aggregation Net for Land Cover Classification. 252-256 - Arthita Ghosh, Max Ehrlich, Sohil Shah, Larry S. Davis, Rama Chellappa:
Stacked U-Nets for Ground Material Segmentation in Remote Sensing Imagery. 257-261 - Alexander Rakhlin, Alex Davydow, Sergey I. Nikolenko:
Land Cover Classification From Satellite Imagery With U-Net and Lovasz-Softmax Loss. 262-266 - Mohamed Samy, Karim Amer, Kareem Eissa, Mahmoud Shaker, Mohamed ElHelw:
NU-Net: Deep Residual Wide Field of View Convolutional Neural Network for Semantic Segmentation. 267-271 - Selim S. Seferbekov, Vladimir Iglovikov, Alexander Buslaev, Alexey Shvets:
Feature Pyramid Network for Multi-Class Land Segmentation. 272-275 - Guillem Pascual, Santi Seguí, Jordi Vitrià:
Uncertainty Gated Network for Land Cover Segmentation. 276-279 - Alex Davydow, Sergey I. Nikolenko:
Land Cover Classification With Superpixels and Jaccard Index Post-Optimization. 280-284
Visual Understanding of Humans in Crowd Scene and Look Into Person Challenge
- Yu-Jhe Li, Fu-En Yang, Yen-Cheng Liu, Yu-Ying Yeh, Xiaofei Du, Yu-Chiang Frank Wang:
Adaptation and Re-Identification Network: An Unsupervised Deep Transfer Learning Approach to Person Re-Identification. 172-178 - Aske R. Lejbølle, Benjamin Krogh, Kamal Nasrollahi, Thomas B. Moeslund:
Attention in Multimodal Neural Networks for Person Re-Identification. 179-187 - Girum G. Demisse, Konstantinos Papadopoulos, Djamila Aouada, Björn E. Ottersten:
Pose Encoding for Robust Skeleton-Based Action Recognition. 188-194 - Diptodip Deb, Jonathan Ventura:
An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free Counting. 195-204 - Mihai Fieraru, Anna Khoreva, Leonid Pishchulin, Bernt Schiele:
Learning to Refine Human Pose Estimation. 205-214 - Meng Yang, Lida Rashidi, Sutharshan Rajasegarar, Christopher Leckie, Aravinda S. Rao, Marimuthu Palaniswami:
Crowd Activity Change Point Detection in Videos via Graph Stream Mining. 215-223
Deep Learning for Visual SLAM
- Daniel DeTone, Tomasz Malisiewicz, Andrew Rabinovich:
SuperPoint: Self-Supervised Interest Point Detection and Description. 224-236 - Emilio Parisotto, Devendra Singh Chaplot, Jian Zhang, Ruslan Salakhutdinov:
Global Pose Estimation With an Attention-Based Recurrent Network. 237-246 - Stefan Milz, Georg Arbeiter, Christian Witt, Bassam Abdallah, Senthil Kumar Yogamani:
Visual SLAM for Automated Driving: Exploring the Applications of Deep Learning. 247-257 - Masaya Kaneko, Kazuya Iwami, Toru Ogawa, Toshihiko Yamasaki, Kiyoharu Aizawa:
Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation. 258-266 - Ganesh Iyer, J. Krishna Murthy, Gunshi Gupta, K. Madhava Krishna, Liam Paull:
Geometric Consistency for Self-Supervised End-to-End Visual Odometry. 267-275 - Sungil Choi, Seungryong Kim, Kihong Park, Kwanghoon Sohn:
Learning Descriptor, Confidence, and Depth Estimation in Multi-View Stereo. 276-282 - Arun C. S. Kumar, Suchendra M. Bhandarkar, Mukta Prasad:
DepthNet: A Recurrent Neural Network Architecture for Monocular Depth Prediction. 283-291 - Luis Contreras, Walterio W. Mayol-Cuevas:
Towards CNN Map Representation and Compression for Camera Relocalisation. 292-299 - Arun C. S. Kumar, Suchendra M. Bhandarkar, Mukta Prasad:
Monocular Depth Prediction Using Generative Adversarial Networks. 300-308 - Bo Yang, Zihang Lai, Xiaoxuan Lu, Shuyu Lin, Hongkai Wen, Andrew Markham, Niki Trigoni:
Learning 3D Scene Semantics and Structure From a Single Depth Image. 309-312 - Lachlan Nicholson, Michael Milford, Niko Sünderhauf:
QuadricSLAM: Dual Quadrics As SLAM Landmarks. 313-314
Diff-CVML: Differential Geometry in Computer Vision and Machine Learning
- Hang Shao, Abhishek Kumar, P. Thomas Fletcher:
The Riemannian Geometry of Deep Generative Models. 315-323 - Kyungmin Ahn, J. Derek Tucker, Wei Wu, Anuj Srivastava:
Elastic Handling of Predictor Phase in Functional Regression Models. 324-331 - Maxime Louis, Benjamin Charlier, Stanley Durrleman:
Geodesic Discriminant Analysis for Manifold-Valued Data. 332-340 - Rudrasis Chakraborty, Chun-Hao Yang, Baba C. Vemuri:
A Mixture Model for Aggregation of Multiple Pre-Trained Weak Classifiers. 341-348 - Hongjun Choi, Qiao Wang, Meynard John Toledo, Pavan K. Turaga, Matthew P. Buman, Anuj Srivastava:
Temporal Alignment Improves Feature Quality: An Experiment on Activity Recognition With Accelerometer Data. 349-357 - Justin D. Strait, Sebastian Kurtek, Steven N. MacEachern:
Locally-Weighted Elastic Comparison of Planar Shapes. 358-366 - Dinesh Acharya, Zhiwu Huang, Danda Pani Paudel, Luc Van Gool:
Covariance Pooling for Facial Expression Recognition. 367-374 - Mehran Javanmardi, Ricardo Bigolin Lanfredi, Müjdat Çetin, Tolga Tasdizen:
Image Segmentation by Deep Learning of Disjunctive Normal Shape Model Shape Representation. 375-382 - Suhas Lohit, Ankan Bansal, Nitesh Shroff, Jaishanker K. Pillai, Pavan K. Turaga, Rama Chellappa:
Predicting Dynamical Evolution of Human Activities From a Single Image. 383-392 - Ioana Ilea, Lionel Bombrun, Salem Said, Yannick Berthoumieu:
Covariance Matrices Encoding Based on the Log-Euclidean and Affine Invariant Riemannian Metrics. 393-402 - Somenath Das, Suchendra M. Bhandarkar:
Principal Curvature Guided Surface Geometry Aware Global Shape Representation. 403-412
Biometrics
- Yoanna Martínez-Díaz, Heydi Méndez-Vázquez, Leyanis López-Avila, Leonardo Chang, Luis Enrique Sucar, Massimo Tistarelli:
Toward More Realistic Face Recognition Evaluation Protocols for the YouTube Faces Database. 413-421 - Yefei Chen, Jianbo Su:
Dict Layer: A Structured Dictionary Layer. 422-431 - Elias N. Zois, Marianna Papagiannopoulou, Dimitrios Tsourounis, George Economou:
Hierarchical Dictionary Learning and Sparse Coding for Static Signature Verification. 432-442 - Mohsen Jenadeleh, Marius Pedersen, Dietmar Saupe:
Realtime Quality Assessment of Iris Biometrics Under Visible Light. 443-452 - Narsi Reddy, Dewan Fahim Noor, Zhu Li, Reza Derakhshani:
Multi-Frame Super Resolution for Ocular Biometrics. 453-461 - Arun Kumar Jindal, Srinivas Chalamala, Santosh Kumar Jami:
Face Template Protection Using Deep Convolutional Neural Network. 462-470 - Ruben Tolosana, Rubén Vera-Rodríguez, Julian Fiérrez, Javier Ortega-Garcia:
Incorporating Touch Biometrics to Mobile One-Time Passwords: Exploration of Digits. 471-478 - Maneet Singh, Shruti Nagpal, Mayank Vatsa, Richa Singh, Angshul Majumdar:
Identity Aware Synthesis for Cross Resolution Face Recognition. 479-488 - Sivaram Prasad Mudunuri, Soubhik Sanyal, Soma Biswas:
GenLR-Net: Deep Framework for Very Low Resolution Face and Object Recognition With Generalization to Unseen Categories. 489-498 - Hadi Kazemi, Sobhan Soleymani, Ali Dabouei, Seyed Mehdi Iranmanesh, Nasser M. Nasrabadi:
Attribute-Centered Loss for Soft-Biometrics Guided Face Sketch-Photo Recognition. 499-507 - Jude Ezeobiejesi, Bir Bhanu:
Latent Fingerprint Image Quality Assessment Using Deep Learning. 508-516 - Shaan Chopra, Aakarsh Malhotra, Mayank Vatsa, Richa Singh:
Unconstrained Fingerphoto Database. 517-525 - Mustafa Berkay Yilmaz, Kagan Öztürk:
Hybrid User-Independent and User-Dependent Offline Signature Verification With a Two-Channel CNN. 526-534 - Siqi Yang, Arnold Wiliem, Brian C. Lovell:
It Takes Two to Tango: Cascading Off-the-Shelf Face Detectors. 535-543 - Javier Hernandez-Ortega, Julian Fiérrez, Aythami Morales, Pedro Tome:
Time Analysis of Pulse-Based Face Anti-Spoofing in Visible and NIR. 544-552 - Fariborz Taherkhani, Nasser M. Nasrabadi, Jeremy M. Dawson:
A Deep Face Identification Network Enhanced by Facial Attributes Prediction. 553-560 - Yasushi Makihara, Daisuke Adachi, Chi Xu, Yasushi Yagi:
Gait Recognition by Deformable Registration. 561-571 - Daksha Yadav, Naman Kohli, Akshay Agarwal, Mayank Vatsa, Richa Singh, Afzel Noore:
Fusion of Handcrafted and Deep Learning Features for Large-Scale Multiple Iris Presentation Attack Detection. 572-579 - Gee-Sern Jison Hsu, Wen-Fong Huang, Jiunn-Horng Kang:
Hierarchical Network for Facial Palsy Detection. 580-586
Embedded Vision
- Mennatullah Siam, Mostafa Gamal, Moemen Abdel-Razek, Senthil Kumar Yogamani, Martin Jägersand, Hong Zhang:
A Comparative Study of Real-Time Semantic Segmentation for Autonomous Driving. 587-597 - Nikitha Vallurupalli, Sriharsha Annamaneni, Girish Varma, C. V. Jawahar, Manu Mathew, Soyeb Nagori:
Efficient Semantic Segmentation Using Gradual Grouping. 598-606 - Hongxing Gao, Wei Tao, Dongchao Wen, Tse-Wei Chen, Kinya Osa, Masami Kato:
IFQ-Net: Integrated Fixed-Point Quantization Networks for Embedded Vision. 607-615 - Takayuki Ujiie, Masayuki Hiromoto, Takashi Sato:
Interpolation-Based Object Detection Using Motion Vectors for Embedded Real-Time Tracking Systems. 616-624 - Cevahir Çigla, Rohan Thakker, Larry H. Matthies:
Onboard Stereo Vision for Drone Pursuit or Sense and Avoid. 625-633 - Andre Ivan, Williem, In Kyu Park:
Light Field Depth Estimation on Off-the-Shelf Mobile GPU. 634-643 - Nicholas F. Y. Chen:
Pseudo-Labels for Supervised Learning on Dynamic Vision Sensor Data, Applied to Object Detection Under Ego-Motion. 644-653 - Cevahir Çigla, Kemal E. Sahin, Fikret Alim:
GPU Based Video Object Tracking on PTZ Cameras. 654-662 - Alexandre Briot, Prashanth Viswanath, Senthil Kumar Yogamani:
Analysis of Efficient CNN Design Techniques for Semantic Segmentation. 663-672 - Pankaj Bhowmik, Md Jubaer Hossain Pantho, Marjan Asadinia, Christophe Bobda:
Design of a Reconfigurable 3D Pixel-Parallel Neuromorphic Architecture for Smart Image Sensor. 673-681 - Paolo Di Febbo, Carlo Dal Mutto, Kinh Tieu, Stefano Mattoccia:
KCNN: Extremely-Efficient Hardware Keypoint Detection With a Compact Convolutional Neural Network. 682-690
New Trends in Image Restoration and Enhancement
- Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc Van Gool:
WESPE: Weakly Supervised Photo Enhancer for Digital Cameras. 691-700 - Yuan Yuan, Siyuan Liu, Jiawei Zhang, Yongbing Zhang, Chao Dong, Liang Lin:
Unsupervised Image Super-Resolution Using Cycle-in-Cycle Generative Adversarial Networks. 701-710 - Honggang Chen, Xiaohai He, Linbo Qing, Shuhua Xiong, Truong Q. Nguyen:
DPW-SDNet: Dual Pixel-Wavelet Domain Deep CNNs for Soft Decoding of JPEG-Compressed Images. 711-720 - Cheng-Han Lee, Kaipeng Zhang, Hu-Cheng Lee, Chia-Wen Cheng, Winston H. Hsu:
Attribute Augmented Convolutional Neural Network for Face Hallucination. 721-729 - Yixin Du, Xin Li:
Recursive Deep Residual Learning for Single Image Dehazing. 730-737 - S. Alireza Golestaneh, Lina J. Karam:
Synthesized Texture Quality Assessment via Multi-Scale Spatial and Statistical Texture Attributes of Image and Gradient Magnitude Coefficients. 738-744 - Meiguang Jin, Michael Hirsch, Paolo Favaro:
Learning Face Deblurring Fast and Wide. 745-753 - Codruta O. Ancuti, Cosmin Ancuti, Radu Timofte, Christophe De Vleeschouwer:
O-HAZE: A Dehazing Benchmark With Real Hazy and Haze-Free Outdoor Images. 754-762 - George Seif, Dimitrios Androutsos:
Large Receptive Field Networks for High-Scale Image Super-Resolution. 763-772 - Pengju Liu, Hongzhi Zhang, Kai Zhang, Liang Lin, Wangmeng Zuo:
Multi-Level Wavelet-CNN for Image Restoration. 773-782 - Asha Anoosheh, Eirikur Agustsson, Radu Timofte, Luc Van Gool:
ComboGAN: Unrestrained Scalability for Image Domain Translation. 783-790 - Namhyuk Ahn, Byungkon Kang, Kyung-Ah Sohn:
Image Super-Resolution via Progressive Cascading Residual Network. 791-799 - Jun-Hyuk Kim, Jong-Seok Lee:
Deep Residual Network With Enhanced Upscaling Module for Super-Resolution. 800-808 - Rong Chen, Yanyun Qu, Kun Zeng, Jinkang Guo, Cuihua Li, Yuan Xie:
Persistent Memory Residual Network for Single Image Super Resolution. 809-816 - Sehwan Ki, Hyeonjun Sim, Jae-Seok Choi, Saehun Kim, Munchurl Kim:
Fully End-to-End Learning Based Conditional Boundary Equilibrium GAN With Receptive Field Sizes Enlarged for Single Ultra-High Resolution Image Dehazing. 817-824 - Deniz Engin, Anil Genç, Hazim Kemal Ekenel:
Cycle-Dehaze: Enhanced CycleGAN for Single Image Dehazing. 825-833 - Manoj Sharma, Rudrabha Mukhopadhyay, Avinash Upadhyay, Sriharsha Koundinya, Ankit Shukla, Santanu Chaudhury:
IRGUN: Improved Residue Based Gradual Up-Scaling Network for Single Image Super Resolution. 834-843 - Sriharsha Koundinya, Himanshu Sharma, Manoj Sharma, Avinash Upadhyay, Raunak Manekar, Rudrabha Mukhopadhyay, Abhijit Karmakar, Santanu Chaudhury:
2D-3D CNN Based Architectures for Spectral Reconstruction From RGB Images. 844-851 - Radu Timofte, Shuhang Gu, Jiqing Wu, Luc Van Gool:
NTIRE 2018 Challenge on Single Image Super-Resolution: Methods and Results. 852-863 - Yifan Wang, Federico Perazzi, Brian McWilliams, Alexander Sorkine-Hornung, Olga Sorkine-Hornung, Christopher Schroers:
A Fully Progressive Approach to Single-Image Super-Resolution. 864-873 - Yijie Bei, Alexandru Damian, Shijia Hu, Sachit Menon, Nikhil Ravi, Cynthia Rudin:
New Techniques for Preserving Global Structure and Denoising With Low Information Loss in Single-Image Super-Resolution. 874-881 - Dongwon Park, Kwan-Young Kim, Se Young Chun:
Efficient Module Based Single Image Super Resolution for Multiple Problems. 882-890 - Cosmin Ancuti, Codruta O. Ancuti, Radu Timofte:
NTIRE 2018 Challenge on Image Dehazing: Methods and Results. 891-901 - He Zhang, Vishwanath Sindagi, Vishal M. Patel:
Multi-Scale Single Image Dehazing Using Perceptual Pyramid Deep Network. 902-911 - Hyeonjun Sim, Sehwan Ki, Jae-Seok Choi, Soomin Seo, Saehun Kim, Munchurl Kim:
High-Resolution Image Dehazing With Respect to Training Losses and Receptive Field Sizes. 912-919 - Ranjan Mondal, Sanchayan Santra, Bhabatosh Chanda:
Image Dehazing by Joint Estimation of Transmittance and Airlight Using Bi-Directional Consistency Loss Minimized FCN. 920-928 - Boaz Arad, Ohad Ben-Shahar, Radu Timofte:
NTIRE 2018 Challenge on Spectral Reconstruction From RGB Images. 929-938 - Zhan Shi, Chang Chen, Zhiwei Xiong, Dong Liu, Feng Wu:
HSCNN+: Advanced CNN-Based Hyperspectral Recovery From RGB Images. 939-947 - Tarek Stiebel, Simon Koppers, Philipp Seltsam, Dorit Merhof:
Reconstructing Spectral Images From RGB-Images Using a Convolutional Neural Network. 948-953
Autonomous Driving
- Xinyu Huang, Xinjing Cheng, Qichuan Geng, Binbin Cao, Dingfu Zhou, Peng Wang, Yuanqing Lin, Ruigang Yang:
The ApolloScape Dataset for Autonomous Driving. 954-960 - JeongYeol Baek, Ioana Veronica Chelu, Livia Iordache, Vlad Paunescu, HyunJoo Ryu, Alexandru Ghiuta, Andrei Petreanu, YunSung Soh, Andrei Leica, ByeongMoon Jeon:
Scene Understanding Networks for Autonomous Driving Based on Around View Monitoring System. 961-968 - Jonathan Tremblay, Aayush Prakash, David Acuna, Mark Brophy, Varun Jampani, Cem Anil, Thang To, Eric Cameracci, Shaad Boochoon, Stan Birchfield:
Training Deep Networks With Synthetic Data: Bridging the Reality Gap by Domain Randomization. 969-977 - Arantxa Casanova, Guillem Cucurull, Michal Drozdzal, Adriana Romero, Yoshua Bengio:
On the Iterative Refinement of Densely Connected Representation Levels for Semantic Segmentation. 978-987 - Satoshi Tsutsui, Tommi Kerola, Shunta Saito, David J. Crandall:
Minimizing Supervision for Free-Space Segmentation. 988-997 - Yu-Hui Huang, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc Van Gool:
Error Correction for Dense Semantic Image Labeling. 998-1006 - Nikolai Smolyanskiy, Alexey Kamenev, Stan Birchfield:
On the Importance of Stereo for Accurate Depth Estimation: An Efficient Semi-Supervised Deep Neural Network Approach. 1007-1015 - Shaohui Sun, Ramesh Sarukkai, Jack Kwok, Vinay D. Shet:
Accurate Deep Direct Geo-Localization From Ground Imagery and Phone-Grade GPS. 1016-1023 - Ernest Cheung, Aniket Bera, Dinesh Manocha:
Efficient and Safe Vehicle Navigation Based on Driver Behavior Classification. 1024-1031 - Bhakti Baheti, Suhas S. Gajre, Sanjay N. Talbar:
Detection of Distracted Driver Using Convolutional Neural Network. 1032-1038 - Aniket Bera, Tanmay Randhavane, Austin Wang, Dinesh Manocha, Emily Kubin, Kurt Gray:
Classifying Group Emotions for Socially-Aware Autonomous Vehicle Navigation. 1039-1047 - Andrew Best, Sahil Narang, Lucas Pasqualin, Daniel Barber, Dinesh Manocha:
AutonoVi-Sim: Autonomous Vehicle Simulation Platform With Weather, Sensing, and Traffic Control. 1048-1056 - Arun C. S. Kumar, Suchendra M. Bhandarkar, Mukta Prasad:
Learning Hierarchical Models for Class-Specific Reconstruction From Natural Data. 1057-1065 - Pratik Prabhanjan Brahma, Adrienne Othon:
Subset Replay Based Continual Learning for Scalable Improvement of Autonomous Systems. 1066-1074
Human Pose, Motion, Activities and Shape in 3D
- Endri Dibra, Silvan Melchior, Ali Balkis, Thomas Wolf, Cengiz Öztireli, Markus H. Gross:
Monocular RGB Hand Pose Inference From Unsupervised Refinable Nets. 1075-1085 - Maren Awiszus, Stella Grasshof, Felix Kuhnke, Jörn Ostermann:
Unsupervised Features for Facial Expression Intensity Estimation Over Time. 1086-1094 - Nolan Lunscher, John S. Zelek:
Deep Learning Whole Body Point Cloud Scans From a Single Depth Map. 1095-1102 - Akshay Rangesh, Mohan M. Trivedi:
HandyNet: A One-Stop Solution to Detect, Segment, Localize & Analyze Driver Hands. 1103-1110
Brave New Ideas for Video Understanding
- Debidatta Dwibedi, Pierre Sermanet, Jonathan Tompson:
Temporal Reasoning in Videos Using Convolutional Gated Recurrent Units. 1111-1116 - Ali Diba, Mohsen Fayyaz, Vivek Sharma, Amir Hossein Karami, Mohammad Mahdi Arzani, Rahman Yousefzadeh, Luc Van Gool:
Temporal 3D ConvNets Using Temporal Transition Layer. 1117-1121 - Wonmin Byeon, Qin Wang, Rupesh Kumar Srivastava, Petros Koumoutsakos:
ContextVP: Fully Context-Aware Video Prediction. 1122-1126 - Michael Wray, Davide Moltisanti, Dima Damen:
Towards an Unequivocal Representation of Actions. 1127-1131 - Suman Saha, Rajitha Navarathna, Leonhard Helminger, Romann M. Weber:
Unsupervised Deep Representations for Learning Audience Facial Behaviors. 1132-1137 - Shweta Bhardwaj, Mitesh M. Khapra:
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames. 1138-1142
Perception Beyond the Visible Spectrum
- Amanda Berg, Jörgen Ahlberg, Michael Felsberg:
Generating Visible Spectrum Images From Thermal Infrared. 1143-1152 - Shuo Liu, Vijay John, Erik Blasch, Zheng Liu, Ying Huang:
IR2VI: Enhanced Night Environmental Perception by Unsupervised Thermal Image Translation. 1153-1160 - Timothy Doster, Tegan Emerson, Colin C. Olson:
Path Orthogonal Matching Pursuit for Sparse Reconstruction and Denoising of SWIR Maritime Imagery. 1161-1168 - Patricia L. Suarez, Angel Domingo Sappa, Boris Xavier Vintimilla, Riad I. Hammoud:
Deep Learning Based Single Image Dehazing. 1169-1176 - Kin Gwn Lore, Kishore K. Reddy, Michael Giering, Edgar A. Bernal:
Generative Adversarial Networks for Depth Map Estimation From RGB Video. 1177-1185 - Michael Loveday, Toby P. Breckon:
On the Impact of Parallax Free Colour and Infrared Image Co-Registration to Fused Illumination Invariant Adaptive Background Modelling. 1186-1195 - Anthony Ortiz, Alonso Granados, Olac Fuentes, Christopher Kiekintveld, Dalton S. Rosario, Zachary Bell:
Integrated Learning and Feature Selection for Deep Neural Networks in Multispectral Images. 1196-1205 - Jiahang Che, Yuxiang Xing, Li Zhang:
A Comprehensive Solution for Deep-Learning Based Cargo Inspection to Discriminate Goods in Containers. 1206-1213 - Jin-Fu Lin, Yen-Liang Lin, Erh-Kan King, Hung-Ting Su, Winston H. Hsu:
Cross-Domain Hallucination Network for Fine-Grained Object Recognition. 1214-1221 - Brian Millikan, Hassan Foroosh, Qiyu Sun:
Deep Convolutional Neural Networks With Integrated Quadratic Correlation Filters for Automatic Target Recognition. 1222-1229 - Xingyu Wan, Jinjun Wang, Sanping Zhou:
An Online and Flexible Multi-Object Tracking Framework Using Long Short-Term Memory. 1230-1238 - Mark W. Koch, R. Derek West, Robert Riley, Tu-Thach Quach:
Polarimetric Synthetic-Aperture-Radar Change-Type Classification With a Hyperparameter-Free Open-Set Classifier. 1239-1246 - Marcel Sheeny, Andrew M. Wallace, Mehryar Emambakhsh, Sen Wang, Barry Connor:
POL-LWIR Vehicle Detection: Convolutional Neural Networks Meet Polarised Infrared Sensors. 1247-1253
Computer Vision for Physiological Measurement
- Christian S. Pilz, Sebastian Zaunseder, Jarek Krajewski, Vladimir Blazek:
Local Group Invariance for Heart Rate Estimation From Face Videos in the Wild. 1254-1262 - Genki Okada, Kenta Masui, Norimichi Tsumura:
Advertisement Effectiveness Estimation Based on Crowdsourced Multimodal Affective Responses. 1263-1271 - Ewa Magdalena Nowara, Tim K. Marks, Hassan Mansour, Ashok Veeraraghavan:
SparsePPG: Towards Driver Monitoring Using Camera-Based Vital Signs Estimation in Near-Infrared. 1272-1281 - Gregory F. Lewis, Maria I. Davila, Stephen W. Porges:
Novel Algorithms to Monitor Continuous Cardiac Activity With a Video Camera. 1282-1290 - Emmett Kerr, Sonya A. Coleman, T. Martin McGinnity, Andrea Shepherd:
Measurement of Capillary Refill Time (CRT) in Healthy Subjects Using a Robotic Hand. 1291-1298 - Changchen Zhao, Chun-Liang Lin, Weihai Chen, Zhengguo Li:
A Novel Framework for Remote Photoplethysmography Pulse Extraction on Compressed Videos. 1299-1308 - Chuanxiang Tang, Jiwu Lu, Jie Liu:
Non-Contact Heart Rate Monitoring by Combining Convolutional Neural Network Skin Detection and Remote Photoplethysmography via a Low-Cost Camera. 1309-1315 - Puneet Gupta, Brojeshwar Bhowmick, Arpan Pal:
Exploring the Feasibility of Face Video Based Instantaneous Heart-Rate for Micro-Expression Spotting. 1316-1323 - Munenori Fukunishi, Kouki Kurita, Shoji Yamamoto, Norimichi Tsumura:
Video Based Measurement of Heart Rate and Heart Rate Variability Spectrogram From Estimated Hemoglobin Information. 1324-1331 - Richard Macwan, Serge Bobbia, Yannick Benezeth, Julien Dubois, Alamin Mansouri:
Periodic Variance Maximization Using Generalized Eigenvalue Decomposition Applied to Remote Photoplethysmography Estimation. 1332-1340 - Serge Bobbia, Duncan Luguern, Yannick Benezeth, Keisuke Nakamura, Randy Gomez, Julien Dubois:
Real-Time Temporal Superpixels for Unsupervised Remote Photoplethysmography. 1341-1348 - Tom Vogels, Mark van Gastel, Wenjin Wang, Gerard de Haan:
Fully-Automatic Camera-Based Pulse-Oximetry During Sleep. 1349-1357 - Andreia Vieira Moco, Sander Stuijk, Mark van Gastel, Gerard de Haan:
Impairing Factors in Remote-PPG Pulse Transit Time Measurements on the Face. 1358-1366 - Daniel McDuff:
Deep Super Resolution for Recovering Physiological Information From Videos. 1367-1374 - Jaehee Park, Ashutosh Sabharwal, Ashok Veeraraghavan:
Direct-Global Separation for Improved Imaging Photoplethysmography. 1375-1384
Automated Analysis of Marine Video for Environmental Monitoring
- Deborah Levy, Yuval Belfer, Elad Osherov, Eyal Bigal, Aviad P. Scheinin, Hagai Nativ, Dan Tchernov, Tali Treibitz:
Automated Analysis of Marine Video With Limited Data. 1385-1393 - Andrew King, Suchendra M. Bhandarkar, Brian M. Hopkinson:
A Comparison of Deep Learning Methods for Semantic Segmentation of Coral Reef Survey Images. 1394-1402 - Yi-Min Chou, Chien-Hung Chen, Keng-Hao Liu, Chu-Song Chen:
Stingray Detection of Aerial Images Using Augmented Training Images Generated by a Conditional Generative Model. 1403-1409 - Malte Pedersen, Stefan Bengtson, Rikke Gade, Niels Madsen, Thomas B. Moeslund:
Camera Calibration for Underwater 3D Reconstruction Based on Ray Tracing Using Snell's Law. 1410-1417
Joint Detection, Tracking, and Prediction in the Wild
- Emad Barsoum, John R. Kender, Zicheng Liu:
HP-GAN: Probabilistic 3D Human Motion Prediction via GAN. 1418-1427 - Roberto Henschel, Laura Leal-Taixé, Daniel Cremers, Bodo Rosenhahn:
Fusion of Head and Full-Body Detectors for Multi-Object Tracking. 1428-1437 - Neeti Narayan, Nishant Sankaran, Srirangaraj Setlur, Venu Govindaraju:
Re-Identification for Online Person Tracking by Modeling Space-Time Continuum. 1438-1447 - Mehran Khodabandeh, Hamid Reza Vaezi Joze, Ilya Zharkov, Vivek Pradeep:
DIY Human Action Dataset Generation. 1448-1458 - Hilke Kieritz, Wolfgang Hübner, Michael Arens:
Joint Detection and Online Multi-Object Tracking. 1459-1467 - Nachiket Deo, Mohan M. Trivedi:
Convolutional Social Pooling for Vehicle Trajectory Prediction. 1468-1476
Visual Odometry and Computer Vision Applications Based on Location Clues
- Chun-Wei Chen, Yin-Hsi Kuo, Tang Lee, Cheng-Han Lee, Winston H. Hsu:
Drone-View Building Identification by Cross-View Visual Learning and Relative Spatial Estimation. 1477-1485 - Silvio Giancola, Jens Schneider, Peter Wonka, Bernard Ghanem:
Integration of Absolute Orientation Measurements in the KinectFusion Reconstruction Pipeline. 1486-1495 - Xue Iuan Wong, Taewook Lee, Puneet Singla, Manoranjan Majji:
Optimal Linear Attitude Estimator for Alignment of Point Clouds. 1496-1504 - Yi Xu, Yuzhang Wu, Hui Zhou:
Multi-Scale Voxel Hashing and Efficient 3D Representation for Mobile Augmented Reality. 1505-1512 - Ahmed Nassar, Karim Amer, Reda ElHakim, Mohamed ElHelw:
A Deep CNN-Based Framework for Enhanced Aerial Imagery Registration With Applications to UAV Geolocalization. 1513-1523 - Qiong Wu, Ambrose Li:
Automated Virtual Navigation and Monocular Localization of Indoor Spaces From Videos. 1524-1532 - Tristan Swedish, Ramesh Raskar:
Deep Visual Teach and Repeat on Path Networks. 1533-1542 - Liang Yang, Bing Li, Wei Li, Biao Jiang, Jizhong Xiao:
Semantic Metric 3D Reconstruction for Concrete Inspection. 1543-1551
Bright and Dark Sides of Computer Vision: Challenges and Opportunities for Privacy and Security
- Aniket Roy, Diangarti Bhalang Tariang, Rajat Subhra Chakraborty, Ruchira Naskar:
Discrete Cosine Transform Residual Feature Based Filtering Forgery and Splicing Detection in JPEG Images. 1552-1560 - Noa Privman-Horesh, Azmi Haider, Hagit Hel-Or:
Forgery Detection in 3D-Sensor Images. 1561-1569 - Jiawei Chen, Janusz Konrad, Prakash Ishwar:
VGAN-Based Image Representation Learning for Privacy-Preserving Facial Expression Recognition. 1570-1579 - Jinyuan Zhao, Natalia Frumkin, Janusz Konrad, Prakash Ishwar:
Privacy-Preserving Indoor Localization via Active Scene Illumination. 1580-1589 - Yifang Li, Wyatt Troutman, Bart P. Knijnenburg, Kelly Caine:
Human Perceptions of Sensitive Content in Photos. 1590-1596 - Jamie Hayes:
On Visible Adversarial Perturbations & Digital Watermarking. 1597-1604 - Mahmood Sharif, Lujo Bauer, Michael K. Reiter:
On the Suitability of Lp-Norms for Creating and Preventing Adversarial Examples. 1605-1613 - Hossein Hosseini, Radha Poovendran:
Semantic Adversarial Examples. 1614-1619 - Steven Hoffman, Renu Sharma, Arun Ross:
Convolutional Neural Networks for Iris Presentation Attack Detection: Toward Cross-Dataset and Cross-Sensor Generalization. 1620-1628
Efficient Deep Learning for Computer Vision
- Amarjot Singh, Devendra Patil, S. N. Omkar:
Eye in the Sky: Real-Time Drone Surveillance System (DSS) for Violent Individuals Identification Using ScatterNet Hybrid Deep Learning Network. 1629-1637 - Amir Gholami, Kiseok Kwon, Bichen Wu, Zizheng Tai, Xiangyu Yue, Peter H. Jin, Sicheng Zhao, Kurt Keutzer:
SqueezeNext: Hardware-Aware Neural Network Design. 1638-1647 - Lane McIntosh, Niru Maheswaranathan, David Sussillo, Jonathon Shlens:
Recurrent Segmentation for Variable Computational Budgets. 1648-1657 - Oyebade K. Oyedotun, Abd El Rahman Shabayek, Djamila Aouada, Björn E. Ottersten:
Highway Network Block With Gates Constraints for Training Very Deep Networks. 1658-1667 - Dae Ha Kim, Seung Hyun Lee, Byung Cheol Song:
MUNet: Macro Unit-Based Convolutional Neural Network for Mobile Devices. 1668-1676 - Baohua Sun, Lin Yang, Patrick Dong, Wenhan Zhang, Jason Dong, Charles Young:
Ultra Power-Efficient CNN Domain Specific Accelerator With 9.3TOPS/Watt for Mobile and Embedded Applications. 1677-1685 - Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee, Chih-Yi Chiu, Chu-Song Chen:
Merging Deep Neural Networks for Mobile Devices. 1686-1694 - Qing Zhang, Mengru Zhang, Mengdi Wang, Wanchen Sui, Chen Meng, Jun Yang, Weidan Kong, Xiaoyuan Cui, Wei Lin:
Efficient Deep Learning Inference Based on Model Compression. 1695-1702 - Michael T. Chan, Daniel Scarafoni, Ronald Duarte, Jason Thornton, Luke J. Skelly:
Learning Network Architectures of Deep CNNs Under Resource Constraints. 1703-1710
Computer Vision in Sports
- Silvio Giancola, Mohieddine Amine, Tarek Dghaily, Bernard Ghanem:
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos. 1711-1721 - Tharindu Fernando, Sridha Sridharan, Clinton Fookes, Simon Denman:
Deep Decision Trees for Discriminative Dictionary Learning With Adversarial Multi-Agent Trajectories. 1722-1731 - Arda Senocak, Tae-Hyun Oh, Junsik Kim, In So Kweon:
Part-Based Player Identification Using Deep Convolutional Representation and Multi-Scale Pooling. 1732-1739 - A. J. Piergiovanni, Michael S. Ryoo:
Fine-Grained Activity Recognition in Baseball Videos. 1740-1748 - Rajkumar Theagarajan, Federico Pala, Xiu Zhang, Bir Bhanu:
Soccer: Who Has the Ball? Generating Visual Analytics and Player Statistics. 1749-1757 - Vito Renò, Nicola Mosca, Roberto Marani, Massimiliano Nitti, Tiziana D'Orazio, Ettore Stella:
Convolutional Neural Networks Based Ball Detection in Tennis Games. 1758-1764 - Anthony Cioppa, Adrien Deliège, Marc Van Droogenbroeck:
A Bottom-Up Approach Based on Semantics for the Interpretation of the Main Camera Stream in Soccer Games. 1765-1774 - Kosuke Takahashi, Dan Mikami, Mariko Isogawa, Hideaki Kimata:
Human Pose As Calibration Pattern; 3D Human Pose Estimation With Multiple Unsynchronized and Uncalibrated Cameras. 1775-1782 - Gen Li, Shikun Xu, Xiang Liu, Lei Li, Changhu Wang:
Jersey Number Recognition With Semi-Supervised Spatial Transformer Network. 1783-1790 - Dan Zecha, Moritz Einfalt, Christian Eggert, Rainer Lienhart:
Kinematic Pose Rectification for Performance Analysis and Retrieval in Sports. 1791-1799 - Pushkar Shukla, Hemant Sadana, Apaar Bansal, Deepak Verma, Carlos E. L. Elmadjian, Balasubramanian Raman, Matthew A. Turk:
Automatic Cricket Highlight Generation Using Event-Driven and Excitement-Based Features. 1800-1808 - Tomoya Kaichi, Shohei Mori, Hideo Saito, Kosuke Takahashi, Dan Mikami, Mariko Isogawa, Hideaki Kimata:
Estimation of Center of Mass for Sports Scene Using Weighted Visual Hull. 1809-1815 - Mohib Ullah, Faouzi Alaya Cheikh:
A Directed Sparse Graphical Model for Multi-Target Tracking. 1816-1823 - Noor ul Huda, Kasper H. Jensen, Rikke Gade, Thomas B. Moeslund:
Estimating the Number of Soccer Players Using Simulation-Based Occlusion Handling. 1824-1833
Computational Cameras and Displays
- Hajime Nagahara, Toshiki Sonoda, Dengyu Liu, Jinwei Gu:
Space-Time-Brightness Sampling Using an Adaptive Pixel-Wise Coded Exposure. 1834-1842 - Avinash Kumar, Manjula Gururaj, Kalpana Seshadrinathan, Ramkumar Narayanswamy:
Multi-Capture Dynamic Calibration of Multi-Camera Systems. 1843-1851 - Nianyi Li, Scott McCloskey, Jingyi Yu:
Jittered Exposures for Image Super-Resolution. 1852-1859
Women in Computer Vision
- Ilke Demir, Dena Bazazian, Adriana Romero, Viktoriia Sharmanska, Lyne P. Tchapmi:
WiCV 2018: The Fourth Women in Computer Vision Workshop. 1860-1862 - Kumar Rohit Malhotra, Anis Davoudi, Scott Siegel, Azra Bihorac, Parisa Rashidi:
Autonomous Detection of Disruptions in the Intensive Care Unit Using Deep Mask R-CNN. 1863-1865 - Avantika Singh, Aditya Nigam:
Encapsulating the Impact of Transfer Learning, Domain Knowledge and Training Strategies in Deep-Learning Based Architecture: A Biometric Based Case Study. 1866-1868 - Bojana Gajic, Ramón Baldrich:
Cross-Domain Fashion Image Retrieval. 1869-1871 - Dena Bazazian, Dimosthenis Karatzas, Andrew D. Bagdanov:
Word Spotting in Scene Images Based on Character Recognition. 1872-1874 - Ilke Demir:
A Holistic Framework for Addressing the World Using Machine Learning. 1875-1877 - Ivona Tautkute, Tomasz Trzcinski, Adam Bielski:
I Know How You Feel: Emotion Recognition With Facial Landmarks. 1878-1880 - Jyoti Islam, Yanqing Zhang:
Early Diagnosis of Alzheimer's Disease: A Neuroimaging Study With Deep Learning Architectures. 1881-1883 - Kanami Yamagishi, Shintaro Yamamoto, Takuya Kato, Shigeo Morishima:
Cosmetic Features Extraction by a Single Image Makeup Decomposition. 1884-1886 - Ksenia Bittner, Marco Körner:
Automatic Large-Scale 3D Building Shape Refinement Using Conditional Generative Adversarial Networks. 1887-1889 - Marcella Cornia, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara:
SAM: Pushing the Limits of Saliency Prediction Models. 1890-1892 - Meng Zheng, Srikrishna Karanam, Richard J. Radke:
RPIfield: A New Dataset for Temporally Evaluating Person Re-Identification. 1893-1895 - Mikayla Timm, Subhransu Maji, Todd Fuller:
Large-Scale Ecological Analyses of Animals in the Wild Using Computer Vision. 1896-1898 - Murium Iqbal, Adair Kovac, Kamelia Aryafar:
Discovering Style Trends Through Deep Visually Aware Latent Item Embeddings. 1899-1901 - Nezihe Merve Gürel:
Towards More Accurate Radio Telescope Images. 1902-1904 - Sima Behpour:
ARC: Adversarial Robust Cuts for Semi-Supervised and Multi-Label Classification. 1905-1907
Mutual Benefits of Cognitive and Computer Vision: How Can We Use One to Understand the Other?
- Vandit Gajjar, Yash Khandhediya, Ayesha Gurnani, Viraj Mavani, Mehul S. Raval:
ViS-HuD: Using Visual Saliency to Improve Human Detection With Convolutional Neural Networks. 1908-1916 - Masaki Nakada, Honglin Chen, Demetri Terzopoulos:
Learning Biomimetic Perception for Human Sensorimotor Control. 1917-1922 - Hossein Hosseini, Baicen Xiao, Mayoore Jaiswal, Radha Poovendran:
Assessing Shape Bias Property of Convolutional Neural Networks. 1923-1931 - Hossein Adeli, Gregory J. Zelinsky:
Deep-BCN: Deep Networks Meet Biased Competition to Create a Brain-Inspired Model of Attention Control. 1932-1942 - Mahmoud Khademi, Oliver Schulte:
Image Caption Generation With Hierarchical Contextual Visual Spatial Attention. 1943-1951 - Ravi Kant Kumar, Jogendra Garain, Dakshina Ranjan Kisku, Goutam Sanyal:
Estimating Attention of Faces Due to Its Growing Level of Emotions. 1952-1960 - Amir Rosenfeld, Markus D. Solbach, John K. Tsotsos:
Totally Looks Like - How Humans Compare, Compared to Machines. 1961-1964 - Lin Qi, Ying Xu, Xiaowei Shang, Junyu Dong:
Fusing Visual Saliency for Material Recognition. 1965-1968 - Mikhail Startsev, Michael Dorr:
Increasing Video Saliency Model Generalizability by Training for Smooth Pursuit Prediction. 1969-1972 - Katerina Malakhova:
Representation of Categories in Filters of Deep Neural Networks. 1973-1975 - Tian Xu, Oliver G. B. Garrod, H. Steven Scholte, Robin A. A. Ince, Philippe G. Schyns:
Using Psychophysical Methods to Understand Mechanisms of Face Identification in a Deep Neural Network. 1976-1984 - Tao Tu, Jonathan Koss, Paul Sajda:
Relating Deep Neural Network Representations to EEG-fMRI Spatiotemporal Dynamics in a Perceptual Decision-Making Task. 1985-1991 - Akram Bayat, Do Hyong Koh, Anubhaw Kumar Nand, Marta Pereira, Marc Pomplun:
Scene Grammar in Human and Machine Recognition of Objects and Scenes. 1992-1999 - Petros Koutras, Georgia Panagiotaropoulou, Antigoni Tsiami, Petros Maragos:
Audio-Visual Temporal Saliency Modeling Validated by fMRI Data. 2000-2010 - Amir Rosenfeld, Mahdi Biparva, John K. Tsotsos:
Priming Neural Networks. 2011-2020
Real World Challenges and New Benchmarks for Deep Learning in Robotic Vision
- Xingchao Peng, Ben Usman, Neela Kaushik, Dequan Wang, Judy Hoffman, Kate Saenko:
VisDA: A Synthetic-to-Real Benchmark for Visual Domain Adaptation. 2021-2026 - Xavier Roynard, Jean-Emmanuel Deschaud, François Goulette:
Paris-Lille-3D: A Point Cloud Dataset for Urban Scene Segmentation and Classification. 2027-2030 - Tyler L. Hayes, Ronald Kemker, Nathan D. Cahill, Christopher Kanan:
New Metrics and Experimental Paradigms for Continual Learning. 2031-2034 - Alan Wu, A. J. Piergiovanni, Michael S. Ryoo:
Action-Conditioned Convolutional Future Regression Models for Robot Imitation Learning. 2035-2037 - Jonathan Tremblay, Thang To, Stan Birchfield:
Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation. 2038-2041 - Deepak Pathak, Yide Shentu, Dian Chen, Pulkit Agrawal, Trevor Darrell, Sergey Levine, Jitendra Malik:
Learning Instance Segmentation by Interaction. 2042-2045 - Phil Ammirato, Alexander C. Berg, Jana Kosecka:
Active Vision Dataset Benchmark. 2046-2049 - Deepak Pathak, Parsa Mahmoudieh, Guanghao Luo, Pulkit Agrawal, Dian Chen, Yide Shentu, Evan Shelhamer, Jitendra Malik, Alexei A. Efros, Trevor Darrell:
Zero-Shot Visual Imitation. 2050-2053 - Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra:
Embodied Question Answering. 2054-2063
Analysis and Modeling of Faces and Gestures
- Yuancheng Ye, Yingli Tian, Matt Huenerfauth, Jingya Liu:
Recognizing American Sign Language Gestures From Within Continuous Videos. 2064-2073 - Nataniel Ruiz, Eunji Chong, James M. Rehg:
Fine-Grained Head Pose Estimation Without Keypoints. 2074-2083 - Sveinn Palsson, Eirikur Agustsson, Radu Timofte, Luc Van Gool:
Generative Adversarial Style Transfer Networks for Face Aging. 2084-2092 - Adam Kortylewski, Bernhard Egger, Andreas Schneider, Thomas Gerig, Andreas Morel-Forster, Thomas Vetter:
Empirically Analyzing the Effect of Dataset Biases on Deep Face Recognition Systems. 2093-2102 - Okan Köpüklü, Neslihan Kose, Gerhard Rigoll:
Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition. 2103-2111 - Jia Xue, Zibo Meng, Karthik Katipally, Haibo Wang, Kees van Zon:
Clothing Change Aware Person Identification. 2112-2120 - Chieh-Ming Kuo, Shang-Hong Lai, Michel Sarkis:
A Compact Deep Learning Model for Robust Facial Expression Recognition. 2121-2129 - Itir Önal Ertugrul, László A. Jeni, Jeffrey F. Cohn:
FACSCaps: Pose-Independent Facial Action Coding With Capsules. 2130-2139 - Daksha Yadav, Naman Kohli, Ekampreet Kalsi, Mayank Vatsa, Richa Singh, Afzel Noore:
Unraveling Human Perception of Facial Aging Using Eye Gaze. 2140-2147 - Dário Augusto Borges Oliveira, Andréa Britto Mattos, Edmilson Da Silva Morais:
Improving Viseme Recognition Using GAN-Based Frontal View Mapping. 2148-2155 - Rajeev Ranjan, Shalini De Mello, Jan Kautz:
Light-Weight Head Pose Invariant Gaze Tracking. 2156-2164 - Esube Bekele, Wallace E. Lawson, Zachary Horne, Sangeet Khemlani:
Implementing a Robust Explanatory Bias in a Person Re-Identification Network. 2165-2172 - Puspita Majumdar, Saheb Chhabra, Richa Singh, Mayank Vatsa:
On Detecting Domestic Abuse via Faces. 2173-2179
Vision With Biased or Scarce Data
- Maren Awiszus, Bodo Rosenhahn:
Markov Chain Neural Networks. 2180-2187 - Ashish Mishra, M. Shiva Krishna Reddy, Anurag Mittal, Hema A. Murthy:
A Generative Model for Zero Shot Learning Using Conditional Variational Autoencoders. 2188-2196 - Liang Qiu, Hongliang Ren:
Endoscope Navigation and 3D Reconstruction of Oral Cavity by Visual SLAM With Mitigated Data Scarcity. 2197-2204
Computer Vision for Microscopy Image Analysis
- Yuki Hiramatsu, Kazuhiro Hotta, Ayako Imanishi, Michiyuki Matsuda, Kenta Terai:
Cell Image Segmentation by Integrating Multiple CNNs. 2205-2211 - Dongnan Liu, Donghao Zhang, Yang Song, Chaoyi Zhang, Heng Huang, Mei Chen, Weidong Cai:
Large Kernel Refine Fusion Net for Neuron Membrane Segmentation. 2212-2220 - Chichen Fu, Soonam Lee, David Joon Ho, Shuo Han, Paul Salama, Kenneth W. Dunn, Edward J. Delp:
Three Dimensional Fluorescence Microscopy Image Synthesis and Segmentation. 2221-2229 - Abdul Aziz, Harshit Pande, Bharath Cheluvaraju, Tathagato Rai Dastidar:
Improved Extraction of Objects From Urine Microscopy Images With Unsupervised Thresholding and Supervised U-Net Techniques. 2230-2238 - Mina Khoshdeli, Garrett Winkelmaier, Bahram Parvin:
Multilayer Encoder-Decoder Network for 3D Nuclear Segmentation in Spheroid Models of Human Mammary Epithelial Cell Lines. 2239-2245 - Cheng Yang, Haowen Ma, Xu Cao, Xia Hua, Xiaofeng Bu, Limin Zhang, Tao Yue, Feng Yan:
Resolution-Enhanced Lensless Color Shadow Imaging Microscopy Based on Large Field-of-View Submicron-Pixel Imaging Sensors. 2246-2253 - Vibha Gupta, Arnav Bhavsar:
Sequential Modeling of Deep Features for Breast Cancer Histopathological Image Classification. 2254-2261 - Romain Mormont, Pierre Geurts, Raphaël Marée:
Comparison of Deep Transfer Learning Strategies for Digital Pathology. 2262-2271 - Alexandr A. Kalinin, Ari Allyn-Feuer, Alexander S. Ade, Gordon-Victor Fon, Walter Meixner, David Dilworth, Jeffrey R. de Wet, Gerald A. Higgins, Gen Zheng, Amy Creekmore, John W. Wiley, James E. Verdone, Robert W. Veltri, Kenneth J. Pienta, Donald S. Coffey, Brian D. Athey, Ivo D. Dinov:
3D Cell Nuclear Morphology: Microscopy Imaging Dataset and Voxel-Based Morphometry Classification Results. 2272-2280 - Sreetama Basu, Elton Rexhepaj, Nathalie Spassky, Auguste Genovesio, Rasmus Reinhold Paulsen, A. S. M. Shihavuddin:
FastSME: Faster and Smoother Manifold Extraction From 3D Stack. 2281-2289 - Shahira Abousamra, Shai Adar, Natalie Elia, Roy Shilkrot:
Localization and Tracking in 4D Fluorescence Microscopy Imagery. 2290-2298 - Karan Dewan, Tathagato Rai Dastidar, Maroof Ahmad:
Estimation of Sperm Concentration and Total Motility From Microscopic Videos of Human Semen Samples. 2299-2306
Computational Models for Learning Systems and Educational Assessment
- Vijay Rowtula, Varun Bhargavan, Mohan Kumar, C. V. Jawahar:
Scaling Handwritten Student Assessments With a Document Image Workflow System. 2307-2314 - Ömer Sümer, Patricia Goldberg, Kathleen Stürmer, Tina Seidel, Peter Gerjets, Ulrich Trautwein, Enkelejda Kasneci:
Teachers' Perception in the Classroom. 2315-2324
Visual Understanding of Subjective Attributes of Data
- Bo Pang, Kaiwen Zha, Cewu Lu:
Human Action Adverb Recognition: ADHA Dataset and a Three-Stream Hybrid Model. 2325-2334 - Adam Bielski, Tomasz Trzcinski:
Pay Attention to Virality: Understanding Popularity of Social Media Videos With the Attention Mechanism. 2335-2337 - Eli Alshan, Sharon Alpert, Assaf Neuberger, Nathaniel Bubis, Eduard Oks:
Learning Fashion by Simulated Human Supervision. 2338-2344 - Amir Sadovnik, Wassim Gharbi, Thanh Vu, Andrew C. Gallagher:
Finding Your Lookalike: Measuring Face Similarity Rather Than Face Identity. 2345-2353 - Dario Dotti, Mirela Popa, Stylianos Asteriadis:
Behavior and Personality Analysis in a Nonsocial Context Dataset. 2354-2362 - Gülcan Can, Yassir Benkhedda, Daniel Gatica-Perez:
Ambiance in Social Media Venues: Visual Cue Interpretation by Machines and Crowds. 2363-2372 - Albert Clapés, Ozan Bilici, Dariia Temirova, Egils Avots, Gholamreza Anbarjafari, Sergio Escalera:
From Apparent to Real Age: Gender, Age, Ethnic, Makeup, and Expression Bias Analysis in Real Age Estimation. 2373-2382
Sight and Sound
- Ruohan Gao, Rogério Schmidt Feris, Kristen Grauman:
Learning to Separate Object Sounds by Watching Unlabeled Video. 2496-2499 - Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg:
Visual to Sound: Generating Natural Sound for Videos in the Wild. 2500-2503 - Vinicius Signori Furlan, Ruzena Bajcsy, Erickson R. Nascimento:
Fast Forwarding Egocentric Videos by Listening and Watching. 2504-2507 - Arda Senocak, Tae-Hyun Oh, Junsik Kim, Ming-Hsuan Yang, In So Kweon:
On Learning Association of Sound Source and Visual Scenes. 2508-2509 - Yue Qiu, Hirokatsu Kataoka:
Image Generation Associated With Music Data. 2510-2513 - Herman Kamper, Gregory Shakhnarovich, Karen Livescu:
Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech. 2514-2517 - Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events. 2518-2519 - Michele Merler, Dhiraj Joshi, Khoi-Nguyen C. Mac, Quoc-Bao Nguyen, Stephen Hammer, John Kent, Jinjun Xiong, Minh N. Do, John R. Smith, Rogério Schmidt Feris:
The Excitement of Sports: Automatic Highlights Using Audio/Visual Cues. 2520-2523 - Tawfiq Salem, Menghua Zhai, Scott Workman, Nathan Jacobs:
A Multimodal Approach to Mapping Soundscapes. 2524-2527 - Chiori Hori, Takaaki Hori, Gordon Wichern, Jue Wang, Teng-Yok Lee, Anoop Cherian, Tim K. Marks:
Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description. 2528-2531 - Abe Davis, Maneesh Agrawala:
Visual Rhythm and Beat. 2532-2535 - Zhoutong Zhang, Jiajun Wu, Qiujia Li, Zhengjia Huang, Joshua B. Tenenbaum, William T. Freeman:
Inverting Audio-Visual Simulation for Shape and Material Perception. 2536-2538
Workshop and Challenge on Learned Image Compression
- David Alexandre, Chih-Peng Chang, Wen-Hsiao Peng, Hsueh-Ming Hang:
An Autoencoder-based Learned Image Compressor: Description of Challenge Proposal by NCTU. 2539-2542 - Ming Li, Jianhua Hu, Changsheng Xia, Yundong Zhang:
An Implementation of Picture Compression with A CNN-based Auto-encoder. 2543-2546 - Alekh Karkada Ashok, Nagaraju Palani:
Autoencoders with Variable Sized Latent Vector for Image Compression. 2547-2550 - Çaglar Aytekin, Xingyang Ni, Francesco Cricri, Jani Lainema, Emre Aksu, Miska M. Hannuksela:
Block-optimized Variable Bit Rate Neural Image Compression. 2551-2554 - Danial Maleki, Soheila Nadalian, Mohammad Mahdi Derakhshani, Mohammad Amin Sadeghi:
BlockCNN: A Deep Network for Artifact Removal and Image Compression. 2555-2558 - Zhenzhong Chen, Yiming Li, Feiyang Liu, Zizheng Liu, Xiang Pan, Wanjie Sun, Yingbin Wang, Yan Zhou, Han Zhu, Shan Liu:
CNN-Optimized Image Compression with Uncertainty based Resource Allocation. 2559-2562 - Jianhua Hu, Ming Li, Changsheng Xia, Yundong Zhang:
Combine Traditional Compression Method With Convolutional Neural Networks. 2563-2566 - Zhimin Tang, Linkai Luo:
Compression artifact removal using multi-scale reshuffling convolutional network. 2567-2570 - Kai Cui, Eckehard G. Steinbach:
Decoder Side Image Quality Enhancement exploiting Inter-channel Correlation in a 3-stage CNN: Submission to CLIC 2018. 2571-2574 - Haojie Liu, Tong Chen, Qiu Shen, Tao Yue, Zhan Ma:
Deep Image Compression via End-to-End Learning. 2575-2578 - Dang-Khoa Le Tan, Huu Le, Tuan Hoang, Thanh-Toan Do, Ngai-Man Cheung:
DeepVQ: A Deep Network Architecture for Vector Quantization. 2579-2582 - Tamar Rott Shaham, Tomer Michaeli:
Deformation Aware Image Compression. 2583-2586 - Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, Luc Van Gool:
Extreme Learned Image Compression with GANs. 2587-2590 - Aupendu Kar, Sri Phani Krishna Karri, Nirmalya Ghosh, Ramanathan Sethuraman, Debdoot Sheet:
Fully Convolutional Model for Variable Bit Length and Lossy High Density Compression of Mammograms. 2591-2594 - Jonatan Samuelsson, Per Hermansson:
Image compression with xvc. 2595-2597 - Mario González, Javier Preciozzi, Pablo Musé, Andrés Almansa:
Joint denoising and decompression using CNN regularization. 2598-2601 - Ogun Kirmemis, Gonca Bakar, A. Murat Tekalp:
Learned Compression Artifact Removal by Deep Residual Networks. 2602-2605 - Yu-Chuan Su, Kristen Grauman:
Learning Compressible 360deg Video Isomers. 2606-2609 - Eli Ben-David, Sharon Carmel, Boris Filippov, Dror Gill, Alexey Martemyanov, Tamar Shoham, Nikolay Terterov, Pavel Tiktov, Tom Vaughan, Alexander Zheludkov:
Perceptually optimized low bit-rate image encoding. 2610-2612 - Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto:
Performance Comparison of Convolutional AutoEncoders, Generative Adversarial Networks and Super-Resolution for Image Compression. 2613-2616 - Lei Zhou, Chunlei Cai, Yue Gao, Sanbao Su, Junmin Wu:
Variational Autoencoder for Low Bit-rate Image Compression. 2617-2620 - Yuchen Fan, Jiahui Yu, Thomas S. Huang:
Wide-activated Deep Residual Networks based Restoration for BPG-compressed Images. 2621-2624 - Dong Wei, Mei Yang:
YASO. 2625-2628
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.