BACK TO EVENTS

IEEE conference on Computer Vision and Pattern Recognition (CVPR 2018)

IEEE conference on Computer Vision and Pattern Recognition (CVPR) 2018 is the premier annual computer vision event comprising the main conference and several co-located workshops and short courses.

Location: Salt Lake City, Utah
Date: June 18-22, 2018
Main Conference and Exhibition: June 19-21
Workshops and Tutorials: June 18, 22

With over 3300 main-conference paper submissions and 979 accepted papers, CVPR 2018 offers an exciting program covering a wide variety of state-of-the-art work in the field of computer vision. In addition to the main program, CVPR 2018 includes 21 tutorials, 48 workshops, our annual doctoral consortium, and an industrial exhibition featuring over 115 companies.

Visit CVPR 2018 website

CVPR 2018 Papers

Embodied Question Answering
Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra

Learning by Asking Questions
Ishan Misra, Ross Girshick, Rob Fergus, Martial Hebert, Abhinav Gupta, Laurens van der Maaten

Finding Tiny Faces in the Wild With Generative Adversarial Network
Yancheng Bai, Yongqiang Zhang, Mingli Ding, Bernard Ghanem

Learning Face Age Progression: A Pyramid Architecture of GANs
Hongyu Yang, Di Huang, Yunhong Wang, Anil K. Jain

PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup
Huiwen Chang, Jingwan Lu, Fisher Yu, Adam Finkelstein

GANerated Hands for Real-Time 3D Hand Tracking From Monocular RGB
Franziska Mueller, Florian Bernard, Oleksandr Sotnychenko, Dushyant Mehta, Srinath Sridhar, Dan Casas, Christian Theobalt

Learning Pose Specific Representations by Predicting Different Views
Georg Poier, David Schinagl, Horst Bischof

Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer
Hao-Shu Fang, Guansong Lu, Xiaolin Fang, Jianwen Xie, Yu-Wing Tai, Cewu Lu

Person Transfer GAN to Bridge Domain Gap for Person Re-Identification
Longhui Wei, Shiliang Zhang, Wen Gao, Qi Tian

Cross-Modal Deep Variational Hand Pose Estimation
Adrian Spurr, Jie Song, Seonwook Park, Otmar Hilliges

Disentangled Person Image Generation
Liqian Ma, Qianru Sun, Stamatios Georgoulis, Luc Van Gool, Bernt Schiele, Mario Fritz

Super-FAN: Integrated Facial Landmark Localization and Super-Resolution of Real-World Low Resolution Faces in Arbitrary Poses With GANs
Adrian Bulat, Georgios Tzimiropoulos

Multistage Adversarial Losses for Pose-Based Human Image Synthesis
Chenyang Si, Wei Wang, Liang Wang, Tieniu Tan

Rotation Averaging and Strong Duality
Anders Eriksson, Carl Olsson, Fredrik Kahl, Tat-Jun Chin

Hybrid Camera Pose Estimation
Federico Camposeco, Andrea Cohen, Marc Pollefeys, Torsten Sattler

A Certifiably Globally Optimal Solution to the Non-Minimal Relative Pose Problem
Jesus Briales, Laurent Kneip, Javier Gonzalez-Jimenez

Single View Stereo Matching
Yue Luo, Jimmy Ren, Mude Lin, Jiahao Pang, Wenxiu Sun, Hongsheng Li, Liang Lin

Fight Ill-Posedness With Ill-Posedness: Single-Shot Variational Depth Super-Resolution From Shading
Bjoern Haefner, Yvain Quéau, Thomas Möllenhoff, Daniel Cremers

Deep Depth Completion of a Single RGB-D Image
Yinda Zhang, Thomas Funkhouser

Multi-View Harmonized Bilinear Network for 3D Object Recognition
Tan Yu, Jingjing Meng, Junsong Yuan

PPFNet: Global Context Aware Local Features for Robust 3D Point Matching
Haowen Deng, Tolga Birdal, Slobodan Ilic

FoldingNet: Point Cloud Auto-Encoder via Deep Grid Deformation
Yaoqing Yang, Chen Feng, Yiru Shen, Dong Tian

A Papier-Mâché Approach to Learning 3D Surface Generation
Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, Mathieu Aubry

LEGO: Learning Edge With Geometry All at Once by Watching Videos
Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia

Five-Point Fundamental Matrix Estimation for Uncalibrated Cameras
Daniel Barath

PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation
Danfei Xu, Dragomir Anguelov, Ashesh Jain

Scalable Dense Non-Rigid Structure-From-Motion: A Grassmannian Perspective
Suryansh Kumar, Anoop Cherian, Yuchao Dai, Hongdong Li

GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition
Yifan Feng, Zizhao Zhang, Xibin Zhao, Rongrong Ji, Yue Gao

Depth and Transient Imaging With Compressive SPAD Array Cameras
Qilin Sun, Xiong Dun, Yifan Peng, Wolfgang Heidrich

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
Xiaojuan Qi, Renjie Liao, Zhengzhe Liu, Raquel Urtasun, Jiaya Jia

Real-Time Seamless Single Shot 6D Object Pose Prediction
Bugra Tekin, Sudipta N. Sinha, Pascal Fua

Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene
Shubham Tulsiani, Saurabh Gupta, David F. Fouhey, Alexei A. Efros, Jitendra Malik

Monocular Relative Depth Perception With Web Stereo Data Supervision
Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo

Spline Error Weighting for Robust Visual-Inertial Fusion
Hannes Ovrén, Per-Erik Forssén

Single-Image Depth Estimation Based on Fourier Domain Analysis
Jae-Han Lee, Minhyeok Heo, Kyung-Rae Kim, Chang-Su Kim

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction
Huangying Zhan, Ravi Garg, Chamara Saroj Weerasekera, Kejie Li, Harsh Agarwal, Ian Reid

Detect-and-Track: Efficient Pose Estimation in Videos
Rohit Girdhar, Georgia Gkioxari, Lorenzo Torresani, Manohar Paluri, Du Tran

Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors
Xuanyi Dong, Shoou-I Yu, Xinshuo Weng, Shih-En Wei, Yi Yang, Yaser Sheikh

Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification
Shuang Li, Slawomir Bak, Peter Carr, Xiaogang Wang

Style Aggregated Network for Facial Landmark Detection
Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang

Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision
Yaojie Liu, Amin Jourabloo, Xiaoming Liu

Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation
Kai Li, Junliang Xing, Chi Su, Weiming Hu, Yundong Zhang, Stephen Maybank

First-Person Hand Action Benchmark With RGB-D Videos and 3D Hand Pose Annotations
Guillermo Garcia-Hernando, Shanxin Yuan, Seungryul Baek, Tae-Kyun Kim

A Pose-Sensitive Embedding for Person Re-Identification With Expanded Cross Neighborhood Re-Ranking
M. Saquib Sarfraz, Arne Schumann, Andreas Eberle, Rainer Stiefelhagen

Disentangling 3D Pose in a Dendritic CNN for Unconstrained 2D Face Alignment
Amit Kumar, Rama Chellappa

A Hierarchical Generative Model for Eye Image Synthesis and Eye Gaze Estimation
Kang Wang, Rui Zhao, Qiang Ji

MiCT: Mixed 3D/2D Convolutional Tube for Human Action Recognition
Yizhou Zhou, Xiaoyan Sun, Zheng-Jun Zha, Wenjun Zeng

Learning to Estimate 3D Human Pose and Shape From a Single Color Image
Georgios Pavlakos, Luyang Zhu, Xiaowei Zhou, Kostas Daniilidis

Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points
Fabien Baradel, Christian Wolf, Julien Mille, Graham W. Taylor

Context-Aware Deep Feature Compression for High-Speed Visual Tracking
Jongwon Choi, Hyung Jin Chang, Tobias Fischer, Sangdoo Yun, Kyuewang Lee, Jiyeoup Jeong, Yiannis Demiris, Jin Young Choi

Correlation Tracking via Joint Discrimination and Reliability Learning
Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang

PhaseNet for Video Frame Interpolation
Simone Meyer, Abdelaziz Djelouah, Brian McWilliams, Alexander Sorkine-Hornung, Markus Gross, Christopher Schroers

The Best of Both Worlds: Combining CNNs and Geometric Constraints for Hierarchical Motion Segmentation
Pia Bideau, Aruni RoyChowdhury, Rakesh R. Menon, Erik Learned-Miller

Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning
Xingping Dong, Jianbing Shen, Wenguan Wang, Yu Liu, Ling Shao, Fatih Porikli

Scale-Transferrable Object Detection
Peng Zhou, Bingbing Ni, Cong Geng, Jianguo Hu, Yi Xu

A Prior-Less Method for Multi-Face Tracking in Unconstrained Videos
Chung-Ching Lin, Ying Hung

End-to-End Flow Correlation Tracking With Spatial-Temporal Attention
Zheng Zhu, Wei Wu, Wei Zou, Junjie Yan

Deep Texture Manifold for Ground Terrain Recognition
Jia Xue, Hang Zhang, Kristin Dana

Learning Superpixels With Segmentation-Aware Affinity Loss
Wei-Chih Tu, Ming-Yu Liu, Varun Jampani, Deqing Sun, Shao-Yi Chien, Ming-Hsuan Yang, Jan Kautz

Interactive Image Segmentation With Latent Diversity
Zhuwen Li, Qifeng Chen, Vladlen Koltun

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, Oliver Wang

Local Descriptors Optimized for Average Precision
Kun He, Yan Lu, Stan Sclaroff

Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform
Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy

Deep Extreme Cut: From Extreme Points to Object Segmentation
Kevis-Kokitsi Maninis, Sergi Caelles, Jordi Pont-Tuset, Luc Van Gool

Learning to Parse Wireframes in Images of Man-Made Environments
Kun Huang, Yifan Wang, Zihan Zhou, Tianjiao Ding, Shenghua Gao, Yi Ma

Occlusion-Aware Rolling Shutter Rectification of 3D Scenes
Subeesh Vasu, Mahesh Mohan M. R., A. N. Rajagopalan

Content-Sensitive Supervoxels via Uniform Tessellations on Video Manifolds
Ran Yi, Yong-Jin Liu, Yu-Kun Lai

Intrinsic Image Transformation via Scale Space Decomposition
Lechao Cheng, Chengyi Zhang, Zicheng Liao

Learned Shape-Tailored Descriptors for Segmentation
Naeemullah Khan, Ganesh Sundaramoorthi

PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing
Dan Xu, Wanli Ouyang, Xiaogang Wang, Nicu Sebe

Multi-Image Semantic Matching by Mining Consistent Features
Qianqian Wang, Xiaowei Zhou, Kostas Daniilidis

Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network
He Zhang, Vishal M. Patel

Joint Cuts and Matching of Partitions in One Graph
Tianshu Yu, Junchi Yan, Jieyi Zhao, Baoxin Li

Progressive Attention Guided Recurrent Network for Salient Object Detection
Xiaoning Zhang, Tiantian Wang, Jinqing Qi, Huchuan Lu, Gang Wang

Fast and Accurate Single Image Super-Resolution via Information Distillation Network
Zheng Hui, Xiumei Wang, Xinbo Gao

Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning
Kwan-Yee Lin, Guanxiang Wang

NAG: Network for Adversary Generation
Konda Reddy Mopuri, Utkarsh Ojha, Utsav Garg, R. Venkatesh Babu

Dynamic-Structured Semantic Propagation Network
Xiaodan Liang, Hongfei Zhou, Eric Xing

Cross-Domain Self-Supervised Multi-Task Feature Learning Using Synthetic Imagery
Zhongzheng Ren, Yong Jae Lee

A Two-Step Disentanglement Method
Naama Hadad, Lior Wolf, Moni Shahar

Robust Facial Landmark Detection via a Fully-Convolutional Local-Global Context Network
Daniel Merget, Matthias Rock, Gerhard Rigoll

Decorrelated Batch Normalization
Lei Huang, Dawei Yang, Bo Lang, Jia Deng

Learning to Sketch With Shortcut Cycle Consistency
Jifei Song, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

Towards a Mathematical Understanding of the Difficulty in Learning With Feedforward Neural Networks
Hao Shen

FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis
Yujun Shen, Ping Luo, Junjie Yan, Xiaogang Wang, Xiaoou Tang

A Constrained Deep Neural Network for Ordinal Regression
Yanzhu Liu, Adams Wai Kin Kong, Chi Keong Goh

Modulated Convolutional Networks
Xiaodi Wang, Baochang Zhang, Ce Li, Rongrong Ji, Jungong Han, Xianbin Cao, Jianzhuang Liu

Learning Steerable Filters for Rotation Equivariant CNNs
Maurice Weiler, Fred A. Hamprecht, Martin Storath

Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++
David Acuna, Huan Ling, Amlan Kar, Sanja Fidler

SplineCNN: Fast Geometric Deep Learning With Continuous B-Spline Kernels
Matthias Fey, Jan Eric Lenssen, Frank Weichert, Heinrich Müller

GAGAN: Geometry-Aware Generative Adversarial Networks
Jean Kossaifi, Linh Tran, Yannis Panagakis, Maja Pantic

On the Robustness of Semantic Segmentation Models to Adversarial Attacks
Anurag Arnab, Ondrej Miksik, Philip H.S. Torr

Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence
Tianlu Wang, Kota Yamaguchi, Vicente Ordonez

Super-Resolving Very Low-Resolution Face Images With Supplementary Attributes
Xin Yu, Basura Fernando, Richard Hartley, Fatih Porikli

Frustum PointNets for 3D Object Detection From RGB-D Data
Charles R. Qi, Wei Liu, Chenxia Wu, Hao Su, Leonidas J. Guibas

W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection
Yongqiang Zhang, Yancheng Bai, Mingli Ding, Yongqiang Li, Bernard Ghanem

3D Object Detection With Latent Support Surfaces
Zhile Ren, Erik B. Sudderth

Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization
Peihua Li, Jiangtao Xie, Qilong Wang, Zilin Gao

Recurrent Scene Parsing With Perspective Understanding in the Loop
Shu Kong, Charless C. Fowlkes

Improving Occlusion and Hard Negative Handling for Single-Stage Pedestrian Detectors
Junhyug Noh, Soochan Lee, Beomsu Kim, Gunhee Kim

Learning to Act Properly: Predicting and Explaining Affordances From Images
Ching-Yao Chuang, Jiaman Li, Antonio Torralba, Sanja Fidler

Pointwise Convolutional Neural Networks
Binh-Son Hua, Minh-Khoi Tran, Sai-Kit Yeung

Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification
Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao

A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts
Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal

Tensorize, Factorize and Regularize: Robust Visual Relationship Learning
Seong Jae Hwang, Sathya N. Ravi, Zirui Tao, Hyunwoo J. Kim, Maxwell D. Collins, Vikas Singh

Transductive Unbiased Embedding for Zero-Shot Learning
Jie Song, Chengchao Shen, Yezhou Yang, Yang Liu, Mingli Song

Hierarchical Novelty Detection for Visual Object Recognition
Kibok Lee, Kimin Lee, Kyle Min, Yuting Zhang, Jinwoo Shin, Honglak Lee

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks
Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang

Learning Rich Features for Image Manipulation Detection
Peng Zhou, Xintong Han, Vlad I. Morariu, Larry S. Davis

Human Semantic Parsing for Person Re-Identification
Mahdi M. Kalayeh, Emrah Basaran, Muhittin Gökmen, Mustafa E. Kamasak, Mubarak Shah

Stacked Latent Attention for Multimodal Reasoning
Haoqi Fan, Jiatong Zhou

R-FCN-3000 at 30fps: Decoupling Detection and Classification
Bharat Singh, Hengduo Li, Abhishek Sharma, Larry S. Davis

CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes
Yuhong Li, Xiaofan Zhang, Deming Chen

Revisiting Knowledge Transfer for Training Object Class Detectors
Jasper Uijlings, Stefan Popov, Vittorio Ferrari

Deep Sparse Coding for Invariant Multimodal Halle Berry Neurons
Edward Kim, Darryl Hannan, Garrett Kenyon

On the Convergence of PatchMatch and Its Variants
Thibaud Ehret, Pablo Arias

Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Yu-Wei Chao, Sudheendra Vijayanarasimhan, Bryan Seybold, David A. Ross, Jia Deng, Rahul Sukthankar

MoNet: Deep Motion Exploitation for Video Object Segmentation
Huaxin Xiao, Jiashi Feng, Guosheng Lin, Yu Liu, Maojun Zhang

Video Representation Learning Using Discriminative Pooling
Jue Wang, Anoop Cherian, Fatih Porikli, Stephen Gould

Recognizing Human Actions as the Evolution of Pose Estimation Maps
Mengyuan Liu, Junsong Yuan

Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding
Dapeng Chen, Hongsheng Li, Tong Xiao, Shuai Yi, Xiaogang Wang

Mask-Guided Contrastive Attention Model for Person Re-Identification
Chunfeng Song, Yan Huang, Wanli Ouyang, Liang Wang

Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning
Yuhua Chen, Jordi Pont-Tuset, Alberto Montes, Luc Van Gool

Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H.S. Torr, Timothy M. Hospedales

COCO-Stuff: Thing and Stuff Classes in Context
Holger Caesar, Jasper Uijlings, Vittorio Ferrari

Image Generation From Scene Graphs
Justin Johnson, Agrim Gupta, Li Fei-Fei

Deep Cauchy Hashing for Hamming Space Retrieval
Yue Cao, Mingsheng Long, Bin Liu, Jianmin Wang

Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks
Dinesh Jayaraman, Kristen Grauman

Multi-Scale Location-Aware Kernel Representation for Object Detection
Hao Wang, Qilong Wang, Mingqi Gao, Peihua Li, Wangmeng Zuo

Clinical Skin Lesion Diagnosis Using Representations Inspired by Dermatologist Criteria
Jufeng Yang, Xiaoxiao Sun, Jie Liang, Paul L. Rosin

Compare and Contrast: Learning Prominent Visual Differences
Steven Chen, Kristen Grauman

Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning
Weifeng Ge, Sibei Yang, Yizhou Yu

HashGAN: Deep Learning to Hash With Pair Conditional Wasserstein GAN
Yue Cao, Bin Liu, Mingsheng Long, Jianmin Wang

Min-Entropy Latent Model for Weakly Supervised Object Detection
Fang Wan, Pengxu Wei, Jianbin Jiao, Zhenjun Han, Qixiang Ye

MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Mohit Bansal, Tamara L. Berg

AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks
Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He

Adversarial Complementary Learning for Weakly Supervised Object Localization
Xiaolin Zhang, Yunchao Wei, Jiashi Feng, Yi Yang, Thomas S. Huang

Conditional Generative Adversarial Network for Structured Domain Adaptation
Weixiang Hong, Zhenzhen Wang, Ming Yang, Junsong Yuan

GroupCap: Group-Based Image Captioning With Structured Relevance and Diversity Constraints
Fuhai Chen, Rongrong Ji, Xiaoshuai Sun, Yongjian Wu, Jinsong Su

Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features
Xiang Wang, Shaodi You, Xi Li, Huimin Ma

Bootstrapping the Performance of Webly Supervised Semantic Segmentation
Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection Under Partial Occlusion
Zhishuai Zhang, Cihang Xie, Jianyu Wang, Lingxi Xie, Alan L. Yuille

Geometry-Aware Scene Text Detection With Instance Transformation Network
Fangfang Wang, Liming Zhao, Xi Li, Xinchao Wang, Dacheng Tao

Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun, Zhanghui Kuang, Lu Sheng, Wanli Ouyang, Wei Zhang

Motion-Guided Cascaded Refinement Network for Video Object Segmentation
Ping Hu, Gang Wang, Xiangfei Kong, Jason Kuen, Yap-Peng Tan

A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos
Sangho Lee, Jinyoung Sung, Youngjae Yu, Gunhee Kim

Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos
Hsien-Tzu Cheng, Chun-Hung Chao, Jin-Dong Dong, Hao-Kai Wen, Tyng-Luh Liu, Min Sun

Appearance-and-Relation Networks for Video Classification
Limin Wang, Wei Li, Wen Li, Luc Van Gool

Excitation Backprop for RNNs
Sarah Adel Bargal, Andrea Zunino, Donghyun Kim, Jianming Zhang, Vittorio Murino, Stan Sclaroff

One-Shot Action Localization by Learning Sequence Matching Network
Hongtao Yang, Xuming He, Fatih Porikli

Structure Preserving Video Prediction
Jingwei Xu, Bingbing Ni, Zefan Li, Shuo Cheng, Xiaokang Yang

Person Re-Identification With Cascaded Pairwise Convolutions
Yicheng Wang, Zhenzhong Chen, Feng Wu, Gang Wang

On the Importance of Label Quality for Semantic Segmentation
Aleksandar Zlateski, Ronnachai Jaroensri, Prafull Sharma, Frédo Durand

Scalable and Effective Deep CCA via Soft Decorrelation
Xiaobin Chang, Tao Xiang, Timothy M. Hospedales

Duplex Generative Adversarial Network for Unsupervised Domain Adaptation
Lanqing Hu, Meina Kan, Shiguang Shan, Xilin Chen

Edit Probability for Scene Text Recognition
Fan Bai, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Shuigeng Zhou

Global Versus Localized Generative Adversarial Nets
Guo-Jun Qi, Liheng Zhang, Hao Hu, Marzieh Edraki, Jingdong Wang, Xian-Sheng Hua

MoCoGAN: Decomposing Motion and Content for Video Generation
Sergey Tulyakov, Ming-Yu Liu, Xiaodong Yang, Jan Kautz

Recurrent Residual Module for Fast Inference in Videos
Bowen Pan, Wuwei Lin, Xiaolin Fang, Chaoqin Huang, Bolei Zhou, Cewu Lu

Improving Landmark Localization With Semi-Supervised Learning
Sina Honari, Pavlo Molchanov, Stephen Tyree, Pascal Vincent, Christopher Pal, Jan Kautz

Adversarial Data Programming: Using GANs to Relax the Bottleneck of Curated Labeled Data
Arghya Pal, Vineeth N. Balasubramanian

Stochastic Variational Inference With Gradient Linearization
Tobias Plötz, Anne S. Wannenwetsch, Stefan Roth

Multi-Label Zero-Shot Learning With Structured Knowledge Graphs
Chung-Wei Lee, Wei Fang, Chih-Kuan Yeh, Yu-Chiang Frank Wang

MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks
Ariel Gordon, Elad Eban, Ofir Nachum, Bo Chen, Hao Wu, Tien-Ju Yang, Edward Choi

Deep Adversarial Subspace Clustering
Pan Zhou, Yunqing Hou, Jiashi Feng

Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection
Keze Wang, Xiaopeng Yan, Dongyu Zhang, Lei Zhang, Liang Lin

Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs
Emanuel Laude, Jan-Hendrik Lange, Jonas Schüpfer, Csaba Domokos, Laura Leal-Taixé, Frank R. Schmidt, Bjoern Andres, Daniel Cremers

Robust Physical-World Attacks on Deep Learning Visual Classification
Kevin Eykholt, Ivan Evtimov, Earlence Fernandes, Bo Li, Amir Rahmati, Chaowei Xiao, Atul Prakash, Tadayoshi Kohno, Dawn Song

Generating a Fusion Image: One's Identity and Another's Shape
DongGyu Joo, Doyeon Kim, Junmo Kim

Learning to Promote Saliency Detectors
Yu Zeng, Huchuan Lu, Lihe Zhang, Mengyang Feng, Ali Borji

Image Super-Resolution via Dual-State Recurrent Networks
Wei Han, Shiyu Chang, Ding Liu, Mo Yu, Michael Witbrock, Thomas S. Huang

Deep Back-Projection Networks for Super-Resolution
Muhammad Haris, Gregory Shakhnarovich, Norimichi Ukita

Focus Manipulation Detection via Photometric Histogram Analysis
Can Chen, Scott McCloskey, Jingyi Yu

Compassionately Conservative Balanced Cuts for Image Segmentation
Nathan D. Cahill, Tyler L. Hayes, Renee T. Meinhold, John F. Hamilton

A High-Quality Denoising Dataset for Smartphone Cameras
Abdelrahman Abdelhamed, Stephen Lin, Michael S. Brown

Context-Aware Synthesis for Video Frame Interpolation
Simon Niklaus, Feng Liu

Salient Object Detection Driven by Fixation Prediction
Wenguan Wang, Jianbing Shen, Xingping Dong, Ali Borji

Enhancing the Spatial Resolution of Stereo Images Using a Parallax Prior
Daniel S. Jeon, Seung-Hwan Baek, Inchang Choi, Min H. Kim

HATS: Histograms of Averaged Time Surfaces for Robust Event-Based Object Classification
Amos Sironi, Manuele Brambilla, Nicolas Bourdis, Xavier Lagorce, Ryad Benosman

A Bi-Directional Message Passing Model for Salient Object Detection
Lu Zhang, Ju Dai, Huchuan Lu, You He, Gang Wang

Matching Pixels Using Co-Occurrence Statistics
Rotal Kat, Roy Jevnisek, Shai Avidan

SeedNet: Automatic Seed Generation With Deep Reinforcement Learning for Robust Interactive Segmentation
Gwangmo Song, Heesoo Myeong, Kyoung Mu Lee

Jerk-Aware Video Acceleration Magnification
Shoichiro Takeda, Kazuki Okami, Dan Mikami, Megumi Isogai, Hideaki Kimata

Defense Against Adversarial Attacks Using High-Level Representation Guided Denoiser
Fangzhou Liao, Ming Liang, Yinpeng Dong, Tianyu Pang, Xiaolin Hu, Jun Zhu

Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal
Jifeng Wang, Xiang Li, Jian Yang

Image Correction via Deep Reciprocating HDR Transformation
Xin Yang, Ke Xu, Yibing Song, Qiang Zhang, Xiaopeng Wei, Rynson W.H. Lau

PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference
Ekta Prashnani, Hong Cai, Yasamin Mostofi, Pradeep Sen

Normalized Cut Loss for Weakly-Supervised CNN Segmentation
Meng Tang, Abdelaziz Djelouah, Federico Perazzi, Yuri Boykov, Christopher Schroers

ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing
Jian Zhang, Bernard Ghanem

Fast End-to-End Trainable Guided Filter
Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang

Disentangling Structure and Aesthetics for Style-Aware Image Completion
Andrew Gilbert, John Collomosse, Hailin Jin, Brian Price

Learning a Discriminative Feature Network for Semantic Segmentation
Changqian Yu, Jingbo Wang, Chao Peng, Changxin Gao, Gang Yu, Nong Sang

Kernelized Subspace Pooling for Deep Local Descriptors
Xing Wei, Yue Zhang, Yihong Gong, Nanning Zheng

pOSE: Pseudo Object Space Error for Initialization-Free Bundle Adjustment
Je Hyeong Hong, Christopher Zach

Deformable Shape Completion With Graph Convolutional Autoencoders
Or Litany, Alex Bronstein, Michael Bronstein, Ameesh Makadia

Learning From Millions of 3D Scans for Large-Scale 3D Face Recognition
Syed Zulqarnain Gilani, Ajmal Mian

CarFusion: Combining Point Tracking and Part Detection for Dynamic 3D Reconstruction of Vehicles
N. Dinesh Reddy, Minh Vo, Srinivasa G. Narasimhan

Deep Material-Aware Cross-Spectral Stereo Matching
Tiancheng Zhi, Bernardo R. Pires, Martial Hebert, Srinivasa G. Narasimhan

Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections
True Price, Johannes L. Schönberger, Zhen Wei, Marc Pollefeys, Jan-Michael Frahm

Matryoshka Networks: Predicting 3D Geometry via Nested Shape Layers
Stephan R. Richter, Stefan Roth

Triplet-Center Loss for Multi-View 3D Object Retrieval
Xinwei He, Yang Zhou, Zhichao Zhou, Song Bai, Xiang Bai

Learning 3D Shape Completion From Laser Scan Data With Weak Supervision
David Stutz, Andreas Geiger

End-to-End Learning of Keypoint Detector and Descriptor for Pose Invariant 3D Matching
Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jan Ernst, Jana Košecká

ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for Visual-Inertial SLAM
Haomin Liu, Mingyu Chen, Guofeng Zhang, Hujun Bao, Yingze Bao

GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
Zhichao Yin, Jianping Shi

Radially-Distorted Conjugate Translations
James Pritts, Zuzana Kukelova, Viktor Larsson, Ondřej Chum

Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu, Mingming Gong, Chaohui Wang, Kayhan Batmanghelich, Dacheng Tao

Analytical Modeling of Vanishing Points and Curves in Catadioptric Cameras
Pedro Miraldo, Francisco Eiras, Srikumar Ramalingam

Learning Depth From Monocular Videos Using Direct Methods
Chaoyang Wang, José Miguel Buenaposada, Rui Zhu, Simon Lucey

Salience Guided Depth Calibration for Perceptually Optimized Compressive Light Field 3D Display
Shizheng Wang, Wenjuan Liao, Phil Surman, Zhigang Tu, Yuanjin Zheng, Junsong Yuan

MegaDepth: Learning Single-View Depth Prediction From Internet Photos
Zhengqi Li, Noah Snavely

LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image
Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem

CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation
Konstantinos Batsos, Changjiang Cai, Philippos Mordohai

Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains
Jiahao Pang, Wenxiu Sun, Chengxi Yang, Jimmy Ren, Ruichao Xiao, Jin Zeng, Liang Lin

Exploring Disentangled Feature Representation Beyond Face Identification
Yu Liu, Fangyin Wei, Jing Shao, Lu Sheng, Junjie Yan, Xiaogang Wang

Learning Facial Action Units From Web Images With Scalable Weakly Supervised Clustering
Kaili Zhao, Wen-Sheng Chu, Aleix M. Martinez

Human Pose Estimation With Parsing Induced Learner
Xuecheng Nie, Jiashi Feng, Yiming Zuo, Shuicheng Yan

Multi-Level Factorisation Net for Person Re-Identification
Xiaobin Chang, Timothy M. Hospedales, Tao Xiang

Attention-Aware Compositional Network for Person Re-Identification
Jing Xu, Rui Zhao, Feng Zhu, Huaming Wang, Wanli Ouyang

Look at Boundary: A Boundary-Aware Face Alignment Algorithm
Wayne Wu, Chen Qian, Shuo Yang, Quan Wang, Yici Cai, Qiang Zhou

Demo2Vec: Reasoning Object Affordances From Online Videos
Kuan Fang, Te-Lin Wu, Daniel Yang, Silvio Savarese, Joseph J. Lim

Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes - The Importance of Multiple Scene Constraints
Andrei Zanfir, Elisabeta Marinoiu, Cristian Sminchisescu

3D Human Sensing, Action and Emotion Recognition in Robot Assisted Therapy of Children With Autism
Elisabeta Marinoiu, Mihai Zanfir, Vlad Olaru, Cristian Sminchisescu

Facial Expression Recognition by De-Expression Residue Learning
Huiyuan Yang, Umur Ciftci, Lijun Yin

A Causal And-Or Graph Model for Visibility Fluent Reasoning in Tracking Interacting Objects
Yuanlu Xu, Lei Qin, Xiaobai Liu, Jianwen Xie, Song-Chun Zhu

Weakly Supervised Facial Action Unit Recognition Through Adversarial Training
Guozhu Peng, Shangfei Wang

Non-Linear Temporal Subspace Representations for Activity Recognition
Anoop Cherian, Suvrit Sra, Stephen Gould, Richard Hartley

Towards Pose Invariant Face Recognition in the Wild
Jian Zhao, Yu Cheng, Yan Xu, Lin Xiong, Jianshu Li, Fang Zhao, Karlekar Jayashree, Sugiri Pranata, Shengmei Shen, Junliang Xing, Shuicheng Yan, Jiashi Feng

Unifying Identification and Context Learning for Person Recognition
Qingqiu Huang, Yu Xiong, Dahua Lin

Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation
Xi Peng, Zhiqiang Tang, Fei Yang, Rogerio S. Feris, Dimitris Metaxas

Wing Loss for Robust Facial Landmark Localisation With Convolutional Neural Networks
Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu

Multiple Granularity Group Interaction Prediction
Taiping Yao, Minsi Wang, Bingbing Ni, Huawei Wei, Xiaokang Yang

Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks
Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, Alexandre Alahi

Deep Group-Shuffling Random Walk for Person Re-Identification
Yantao Shen, Hongsheng Li, Tong Xiao, Shuai Yi, Dapeng Chen, Xiaogang Wang

Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-Identification
Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li

Harmonious Attention Network for Person Re-Identification
Wei Li, Xiatian Zhu, Shaogang Gong

Real-Time Rotation-Invariant Face Detection With Progressive Calibration Networks
Xuepeng Shi, Shiguang Shan, Meina Kan, Shuzhe Wu, Xilin Chen

Deep Regression Forests for Age Estimation
Wei Shen, Yilu Guo, Yan Wang, Kai Zhao, Bo Wang, Alan L. Yuille

Weakly-Supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation
Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji

Memory Based Online Learning of Deep Representations From Video Streams
Federico Pernici, Federico Bartoli, Matteo Bruni, Alberto Del Bimbo

Efficient and Deep Person Re-Identification Using Multi-Level Similarity
Yiluan Guo, Ngai-Man Cheung

Multi-Level Fusion Based 3D Object Detection From Monocular Images
Bin Xu, Zhenzhong Chen

A Perceptual Measure for Deep Single Image Camera Calibration
Yannick Hold-Geoffroy, Kalyan Sunkavalli, Jonathan Eisenmann, Matthew Fisher, Emiliano Gambaretto, Sunil Hadap, Jean-François Lalonde

Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks
Wei Xiong, Wenhan Luo, Lin Ma, Wei Liu, Jiebo Luo

Document Enhancement Using Visibility Detection
Netanel Kligler, Sagi Katz, Ayellet Tal

A Weighted Sparse Sampling and Smoothing Frame Transition Approach for Semantic Fast-Forward First-Person Videos
Michel Silva, Washington Ramos, João Ferreira, Felipe Chamone, Mario Campos, Erickson R. Nascimento

Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation
Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

Deep Layer Aggregation
Fisher Yu, Dequan Wang, Evan Shelhamer, Trevor Darrell

Convolutional Neural Networks With Alternately Updated Clique
Yibo Yang, Zhisheng Zhong, Tiancheng Shen, Zhouchen Lin

Practical Block-Wise Neural Network Architecture Generation
Zhao Zhong, Junjie Yan, Wei Wu, Jing Shao, Cheng-Lin Liu

xUnit: Learning a Spatial Activation Function for Efficient Image Restoration
Idan Kligvasser, Tamar Rott Shaham, Tomer Michaeli

Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning
Ke Yu, Chao Dong, Liang Lin, Chen Change Loy

Deformation Aware Image Compression
Tamar Rott Shaham, Tomer Michaeli

Distributable Consistent Multi-Object Matching
Nan Hu, Qixing Huang, Boris Thibert, Leonidas J. Guibas

Residual Dense Network for Image Super-Resolution
Yulun Zhang, Yapeng Tian, Yu Kong, Bineng Zhong, Yun Fu

Attentive Generative Adversarial Network for Raindrop Removal From a Single Image
Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, Jiaying Liu

FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors
Yu Chen, Ying Tai, Xiaoming Liu, Chunhua Shen, Jian Yang

Burst Denoising With Kernel Prediction Networks
Ben Mildenhall, Jonathan T. Barron, Jiawen Chen, Dillon Sharlet, Ren Ng, Robert Carroll

Unsupervised Sparse Dirichlet-Net for Hyperspectral Image Super-Resolution
Ying Qu, Hairong Qi, Chiman Kwan

Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks
Jiawei Zhang, Jinshan Pan, Jimmy Ren, Yibing Song, Linchao Bao, Rynson W.H. Lau, Ming-Hsuan Yang

SPLATNet: Sparse Lattice Networks for Point Cloud Processing
Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, Jan Kautz

Surface Networks
Ilya Kostrikov, Zhongshi Jiang, Daniele Panozzo, Denis Zorin, Joan Bruna

Self-Supervised Multi-Level Face Model Learning for Monocular Reconstruction at Over 250 Hz
Ayush Tewari, Michael Zollhöfer, Pablo Garrido, Florian Bernard, Hyeongwoo Kim, Patrick Pérez, Christian Theobalt

CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM
Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison

SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation
Weiyue Wang, Ronald Yu, Qiangui Huang, Ulrich Neumann

PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image
Chen Liu, Jimei Yang, Duygu Ceylan, Ersin Yumer, Yasutaka Furukawa

Deep Parametric Continuous Convolutional Neural Networks
Shenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, Raquel Urtasun

FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis
Nitika Verma, Edmond Boyer, Jakob Verbeek

Image Collection Pop-Up: 3D Reconstruction and Clustering of Rigid and Non-Rigid Categories
Antonio Agudo, Melcior Pijoan, Francesc Moreno-Noguer

Geometry-Aware Learning of Maps for Camera Localization
Samarth Brahmbhatt, Jinwei Gu, Kihwan Kim, James Hays, Jan Kautz

Recurrent Slice Networks for 3D Segmentation of Point Clouds
Qiangui Huang, Weiyue Wang, Ulrich Neumann

Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals
Shanxin Yuan, Guillermo Garcia-Hernando, Björn Stenger, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee, Pavlo Molchanov, Jan Kautz, Sina Honari, Liuhao Ge, Junsong Yuan, Xinghao Chen, Guijin Wang, Fan Yang, Kai Akiyama, Yang Wu, Qingfu Wan, Meysam Madadi, Sergio Escalera, Shile Li, Dongheui Lee, Iason Oikonomidis, Antonis Argyros, Tae-Kyun Kim

SobolevFusion: 3D Reconstruction of Scenes Undergoing Free Non-Rigid Motion
Miroslava Slavcheva, Maximilian Baust, Slobodan Ilic

AdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation
Jogendra Nath Kundu, Phani Krishna Uppala, Anuj Pahuja, R. Venkatesh Babu

Learning to Find Good Correspondences
Kwang Moo Yi, Eduard Trulls, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua

OATM: Occlusion Aware Template Matching by Consensus Set Maximization
Simon Korman, Mark Milam, Stefano Soatto

Deep Learning of Graph Matching
Andrei Zanfir, Cristian Sminchisescu

Unsupervised Discovery of Object Landmarks as Structural Representations
Yuting Zhang, Yijie Guo, Yixin Jin, Yijun Luo, Zhiyuan He, Honglak Lee

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, Dmitry Kalenichenko

Lean Multiclass Crowdsourcing
Grant Van Horn, Steve Branson, Scott Loarie, Serge Belongie, Pietro Perona

Partial Transfer Learning With Selective Adversarial Networks
Zhangjie Cao, Mingsheng Long, Jianmin Wang, Michael I. Jordan

Self-Supervised Feature Learning by Learning to Spot Artifacts
Simon Jenni, Paolo Favaro

LDMNet: Low Dimensional Manifold Regularized Neural Networks
Wei Zhu, Qiang Qiu, Jiaji Huang, Robert Calderbank, Guillermo Sapiro, Ingrid Daubechies

CondenseNet: An Efficient DenseNet Using Learned Group Convolutions
Gao Huang, Shichen Liu, Laurens van der Maaten, Kilian Q. Weinberger

Learning Deep Descriptors With Scale-Aware Triplet Networks
Michel Keller, Zetao Chen, Fabiola Maffra, Patrik Schmuck, Margarita Chli

Decoupled Networks
Weiyang Liu, Zhen Liu, Zhiding Yu, Bo Dai, Rongmei Lin, Yisen Wang, James M. Rehg, Le Song

Deep Adversarial Metric Learning
Yueqi Duan, Wenzhao Zheng, Xudong Lin, Jiwen Lu, Jie Zhou

PU-Net: Point Cloud Upsampling Network
Lequan Yu, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng

Real-Time Monocular Depth Estimation Using Synthetic Data With Domain Adaptation via Image Style Transfer
Amir Atapour-Abarghouei, Toby P. Breckon

Learning for Disparity Estimation Through Feature Constancy
Zhengfa Liang, Yiliu Feng, Yulan Guo, Hengzhu Liu, Wei Chen, Linbo Qiao, Li Zhou, Jianfeng Zhang

DeepMVS: Learning Multi-View Stereopsis
Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, Jia-Bin Huang

Self-Calibrating Polarising Radiometric Calibration
Daniel Teo, Boxin Shi, Yinqiang Zheng, Sai-Kit Yeung

Coding Kendall's Shape Trajectories for 3D Action Recognition
Amor Ben Tanfous, Hassen Drira, Boulbaba Ben Amor

Efficient, Sparse Representation of Manifold Distance Matrices for Classical Scaling
Javier S. Turek, Alexander G. Huth

Motion Segmentation by Exploiting Complementary Geometric Models
Xun Xu, Loong Fah Cheong, Zhuwen Li

Estimation of Camera Locations in Highly Corrupted Scenarios: All About That Base, No Shape Trouble
Yunpeng Shi, Gilad Lerman

4D Human Body Correspondences From Panoramic Depth Maps
Zhong Li, Minye Wu, Wangyiteng Zhou, Jingyi Yu

Reconstructing Thin Structures of Manifold Surfaces by Integrating Spatial Curves
Shiwei Li, Yao Yao, Tian Fang, Long Quan

Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction
Shubham Tulsiani, Alexei A. Efros, Jitendra Malik

Probabilistic Plant Modeling via Multi-View Image-to-Image Translation
Takahiro Isokane, Fumio Okura, Ayaka Ide, Yasuyuki Matsushita, Yasushi Yagi

Deep Marching Cubes: Learning Explicit Surface Representations
Yiyi Liao, Simon Donné, Andreas Geiger

Tags2Parts: Discovering Semantic Regions From Shape Tags
Sanjeev Muralikrishnan, Vladimir G. Kim, Siddhartha Chaudhuri

Uncalibrated Photometric Stereo Under Natural Illumination
Zhipeng Mo, Boxin Shi, Feng Lu, Sai-Kit Yeung, Yasuyuki Matsushita

Robust Depth Estimation From Auto Bracketed Images
Sunghoon Im, Hae-Gon Jeon, In So Kweon

Free Supervision From Video Games
Philipp Krähenbühl

Planar Shape Detection at Structural Scales
Hao Fang, Florent Lafarge, Mathieu Desbrun

Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
Xingyuan Sun, Jiajun Wu, Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang, Tianfan Xue, Joshua B. Tenenbaum, William T. Freeman

Camera Pose Estimation With Unknown Principal Point
Viktor Larsson, Zuzana Kukelova, Yinqiang Zheng

Inverse Composition Discriminative Optimization for Point Cloud Registration
Jayakorn Vongkulbhisal, Beñat Irastorza Ugalde, Fernando De la Torre, João P. Costeira

SurfConv: Bridging 3D and 2D Convolution for RGBD Images
Hang Chu, Wei-Chiu Ma, Kaustav Kundu, Raquel Urtasun, Sanja Fidler

A Fast Resection-Intersection Method for the Known Rotation Problem
Qianggong Zhang, Tat-Jun Chin, Huu Minh Le

3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
Alexander Grabner, Peter M. Roth, Vincent Lepetit

Structure From Recurrent Motion: From Rigidity to Recurrency
Xiu Li, Hongdong Li, Hanbyul Joo, Yebin Liu, Yaser Sheikh

Learning Patch Reconstructability for Accelerating Multi-View Stereo
Alex Poms, Chenglei Wu, Shoou-I Yu, Yaser Sheikh

Progressively Complementarity-Aware Fusion Network for RGB-D Salient Object Detection
Hao Chen, Youfu Li

Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction
Daeyun Shin, Charless C. Fowlkes, Derek Hoiem

Learning Dual Convolutional Neural Networks for Low-Level Vision
Jinshan Pan, Sifei Liu, Deqing Sun, Jiawei Zhang, Yang Liu, Jimmy Ren, Zechao Li, Jinhui Tang, Huchuan Lu, Yu-Wing Tai, Ming-Hsuan Yang

Defocus Blur Detection via Multi-Stream Bottom-Top-Bottom Fully Convolutional Network
Wenda Zhao, Fan Zhao, Dong Wang, Huchuan Lu

PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection
Nian Liu, Junwei Han, Ming-Hsuan Yang

Curve Reconstruction via the Global Statistics of Natural Curves
Ehud Barnea, Ohad Ben-Shahar

What Do Deep Networks Like to See?
Sebastian Palacio, Joachim Folz, Jörn Hees, Federico Raue, Damian Borth, Andreas Dengel

“Zero-Shot” Super-Resolution Using Deep Internal Learning
Assaf Shocher, Nadav Cohen, Michal Irani

Detect Globally, Refine Locally: A Novel Approach to Saliency Detection
Tiantian Wang, Lihe Zhang, Shuo Wang, Huchuan Lu, Gang Yang, Xiang Ruan, Ali Borji

Beyond the Pixel-Wise Loss for Topology-Aware Delineation
Agata Mosinska, Pablo Márquez-Neila, Mateusz Koziński, Pascal Fua

KIPPI: KInetic Polygonal Partitioning of Images
Jean-Philippe Bauchet, Florent Lafarge

Image Blind Denoising With Generative Adversarial Network Based Noise Modeling
Jingwen Chen, Jiawei Chen, Hongyang Chao, Ming Yang

Multi-Scale Weighted Nuclear Norm Image Restoration
Noam Yair, Tomer Michaeli

MoNet: Moments Embedding Network
Mengran Gou, Fei Xiong, Octavia Camps, Mario Sznaier

Active Fixation Control to Predict Saccade Sequences
Calden Wloka, Iuliia Kotseruba, John K. Tsotsos

Densely Connected Pyramid Dehazing Network
He Zhang, Vishal M. Patel

Universal Denoising Networks : A Novel CNN Architecture for Image Denoising
Stamatios Lefkimmiatis

Learning Convolutional Networks for Content-Weighted Image Compression
Mu Li, Wangmeng Zuo, Shuhang Gu, Debin Zhao, David Zhang

Deep Video Super-Resolution Network Using Dynamic Upsampling Filters Without Explicit Motion Compensation
Younghyun Jo, Seoung Wug Oh, Jaeyeon Kang, Seon Joo Kim

Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos
Jiaying Liu, Wenhan Yang, Shuai Yang, Zongming Guo

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection
Guanbin Li, Yuan Xie, Tianhao Wei, Keze Wang, Liang Lin

Gated Fusion Network for Single Image Dehazing
Wenqi Ren, Lin Ma, Jiawei Zhang, Jinshan Pan, Xiaochun Cao, Wei Liu, Ming-Hsuan Yang

Learning a Single Convolutional Super-Resolution Network for Multiple Degradations
Kai Zhang, Wangmeng Zuo, Lei Zhang

Non-Blind Deblurring: Handling Kernel Uncertainty With CNNs
Subeesh Vasu, Venkatesh Reddy Maligireddy, A. N. Rajagopalan

Boundary Flow: A Siamese Network That Predicts Boundary Motion Without Training on Motion
Peng Lei, Fuxin Li, Sinisa Todorovic

Learning to See in the Dark
Chen Chen, Qifeng Chen, Jia Xu, Vladlen Koltun

BPGrad: Towards Global Optimality in Deep Learning via Branch and Pruning
Ziming Zhang, Yuanwei Wu, Guanghui Wang

Perturbative Neural Networks
Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

Unsupervised Correlation Analysis
Yedid Hoshen, Lior Wolf

A Biresolution Spectral Framework for Product Quantization
Lopamudra Mukherjee, Sathya N. Ravi, Jiming Peng, Vikas Singh

Domain Adaptive Faster R-CNN for Object Detection in the Wild
Yuhua Chen, Wen Li, Christos Sakaridis, Dengxin Dai, Luc Van Gool

Low-Shot Learning With Large-Scale Diffusion
Matthijs Douze, Arthur Szlam, Bharath Hariharan, Hervé Jégou

Joint Pose and Expression Modeling for Facial Expression Recognition
Feifei Zhang, Tianzhu Zhang, Qirong Mao, Changsheng Xu

Lightweight Probabilistic Deep Networks
Jochen Gast, Stefan Roth

Adversarially Learned One-Class Classifier for Novelty Detection
Mohammad Sabokrou, Mohammad Khalooei, Mahmood Fathy, Ehsan Adeli

Defense Against Universal Adversarial Perturbations
Naveed Akhtar, Jian Liu, Ajmal Mian

Disentangling Factors of Variation by Mixing Them
Qiyang Hu, Attila Szabó, Tiziano Portenier, Paolo Favaro, Matthias Zwicker

Deformable GANs for Pose-Based Human Image Generation
Aliaksandr Siarohin, Enver Sangineto, Stéphane Lathuilière, Nicu Sebe

Hierarchical Recurrent Attention Networks for Structured Online Maps
Namdar Homayounfar, Wei-Chiu Ma, Shrinidhi Kowshika Lakshmikanth, Raquel Urtasun

Sliced Wasserstein Distance for Learning Gaussian Mixture Models
Soheil Kolouri, Gustavo K. Rohde, Heiko Hoffmann

Aligning Infinite-Dimensional Covariance Matrices in Reproducing Kernel Hilbert Spaces for Domain Adaptation
Zhen Zhang, Mianzhi Wang, Yan Huang, Arye Nehorai

CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition
Jedrzej Kozerawski, Matthew Turk

Local and Global Optimization Techniques in Graph-Based Clustering
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa

Multi-Task Learning by Maximizing Statistical Dependence
Youssef A. Mejjati, Darren Cosker, Kwang In Kim

Robust Classification With Convolutional Prototype Learning
Hong-Ming Yang, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu

Generative Modeling Using the Sliced Wasserstein Distance
Ishan Deshpande, Ziyu Zhang, Alexander G. Schwing

Learning Time/Memory-Efficient Deep Architectures With Budgeted Super Networks
Tom Véniat, Ludovic Denoyer

Cross-View Image Synthesis Using Conditional GANs
Krishna Regmi, Ali Borji

Sparse, Smart Contours to Represent and Edit Images
Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman

Anticipating Traffic Accidents With Adaptive Loss and Large-Scale Incident DB
Tomoyuki Suzuki, Hirokatsu Kataoka, Yoshimitsu Aoki, Yutaka Satoh

A Minimalist Approach to Type-Agnostic Detection of Quadrics in Point Clouds
Tolga Birdal, Benjamin Busam, Nassir Navab, Slobodan Ilic, Peter Sturm

Facelet-Bank for Fast Portrait Manipulation
Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Xiaoyong Shen, Yangang Ye, Jiaya Jia

Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg

3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare
Abhijit Kundu, Yin Li, James M. Rehg

Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting With a Single Convolutional Net
Wenjie Luo, Bin Yang, Raquel Urtasun

An Analysis of Scale Invariance in Object Detection ­SNIP
Bharat Singh, Larry S. Davis

Relation Networks for Object Detection
Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei

Zero-Shot Sketch-Image Hashing
Yuming Shen, Li Liu, Fumin Shen, Ling Shao

VizWiz Grand Challenge: Answering Visual Questions From Blind People
Danna Gurari, Qing Li, Abigale J. Stangl, Anhong Guo, Chi Lin, Kristen Grauman, Jiebo Luo, Jeffrey P. Bigham

Divide and Grow: Capturing Huge Diversity in Crowd Images With Incrementally Growing CNN
Deepak Babu Sam, Neeraj N. Sajjan, R. Venkatesh Babu, Mukundhan Srinivasan

Structured Set Matching Networks for One-Shot Part Labeling
Jonghyun Choi, Jayant Krishnamurthy, Aniruddha Kembhavi, Ali Farhadi

Self-Supervised Learning of Geometrically Stable Features Through Probabilistic Introspection
David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi

Link and Code: Fast Indexing With Graphs and Compact Regression Codes
Matthijs Douze, Alexandre Sablayrolles, Hervé Jégou

Textbook Question Answering Under Instructor Guidance With Memory Networks
Juzheng Li, Hang Su, Jun Zhu, Siyu Wang, Bo Zhang

Unsupervised Deep Generative Adversarial Hashing Network
Kamran Ghasedi Dizaji, Feng Zheng, Najmeh Sadoughi, Yanhua Yang, Cheng Deng, Heng Huang

Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian Reid, Stephen Gould, Anton van den Hengel

DenseASPP for Semantic Segmentation in Street Scenes
Maoke Yang, Kun Yu, Chi Zhang, Zhiwei Li, Kuiyuan Yang

Efficient Optimization for Rank-Based Loss Functions
Pritish Mohapatra, Michal Rolínek, C.V. Jawahar, Vladimir Kolmogorov, M. Pawan Kumar

Wasserstein Introspective Neural Networks
Kwonjoon Lee, Weijian Xu, Fan Fan, Zhuowen Tu

Taskonomy: Disentangling Task Transfer Learning
Amir R. Zamir, Alexander Sax, William Shen, Leonidas J. Guibas, Jitendra Malik, Silvio Savarese

Maximum Classifier Discrepancy for Unsupervised Domain Adaptation
Kuniaki Saito, Kohei Watanabe, Yoshitaka Ushiku, Tatsuya Harada

Unsupervised Feature Learning via Non-Parametric Instance Discrimination
Zhirong Wu, Yuanjun Xiong, Stella X. Yu, Dahua Lin

Multi-Task Adversarial Network for Disentangled Feature Learning
Yang Liu, Zhaowen Wang, Hailin Jin, Ian Wassell

Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation
Swami Sankaranarayanan, Yogesh Balaji, Arpit Jain, Ser Nam Lim, Rama Chellappa

Empirical Study of the Topology and Geometry of Deep Networks
Alhussein Fawzi, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard, Stefano Soatto

Boosting Domain Adaptation by Discovering Latent Domains
Massimiliano Mancini, Lorenzo Porzi, Samuel Rota Bulò, Barbara Caputo, Elisa Ricci

Shape From Shading Through Shape Evolution
Dawei Yang, Jia Deng

Weakly Supervised Instance Segmentation Using Class Peak Response
Yanzhao Zhou, Yi Zhu, Qixiang Ye, Qiang Qiu, Jianbin Jiao

Collaborative and Adversarial Network for Unsupervised Domain Adaptation
Weichen Zhang, Wanli Ouyang, Wen Li, Dong Xu

Environment Upgrade Reinforcement Learning for Non-Differentiable Multi-Stage Pipelines
Shuqin Xie, Zitian Chen, Chao Xu, Cewu Lu

Teaching Categories to Human Learners With Visual Explanations
Oisin Mac Aodha, Shihan Su, Yuxin Chen, Pietro Perona, Yisong Yue

Density Adaptive Point Set Registration
Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Per-Erik Forssén, Michael Felsberg

Left-Right Comparative Recurrent Model for Stereo Matching
Zequn Jie, Pengfei Wang, Yonggen Ling, Bo Zhao, Yunchao Wei, Jiashi Feng, Wei Liu

Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View
Shuran Song, Andy Zeng, Angel X. Chang, Manolis Savva, Silvio Savarese, Thomas Funkhouser

Polarimetric Dense Monocular SLAM
Luwei Yang, Feitong Tan, Ao Li, Zhaopeng Cui, Yasutaka Furukawa, Ping Tan

A Unifying Contrast Maximization Framework for Event Cameras, With Applications to Motion, Depth, and Optical Flow Estimation
Guillermo Gallego, Henri Rebecq, Davide Scaramuzza

Modeling Facial Geometry Using Compositional VAEs
Timur Bagautdinov, Chenglei Wu, Jason Saragih, Pascal Fua, Yaser Sheikh

Tangent Convolutions for Dense Prediction in 3D
Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, Qian-Yi Zhou

RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials
Despoina Paschalidou, Osman Ulusoy, Carolin Schmitt, Luc Van Gool, Andreas Geiger

Neural 3D Mesh Renderer
Hiroharu Kato, Yoshitaka Ushiku, Tatsuya Harada

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation
Dan Xu, Wei Wang, Hao Tang, Hong Liu, Nicu Sebe, Elisa Ricci

Automatic 3D Indoor Scene Modeling From Single Panorama
Yang Yang, Shi Jin, Ruiyang Liu, Sing Bing Kang, Jingyi Yu

Extreme 3D Face Reconstruction: Seeing Through Occlusions
Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Eran Paz, Yuval Nirkin, Gérard Medioni

Beyond Grobner Bases: Basis Selection for Minimal Solvers
Viktor Larsson, Magnus Oskarsson, Kalle Astrom, Alge Wallis, Zuzana Kukelova, Tomas Pajdla

Lions and Tigers and Bears: Capturing Non-Rigid, 3D, Articulated Shape From Images
Silvia Zuffi, Angjoo Kanazawa, Michael J. Black

Deep Cocktail Network: Multi-Source Unsupervised Domain Adaptation With Category Shift
Ruijia Xu, Ziliang Chen, Wangmeng Zuo, Junjie Yan, Liang Lin

DOTA: A Large-Scale Dataset for Object Detection in Aerial Images
Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang

Finding Beans in Burgers: Deep Semantic-Visual Embedding With Localization
Martin Engilberge, Louis Chevallier, Patrick Pérez, Matthieu Cord

Feature Super-Resolution: Make Machine See More Clearly
Weimin Tan, Bo Yan, Bahetiyaer Bare

ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information
Rodney LaLonde, Dong Zhang, Mubarak Shah

MaskLab: Instance Segmentation by Refining Object Detection With Semantic and Direction Features
Liang-Chieh Chen, Alexander Hermans, George Papandreou, Florian Schroff, Peng Wang, Hartwig Adam

Hashing as Tie-Aware Learning to Rank
Kun He, Fatih Cakir, Sarah Adel Bargal, Stan Sclaroff

Classification-Driven Dynamic Image Enhancement
Vivek Sharma, Ali Diba, Davy Neven, Michael S. Brown, Luc Van Gool, Rainer Stiefelhagen

Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Kan Chen, Jiyang Gao, Ram Nevatia

Who Let the Dogs Out? Modeling Dog Behavior From Visual Data
Kiana Ehsani, Hessam Bagherinezhad, Joseph Redmon, Roozbeh Mottaghi, Ali Farhadi

Pseudo Mask Augmented Object Detection
Xiangyun Zhao, Shuang Liang, Yichen Wei

Dual Skipping Networks
Changmao Cheng, Yanwei Fu, Yu-Gang Jiang, Wei Liu, Wenlian Lu, Jianfeng Feng, Xiangyang Xue

Memory Matching Networks for One-Shot Image Recognition
Qi Cai, Yingwei Pan, Ting Yao, Chenggang Yan, Tao Mei

IQA: Visual Question Answering in Interactive Environments
Daniel Gordon, Aniruddha Kembhavi, Mohammad Rastegari, Joseph Redmon, Dieter Fox, Ali Farhadi

Pose Transferrable Person Re-Identification
Jinxian Liu, Bingbing Ni, Yichao Yan, Peng Zhou, Shuo Cheng, Jianguo Hu

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge Belongie

Data Distillation: Towards Omni-Supervised Learning
Ilija Radosavovic, Piotr Dollár, Ross Girshick, Georgia Gkioxari, Kaiming He

Object Referring in Videos With Language and Human Gaze
Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool

Feature Selective Networks for Object Detection
Yao Zhai, Jingjing Fu, Yan Lu, Houqiang Li

Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition
Yaming Wang, Vlad I. Morariu, Larry S. Davis

Grounding Referring Expressions in Images by Variational Context
Hanwang Zhang, Yulei Niu, Shih-Fu Chang

Dynamic Graph Generation Network: Generating Relational Knowledge From Diagrams
Daesik Kim, YoungJoon Yoo, Jee-Soo Kim, SangKuk Lee, Nojun Kwak

A Network Architecture for Point Cloud Classification via Automatic Depth Images Generation
Riccardo Roveri, Lukas Rahmann, Cengiz Oztireli, Markus Gross

Towards Dense Object Tracking in a 2D Honeybee Hive
Katarzyna Bozek, Laetitia Hebert, Alexander S. Mikheyev, Greg J. Stephens

Long-Term On-Board Prediction of People in Traffic Scenes Under Uncertainty
Apratim Bhattacharyya, Mario Fritz, Bernt Schiele

Single-Shot Refinement Neural Network for Object Detection
Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

Video Captioning via Hierarchical Reinforcement Learning
Xin Wang, Wenhu Chen, Jiawei Wu, Yuan-Fang Wang, William Yang Wang

Tips and Tricks for Visual Question Answering: Learnings From the 2017 Challenge
Damien Teney, Peter Anderson, Xiaodong He, Anton van den Hengel

Learning to Segment Every Thing
Ronghang Hu, Piotr Dollár, Kaiming He, Trevor Darrell, Ross Girshick

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
Chao Li, Cheng Deng, Ning Li, Wei Liu, Xinbo Gao, Dacheng Tao

Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries
Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton van den Hengel

Zigzag Learning for Weakly Supervised Object Detection
Xiaopeng Zhang, Jiashi Feng, Hongkai Xiong, Qi Tian

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification
Wenguan Wang, Yuanlu Xu, Jianbing Shen, Song-Chun Zhu

Generalized Zero-Shot Learning via Synthesized Examples
Vinay Kumar Verma, Gundeep Arora, Ashish Mishra, Piyush Rai

Partially Shared Multi-Task Convolutional Neural Network With Local Constraint for Face Attribute Learning
Jiajiong Cao, Yingming Li, Zhongfei Zhang

SYQ: Learning Symmetric Quantization for Efficient Deep Neural Networks
Julian Faraone, Nicholas Fraser, Michaela Blott, Philip H.W. Leong

DS*: Tighter Lifting-Free Convex Relaxations for Quadratic Matching Problems
Florian Bernard, Christian Theobalt, Michael Moeller

Deep Mutual Learning
Ying Zhang, Tao Xiang, Timothy M. Hospedales, Huchuan Lu

Coupled End-to-End Transfer Learning With Generalized Fisher Information
Shixing Chen, Caojin Zhang, Ming Dong

Residual Parameter Transfer for Deep Domain Adaptation
Artem Rozantsev, Mathieu Salzmann, Pascal Fua

High-Order Tensor Regularization With Application to Attribute Ranking
Kwang In Kim, Juhyun Park, James Tompkin

Learning to Localize Sound Source in Visual Scenes
Arda Senocak, Tae-Hyun Oh, Junsik Kim, Ming-Hsuan Yang, In So Kweon

Dynamic Few-Shot Visual Learning Without Forgetting
Spyros Gidaris, Nikos Komodakis

Two-Step Quantization for Low-Bit Neural Networks
Peisong Wang, Qinghao Hu, Yifan Zhang, Chunjie Zhang, Yang Liu, Jian Cheng

Improved Lossy Image Compression With Priming and Spatially Adaptive Bit Rates for Recurrent Networks
Nick Johnston, Damien Vincent, David Minnen, Michele Covell, Saurabh Singh, Troy Chinen, Sung Jin Hwang, Joel Shor, George Toderici

Conditional Probability Models for Deep Image Compression
Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc Van Gool

Deep Diffeomorphic Transformer Networks
Nicki Skafte Detlefsen, Oren Freifeld, Søren Hauberg

The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks
Maxim Berman, Amal Rannen Triki, Matthew B. Blaschko

Generative Adversarial Perturbations
Omid Poursaeed, Isay Katsman, Bicheng Gao, Serge Belongie

Learning Strict Identity Mappings in Deep Residual Networks
Xin Yu, Zhiding Yu, Srikumar Ramalingam

Geometric Robustness of Deep Networks: Analysis and Improvement
Can Kanbak, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

View Extrapolation of Human Body From a Single Image
Hao Zhu, Hao Su, Peng Wang, Xun Cao, Ruigang Yang

Geometry Aware Constrained Optimization Techniques for Deep Learning
Soumava Kumar Roy, Zakaria Mhammedi, Mehrtash Harandi

PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition
Mikaela Angelina Uy, Gim Hee Lee

An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption
Xiyu Yu, Tongliang Liu, Mingming Gong, Kayhan Batmanghelich, Dacheng Tao

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Yin Zhou, Oncel Tuzel

Image to Image Translation for Domain Adaptation
Zak Murez, Soheil Kolouri, David Kriegman, Ravi Ramamoorthi, Kyungnam Kim

MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen

Im2Struct: Recovering 3D Shape Structure From a Single RGB Image
Chengjie Niu, Jun Li, Kai Xu

Trust Your Model: Light Field Depth Estimation With Inline Occlusion Handling
Hendrik Schilling, Maximilian Diebold, Carsten Rother, Bernd Jähne

Baseline Desensitizing in Translation Averaging
Bingbing Zhuang, Loong-Fah Cheong, Gim Hee Lee

Mining Point Cloud Local Structures by Kernel Correlation and Graph Pooling
Yiru Shen, Chen Feng, Yaoqing Yang, Dong Tian

Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs
Loic Landrieu, Martin Simonovsky

Very Large-Scale Global SfM by Distributed Motion Averaging
Siyu Zhu, Runze Zhang, Lei Zhou, Tianwei Shen, Tian Fang, Ping Tan, Long Quan

ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans
Angela Dai, Daniel Ritchie, Martin Bokeloh, Scott Reed, Jürgen Sturm, Matthias Nießner

Solving the Perspective-2-Point Problem for Flying-Camera Photo Composition
Ziquan Lan, David Hsu, Gim Hee Lee

Reflection Removal for Large-Scale 3D Point Clouds
Jae-Seong Yun, Jae-Young Sim

Attentional ShapeContextNet for Point Cloud Recognition
Saining Xie, Sainan Liu, Zeyu Chen, Zhuowen Tu

Geometry-Aware Deep Network for Single-Image Novel View Synthesis
Miaomiao Liu, Xuming He, Mathieu Salzmann

InverseFaceNet: Deep Monocular Inverse Face Rendering
Hyeongwoo Kim, Michael Zollhöfer, Ayush Tewari, Justus Thies, Christian Richardt, Christian Theobalt

Sparse Photometric 3D Face Reconstruction Guided by Morphable Models
Xuan Cao, Zhang Chen, Anpei Chen, Xin Chen, Shiying Li, Jingyi Yu

Texture Mapping for 3D Reconstruction With RGB-D Sensor
Yanping Fu, Qingan Yan, Long Yang, Jie Liao, Chunxia Xiao

Learning Less Is More - 6D Camera Localization via 3D Surface Regression
Eric Brachmann, Carsten Rother

Feature Mapping for Learning Fast and Accurate 3D Pose Inference From Synthetic Images
Mahdi Rad, Markus Oberweger, Vincent Lepetit

Indoor RGB-D Compass From a Single Line and Plane
Pyojin Kim, Brian Coltin, H. Jin Kim

Geometry-Aware Network for Non-Rigid Shape Prediction From a Single View
Albert Pumarola, Antonio Agudo, Lorenzo Porzi, Alberto Sanfeliu, Vincent Lepetit, Francesc Moreno-Noguer

Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control
Fereshteh Sadeghi, Alexander Toshev, Eric Jang, Sergey Levine

DocUNet: Document Image Unwarping via a Stacked U-Net
Ke Ma, Zhixin Shu, Xue Bai, Jue Wang, Dimitris Samaras

Analysis of Hand Segmentation in the Wild
Aisha Urooj, Ali Borji

RoadTracer: Automatic Extraction of Road Networks From Aerial Images
Favyen Bastani, Songtao He, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Sam Madden, David DeWitt

Alternating-Stereo VINS: Observability Analysis and Performance Evaluation
Mrinal K. Paul, Stergios I. Roumeliotis

Soccer on Your Tabletop
Konstantinos Rematas, Ira Kemelmacher-Shlizerman, Brian Curless, Steve Seitz

EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth From Light Field Images
Changha Shin, Hae-Gon Jeon, Youngjin Yoon, In So Kweon, Seon Joo Kim

A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping
Zhetong Liang, Jun Xu, David Zhang, Zisheng Cao, Lei Zhang

Deeply Learned Filter Response Functions for Hyperspectral Reconstruction
Shijie Nie, Lin Gu, Yinqiang Zheng, Antony Lam, Nobutaka Ono, Imari Sato

CRRN: Multi-Scale Guided Concurrent Reflection Removal Network
Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot

Single Image Reflection Separation With Perceptual Losses
Xuaner Zhang, Ren Ng, Qifeng Chen

A Robust Method for Strong Rolling Shutter Effects Correction Using Lines With Automatic Feature Selection
Yizhen Lao, Omar Ait-Aider

Time-Resolved Light Transport Decomposition for Thermal Photometric Stereo
Kenichiro Tanaka, Nobuhiro Ikeya, Tsuyoshi Takatani, Hiroyuki Kubo, Takuya Funatomi, Yasuhiro Mukaigawa

Efficient Diverse Ensemble for Discriminative Co-Tracking
Kourosh Meshgi, Shigeyuki Oba, Shin Ishii

Rolling Shutter and Radial Distortion Are Features for High Frame Rate Multi-Camera Tracking
Akash Bapat, True Price, Jan-Michael Frahm

A Twofold Siamese Network for Real-Time Object Tracking
Anfeng He, Chong Luo, Xinmei Tian, Wenjun Zeng

Multi-Cue Correlation Filters for Robust Visual Tracking
Ning Wang, Wengang Zhou, Qi Tian, Richang Hong, Meng Wang, Houqiang Li

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking
Qiang Wang, Zhu Teng, Junliang Xing, Jin Gao, Weiming Hu, Stephen Maybank

SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation
Xiao Wang, Chenglong Li, Bin Luo, Jin Tang

High-Speed Tracking With Multi-Kernel Correlation Filters
Ming Tang, Bin Yu, Fan Zhang, Jinqiao Wang

Occlusion Aware Unsupervised Learning of Optical Flow
Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu

Revisiting Video Saliency: A Large-Scale Benchmark and a New Model
Wenguan Wang, Jianbing Shen, Fang Guo, Ming-Ming Cheng, Ali Borji

Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
Feng Li, Cheng Tian, Wangmeng Zuo, Lei Zhang, Ming-Hsuan Yang

Multimodal Visual Concept Learning With Weakly Supervised Techniques
Giorgos Bouritsas, Petros Koutras, Athanasia Zlatintsi, Petros Maragos

Efficient Large-Scale Approximate Nearest Neighbor Search on OpenCL FPGA
Jialiang Zhang, Soroosh Khoram, Jing Li

Learning a Complete Image Indexing Pipeline
Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval

Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka, Philip Tran, Ryan Soklaski, Arjun Majumdar

Fooling Vision and Language Models Despite Localization and Attention Mechanism
Xiaojun Xu, Xinyun Chen, Chang Liu, Anna Rohrbach, Trevor Darrell, Dawn Song

Categorizing Concepts With Basic Level for Vision-to-Language
Hanzhang Wang, Hanli Wang, Kaisheng Xu

Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Aniruddha Kembhavi

Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation
Jiwoon Ahn, Suha Kwak

From Lifestyle Vlogs to Everyday Interactions
David F. Fouhey, Wei-cheng Kuo, Alexei A. Efros, Jitendra Malik

Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation
Naoto Inoue, Ryosuke Furuta, Toshihiko Yamasaki, Kiyoharu Aizawa

RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews From Unsupervised Viewpoints
Asako Kanezaki, Yasuyuki Matsushita, Yoshifumi Nishida

An End-to-End TextSpotter With Explicit Alignment and Attention
Tong He, Zhi Tian, Weilin Huang, Chunhua Shen, Yu Qiao, Changming Sun

WILDTRACK: A Multi-Camera HD Dataset for Dense Unscripted Pedestrian Detection
Tatjana Chavdarova, Pierre Baqué, Stéphane Bouquet, Andrii Maksai, Cijo Jose, Timur Bagautdinov, Louis Lettry, Pascal Fua, Luc Van Gool, François Fleuret

Direct Shape Regression Networks for End-to-End Face Alignment
Xin Miao, Xiantong Zhen, Xianglong Liu, Cheng Deng, Vassilis Athitsos, Heng Huang

Natural and Effective Obfuscation by Head Inpainting
Qianru Sun, Liqian Ma, Seong Joon Oh, Luc Van Gool, Bernt Schiele, Mario Fritz

3D Semantic Trajectory Reconstruction From 3D Pixel Continuum
Jae Shin Yoon, Ziwei Li, Hyun Soo Park

Optimizing Filter Size in Convolutional Neural Networks for Facial Action Unit Recognition
Shizhong Han, Zibo Meng, Zhiyuan Li, James O'Reilly, Jie Cai, Xiaofeng Wang, Yan Tong

V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation From a Single Depth Map
Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

Ring Loss: Convex Feature Normalization for Face Recognition
Yutong Zheng, Dipan K. Pal, Marios Savvides

Adversarially Occluded Samples for Person Re-Identification
Houjing Huang, Dangwei Li, Zhang Zhang, Xiaotang Chen, Kaiqi Huang

Classifier Learning With Prior Probabilities for Facial Action Unit Recognition
Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji

4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications
Shiyang Cheng, Irene Kotsia, Maja Pantic, Stefanos Zafeiriou

Seeing Small Faces From Robust Anchor's Perspective
Chenchen Zhu, Ran Tao, Khoa Luu, Marios Savvides

2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning
Diogo C. Luvizon, David Picard, Hedi Tabia

Dense 3D Regression for Hand Pose Estimation
Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao

Camera Style Adaptation for Person Re-Identification
Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, Yi Yang

PoseTrack: A Benchmark for Human Pose Estimation and Tracking
Mykhaylo Andriluka, Umar Iqbal, Eldar Insafutdinov, Leonid Pishchulin, Anton Milan, Juergen Gall, Bernt Schiele

Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning
Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang

Pose-Robust Face Recognition via Deep Residual Equivariant Mapping
Kaidi Cao, Yu Rong, Cheng Li, Xiaoou Tang, Chen Change Loy

DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation
Jiang Liu, Chenqiang Gao, Deyu Meng, Alexander G. Hauptmann

LSTM Pose Machines
Yue Luo, Jimmy Ren, Zhouxia Wang, Wenxiu Sun, Jinshan Pan, Jianbo Liu, Jiahao Pang, Liang Lin

Disentangling Features in 3D Face Shapes for Joint Face Reconstruction and Recognition
Feng Liu, Ronghang Zhu, Dan Zeng, Qijun Zhao, Xiaoming Liu

Convolutional Sequence to Sequence Model for Human Dynamics
Chen Li, Zhen Zhang, Wee Sun Lee, Gim Hee Lee

Gesture Recognition: Focus on the Hands
Pradyumna Narayana, Ross Beveridge, Bruce A. Draper

Crowd Counting via Adversarial Cross-Scale Consistency Pursuit
Zan Shen, Yi Xu, Bingbing Ni, Minsi Wang, Jianguo Hu, Xiaokang Yang

3D Human Pose Estimation in the Wild by Adversarial Learning
Wei Yang, Wanli Ouyang, Xiaolong Wang, Jimmy Ren, Hongsheng Li, Xiaogang Wang

CosFace: Large Margin Cosine Loss for Deep Face Recognition
Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, Wei Liu

Encoding Crowd Interaction With Deep Neural Network for Pedestrian Trajectory Prediction
Yanyu Xu, Zhixin Piao, Shenghua Gao

Mean-Variance Loss for Deep Age Estimation From a Face
Hongyu Pan, Hu Han, Shiguang Shan, Xilin Chen

Probabilistic Joint Face-Skull Modelling for Facial Reconstruction
Dennis Madsen, Marcel Lüthi, Andreas Schneider, Thomas Vetter

Learning Latent Super-Events to Detect Multiple Activities in Videos
AJ Piergiovanni, Michael S. Ryoo

Temporal Hallucinating for Action Recognition With Few Still Images
Yali Wang, Lei Zhou, Yu Qiao

Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition
Yansong Tang, Yi Tian, Jiwen Lu, Peiyang Li, Jie Zhou

Gaze Prediction in Dynamic 360° Immersive Videos
Yanyu Xu, Yanbing Dong, Junru Wu, Zhengzhong Sun, Zhiru Shi, Jingyi Yu, Shenghua Gao

When Will You Do What? - Anticipating Temporal Occurrences of Activities
Yazan Abu Farha, Alexander Richard, Juergen Gall

Fusing Crowd Density Maps and Visual Object Trackers for People Tracking in Crowd Scenes
Weihong Ren, Di Kang, Yandong Tang, Antoni B. Chan

Dual Attention Matching Network for Context-Aware Feature Sequence Based Person Re-Identification
Jianlou Si, Honggang Zhang, Chun-Guang Li, Jason Kuen, Xiangfei Kong, Alex C. Kot, Gang Wang

Easy Identification From Better Constraints: Multi-Shot Person Re-Identification From Reference Constraints
Jiahuan Zhou, Bing Su, Ying Wu

Crowd Counting With Deep Negative Correlation Learning
Zenglin Shi, Le Zhang, Yun Liu, Xiaofeng Cao, Yangdong Ye, Ming-Ming Cheng, Guoyan Zheng

Human Appearance Transfer
Mihai Zanfir, Alin-Ionut Popa, Andrei Zanfir, Cristian Sminchisescu

Domain Generalization With Adversarial Feature Learning
Haoliang Li, Sinno Jialin Pan, Shiqi Wang, Alex C. Kot

Pyramid Stereo Matching Network
Jia-Ren Chang, Yong-Sheng Chen

Event-Based Vision Meets Deep Learning on Steering Prediction for Self-Driving Cars
Ana I. Maqueda, Antonio Loquercio, Guillermo Gallego, Narciso García, Davide Scaramuzza

Learning Answer Embeddings for Visual Question Answering
Hexiang Hu, Wei-Lun Chao, Fei Sha

Good View Hunting: Learning Photo Composition From Dense View Pairs
Zijun Wei, Jianming Zhang, Xiaohui Shen, Zhe Lin, Radomír Mech, Minh Hoai, Dimitris Samaras

CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise
Kuang-Huei Lee, Xiaodong He, Lei Zhang, Linjun Yang

Independently Recurrent Neural Network (IndRNN): Building a Longer and Deeper RNN
Shuai Li, Wanqing Li, Chris Cook, Ce Zhu, Yanbo Gao

Mix and Match Networks: Encoder-Decoder Alignment for Zero-Pair Image Translation
Yaxing Wang, Joost van de Weijer, Luis Herranz

Structured Uncertainty Prediction Networks
Garoe Dorta, Sara Vicente, Lourdes Agapito, Neill D. F. Campbell, Ivor Simpson

Between-Class Learning for Image Classification
Yuji Tokozume, Yoshitaka Ushiku, Tatsuya Harada

Adversarial Feature Augmentation for Unsupervised Domain Adaptation
Riccardo Volpi, Pietro Morerio, Silvio Savarese, Vittorio Murino

Generative Image Inpainting With Contextual Attention
Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang

CSGNet: Neural Shape Parser for Constructive Solid Geometry
Gopal Sharma, Rishabh Goyal, Difan Liu, Evangelos Kalogerakis, Subhransu Maji

Conditional Image-to-Image Translation
Jianxin Lin, Yingce Xia, Tao Qin, Zhibo Chen, Tie-Yan Liu

Continuous Relaxation of MAP Inference: A Nonconvex Perspective
D. Khuê Lê-Huu, Nikos Paragios

Feature Generating Networks for Zero-Shot Learning
Yongqin Xian, Tobias Lorenz, Bernt Schiele, Zeynep Akata

Joint Optimization Framework for Learning With Noisy Labels
Daiki Tanaka, Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa

Convolutional Image Captioning
Jyoti Aneja, Aditya Deshpande, Alexander G. Schwing

AON: Towards Arbitrarily-Oriented Text Recognition
Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, Shiliang Pu, Shuigeng Zhou

Wrapped Gaussian Process Regression on Riemannian Manifolds
Anton Mallasto, Aasa Feragen

Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning
Chuang Gan, Boqing Gong, Kun Liu, Hao Su, Leonidas J. Guibas

DiverseNet: When One Right Answer Is Not Enough
Michael Firman, Neill D. F. Campbell, Lourdes Agapito, Gabriel J. Brostow

Deep Face Detector Adaptation Without Negative Transfer or Catastrophic Forgetting
Muhammad Abdullah Jamal, Haoxiang Li, Boqing Gong

Analyzing Filters Toward Efficient ConvNet
Takumi Kobayashi

Regularizing Deep Networks by Modeling and Predicting Label Structure
Mohammadreza Mostajabi, Michael Maire, Gregory Shakhnarovich

In-Place Activated BatchNorm for Memory-Optimized Training of DNNs
Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder

DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle, Brian Price, Scott Cohen, Christopher Kanan

DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks
Shuang Ma, Jianlong Fu, Chang Wen Chen, Tao Mei

Unsupervised Learning of Depth and Ego-Motion From Monocular Video Using 3D Geometric Constraints
Reza Mahjourian, Martin Wicke, Anelia Angelova

FOTS: Fast Oriented Text Spotting With a Unified Network
Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan

Mobile Video Object Detection With Temporally-Aware Feature Maps
Mason Liu, Menglong Zhu

Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network
Fang Zhao, Jianshu Li, Jian Zhao, Jiashi Feng

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking
Filip Radenović, Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum

Cross-Dataset Adaptation for Visual Question Answering
Wei-Lun Chao, Hexiang Hu, Fei Sha

Globally Optimal Inlier Set Maximization for Atlanta Frame Estimation
Kyungdon Joo, Tae-Hyun Oh, In So Kweon, Jean-Charles Bazin

End-to-End Convolutional Semantic Embeddings
Quanzeng You, Zhengyou Zhang, Jiebo Luo

Referring Image Segmentation via Recurrent Refinement Networks
Ruiyu Li, Kaican Li, Yi-Chun Kuo, Michelle Shu, Xiaojuan Qi, Xiaoyong Shen, Jiaya Jia

Two Can Play This Game: Visual Dialog With Discriminative Question Generation and Answering
Unnat Jain, Svetlana Lazebnik, Alexander G. Schwing

Generative Adversarial Learning Towards Fast Weakly Supervised Detection
Yunhan Shen, Rongrong Ji, Shengchuan Zhang, Wangmeng Zuo, Yan Wang

A Deeper Look at Power Normalizations
Piotr Koniusz, Hongguang Zhang, Fatih Porikli

Dimensionality's Blessing: Clustering Images by Underlying Distribution
Wen-Yan Lin, Siying Liu, Jian-Huang Lai, Yasuyuki Matsushita

Eliminating Background-Bias for Robust Person Re-Identification
Maoqing Tian, Shuai Yi, Hongsheng Li, Shihua Li, Xuesen Zhang, Jianping Shi, Junjie Yan, Xiaogang Wang

Learning to Evaluate Image Captioning
Yin Cui, Guandao Yang, Andreas Veit, Xun Huang, Serge Belongie

Single-Shot Object Detection With Enriched Semantics
Zhishuai Zhang, Siyuan Qiao, Cihang Xie, Wei Shen, Bo Wang, Alan L. Yuille

Low-Shot Learning With Imprinted Weights
Hang Qi, Matthew Brown, David G. Lowe

Neural Motifs: Scene Graph Parsing With Global Context
Rowan Zellers, Mark Yatskar, Sam Thomson, Yejin Choi

Variational Autoencoders for Deforming 3D Mesh Models
Qingyang Tan, Lin Gao, Yu-Kun Lai, Shihong Xia

Fast Monte-Carlo Localization on Aerial Vehicles Using Approximate Continuous Belief Representations
Aditya Dhawale, Kumar Shaurya Shankar, Nathan Michael

DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map
Peng Wang, Ruigang Yang, Binbin Cao, Wei Xu, Yuanqing Lin

LiDAR-Video Driving Dataset: Learning Driving Policies Effectively
Yiping Chen, Jingkang Wang, Jonathan Li, Cewu Lu, Zhipeng Luo, Han Xue, Cheng Wang

Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks
Alexander Sage, Eirikur Agustsson, Radu Timofte, Luc Van Gool

Egocentric Basketball Motion Planning From a Single First-Person Image
Gedas Bertasius, Aaron Chan, Jianbo Shi

Human-Centric Indoor Scene Synthesis Using Stochastic Grammar
Siyuan Qi, Yixin Zhu, Siyuan Huang, Chenfanfu Jiang, Song-Chun Zhu

Rotation-Sensitive Regression for Oriented Scene Text Detection
Minghui Liao, Zhen Zhu, Baoguang Shi, Gui-song Xia, Xiang Bai

Separating Self-Expression and Visual Content in Hashtag Supervision
Andreas Veit, Maximilian Nickel, Serge Belongie, Laurens van der Maaten

Distort-and-Recover: Color Enhancement Using Deep Reinforcement Learning
Jongchan Park, Joon-Young Lee, Donggeun Yoo, In So Kweon

Im2Flow: Motion Hallucination From Static Images for Action Recognition
Ruohan Gao, Bo Xiong, Kristen Grauman

Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos
De-An Huang, Shyamal Buch, Lucio Dery, Animesh Garg, Li Fei-Fei, Juan Carlos Niebles

Actor and Action Video Segmentation From a Sentence
Kirill Gavrilyuk, Amir Ghodrati, Zhenyang Li, Cees G. M. Snoek

Egocentric Activity Recognition on a Budget
Rafael Possas, Sheila Pinto Caceres, Fabio Ramos

CNN in MRF: Video Object Segmentation via Inference in a CNN-Based Higher-Order Spatio-Temporal MRF
Linchao Bao, Baoyuan Wu, Wei Liu

Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints
Alexander Richard, Hilde Kuehne, Juergen Gall

Low-Latency Video Semantic Segmentation
Yule Li, Jianping Shi, Dahua Lin

Fine-Grained Video Captioning for Sports Narrative
Huanyu Yu, Shuo Cheng, Bingbing Ni, Minsi Wang, Jian Zhang, Xiaokang Yang

End-to-End Learning of Motion Representation for Video Understanding
Lijie Fan, Wenbing Huang, Chuang Gan, Stefano Ermon, Boqing Gong, Junzhou Huang

Compressed Video Action Recognition
Chao-Yuan Wu, Manzil Zaheer, Hexiang Hu, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl

Features for Multi-Target Multi-Camera Tracking and Re-Identification
Ergys Ristani, Carlo Tomasi

AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions
Chunhui Gu, Chen Sun, David A. Ross, Carl Vondrick, Caroline Pantofaru, Yeqing Li, Sudheendra Vijayanarasimhan, George Toderici, Susanna Ricco, Rahul Sukthankar, Cordelia Schmid, Jitendra Malik

Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination
Hazel Doughty, Dima Damen, Walterio Mayol-Cuevas

MX-LSTM: Mixing Tracklets and Vislets to Jointly Forecast Trajectories and Head Poses
Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Alessio Del Bue, Fabio Galasso, Marco Cristani

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang

Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
Duy-Kien Nguyen, Takayuki Okatani

FlipDial: A Generative Model for Two-Way Visual Dialogue
Daniela Massiceti, N. Siddharth, Puneet K. Dokania, Philip H.S. Torr

Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning
Qi Wu, Peng Wang, Chunhua Shen, Ian Reid, Anton van den Hengel

Visual Question Generation as Dual Task of Visual Question Answering
Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, Xiaogang Wang, Ming Zhou

Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh, Minh N. Do, Alexander G. Schwing

Focal Visual-Text Attention for Visual Question Answering
Junwei Liang, Lu Jiang, Liangliang Cao, Li-Jia Li, Alexander G. Hauptmann

SeGAN: Segmenting and Generating the Invisible
Kiana Ehsani, Roozbeh Mottaghi, Ali Farhadi

Cascade R-CNN: Delving Into High Quality Object Detection
Zhaowei Cai, Nuno Vasconcelos

Learning Semantic Concepts and Order for Image and Sentence Matching
Yan Huang, Qi Wu, Chunfeng Song, Liang Wang

Functional Map of the World
Gordon Christie, Neil Fendley, James Wilson, Ryan Mukherjee

MegDet: A Large Mini-Batch Object Detector
Chao Peng, Tete Xiao, Zeming Li, Yuning Jiang, Xiangyu Zhang, Kai Jia, Gang Yu, Jian Sun

Learning Globally Optimized Object Detector via Policy Gradient
Yongming Rao, Dahua Lin, Jiwen Lu, Jie Zhou

Photographic Text-to-Image Synthesis With a Hierarchically-Nested Adversarial Network
Zizhao Zhang, Yuanpu Xie, Lin Yang

Illuminant Spectra-Based Source Separation Using Flash Photography
Zhuo Hui, Kalyan Sunkavalli, Sunil Hadap, Aswin C. Sankaranarayanan

Trapping Light for Time of Flight
Ruilin Xu, Mohit Gupta, Shree K. Nayar

The Perception-Distortion Tradeoff
Yochai Blau, Tomer Michaeli

Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Faces
Hao Zhou, Jin Sun, Yaser Yacoob, David W. Jacobs

Optimal Structured Light à La Carte
Parsa Mirdehghan, Wenzheng Chen, Kiriakos N. Kutulakos

Tracking Multiple Objects Outside the Line of Sight Using Speckle Imaging
Brandon M. Smith, Matthew O'Toole, Mohit Gupta

Inferring Light Fields From Shadows
Manel Baradad, Vickie Ye, Adam B. Yedidia, Frédo Durand, William T. Freeman, Gregory W. Wornell, Antonio Torralba

Modifying Non-Local Variations Across Multiple Views
Tal Tlusty, Tomer Michaeli, Tali Dekel, Lihi Zelnik-Manor

Robust Video Content Alignment and Compensation for Rain Removal in a CNN Framework
Jie Chen, Cheen-Hau Tan, Junhui Hou, Lap-Pui Chau, He Li

SfSNet: Learning Shape, Reflectance and Illuminance of Faces `in the Wild'
Soumyadip Sengupta, Angjoo Kanazawa, Carlos D. Castillo, David W. Jacobs

Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs
Yu-Sheng Chen, Yu-Ching Wang, Man-Hsin Kao, Yung-Yu Chuang

LIME: Live Intrinsic Material Estimation
Abhimitra Meka, Maxim Maximov, Michael Zollhöfer, Avishek Chatterjee, Hans-Peter Seidel, Christian Richardt, Christian Theobalt

Learning to Detect Features in Texture Images
Linguang Zhang, Szymon Rusinkiewicz

Learning to Extract a Video Sequence From a Single Motion-Blurred Image
Meiguang Jin, Givi Meishvili, Paolo Favaro

Lose the Views: Limited Angle CT Reconstruction via Implicit Sinogram Completion
Rushil Anirudh, Hyojin Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle Champley, Timo Bremer

A Common Framework for Interactive Texture Transfer
Yifang Men, Zhouhui Lian, Yingmin Tang, Jianguo Xiao

AMNet: Memorability Estimation With Attention
Jiri Fajtl, Vasileios Argyriou, Dorothy Monekosso, Paolo Remagnino

Blind Predicting Similar Quality Map for Image Quality Assessment
Da Pan, Ping Shi, Ming Hou, Zefeng Ying, Sizhe Fu, Yuan Zhang

Deep End-to-End Time-of-Flight Imaging
Shuochen Su, Felix Heide, Gordon Wetzstein, Wolfgang Heidrich

Aperture Supervision for Monocular Depth Estimation
Pratul P. Srinivasan, Rahul Garg, Neal Wadhwa, Ren Ng, Jonathan T. Barron

Seeing Temporal Modulation of Lights From Standard Cameras
Naoki Sakakibara, Fumihiko Sakaue, Jun Sato

Statistical Tomography of Microscopic Life
Aviad Levis, Yoav Y. Schechner, Ronen Talmon

Divide and Conquer for Full-Resolution Light Field Deblurring
M. R. Mahesh Mohan, A. N. Rajagopalan

Multispectral Image Intrinsic Decomposition via Subspace Constraint
Qian Huang, Weixin Zhu, Yang Zhao, Linsen Chen, Yao Wang, Tao Yue, Xun Cao

Improving Color Reproduction Accuracy on Cameras
Hakki Can Karaimer, Michael S. Brown

A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun, Manohar Paluri

Inferring Shared Attention in Social Scene Videos
Lifeng Fan, Yixin Chen, Ping Wei, Wenguan Wang, Song-Chun Zhu

Making Convolutional Networks Recurrent for Visual Sequence Learning
Xiaodong Yang, Pavlo Molchanov, Jan Kautz

Real-World Anomaly Detection in Surveillance Videos
Waqas Sultani, Chen Chen, Mubarak Shah

Viewpoint-Aware Attentive Multi-View Inference for Vehicle Re-Identification
Yi Zhou, Ling Shao

Efficient Video Object Segmentation via Network Modulation
Linjie Yang, Yanran Wang, Xuehan Xiong, Jianchao Yang, Aggelos K. Katsaggelos

Weakly-Supervised Action Segmentation With Iterative Soft Boundary Assignment
Li Ding, Chenliang Xu

Depth-Aware Stereo Video Retargeting
Bing Li, Chia-Wen Lin, Boxin Shi, Tiejun Huang, Wen Gao, C.-C. Jay Kuo

Instance Embedding Transfer to Unsupervised Video Object Segmentation
Siyang Li, Bryan Seybold, Alexey Vorobyov, Alireza Fathi, Qin Huang, C.-C. Jay Kuo

Future Frame Prediction for Anomaly Detection – A New Baseline
Wen Liu, Weixin Luo, Dongze Lian, Shenghua Gao

Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara, Hirokatsu Kataoka, Yutaka Satoh

Dynamic Video Segmentation Network
Yu-Syuan Xu, Tsu-Jui Fu, Hsuan-Kung Yang, Chun-Yi Lee

Recognize Actions by Disentangling Components of Dynamics
Yue Zhao, Yuanjun Xiong, Dahua Lin

Motion-Appearance Co-Memory Networks for Video Question Answering
Jiyang Gao, Runzhou Ge, Kan Chen, Ram Nevatia

Learning to Understand Image Blur
Shanghang Zhang, Xiaohui Shen, Zhe Lin, Radomír Měch, João P. Costeira, José M. F. Moura

Dense Decoder Shortcut Connections for Single-Pass Semantic Segmentation
Piotr Bilinski, Victor Prisacariu

Generative Adversarial Image Synthesis With Decision Tree Latent Controller
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino

Learning a Discriminative Prior for Blind Image Deblurring
Lerenhan Li, Jinshan Pan, Wei-Sheng Lai, Changxin Gao, Nong Sang, Ming-Hsuan Yang

Frame-Recurrent Video Super-Resolution
Mehdi S. M. Sajjadi, Raviteja Vemulapalli, Matthew Brown

Discovering Point Lights With Intensity Distance Fields
Edward Zhang, Michael F. Cohen, Brian Curless

Video Rain Streak Removal by Multiscale Convolutional Sparse Coding
Minghan Li, Qi Xie, Qian Zhao, Wei Wei, Shuhang Gu, Jing Tao, Deyu Meng

Stereoscopic Neural Style Transfer
Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua

Multi-Frame Quality Enhancement for Compressed Video
Ren Yang, Mai Xu, Zulin Wang, Tianyi Li

CNN Based Learning Using Reflection and Retinex Models for Intrinsic Image Decomposition
Anil S. Baslamisli, Hoang-An Le, Theo Gevers

Image Restoration by Estimating Frequency Distribution of Local Patches
Jaeyoung Yoo, Sang-ho Lee, Nojun Kwak

Latent RANSAC
Simon Korman, Roee Litman

Two-Stream Convolutional Networks for Dynamic Texture Synthesis
Matthew Tesfaldet, Marcus A. Brubaker, Konstantinos G. Derpanis

Towards Open-Set Identity Preserving Face Synthesis
Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, Gang Hua

A Revised Underwater Image Formation Model
Derya Akkaynak, Tali Treibitz

Graph-Cut RANSAC
Daniel Barath, Jiří Matas

Temporal Deformable Residual Networks for Action Segmentation in Videos
Peng Lei, Sinisa Todorovic

Weakly Supervised Action Localization by Sparse Temporal Pooling Network
Phuc Nguyen, Ting Liu, Gautam Prasad, Bohyung Han

PoseFlow: A Deep Motion Representation for Understanding Human Behaviors in Videos
Dingwen Zhang, Guangyu Guo, Dong Huang, Junwei Han

FFNet: Video Fast-Forwarding via Reinforcement Learning
Shuyue Lan, Rameswar Panda, Qi Zhu, Amit K. Roy-Chowdhury

Multi-Shot Pedestrian Re-Identification via Sequential Decision Making
Jianfu Zhang, Naiyan Wang, Liqing Zhang

Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma, Asim Kadav, Iain Melvin, Zsolt Kira, Ghassan AlRegib, Hans Peter Graf

Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks
Ping Wei, Yang Liu, Tianmin Shu, Nanning Zheng, Song-Chun Zhu

Fully Convolutional Adaptation Networks for Semantic Segmentation
Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei

Semantic Video Segmentation by Gated Recurrent Flow Propagation
David Nilsson, Cristian Sminchisescu

Interpretable Video Captioning via Trajectory Structured Localization
Xian Wu, Guanbin Li, Qingxing Cao, Qingge Ji, Liang Lin

Deep Hashing via Discrepancy Minimization
Zhixiang Chen, Xin Yuan, Jiwen Lu, Qi Tian, Jie Zhou

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, Jian Sun

Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs
Xiaolong Wang, Yufei Ye, Abhinav Gupta

Referring Relationships
Ranjay Krishna, Ines Chami, Michael Bernstein, Li Fei-Fei

Improving Object Localization With Fitness NMS and Bounded IoU Loss
Lachlan Tychsen-Smith, Lars Petersson

End-to-End Deep Kronecker-Product Matching for Person Re-Identification
Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang

Semantic Visual Localization
Johannes L. Schönberger, Marc Pollefeys, Andreas Geiger, Torsten Sattler

Objects as Context for Detecting Their Semantic Parts
Abel Gonzalez-Garcia, Davide Modolo, Vittorio Ferrari

End-to-End Weakly-Supervised Semantic Alignment
Ignacio Rocco, Relja Arandjelović, Josef Sivic

Dynamic Zoom-In Network for Fast Object Detection in Large Images
Mingfei Gao, Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis

Learning Markov Clustering Networks for Scene Text Detection
Zichuan Liu, Guosheng Lin, Sheng Yang, Jiashi Feng, Weisi Lin, Wang Ling Goh

Deep Reinforcement Learning of Region Proposal Networks for Object Detection
Aleksis Pirinen, Cristian Sminchisescu

Beyond Holistic Object Recognition: Enriching Image Understanding With Part States
Cewu Lu, Hao Su, Yonglu Li, Yongyi Lu, Li Yi, Chi-Keung Tang, Leonidas J. Guibas

Discriminability Objective for Training Descriptive Captions
Ruotian Luo, Brian Price, Scott Cohen, Gregory Shakhnarovich

Visual Question Answering With Memory-Augmented Networks
Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton van den Hengel, Ian Reid

Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships
Yong Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

Occluded Pedestrian Detection Through Guided Attention in CNNs
Shanshan Zhang, Jian Yang, Bernt Schiele

Reward Learning From Narrated Demonstrations
Hsiao-Yu Tung, Adam W. Harley, Liang-Kang Huang, Katerina Fragkiadaki

Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing
Zilong Huang, Xinggang Wang, Jiasi Wang, Wenyu Liu, Jingdong Wang

PoTion: Pose MoTion Representation for Action Recognition
Vasileios Choutas, Philippe Weinzaepfel, Jérôme Revaud, Cordelia Schmid

Bilateral Ordinal Relevance Multi-Instance Regression for Facial Action Unit Intensity Estimation
Yong Zhang, Rui Zhao, Weiming Dong, Bao-Gang Hu, Qiang Ji

Pulling Actions out of Context: Explicit Separation for Effective Combination
Yang Wang, Minh Hoai

Dynamic Feature Learning for Partial Face Recognition
Lingxiao He, Haiqing Li, Qi Zhang, Zhenan Sun

Exploiting Transitivity for Learning Person Re-Identification Models on a Budget
Sourya Roy, Sujoy Paul, Neal E. Young, Amit K. Roy-Chowdhury

Deep Spatial Feature Reconstruction for Partial Person Re-Identification: Alignment-Free Approach
Lingxiao He, Jian Liang, Haiqing Li, Zhenan Sun

Every Smile Is Unique: Landmark-Guided Diverse Smile Generation
Wei Wang, Xavier Alameda-Pineda, Dan Xu, Pascal Fua, Elisa Ricci, Nicu Sebe

UV-GAN: Adversarial Facial UV Map Completion for Pose-Invariant Face Recognition
Jiankang Deng, Shiyang Cheng, Niannan Xue, Yuxiang Zhou, Stefanos Zafeiriou

Cascaded Pyramid Network for Multi-Person Pose Estimation
Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, Jian Sun

A Face-to-Face Neural Conversation Model
Hang Chu, Daiqing Li, Sanja Fidler

End-to-End Recovery of Human Shape and Pose
Angjoo Kanazawa, Michael J. Black, David W. Jacobs, Jitendra Malik

Squeeze-and-Excitation Networks
Jie Hu, Li Shen, Gang Sun

Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Salient Objects
Md Amirul Islam, Mahmoud Kalash, Neil D. B. Bruce

Context Encoding for Semantic Segmentation
Hang Zhang, Kristin Dana, Jianping Shi, Zhongyue Zhang, Xiaogang Wang, Ambrish Tyagi, Amit Agrawal

Creating Capsule Wardrobes From Fashion Images
Wei-Lin Hsiao, Kristen Grauman

Webly Supervised Learning Meets Zero-Shot Learning: A Hybrid Approach for Fine-Grained Classification
Li Niu, Ashok Veeraraghavan, Ashutosh Sabharwal

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval With Generative Models
Jiuxiang Gu, Jianfei Cai, Shafiq R. Joty, Li Niu, Gang Wang

Bidirectional Attentive Fusion With Context Gating for Dense Video Captioning
Jingwen Wang, Wenhao Jiang, Lin Ma, Wei Liu, Yong Xu

InLoc: Indoor Visual Localization With Dense Matching and View Synthesis
Hajime Taira, Masatoshi Okutomi, Torsten Sattler, Mircea Cimpoi, Marc Pollefeys, Josef Sivic, Tomas Pajdla, Akihiko Torii

Towards High Performance Video Object Detection
Xizhou Zhu, Jifeng Dai, Lu Yuan, Yichen Wei

Neural Baby Talk
Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh

Few-Shot Image Recognition by Predicting Parameters From Activations
Siyuan Qiao, Chenxi Liu, Wei Shen, Alan L. Yuille

Iterative Visual Reasoning Beyond Convolutions
Xinlei Chen, Li-Jia Li, Li Fei-Fei, Abhinav Gupta

Visual Question Reasoning on General Dependency Tree
Qingxing Cao, Xiaodan Liang, Bailing Li, Guanbin Li, Liang Lin

CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization
Sixing Hu, Mengdan Feng, Rang M. H. Nguyen, Gim Hee Lee

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation
Yunchao Wei, Huaxin Xiao, Honghui Shi, Zequn Jie, Jiashi Feng, Thomas S. Huang

Low-Shot Learning From Imaginary Data
Yu-Xiong Wang, Ross Girshick, Martial Hebert, Bharath Hariharan

DoubleFusion: Real-Time Capture of Human Performances With Inner Body Shapes From a Single Depth Sensor
Tao Yu, Zerong Zheng, Kaiwen Guo, Jianhui Zhao, Qionghai Dai, Hao Li, Gerard Pons-Moll, Yebin Liu

DensePose: Dense Human Pose Estimation in the Wild
Rıza Alp Güler, Natalia Neverova, Iasonas Kokkinos

Ordinal Depth Supervision for 3D Human Pose Estimation
Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis

Consensus Maximization for Semantic Region Correspondences
Pablo Speciale, Danda P. Paudel, Martin R. Oswald, Hayko Riemenschneider, Luc Van Gool, Marc Pollefeys

Robust Hough Transform Based 3D Reconstruction From Circular Light Fields
Alessandro Vianello, Jens Ackermann, Maximilian Diebold, Bernd Jähne

Alive Caricature From 2D to 3D
Qianyi Wu, Juyong Zhang, Yu-Kun Lai, Jianmin Zheng, Jianfei Cai

Nonlinear 3D Face Morphable Model
Luan Tran, Xiaoming Liu

Through-Wall Human Pose Estimation Using Radio Signals
Mingmin Zhao, Tianhong Li, Mohammad Abu Alsheikh, Yonglong Tian, Hang Zhao, Antonio Torralba, Dina Katabi

What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets
De-An Huang, Vignesh Ramanathan, Dhruv Mahajan, Lorenzo Torresani, Manohar Paluri, Li Fei-Fei, Juan Carlos Niebles

Fast Video Object Segmentation by Reference-Guided Mask Propagation
Seoung Wug Oh, Joon-Young Lee, Kalyan Sunkavalli, Seon Joo Kim

NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
Alexander Richard, Hilde Kuehne, Ahsan Iqbal, Juergen Gall

Actor and Observer: Joint Modeling of First and Third-Person Videos
Gunnar A. Sigurdsson, Abhinav Gupta, Cordelia Schmid, Ali Farhadi, Karteek Alahari

HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization
Bin Zhao, Xuelong Li, Xiaoqiang Lu

Fast and Accurate Online Video Object Segmentation via Tracking Parts
Jingchun Cheng, Yi-Hsuan Tsai, Wei-Chih Hung, Shengjin Wang, Ming-Hsuan Yang

Now You Shake Me: Towards Automatic 4D Cinema
Yuhao Zhou, Makarand Tapaswi, Sanja Fidler

Viewpoint-Aware Video Summarization
Atsushi Kanehira, Luc Van Gool, Yoshitaka Ushiku, Tatsuya Harada

Photometric Stereo in Participating Media Considering Shape-Dependent Forward Scatter
Yuki Fujimura, Masaaki Iiyama, Atsushi Hashimoto, Michihiko Minoh

Direction-Aware Spatial Context Features for Shadow Detection
Xiaowei Hu, Lei Zhu, Chi-Wing Fu, Jing Qin, Pheng-Ann Heng

Discriminative Learning of Latent Features for Zero-Shot Recognition
Yan Li, Junge Zhang, Jianguo Zhang, Kaiqi Huang

Learning to Adapt Structured Output Space for Semantic Segmentation
Yi-Hsuan Tsai, Wei-Chih Hung, Samuel Schulter, Kihyuk Sohn, Ming-Hsuan Yang, Manmohan Chandraker

Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Alex Kendall, Yarin Gal, Roberto Cipolla

Jointly Localizing and Describing Events for Dense Video Captioning
Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei

Going From Image to Video Saliency: Augmenting Image Salience With Dynamic Attentional Push
Siavash Gorji, James J. Clark

M3: Multimodal Memory Modelling for Video Captioning
Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

Emotional Attention: A Study of Image Sentiment and Visual Attention
Shaojing Fan, Zhiqi Shen, Ming Jiang, Bryan L. Koenig, Juan Xu, Mohan S. Kankanhalli, Qi Zhao

A Low Power, High Throughput, Fully Event-Based Stereo System
Alexander Andreopoulos, Hirak J. Kashyap, Tapan K. Nayak, Arnon Amir, Myron D. Flickner

VITON: An Image-Based Virtual Try-On Network
Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry S. Davis

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai

Multi-Content GAN for Few-Shot Font Style Transfer
Samaneh Azadi, Matthew Fisher, Vladimir G. Kim, Zhaowen Wang, Eli Shechtman, Trevor Darrell

Audio to Body Dynamics
Eli Shlizerman, Lucio Dery, Hayden Schoen, Ira Kemelmacher-Shlizerman

Weakly Supervised Coupled Networks for Visual Sentiment Analysis
Jufeng Yang, Dongyu She, Yu-Kun Lai, Paul L. Rosin, Ming-Hsuan Yang

Future Person Localization in First-Person Videos
Takuma Yagi, Karttikeya Mangalam, Ryo Yonetani, Yoichi Sato

Preserving Semantic Relations for Zero-Shot Learning
Yashas Annadani, Soma Biswas

Show Me a Story: Towards Coherent Neural Story Illustration
Hareesh Ravi, Lezi Wang, Carlos Muniz, Leonid Sigal, Dimitris Metaxas, Mubbasir Kapadia

Reconstruction Network for Video Captioning
Bairui Wang, Lin Ma, Wei Zhang, Wei Liu

Fast Spectral Ranking for Similarity Search
Ahmet Iscen, Yannis Avrithis, Giorgos Tolias, Teddy Furon, Ondřej Chum

Mining on Manifolds: Metric Learning Without Labels
Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum

PIXOR: Real-Time 3D Object Detection From Point Clouds
Bin Yang, Wenjie Luo, Raquel Urtasun

Leveraging Unlabeled Data for Crowd Counting by Learning to Rank
Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov

Zero-Shot Kernel Learning
Hongguang Zhang, Piotr Koniusz

Differential Attention for Visual Question Answering
Badri Patro, Vinay P. Namboodiri

Learning From Noisy Web Data With Category-Level Supervision
Li Niu, Qingtao Tang, Ashok Veeraraghavan, Ashutosh Sabharwal

Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning
Vasili Ramanishka, Yi-Ting Chen, Teruhisa Misu, Kate Saenko

Learning Attribute Representations With Localization for Flexible Fashion Search
Kenan E. Ak, Ashraf A. Kassim, Joo Hwee Lim, Jo Yew Tham

Bidirectional Retrieval Made Simple
Jônatas Wehrmann, Rodrigo C. Barros

Learning Multi-Instance Enriched Image Representations via Non-Greedy Ratio Maximization of the l1-Norm Distances
Kai Liu, Hua Wang, Feiping Nie, Hao Zhang

Learning Visual Knowledge Memory Networks for Visual Question Answering
Zhou Su, Chen Zhu, Yinpeng Dong, Dongqi Cai, Yurong Chen, Jianguo Li

Visual Grounding via Accumulated Attention
Chaorui Deng, Qi Wu, Qingyao Wu, Fuyuan Hu, Fan Lyu, Mingkui Tan

Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy
Guanglu Song, Yu Liu, Ming Jiang, Yujie Wang, Junjie Yan, Biao Leng

PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
Arun Mallya, Svetlana Lazebnik

Repulsion Loss: Detecting Pedestrians in a Crowd
Xinlong Wang, Tete Xiao, Yuning Jiang, Shuai Shao, Jian Sun, Chunhua Shen

Neural Sign Language Translation
Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Hermann Ney, Richard Bowden

Non-Local Neural Networks
Xiaolong Wang, Ross Girshick, Abhinav Gupta, Kaiming He

LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers
Lorenzo Baraldi, Matthijs Douze, Rita Cucchiara, Hervé Jégou

Optimizing Video Object Detection via a Scale-Time Lattice
Kai Chen, Jiaqi Wang, Shuo Yang, Xingcheng Zhang, Yuanjun Xiong, Chen Change Loy, Dahua Lin

Learning Compressible 360° Video Isomers
Yu-Chuan Su, Kristen Grauman

Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long, Chuang Gan, Gerard de Melo, Jiajun Wu, Xiao Liu, Shilei Wen

What Have We Learned From Deep Representations for Action Recognition?
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes, Andrew Zisserman

Controllable Video Generation With Sparse Trajectories
Zekun Hao, Xun Huang, Serge Belongie

Representing and Learning High Dimensional Data With the Optimal Transport Map From a Probabilistic Viewpoint
Serim Park, Matthew Thorpe

CLIP-Q: Deep Network Compression Learning by In-Parallel Pruning-Quantization
Frederick Tung, Greg Mori

Inference in Higher Order MRF-MAP Problems With Small and Large Cliques
Ishant Shanu, Chetan Arora, S.N. Maheshwari

ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes
Yuhua Chen, Wen Li, Luc Van Gool

Eye In-Painting With Exemplar Generative Adversarial Networks
Brian Dolhansky, Cristian Canton Ferrer

ClcNet: Improving the Efficiency of Convolutional Neural Network Using Channel Local Convolutions
Dong-Qing Zhang

Towards Effective Low-Bitwidth Convolutional Neural Networks
Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
Jason Kuen, Xiangfei Kong, Zhe Lin, Gang Wang, Jianxiong Yin, Simon See, Yap-Peng Tan

Face Aging With Identity-Preserved Conditional Generative Adversarial Networks
Zongwei Wang, Xu Tang, Weixin Luo, Shenghua Gao

Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns
Jianming Lv, Weihang Chen, Qing Li, Can Yang

Feature Quantization for Defending Against Distortion of Images
Zhun Sun, Mete Ozay, Yan Zhang, Xing Liu, Takayuki Okatani

Tagging Like Humans: Diverse and Distinct Image Annotation
Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu

Re-Weighted Adversarial Adaptation Network for Unsupervised Domain Adaptation
Qingchao Chen, Yang Liu, Zhaowen Wang, Ian Wassell, Kevin Chetty

Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Seunghoon Hong, Dingdong Yang, Jongwook Choi, Honglak Lee

Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present
Xinpeng Chen, Lin Ma, Wenhao Jiang, Jian Yao, Wei Liu

Unsupervised Domain Adaptation With Similarity Learning
Pedro O. Pinheiro

Learning Deep Sketch Abstraction
Umar Riaz Muhammad, Yongxin Yang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

Matching Adversarial Networks
Gellért Máttyus, Raquel Urtasun

SoS-RSC: A Sum-of-Squares Polynomial Approach to Robustifying Subspace Clustering Algorithms
Mario Sznaier, Octavia Camps

Resource Aware Person Re-Identification Across Multiple Resolutions
Yan Wang, Lequn Wang, Yurong You, Xu Zou, Vincent Chen, Serena Li, Gao Huang, Bharath Hariharan, Kilian Q. Weinberger

Learning and Using the Arrow of Time
Donglai Wei, Joseph J. Lim, Andrew Zisserman, William T. Freeman

Neural Style Transfer via Meta Networks
Falong Shen, Shuicheng Yan, Gang Zeng

People, Penguins and Petri Dishes: Adapting Object Counting Models to New Visual Domains and Object Types Without Forgetting
Mark Marsden, Kevin McGuinness, Suzanne Little, Ciara E. Keogh, Noel E. O'Connor

HydraNets: Specialized Dynamic Architectures for Efficient Inference
Ravi Teja Mullapudi, William R. Mark, Noam Shazeer, Kayvon Fatahalian

SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval
Peng Xu, Yongye Huang, Tongtong Yuan, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo

From Source to Target and Back: Symmetric Bi-Directional Adaptive GAN
Paolo Russo, Fabio M. Carlucci, Tatiana Tommasi, Barbara Caputo

OLÉ: Orthogonal Low-Rank Embedding - A Plug and Play Geometric Loss for Deep Learning
José Lezama, Qiang Qiu, Pablo Musé, Guillermo Sapiro

Efficient Parametrization of Multi-Domain Deep Neural Networks
Sylvestre-Alvise Rebuffi, Hakan Bilen, Andrea Vedaldi

Deep Density Clustering of Unconstrained Faces
Wei-An Lin, Jun-Cheng Chen, Carlos D. Castillo, Rama Chellappa

Geometric Multi-Model Fitting With a Convex Relaxation Algorithm
Paul Amayo, Pedro Piniés, Lina M. Paz, Paul Newman

Fast and Robust Estimation for Unit-Norm Constrained Linear Fitting Problems
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa

Importance Weighted Adversarial Nets for Partial Domain Adaptation
Jing Zhang, Zewei Ding, Wanqing Li, Philip Ogunbona

Efficient Subpixel Refinement With Symbolic Linear Predictors
Vincent Lui, Jonathon Geeves, Winston Yii, Tom Drummond

Scale-Recurrent Network for Deep Image Deblurring
Xin Tao, Hongyun Gao, Xiaoyong Shen, Jue Wang, Jiaya Jia

DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks
Orest Kupyn, Volodymyr Budzan, Mykola Mykhailych, Dmytro Mishkin, Jiří Matas

A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping
Debang Li, Huikai Wu, Junge Zhang, Kaiqi Huang

Single Image Dehazing via Conditional Generative Adversarial Network
Runde Li, Jinshan Pan, Zechao Li, Jinhui Tang

On the Duality Between Retinex and Image Dehazing
Adrian Galdran, Aitor Alvarez-Gila, Alessandro Bria, Javier Vazquez-Corral, Marcelo Bertalmío

Arbitrary Style Transfer With Deep Feature Reshuffle
Shuyang Gu, Congliang Chen, Jing Liao, Lu Yuan

Nonlocal Low-Rank Tensor Factor Analysis for Image Restoration
Xinyuan Zhang, Xin Yuan, Lawrence Carin

Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration
Lu Sheng, Ziyi Lin, Jing Shao, Xiaogang Wang

Missing Slice Recovery for Tensors Using a Low-Rank Model in Embedded Space
Tatsuya Yokota, Burak Erem, Seyhmus Guler, Simon K. Warfield, Hidekata Hontani

Deep Semantic Face Deblurring
Ziyi Shen, Wei-Sheng Lai, Tingfa Xu, Jan Kautz, Ming-Hsuan Yang

GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning
Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie Zhou

Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation
Qihang Yu, Lingxi Xie, Yan Wang, Yuyin Zhou, Elliot K. Fishman, Alan L. Yuille

Thoracic Disease Identification and Localization With Limited Supervision
Zhe Li, Chong Wang, Mei Han, Yuan Xue, Wei Wei, Li-Jia Li, Li Fei-Fei

Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation
Xiaowei Xu, Qing Lu, Lin Yang, Sharon Hu, Danny Chen, Yu Hu, Yiyu Shi

Visual Feature Attribution Using Wasserstein GANs
Christian F. Baumgartner, Lisa M. Koch, Kerem Can Tezcan, Jia Xi Ang, Ender Konukoglu

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies
Hanbyul Joo, Tomas Simon, Yaser Sheikh

Augmented Skeleton Space Transfer for Depth-Based Hand Pose Estimation
Seungryul Baek, Kwang In Kim, Tae-Kyun Kim

Synthesizing Images of Humans in Unseen Poses
Guha Balakrishnan, Amy Zhao, Adrian V. Dalca, Frédo Durand, John Guttag

SSNet: Scale Selection Network for Online 3D Action Prediction
Jun Liu, Amir Shahroudy, Gang Wang, Ling-Yu Duan, Alex C. Kot

Detecting and Recognizing Human-Object Interactions
Georgia Gkioxari, Ross Girshick, Piotr Dollár, Kaiming He

Unsupervised Learning and Segmentation of Complex Activities From Video
Fadime Sener, Angela Yao

Unsupervised Training for 3D Morphable Model Regression
Kyle Genova, Forrester Cole, Aaron Maschinot, Aaron Sarna, Daniel Vlasic, William T. Freeman

Video Based Reconstruction of 3D People Models
Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll

Pose-Guided Photorealistic Face Rotation
Yibo Hu, Xiang Wu, Bing Yu, Ran He, Zhenan Sun

Mesoscopic Facial Geometry Inference Using Deep Neural Networks
Loc Huynh, Weikai Chen, Shunsuke Saito, Jun Xing, Koki Nagano, Andrew Jones, Paul Debevec, Hao Li

Hand PointNet: 3D Hand Pose Estimation Using Point Sets
Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan

Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching
Arsha Nagrani, Samuel Albanie, Andrew Zisserman

Learning Monocular 3D Human Pose Estimation From Multi-View Images
Helge Rhodin, Jörg Spörri, Isinsu Katircioglu, Victor Constantin, Frédéric Meyer, Erich Müller, Mathieu Salzmann, Pascal Fua

Separating Style and Content for Generalized Style Transfer
Yexun Zhang, Ya Zhang, Wenbin Cai

TextureGAN: Controlling Deep Image Synthesis With Texture Patches
Wenqi Xian, Patsorn Sangkloy, Varun Agrawal, Amit Raj, Jingwan Lu, Chen Fang, Fisher Yu, James Hays

Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images
Tribhuvanesh Orekondy, Mario Fritz, Bernt Schiele

MapNet: An Allocentric Spatial Memory for Mapping Environments
João F. Henriques, Andrea Vedaldi

Accurate and Diverse Sampling of Sequences Based on a “Best of Many” Sample Objective
Apratim Bhattacharyya, Bernt Schiele, Mario Fritz

VirtualHome: Simulating Household Activities via Programs
Xavier Puig, Kevin Ra, Marko Boben, Jiaman Li, Tingwu Wang, Sanja Fidler, Antonio Torralba

Generate to Adapt: Aligning Domains Using Generative Adversarial Networks
Swami Sankaranarayanan, Yogesh Balaji, Carlos D. Castillo, Rama Chellappa

Multi-Agent Diverse Generative Adversarial Networks
Arnab Ghosh, Viveka Kulharia, Vinay P. Namboodiri, Philip H.S. Torr, Puneet K. Dokania

A PID Controller Approach for Stochastic Optimization of Deep Networks
Wangpeng An, Haoqian Wang, Qingyun Sun, Jun Xu, Qionghai Dai, Lei Zhang

“Learning-Compression” Algorithms for Neural Net Pruning
Miguel Á. Carreira-Perpiñán, Yerlan Idelbayev

Large-Scale Distance Metric Learning With Uncertainty
Qi Qian, Jiasheng Tang, Hao Li, Shenghuo Zhu, Rong Jin

Guide Me: Interacting With Deep Networks
Christian Rupprecht, Iro Laina, Nassir Navab, Gregory D. Hager, Federico Tombari

Art of Singular Vectors and Universal Adversarial Perturbations
Valentin Khrulkov, Ivan Oseledets

Deflecting Adversarial Attacks With Pixel Deflection
Aaditya Prakash, Nick Moran, Solomon Garber, Antonella DiLillo, James Storer

MovieGraphs: Towards Understanding Human-Centric Situations From Videos
Paul Vicol, Makarand Tapaswi, Lluís Castrejón, Sanja Fidler

SemStyle: Learning to Generate Stylised Image Captions Using Unaligned Text
Alexander Mathews, Lexing Xie, Xuming He

Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions
Torsten Sattler, Will Maddern, Carl Toft, Akihiko Torii, Lars Hammarstrand, Erik Stenborg, Daniel Safari, Masatoshi Okutomi, Marc Pollefeys, Josef Sivic, Fredrik Kahl, Tomas Pajdla

IVQA: Inverse Visual Question Answering
Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

Unsupervised Person Image Synthesis in Arbitrary Poses
Albert Pumarola, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

Learning Descriptor Networks for 3D Shape Synthesis and Analysis
Jianwen Xie, Zilong Zheng, Ruiqi Gao, Wenguan Wang, Song-Chun Zhu, Ying Nian Wu

Neural Kinematic Networks for Unsupervised Motion Retargetting
Ruben Villegas, Jimei Yang, Duygu Ceylan, Honglak Lee

Group Consistent Similarity Learning via Deep CRF for Person Re-Identification
Dapeng Chen, Dan Xu, Hongsheng Li, Nicu Sebe, Xiaogang Wang

Learning Compositional Visual Concepts With Mutual Consistency
Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, Peter C. Doerschuk

NestedNet: Learning Nested Sparse Structures in Deep Neural Networks
Eunwoo Kim, Chanho Ahn, Songhwai Oh

Context Embedding Networks
Kun Ho Kim, Oisin Mac Aodha, Pietro Perona

Iterative Learning With Open-Set Noisy Labels
Yisen Wang, Weiyang Liu, Xingjun Ma, James Bailey, Hongyuan Zha, Le Song, Shu-Tao Xia

Learning Transferable Architectures for Scalable Image Recognition
Barret Zoph, Vijay Vasudevan, Jonathon Shlens, Quoc V. Le

SBNet: Sparse Blocks Network for Fast Inference
Mengye Ren, Andrei Pokrovsky, Bin Yang, Raquel Urtasun

Language-Based Image Editing With Recurrent Attentive Models
Jianbo Chen, Yelong Shen, Jianfeng Gao, Jingjing Liu, Xiaodong Liu

Net2Vec: Quantifying and Explaining How Concepts Are Encoded by Filters in Deep Neural Networks
Ruth Fong, Andrea Vedaldi

End-to-End Dense Video Captioning With Masked Transformer
Luowei Zhou, Yingbo Zhou, Jason J. Corso, Richard Socher, Caiming Xiong

A Neural Multi-Sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan, Boyang Li, Leonid Sigal, Markus Gross

Path Aggregation Network for Instance Segmentation
Shu Liu, Lu Qi, Haifang Qin, Jianping Shi, Jiaya Jia

The INaturalist Species Classification and Detection Dataset
Grant Van Horn, Oisin Mac Aodha, Yang Song, Yin Cui, Chen Sun, Alex Shepard, Hartwig Adam, Pietro Perona, Serge Belongie

Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Marcus Rohrbach

StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul Choo

High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro

Semi-Parametric Image Synthesis
Xiaojuan Qi, Qifeng Chen, Jiaya Jia, Vladlen Koltun

BlockDrop: Dynamic Inference Paths in Residual Networks
Zuxuan Wu, Tushar Nagarajan, Abhishek Kumar, Steven Rennie, Larry S. Davis, Kristen Grauman, Rogerio Feris

Interpretable Convolutional Neural Networks
Quanshi Zhang, Ying Nian Wu, Song-Chun Zhu

Deep Cross-Media Knowledge Transfer
Xin Huang, Yuxin Peng

Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie, Jingdong Wang, Ting Zhang, Jianhuang Lai, Richang Hong, Guo-Jun Qi

A Variational U-Net for Conditional Appearance and Shape Generation
Patrick Esser, Ekaterina Sutter, Björn Ommer

Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation
Yen-Cheng Liu, Yu-Ying Yeh, Tzu-Chien Fu, Sheng-De Wang, Wei-Chen Chiu, Yu-Chiang Frank Wang

Learning Deep Structured Active Contours End-to-End
Diego Marcos, Devis Tuia, Benjamin Kellenberger, Lisa Zhang, Min Bai, Renjie Liao, Raquel Urtasun

Deep Learning Under Privileged Information Using Heteroscedastic Dropout
John Lambert, Ozan Sener, Silvio Savarese

Smooth Neighbors on Teacher Graphs for Semi-Supervised Learning
Yucen Luo, Jun Zhu, Mengxi Li, Yong Ren, Bo Zhang

Interpret Neural Networks by Identifying Critical Data Routing Paths
Yulong Wang, Hang Su, Bo Zhang, Xiaolin Hu

Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
Siddhartha Chandra, Camille Couprie, Iasonas Kokkinos

Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Andrew Shin, Yoshitaka Ushiku, Tatsuya Harada

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz

Revisiting Deep Intrinsic Image Decompositions
Qingnan Fan, Jiaolong Yang, Gang Hua, Baoquan Chen, David Wipf

Multi-Cell Detection and Classification Using a Generative Convolutional Model
Florence Yellin, Benjamin D. Haeffele, Sophie Roth, René Vidal

Learning Spatial-Aware Regressions for Visual Tracking
Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang

High Performance Visual Tracking With Siamese Region Proposal Network
Bo Li, Junjie Yan, Wei Wu, Zheng Zhu, Xiaolin Hu

LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation
Tak-Wai Hui, Xiaoou Tang, Chen Change Loy

VITAL: VIsual Tracking via Adversarial Learning
Yibing Song, Chao Ma, Xiaohe Wu, Lijun Gong, Linchao Bao, Wangmeng Zuo, Chunhua Shen, Rynson W.H. Lau, Ming-Hsuan Yang

Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation
Huaizu Jiang, Deqing Sun, Varun Jampani, Ming-Hsuan Yang, Erik Learned-Miller, Jan Kautz

Real-World Repetition Estimation by Div, Grad and Curl
Tom F. H. Runia, Cees G. M. Snoek, Arnold W. M. Smeulders

Recurrent Pixel Embedding for Instance Grouping
Shu Kong, Charless C. Fowlkes

Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective
Jing Zhang, Tong Zhang, Yuchao Dai, Mehrtash Harandi, Richard Hartley

Learning Intrinsic Image Decomposition From Watching the World
Zhengqi Li, Noah Snavely

TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-Rays
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Ronald M. Summers

Generating Synthetic X-Ray Images of a Person From the Surface Geometry
Brian Teixeira, Vivek Singh, Terrence Chen, Kai Ma, Birgi Tamersoy, Yifan Wu, Elena Balashova, Dorin Comaniciu

Gibson Env: Real-World Perception for Embodied Agents
Fei Xia, Amir R. Zamir, Zhiyang He, Alexander Sax, Jitendra Malik, Silvio Savarese

Reinforcement Cutting-Agent Learning for Video Object Segmentation
Junwei Han, Le Yang, Dingwen Zhang, Xiaojun Chang, Xiaodan Liang

Feature Space Transfer for Data Augmentation
Bo Liu, Xudong Wang, Mandar Dixit, Roland Kwitt, Nuno Vasconcelos

Analytic Expressions for Probabilistic Moments of PL-DNN With Gaussian Input
Adel Bibi, Modar Alfadly, Bernard Ghanem

Detail-Preserving Pooling in Deep Networks
Faraz Saeedan, Nicolas Weber, Michael Goesele, Stefan Roth

Rethinking Feature Distribution for Loss Functions in Image Classification
Weitao Wan, Yuanyi Zhong, Tianpeng Li, Jiansheng Chen

Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Bichen Wu, Alvin Wan, Xiangyu Yue, Peter Jin, Sicheng Zhao, Noah Golmant, Amir Gholaminejad, Joseph Gonzalez, Kurt Keutzer

Sketch-a-Classifier: Sketch-Based Photo Classifier Generation
Conghui Hu, Da Li, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales

Light Field Intrinsics With a Deep Encoder-Decoder Network
Anna Alperovich, Ole Johannsen, Michael Strecke, Bastian Goldluecke

Learning Generative ConvNets via Multi-Grid Modeling and Sampling
Ruiqi Gao, Yang Lu, Junpei Zhou, Song-Chun Zhu, Ying Nian Wu

Manifold Learning in Quotient Spaces
Éloi Mehr, André Lieutier, Fernando Sanchez Bermudez, Vincent Guitteny, Nicolas Thome, Matthieu Cord

Learning Intelligent Dialogs for Bounding Box Annotation
Ksenia Konyushkova, Jasper Uijlings, Christoph H. Lampert, Vittorio Ferrari

Boosting Adversarial Attacks With Momentum
Yinpeng Dong, Fangzhou Liao, Tianyu Pang, Hang Su, Jun Zhu, Xiaolin Hu, Jianguo Li

NISP: Pruning Networks Using Neuron Importance Score Propagation
Ruichi Yu, Ang Li, Chun-Fu Chen, Jui-Hsin Lai, Vlad I. Morariu, Xintong Han, Mingfei Gao, Ching-Yung Lin, Larry S. Davis

PointGrid: A Deep Network for 3D Shape Understanding
Truc Le, Ye Duan

Tell Me Where to Look: Guided Attention Inference Network
Kunpeng Li, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, Yun Fu

3D Semantic Segmentation With Submanifold Sparse Convolutional Networks
Benjamin Graham, Martin Engelcke, Laurens van der Maaten

TOM-Net: Learning Transparent Object Matting From a Single Image
Guanying Chen, Kai Han, Kwan-Yee K. Wong

Translating and Segmenting Multimodal Medical Volumes With Cycle- and Shape-Consistency Generative Adversarial Network
Zizhao Zhang, Lin Yang, Yefeng Zheng

An Unsupervised Learning Model for Deformable Medical Image Registration
Guha Balakrishnan, Amy Zhao, Mert R. Sabuncu, John Guttag, Adrian V. Dalca

Deep Lesion Graphs in the Wild: Relationship Learning and Organization of Significant Radiology Image Findings in a Diverse Large-Scale Lesion Database
Ke Yan, Xiaosong Wang, Le Lu, Ling Zhang, Adam P. Harrison, Mohammadhadi Bagheri, Ronald M. Summers

Learning Distributions of Shape Trajectories From Longitudinal Datasets: A Hierarchical Model on a Manifold of Diffeomorphisms
Alexandre Bône, Olivier Colliot, Stanley Durrleman

CNN Driven Sparse Multi-Level B-Spline Image Registration
Pingge Jiang, James A. Shackleford

Anatomical Priors in Convolutional Networks for Unsupervised Biomedical Segmentation
Adrian V. Dalca, John Guttag, Mert R. Sabuncu

3D Registration of Curves and Surfaces Using Local Differential Information
Carolina Raposo, João P. Barreto

Weakly Supervised Learning of Single-Cell Feature Embeddings
Juan C. Caicedo, Claire McQuin, Allen Goodman, Shantanu Singh, Anne E. Carpenter

Guided Proofreading of Automatic Segmentations for Connectomics
Daniel Haehn, Verena Kaynig, James Tompkin, Jeff W. Lichtman, Hanspeter Pfister

Wide Compression: Tensor Ring Nets
Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet Aggarwal

Improvements to Context Based Self-Supervised Learning
T. Nathan Mundhenk, Daniel Ho, Barry Y. Chen

Learning Structure and Strength of CNN Filters for Small Sample Size Training
Rohit Keshari, Mayank Vatsa, Richa Singh, Afzel Noore

Boosting Self-Supervised Learning via Knowledge Transfer
Mehdi Noroozi, Ananth Vinjimoor, Paolo Favaro, Hamed Pirsiavash

The Power of Ensembles for Active Learning in Image Classification
William H. Beluch, Tim Genewein, Andreas Nürnberger, Jan M. Köhler

Learning Compact Recurrent Neural Networks With Block-Term Tensor Decomposition
Jinmian Ye, Linnan Wang, Guangxi Li, Di Chen, Shandian Zhe, Xinqi Chu, Zenglin Xu

Spatially-Adaptive Filter Units for Deep Neural Networks
Domen Tabernik, Matej Kristan, Aleš Leonardis

SO-Net: Self-Organizing Network for Point Cloud Analysis
Jiaxin Li, Ben M. Chen, Gim Hee Lee

SGAN: An Alternative Training of Generative Adversarial Networks
Tatjana Chavdarova, François Fleuret

SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis
Wengling Chen, James Hays

Explicit Loss-Error-Aware Quantization for Low-Bit Deep Neural Networks
Aojun Zhou, Anbang Yao, Kuan Wang, Yurong Chen

Towards Universal Representation for Unseen Action Recognition
Yi Zhu, Yang Long, Yu Guan, Shawn Newsam, Ling Shao

Deep Image Prior
Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky

ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing
Chen-Hsuan Lin, Ersin Yumer, Oliver Wang, Eli Shechtman, Simon Lucey

CartoonGAN: Generative Adversarial Networks for Photo Cartoonization
Yang Chen, Yu-Kun Lai, Yong-Jin Liu