Publication


The best is yet to come

I am interested in computer vision related research. My most updated publication list is in google scholar.


Facebook Research (2018 - Now)


Oral: Zero-Shot Grounding of Objects from Natural Language Queries. Arka Sadhu, Kan Chen, Ram Nevatia, Computer Vision (ICCV), IEEE International Conference on, 2019 [PDF][Code]

ArXiv: Billion-scale Semi-supervised Learning for Image Classification. Zeki Yalniz, Hervé Jégou, Kan Chen, Manohar Paluri, Dhruv Mahajan, ArXiv, 2019 [PDF][Code]


PhD years (2013 - 2018)


Thesis: Multimodal Reasoning of Visual Information and Natural Language. Kan Chen, USC Digital Library, 2019 [PDF]

Oral: MAC: Mining Activity Concepts for Language-based Temporal Localization. Runzhou Ge, Jiyang Gao, Kan Chen, Ram Nevatia, Applications of Computer Vision (WACV), IEEE Winter Conference on, 2019 [PDF][Code]

Poster: CTAP: Complementary Temporal Action Proposal Generation. Kan Chen*, Jiyang Gao*, Ram Nevatia, Computer Vision (ECCV), European Conference on, 2018 [PDF][Code]

Best paper: Visually Indicated Sound Generation by Perceptually Optimized Classification. Kan Chen*, Chuanxi Zhang*, Chen Fang, Zhaowen Wang, Trung Bui, Ram Nevatia, Computer Vision Workshop (ECCVW), European Conference on, 2018 [PDF][Code]

Poster: Knowledge Aided Consistency for Weakly Supervised Phrase Grounding. Kan Chen, Jiyang Gao, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2018 [PDF][Supplementary][Code]

Poster: Motion-Appearance Co-Memory Networks for Video Question Answering. Jiyang Gao*, Runzhou Ge*, Kan Chen, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2018 [PDF]

Journal: MSRC: Multimodal Spatial Regression with Semantic Context for Phrase Grounding. Kan Chen, Rama Kovvuri, Jiyang Gao, Ram Nevatia, International Journal of Multimedia Information Retrieval, 2017 [PDF]

Spotlight: Query-guided Regression Network with Context Policy for Phrase Grounding. Kan Chen*, Rama Kovvuri*, Ram Nevatia, Computer Vision (ICCV), IEEE International Conference on, 2017 [PDF][Code]

Poster: TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals. Jiyang Gao*, Zhenheng Yang*, Kan Chen, Chen Sun, Ram Nevatia, Computer Vision (ICCV), IEEE International Conference on, 2017 [PDF][Code]

Oral: MSRC: Multimodal Spatial Regression with Semantic Context for Phrase Grounding. Kan Chen, Rama Kovvuri, Jiyang Gao, Ram Nevatia, Multimedia Retrieval (ICMR), ACM International Conference on, 2017 [PDF]

Poster: AMC: Attention guided Multi-modal Correlation Learning for Image Search. Kan Chen, Trung Bui, Fang Chen, Zhaowen Wang, Ram Nevatia, Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2017 [PDF][Code]

Poster: Activity Recognition and Prediction with Pose based Discriminative Patch Model. Song Cao, Kan Chen, Ram Nevatia, Applications of Computer Vision (WACV), IEEE Winter Conference on, 2016 [PDF]

Poster: Abstraction Hierarchy and Self Annotation Update for Fine Grained Activity Recognition. Song Cao, Kan Chen, Ram Nevatia, Applications of Computer Vision (WACV), IEEE Winter Conference on, 2016 [PDF]

Poster: ABC-CNN: An attention based convolutional neural network for visual question answering. Kan Chen, Jiang Wang, Liang-Chieh Chen, Haoyuan Gao, Wei Xu, Ram Nevatia, Computer Vision and Pattern Recognition Workshop (CVPRW), IEEE Conference on, 2016 [PDF]


Undergraduate years (2009 - 2013)


Poster: Estimating the 3D Layout of Indoor Scenes and its Clutter from Depth Sensors. Jian Zhang, Kan Chen, Alexander G. Schwing, Raquel Urtasun, Computer Vision (ICCV) IEEE International Conference on, 2013 [PDF]

Journal: Image super resolution via analysis sparse prior. Qiang Ning, Kan Chen, Li Yi, Chuchu Fan, Jiangtao Wen. IEEE transaction of Signal Process, 2013 [PDF]