
一、 个人简介
石恒璨,博士,副教授,博士生导师,王耀南院士团队成员,机器人视觉感知与控制技术国家工程研究中心骨干成员。现为美国电气与电子工程师协会会员,美国计算机学会会员,和中国图象图形学学会会员。2021年获中国图象图形学学会优秀博士学位论文奖(全国仅10人)。2024年人才引进加入湖南大学电气与信息工程学院。
主要研究方向为人工智能、计算机与机器人视觉感知、视觉-语言多模态学习、弱监督学习与无监督学习等。在多媒体与计算机视觉顶级期刊会议IJCV、CVPR、ECCV、ACMMM、SIGIR、IEEE TMM等发表论文近40余篇。长期担任计算机视觉顶级会议CVPR、ICCV、ECCV、机器学习顶级会议ICML、NeurIPS、ICLR、人工智能顶级会议AAAI、IJCAI等网络主席、领域主席、程序委员会委员,顶级期刊IEEE TPAMI、IJCV、TMM、TCSVT、TIE、TNNLS等审稿人。
联系方式:shihengcan@hnu.edu.cn
二、 招生信息
欢迎对人工智能、计算机与机器人视觉感知、视觉-语言多模态学习感兴趣的学生加入我们团队。
优秀学生可推荐至澳大利亚悉尼大学、莫纳什大学、阿德莱德大学、香港中文大学、香港城市大学等QS前100大学深造或访问交流,腾讯、阿里、百度、字节、商汤等知名企业位于中国、美国、澳大利亚、新加坡等地研发部门工作或实习。
三、 教育与工作经历
2024-至今,湖南大学,副教授
2020-2024,澳大利亚莫纳什大学,研究员
2014-2019,电子科技大学,博士
2010-2014, 电子科技大学, 学士
四、 部分代表性科研成果
[1]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. " LLMFormer: Large language model for open-vocabulary semantic segmentation." International Journal of Computer Vision (IJCV), 2024. (机器视觉顶级期刊,CCF-A)
[2]. Duy Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi, “JRDB-PanoTrack: An open-world panoptic segmentation and tracking robotic dataset in crowded human environments”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024. (机器视觉顶级会议,CCF-A)
[3]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. "Unified open-vocabulary dense visual prediction." IEEE Transactions on Multimedia (TMM), 2024. (多媒体顶级期刊,一区TOP)
[4]. Duy-Tho Le, Hengcan Shi, Jianfei Cai, Hamid Rezatofighi, “Diffusion model for robust multi-sensor fusion in 3d object detection and bev segmentation”, European Conference on Computer Vision (ECCV), 2024. (机器视觉顶级会议)
[5]. Son Duy Dao, Hengcan Shi, Dinh Q Phung, Jianfei Cai" CA-OVS: Cluster and Adapt Mask Proposals for Open-Vocabulary Semantic Segmentation" the 6th ACM International Conference on Multimedia in Asia, 2024.
[6]. Hengcan Shi, Munawar Hayat, Jianfei Cai, “Transformer Scale Gate for Semantic Segmentation”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (机器视觉顶级会议,CCF-A)
[7]. Hengcan Shi, Munawar Hayat, and Jianfei Cai. "Open-vocabulary object detection via scene graph discovery." Proceedings of the 31st ACM International Conference on Multimedia (ACMMM), 2023. (多媒体顶级会议,CCF-A)
[8]. Son Duy Dao, Hengcan Shi, Dinh Phung, Jianfei Cai. "Class Enhancement Losses with Pseudo Labels for Open-Vocabulary Semantic Segmentation." IEEE Transactions on Multimedia (TMM), 2023. (多媒体顶级期刊,一区TOP)
[9]. Hengcan Shi, Munawar Hayat, Jianfei Cai, “Unpaired referring expression grounding via bidirectional cross-modal matching”, Neurocomputing, 2023 (二区TOP)
[10]. Hengcan Shi, Munawar Hayat, Yicheng Wu, Jianfei Cai, “ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022. (机器视觉顶级会议,CCF-A)
[11]. Duy Tho Le, Hengcan Shi, Hamid Rezatofighi, Jianfei Cai, “Accurate and real-time 3D pedestrian detection using an efficient attentive pillar network”, IEEE Robotics and Automation Letters (RA-L) 2022. (机器视觉顶级会议,CCF-A)
[12]. Tingtian Li, Zixun Sun, Haoruo Zhang, Jin Li, Ziming Wu, Hui Zhan, Yipeng Yu, Hengcan Shi, “Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs”, Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) 2021. (信息检索顶级会议,CCF-A)
[13]. Hengcan Shi, Hongliang Li, Qingbo Wu, and King Ngi Ngan, “Query Reconstruction Network for Referring Expression Image Segmentation”, IEEE Transactions on Multimedia (TMM), 2020. (多媒体顶级期刊,一区TOP)
[14]. Heqian Qiu, Hongliang Li, Qingbo Wu, and Hengcan Shi, “Offset Bin Classification Network for Accurate Object Detection”, IEEE Conference on ComputerVision and Pattern Recognition (CVPR), 2020. (机器视觉顶级会议,CCF-A)
[15]. Heqian Qiu, Hongliang Li, Qingbo Wu, FanmanMeng, Hengcan Shi, Taijin Zhao, and King Ngi Ngan, “Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension”, ACMinternational conference on Multimedia (ACMMM), 2020. (多媒体顶级会议,CCF-A)
[16]. Heqian Qiu, Hongliang Li, Qingbo Wu, Fanman Meng, Linfeng Xu, King Ngi Ngan, and Hengcan Shi, “Hierarchical Context Features Embedding for Object Detection”, IEEE Transactions on Multimedia (TMM), 2020. (多媒体顶级期刊,CCF-A)
[17]. Hengcan Shi, Hongliang Li, Qingbo Wu, and Zichen Song, “Scene Parsing via Integrated Classification Model and Variance-Based Regularization”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. (机器视觉顶级会议,CCF-A)
[18]. Hengcan Shi, Hongliang Li, Qingbo Wu, FanmanMeng, and King Ngi Ngan, “Boosting scene parsing performance via reliable scale prediction”, ACMinternational conference on Multimedia (ACM MM), 2018. (多媒体顶级会议,CCF-A, Oral)
[19]. Hengcan Shi, Hongliang Li, Fanman Meng, and Qingbo Wu, “Key-Word-Aware Network for Referring Expression Image Segmentation”, European Conference on Computer Vision (ECCV), 2018. (机器视觉顶级会议)
[20]. Hengcan Shi, Hongliang Li, Fanman Meng, Qingbo Wu, Linfeng Xu, and King N. Ngan,“Hierarchical Parsing Net: Semantic Scene Parsing from Global Scene to Objects”, IEEE Transactions on Multimedia (TMM), 2018 (多媒体顶级期刊,一区TOP)