殷緒成檢視原始碼討論檢視歷史

殷緒成北京科技大學順德創新學院

殷緒成，男，北京科技大學順德創新學院教授。

人物簡歷

1995.09 - 2002.06 北京科技大學計算機系學士、碩士

2002.07 – 2006.07 漢王科技股份有限公司研發中心研發工程師/技術經理

2003.09 - 2006.07 中國科學院自動化研究所博士

2006.08 - 2008.06 富士通研究開發中心信息技術部研究員

2008.07 - 今於北京科技大學計算機系從事教學和科研工作（副教授、教授）

2013.01 - 2014.01 Center for Intelligent Information Retrieval, School of Computer Science, University of Massachusetts Amherst, USA, Visiting Associate Professor

2014.07 – 2014.08 Computer Vision Lab, School of Computer Science, University of MassachusettsAmherst, USA, Visiting Professor

2016.07-2016.09 BioNLP Lab, Department of Quantitative Health Sciences, University of Massachusetts Medical School, USA, Visiting Professor

研究方向

模式識別、文字識別、計算機視覺、人工智能芯片、工業智能與工業軟件

學術成果

[J1] Xu-Cheng Yin (殷緒成)*, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, 「Robust text detection in natural scene images」, IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), vol. 36, no. 5, pp. 970-983, 2014. (2022 Impact Factor: 24.314)

[J2] Xu-Cheng Yin (殷緒成)*, Wei-Yi Pei, Jun Zhang, and Hong-Wei Hao, 「Multi-orientation scene text detection with adaptive clustering」, IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), vol. 37, no. 9, pp. 1930-1937, 2015. (2022 Impact Factor: 24.314)

[J3] Shu Tian, Xu-Cheng Yin* (殷緒成), Ya Su, and Hong-Wei Hao, 「A unified framework for tracking based text detection and recognition from web videos,」 IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), vol. 40, no. 3, pp. 542-554, 2018. (2022 Impact Factor: 24.314)

[J4] Shi-Xue Zhang, Xiaobin Zhu, Lei Chen, Jie-Bo Hou, and Xu-Cheng Yin, 「Arbitrary shape text detection via segmentation with probability maps,」 IEEE Trans. Pattern Analysis and Machine Intelligence (T-PAMI), published online (DOI: 10.1109/TPAMI.2022.3176122), 2022. (2022 Impact Factor: 24.314)

[J5] Xu-Cheng Yin (殷緒成)*, Ze-Yu Zuo, Shu Tian, and Cheng-Lin Liu, 「Text detection, tracking and recognition in video: A comprehensive survey,」 IEEE Trans. Image Processing (T-IP), vol. 25, no. 6, pp. 2752-2773, 2016. (2022 Impact Factor: 11.041) (2021年北京地區廣受關注學術成果優秀論文/圖像圖形領域)

[J6] Chun Yang, Xu-Cheng Yin* (殷緒成), Wei-Yi Pei, Shu Tian, Ze-Yu Zuo, Chao Zhu and Junchi Yan, 「Tracking based multi-orientation scene text detection: A unified framework with dynamic programming,」 IEEE Trans. Image Processing (T-IP), vol. 26, no. 7, pp. 3235-3248, 2017. (2022 Impact Factor: 11.041)

[J7] Jie-Bo Hou, Xiaobin Zhu, Chang Liu, Kekai Sheng, Long-Huang Wu, Hongfa Wang, and Xu-Cheng Yin* (殷緒成), 「HAM: Hidden anchor mechanism for scene text detection,」 IEEE Trans. Image Processing (T-IP), vol. 29, pp. 7904-7916, 2020. (2022 Impact Factor: 11.041)

[J8] Song-Lu Chen, Chun Yang, Jia-Wei Ma, Feng Chen, and Xu-Cheng Yin* (殷緒成), 「Simultaneous end-to-end vehicle and license plate detection with multi-branch attention neural network,」 IEEE Trans. Intelligent Transportation Systems (T-ITS), vol. 21, no. 9, pp. 3686-3695. (2022 Impact Factor: 9.551) (2020年北京地區廣受關注學術成果優秀論文/物聯網領域)

[J9] Jie-Bo Hou, Xiaobin Zhu*, Chang Liu, Chun Yang, Long-Huang Wu, Hongfa Wang, and Xu-Cheng Yin* (殷緒成), 「Detecting text in scene and traffic guide panels with attention anchor mechanism,」 IEEE Trans. Intelligent Transportation Systems (T-ITS), vol. 22, no. 11, pp. 6890-6899, 2021. (2022 Impact Factor: 9.551)

[J10] Ye He, Chao Zhu*, and Xu-Cheng Yin* (殷緒成), 「Occluded pedestrian detection via distribution-based mutual-supervised feature learning,」 IEEE Trans. Intelligent Transportation Systems (T-ITS), vol. 23, no. 8, pp. 10514-10529, 2022. (2022 Impact Factor: 9.551)

[C1] Zanxia Jin, Mike Zheng Shou, Fang Zhou, Satoshi Tsutsui, Jingyan Jin, and Xu-Cheng Yin (殷緒成), 「From token to word: OCR token evolution via contrastive learning and semantic matching for Text-VQA,」 Proceedings of the 30th ACM International Conference on Multimedia (ACM Multimedia), 2022. (CCF A)

[C2] Hongyu Gao, Chao Zhu, Mengyin Liu, Weibo Gu, Hongfa Wang, Wei Liu, and Xu-Cheng Yin (殷緒成), 「CAliC: Accurate and efficient image-text retrieval via contrastive alignment and visual contexts modeling,」 Proceedings of the 30th ACM International Conference on Multimedia (ACM Multimedia), 2022. (CCF A)

[C3] Kangneng Zhou, Xiaobin Zhu, Daiheng Gao, Kai Lee, Xinjie Li, and Xu-Cheng Yin (殷緒成), 「SD-GAN: Semantic decomposition for face image synthesis with discrete attribute,」 Proceedings of the 30th ACM International Conference on Multimedia (ACM Multimedia), 2022. (CCF A)

[C4] Chang Liu, Chun Yang, and Xu-Cheng Yin* (殷緒成), 「Open-set text recognition via character-context decoupling,」 Proceedings of 2020 IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR), 2022. (CCF A)

[C5] Zhiyu Fang, Xiaobin Zhu*, Chun Yang, Zheng Han, Jingyan Qin, and Xu-Cheng Yin (殷緒成), 「Learning aligned cross-model representation for generalized zero-shot classification,」 Proceedings of 36th AAAI Conference on Artificial Intelligent (AAAI), 2022. (CCF A)

[C6] Shi-Xue Zhang, Xiaobin Zhu*, Chun Yang, Hongfa Wang, and Xu-Cheng Yin* (殷緒成), 「Adaptive boundary proposal network for arbitrary shape text detection,」 Proceedings of 2020 IEEE/CVF International Conference on Computer Vision (ICCV), 2021. (CCF A)

[C7] Mengyin Liu, Chao Zhu*, Jun Wang, and Xu-Cheng Yin* (殷緒成), 「Adaptive pattern-parameter matching for robust pedestrian detection,」 Proceedings of 35th AAAI Conference on Artificial Intelligent (AAAI), 2021. (CCF A)

[C8] Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chang Liu, Chun Yang, Hongfa Wang, and Xu-Cheng Yin* (殷緒成), 「Deep relational reasoning graph network for arbitrary shape text detection,」 Proceedings of 2020 IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR), 2020. (CCF A)

[C9] Bowen Yang, Chun Yang, Qi Liu, and Xu-Cheng Yin* (殷緒成), 「Joint rotation-invariance face detection and alignment with angle-sensitivity cascaded networks,」 Proceedings of the 27th ACM International Conference on Multimedia (ACM Multimedia), 2019. (CCF A)

[C10] Bo-Wen Zhang, Xu-Cheng Yin* (殷緒成), Fang Zhou, and Jianlin Jin, 「Building your own reading list anytime via embedding relevance, quality, timeliness and diversity,」 Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR), 2017. (CCF A)

科研業績

橫向項目

(1)「網絡圖片文字識別與廣告視頻內容理解研究」（2016~2021, 騰訊科技合作項目，負責人）

(2)「面向AI芯片的人工智能技術」（2018-2021，億智電子合作項目，負責人）

(3)「教育行業複雜英文文檔分析與識別技術」（2014~2015，科大訊飛合作項目，負責人）

縱向項目

(1)「鋼鐵智能製造過程中數據認知與生產決策技術及應用」（2023-2026，科技創新2030——新一代人工智能重大項目，負責人）

(2) 「大規模網絡圖像的文本識別方法與關鍵技術研究」（2022-2026，國家傑出青年科學基金項目，負責人）

(3) 「多語言場景文本檢測與識別關鍵技術研究」（2021-2024，國家自然科學基金面上項目，負責人）

獲獎情況

2019年度北京市科技進步一等獎（第一完成人），「網絡圖像視頻大數據的智能識別關鍵技術及應用」；

2018年度教育部科技進步二等獎（第一完成人），「大規模網絡圖像的文本識別技術及應用」;

連續四屆（2013/2015/2017/2019年）榮獲國際文檔分析與識別大會技術競賽「場景文本檢測」、「場景文本識別」、「網絡圖片文本檢測」、「網絡圖片文本識別」等15項冠軍；

連續四年（2015/2016/2017/2018年）榮獲國際生物信息文本語義檢索與問答技術挑戰平台BioASQ Challenge多項第一名；

2005年度北京市科技進步一等獎（主要成員），「漢王OCR技術及應用」;

2006年度富士通研究開發中心優秀髮明獎;

2006年富士通研究所社長獎，2007年富士通研究所社長獎。^[1]

參考資料

↑ 北京科技大學順德創新學院

[1] 北京科技大學順德創新學院

[1]