楊山田檢視原始碼討論檢視歷史
|
楊山田,男,西南財經大學講師。
研究領域
機器學習
智能金融
講授課程
機器學習、強化學習、人工智能概述。
研究成果
[1] Shantian Yang*, Bo Yang, Zheng Zeng, Zhongfeng Kang. Causal inference multi-agent reinforcement learning for traffic signal control. Information Fusion (中科院JCR分區大類1區期刊,TOP期刊、影響因子:17.564), 94:243-256, 2023.
[2] Shantian Yang*. Hierarchical graph multi-agent reinforcement learning for traffic signal control. Information Sciences (中科院JCR分區大類1區期刊,TOP期刊,CCF-B、影響因子:8.1), 634:55-72, 2023.
[3] Shantian Yang*. Deep reinforcement learning for portfolio management. Knowledge-Based System (中科院JCR分區大類1區期刊,TOP期刊,影響因子:8.8), 278, art No. 110905, 2023.
[4] Shantian Yang and Bo. Yang*. An Inductive Heterogeneous Graph Attention-based Multi-agent Deep Graph Infomax Algorithm for Adaptive Traffic Signal Control. Information Fusion (中科院JCR分區大類1區期刊,TOP期刊、影響因子:17.564), 88:249-262, 2022.
[5] Shantian Yang, Bo Yang*, Zhongfeng Kang, Lihui Deng. IHG-MA: Inductive heterogeneous graph multi-agent reinforcement learning for multi-intersection traffic signal control. Neural Networks (中科院JCR分區大類1區期刊,TOP期刊,CCF-B、影響因子:9.657), 139, 265-277, 2021.
[6] Shantian Yang and Bo Yang*. A semi-decentralized feudal multi-agent learned-goal algorithm for multi-intersection traffic signal control. Knowledge-Based System (中科院JCR分區大類1區期刊,TOP期刊,影響因子:8.139), 213, art No. 06708, 2021.
[7] Shantian Yang, Bo Yang*, Hau-san Wong, Zhongfeng Kang. Cooperative traffic signal control using Multi-step return and Off-policy Asynchronous Advantage Actor-Critic Graph algorithm. Knowledge-Based Systems (中科院JCR分區大類1區期刊,TOP期刊,影響因子:8.139), 183, art No. 104855, 2019.
[8] Shantian Yang and Bo Yang*. A Meta Multi-agent Reinforcement Learning Algorithm for Multi-intersection Traffic Signal Control. IEEE Intl Conf on Depend., Autono. and Secure Compu., pp. 18-25, 2021.
[9] Lihui Deng, Bo Yang*, Zhongfeng Kang, Shantian Yang and Shihu Wu. A Noisy Label and Negative Sample Robust Loss Function for DNN-based Distant Supervised Relation Extraction. Neural Networks, 139, 358-370, 2021.
[10] Zhongfeng Kang, Bo Yang*, Mads Nielsen, Lihui Deng, Shantian Yang. A Buffered Online Transfer Learning Algorithm with Multi-layer Network. Neurocomputing, 488, 581-597, 2022.
[11] Zhongfeng Kang, Bo Yang*, Shantian Yang. Online transfer learning with multiple source domains for multi-class classification. Knowledge-Based Systems, 190, 105149, 2019.
科研項目
(1)「面向在線學習的基於 SAF 的推薦模型研究 (項目號: 61977013)」 , 國家自然科學基金,面上項目, 50 萬元, 2020.1-2023.12,主研,結題。
(2) 「基於梯度提升深度森林的網絡教育平台中的推薦系統研究 (項目號: 2019YJ0164)」 ,10W 元, 四川省科技廳, 2019.1-2021.1,主研,結題。
(3) 「公共服務政策智能推送關鍵技術研究與原型系統研發 (項目號190241)」 , CECT 大數據研究工程公司,75 萬元, 2019.5-2020.10,主研,結題。
(4) 「基於可解釋強化學習的組合投資算法研究(項目號: JBK23YJ26)」,中央高校基本科研業務費專項資金-西南財經大學引進人才科研啟動資助項目,2萬元,2023.1-2023.12,主持,完成。[1]