版权说明 操作指南
首页 > 成果 > 详情

Incremental Multistep Q-learning for Adaptive Traffic Signal Control Based on Delay Minimization Strategy

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文、会议论文
作者:
Lu, Shoufeng*;Liu, Ximin;Dai, Shiqiang
通讯作者:
Lu, Shoufeng
作者机构:
[Lu, Shoufeng; Liu, Ximin] Changsha Univ Sci & Technol, Traff & Transportat Coll, Changsha 410076, Hunan, Peoples R China.
[Dai, Shiqiang; Liu, Ximin] Shanghai Univ, Inst Appl Math & Mech, Shanghai 200072, Peoples R China.
通讯机构:
[Lu, Shoufeng] C
Changsha Univ Sci & Technol, Traff & Transportat Coll, Changsha 410076, Hunan, Peoples R China.
语种:
英文
关键词:
Adaptive Traffic Signal Control;Incremental Multistep Q Learning;Delay Minimization Strategy
期刊:
Proceedings of the World Congress on Intelligent Control and Automation (WCICA)
年:
2008
页码:
2854-2858
会议名称:
7th World Congress on Intelligent Control and Automation
会议时间:
JUN 25-27, 2008
会议地点:
Chongqing, PEOPLES R CHINA
会议主办单位:
[Lu, Shoufeng;Liu, Ximin] Changsha Univ Sci & Technol, Traff & Transportat Coll, Changsha 410076, Hunan, Peoples R China.^[Liu, Ximin;Dai, Shiqiang] Shanghai Univ, Inst Appl Math & Mech, Shanghai 200072, Peoples R China.
会议赞助商:
Chongqing Univ, Chongqing Inst Technol, Chongqing Univ Sci & Technol, Xihua Univ, SW Univ Sci & Technol, IEEE Robot & Automat Soc, IEEE Control Syst Soc, Beijing Chapter, Chinese Assoc Automat, Chinese Assoc Artificial Intelligence, Natl Nat Sci Fdn, Chongqing Municipal Sci & Technol Comm, Chongqing Municipal Assoc Sci & Technol, KC Wong Educ Fdn
出版地:
345 E 47TH ST, NEW YORK, NY 10017 USA
出版者:
IEEE
ISBN:
978-1-4244-2113-8
基金类别:
NSFC [70701006, 10532060]; National Basic Research Program of China [2006CB705500]; Talent Recruitment Foundation of Changsha University of Science and Technology [1004140]
机构署名:
本校为第一且通讯机构
院系归属:
交通运输工程学院
摘要:
Incremental multistep Q learning (Q( λ)) combines Q learning and TD(λ). Theoretically, Q(λ) has better performance than Q learning. The goal of the paper is to test the performance of Q(λ) for adaptive traffic signal control. For Q(λ), the state is total delay of the intersection, and the action is phase green time change. The relationship between phase green time change and action space is discussed. The performance between Q(λ) learning and fixed cycle signal setting for isolated intersection is compared. The computation results show that Q(λ)...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com