<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-09T02:16:35Z</responseDate><request verb="GetRecord" identifier="oai:riubu.ubu.es:10259/7002" metadataPrefix="etdms">https://riubu.ubu.es/oai/request</request><GetRecord><record><header><identifier>oai:riubu.ubu.es:10259/7002</identifier><datestamp>2024-05-20T08:02:46Z</datestamp><setSpec>com_10259.4_104</setSpec><setSpec>com_10259_2604</setSpec><setSpec>col_10259_6848</setSpec></header><metadata><thesis xmlns="http://www.ndltd.org/standards/metadata/etdms/1.0/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.ndltd.org/standards/metadata/etdms/1.0/ http://www.ndltd.org/standards/metadata/etdms/1.0/etdms.xsd">
<title>Reinforcement learning for Traffic Signal Control: Comparison with commercial systems</title>
<creator>Cabrejas Egea, Álvaro</creator>
<creator>Zhang, Raymond</creator>
<creator>Walton, Neil</creator>
<subject>Tráfico</subject>
<subject>Infraestructuras</subject>
<subject>Traffic</subject>
<subject>Infrastructures</subject>
<description>Trabajo presentado en: R-Evolucionando el transporte, XIV Congreso de Ingeniería del Transporte (CIT 2021), realizado en modalidad online los días 6, 7 y 8 de julio de 2021, organizado por la Universidad de Burgos</description>
<description>In recent years, Intelligent Transportation Systems are leveraging the power of increased&#xd;
sensory coverage and available computing power to deliver data-intensive solutions&#xd;
achieving higher levels of performance than traditional systems. Within Traffic Signal&#xd;
Control (TSC), this has allowed the emergence of Machine Learning (ML) based systems.&#xd;
Among this group, Reinforcement Learning (RL) approaches have performed particularly&#xd;
well. Given the lack of industry standards in ML for TSC, literature exploring RL often lacks&#xd;
comparison against commercially available systems and straightforward formulations of&#xd;
how the agents operate. Here we attempt to bridge that gap. We propose three different&#xd;
architectures for RL based agents and compare them against currently used commercial&#xd;
systems MOVA, SurTrac and Cyclic controllers and provide pseudo-code for them. The&#xd;
agents use variations of Deep Q-Learning (Double Q Learning, Duelling Architectures and&#xd;
Prioritised Experience Replay) and Actor Critic agents, using states and rewards based on&#xd;
queue length measurements. Their performance is compared in across different map&#xd;
scenarios with variable demand, assessing them in terms of the global delay generated by all&#xd;
vehicles. We find that the RL-based systems can significantly and consistently achieve lower&#xd;
delays when compared with traditional and existing commercial systems.</description>
<date>2022-09-22</date>
<date>2022-09-22</date>
<date>2021-07</date>
<type>info:eu-repo/semantics/conferenceObject</type>
<identifier>978-84-18465-12-3</identifier>
<identifier>http://hdl.handle.net/10259/7002</identifier>
<identifier>10.36443/10259/7002</identifier>
<language>eng</language>
<relation>R-Evolucionando el transporte</relation>
<relation>http://hdl.handle.net/10259/6490</relation>
<relation>https://doi.org/10.36443/9788418465123</relation>
<relation>info:eu-repo/grantAgreement/EPSRC//EP%2FL015374</relation>
<rights>info:eu-repo/semantics/openAccess</rights>
<publisher>Universidad de Burgos. Servicio de Publicaciones e Imagen Institucional</publisher>
</thesis></metadata></record></GetRecord></OAI-PMH>