<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-09T03:13:50Z</responseDate><request verb="GetRecord" identifier="oai:riubu.ubu.es:10259/7002" metadataPrefix="mods">https://riubu.ubu.es/oai/request</request><GetRecord><record><header><identifier>oai:riubu.ubu.es:10259/7002</identifier><datestamp>2024-05-20T08:02:46Z</datestamp><setSpec>com_10259.4_104</setSpec><setSpec>com_10259_2604</setSpec><setSpec>col_10259_6848</setSpec></header><metadata><mods:mods xmlns:mods="http://www.loc.gov/mods/v3" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
<mods:name>
<mods:namePart>Cabrejas Egea, Álvaro</mods:namePart>
</mods:name>
<mods:name>
<mods:namePart>Zhang, Raymond</mods:namePart>
</mods:name>
<mods:name>
<mods:namePart>Walton, Neil</mods:namePart>
</mods:name>
<mods:extension>
<mods:dateAvailable encoding="iso8601">2022-09-22T06:41:43Z</mods:dateAvailable>
</mods:extension>
<mods:extension>
<mods:dateAccessioned encoding="iso8601">2022-09-22T06:41:43Z</mods:dateAccessioned>
</mods:extension>
<mods:originInfo>
<mods:dateIssued encoding="iso8601">2021-07</mods:dateIssued>
</mods:originInfo>
<mods:identifier type="isbn">978-84-18465-12-3</mods:identifier>
<mods:identifier type="uri">http://hdl.handle.net/10259/7002</mods:identifier>
<mods:identifier type="doi">10.36443/10259/7002</mods:identifier>
<mods:abstract>In recent years, Intelligent Transportation Systems are leveraging the power of increased&#xd;
sensory coverage and available computing power to deliver data-intensive solutions&#xd;
achieving higher levels of performance than traditional systems. Within Traffic Signal&#xd;
Control (TSC), this has allowed the emergence of Machine Learning (ML) based systems.&#xd;
Among this group, Reinforcement Learning (RL) approaches have performed particularly&#xd;
well. Given the lack of industry standards in ML for TSC, literature exploring RL often lacks&#xd;
comparison against commercially available systems and straightforward formulations of&#xd;
how the agents operate. Here we attempt to bridge that gap. We propose three different&#xd;
architectures for RL based agents and compare them against currently used commercial&#xd;
systems MOVA, SurTrac and Cyclic controllers and provide pseudo-code for them. The&#xd;
agents use variations of Deep Q-Learning (Double Q Learning, Duelling Architectures and&#xd;
Prioritised Experience Replay) and Actor Critic agents, using states and rewards based on&#xd;
queue length measurements. Their performance is compared in across different map&#xd;
scenarios with variable demand, assessing them in terms of the global delay generated by all&#xd;
vehicles. We find that the RL-based systems can significantly and consistently achieve lower&#xd;
delays when compared with traditional and existing commercial systems.</mods:abstract>
<mods:language>
<mods:languageTerm>eng</mods:languageTerm>
</mods:language>
<mods:accessCondition type="useAndReproduction">info:eu-repo/semantics/openAccess</mods:accessCondition>
<mods:subject>
<mods:topic>Tráfico</mods:topic>
</mods:subject>
<mods:subject>
<mods:topic>Infraestructuras</mods:topic>
</mods:subject>
<mods:subject>
<mods:topic>Traffic</mods:topic>
</mods:subject>
<mods:subject>
<mods:topic>Infrastructures</mods:topic>
</mods:subject>
<mods:titleInfo>
<mods:title>Reinforcement learning for Traffic Signal Control: Comparison with commercial systems</mods:title>
</mods:titleInfo>
<mods:genre>info:eu-repo/semantics/conferenceObject</mods:genre>
</mods:mods></metadata></record></GetRecord></OAI-PMH>