<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">AGILE-GISS</journal-id>
<journal-title-group>
<journal-title>AGILE: GIScience Series</journal-title>
<abbrev-journal-title abbrev-type="publisher">AGILE-GISS</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">AGILE GIScience Ser.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2700-8150</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/agile-giss-7-44-2026</article-id>
<title-group>
<article-title>Uncertainty in Binned Building Construction Year Data: Comparing EPC and Crowdsourced Datasets</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Teichmann</surname>
<given-names>Sophie</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Hudson</surname>
<given-names>Polly</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Kim</surname>
<given-names>Mihyun</given-names>
</name>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Herold</surname>
<given-names>Hendrik</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Hecht</surname>
<given-names>Robert</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig, TU Dresden, Germany</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Research Group Advanced Environmental Risk and Sustainability Modelling of Cities and Regions Using AI (SITES.AI), Leibniz Institute of Ecological Urban and Regional Development, Germany</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>University of Cambridge (Visiting academic), UK</addr-line>
</aff>
<aff id="aff4">
<label>4</label>
<addr-line>Loughborough University, UK</addr-line>
</aff>
<pub-date pub-type="epub">
<day>10</day>
<month>06</month>
<year>2026</year>
</pub-date>
<volume>7</volume>
<elocation-id>44</elocation-id>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2026 Sophie Teichmann et al.</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://agile-giss.copernicus.org/articles/7/44/2026/agile-giss-7-44-2026.html">This article is available from https://agile-giss.copernicus.org/articles/7/44/2026/agile-giss-7-44-2026.html</self-uri>
<self-uri xlink:href="https://agile-giss.copernicus.org/articles/7/44/2026/agile-giss-7-44-2026.pdf">The full text article is available as a PDF file from https://agile-giss.copernicus.org/articles/7/44/2026/agile-giss-7-44-2026.pdf</self-uri>
<abstract>
<p>The building&amp;rsquo;s construction year indicates energy performance and retrofit potential, and is used in energy performance ratings. However, in many countries, comprehensive datasets providing the construction date of buildings are unavailable or difficult to access. In the UK, Energy Performance Certificates (EPC) provide the only open dataset on building construction years, at national scale. However, the data are binned, and minimal information is available on data quality. For specific areas of England, construction year data have also been contributed by historians to the Colouring Britain data platform. This study investigates systematic differences between EPC and historian-crowdsourced construction year data for 4,849 buildings in Loughborough, covering 14 EPC bands. To increase comparability of datasets with differing age bands, we test a random forest method to resolve the binning using two feature sets: urban form features, and urban form features plus EPC bands. The median of each EPC band acts as the baseline. The results show that combining urban form features and EPC bands delivers the highest accuracy.</p>
</abstract>
<counts><page-count count="8"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>