Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EPC Duplicate Certificates #5

Open
anetobradley opened this issue Feb 11, 2022 · 2 comments · May be fixed by #7
Open

EPC Duplicate Certificates #5

anetobradley opened this issue Feb 11, 2022 · 2 comments · May be fixed by #7
Assignees

Comments

@anetobradley
Copy link
Owner

When using the EPC datasets we need to be careful with duplicate EPCs for the same property. While not an enormous issue as an EPC is valid for up to 10 years unless the property is renovated or retrofitted, there may be multiple records especially for rental properties which are improved to meet recent regulations.

We should be able to spot this by removing duplicates with the same UPRN (UPRN: Unique Property Reference Number) and I would suggest selecting the most recent record and discarding others. I will add this feature to the R code for the energy intensity sampler.

I'm not sure this will have a big impact when taking a recent sample of 5000 certificates from the API, but when using the full csv this could be a problem (My colleague has pointed out some properties in that dataset can have four or five duplicates!).

@anetobradley
Copy link
Owner Author

Have updated the EPC search function for R code to address this as well as change in default number of instances returned.

@anetobradley
Copy link
Owner Author

This update has a negligible effect on mean energy intensity estimates for most typologies of house for most Local Authorities but there are some exceptions. these exceptions seem to often be older houses that have several EPCs as a result of being renovated/refurbished and improved. You can see this in the scatter plot comparing mean energy intensities with the update and the original estimates for each house type in a sample of midlands LADs.

image

For @nickmalleson I am rerunning the national sample estimation on DAFNI and will update the geojson file once this has finished. This should not change the trend especially amongst newer build homes but should be a bit more accurate/less biased estimates.

@patricia-ternes patricia-ternes linked a pull request Apr 11, 2022 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants