Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nation-wide data for Canada available #7192

Open
pheersink opened this issue May 1, 2024 · 9 comments
Open

Nation-wide data for Canada available #7192

pheersink opened this issue May 1, 2024 · 9 comments
Assignees

Comments

@pheersink
Copy link

pheersink commented May 1, 2024

Statistics Canada has released a nation-wide address database. Data available here: https://www150.statcan.gc.ca/n1/pub/46-26-0002/462600022022001-eng.htm. Licence is open (https://www.statcan.gc.ca/en/reference/licence). Release December 2023.

The data is provided in 2 seperate files - one for addresses and one for locations. These two share a common LOC ID field. Files are provided by province and territory (i.e. 13 address files, 13 location files). I've joined these files into one single address database, with 16 million records. Not familair with json so much but I can provide you what I have - or you can get it directly form the source.

@iandees
Copy link
Member

iandees commented May 2, 2024

Thanks for the suggestion!

In situations like this, we usually build a script to do the work, document it thoroughly so we can run it in the future, and then put it in https://github.com/openaddresses/openaddresses/tree/master/scripts. We don't have automation to run these scripts at this point, so someone runs the script manually, we post the output data somewhere (we can store it in our S3 bucket) and add a source JSON to this repository to incorporate the data into our overall output.

If you're up for it, can you write a pull request to add the script+documentation? You can upload the output data you have to https://www.dropbox.com/request/rUcmasTnxZkiDR6rJc4l and I will move it over to S3, too.

@justinelliotmeyers
Copy link
Member

i could have done this wrong, but does this look right @pheersink
image

@pheersink
Copy link
Author

pheersink commented May 2, 2024

Pretty much what I get so yes it is ok. Coverage is inconsistent but there should be around 16.5 million addresses on the map.
image @justinelliotmeyers

@iandees
Copy link
Member

iandees commented May 2, 2024

Nice, looks like Canada!

@justinelliotmeyers
Copy link
Member

@iandees you mean it looks like Canada 'eh!

@pheersink in the past we have collaborated with statcan to fill in gaps. I'd love to explore that option for them as well as ESRI and openaddresses. Little tricky with different municipal legal hoops, but we do this stuff a lot, so not hard, but takes effort. Thoughts about trying to fill in gaps up there 'eh?

@jwass
Copy link
Contributor

jwass commented May 11, 2024

I work with @pheersink on some of the address work.

I built a script to download and process the National Address Register data and opened PR #7224 with a script for it. It joins the address and locations files together on the LOC_GUID field and adds the REPPOINT_LATITUDE and REPPOINT_LONGITUDE fields to the addresses, and zips that up. I uploaded the result file to the Dropbox link that @iandees posted above with file canada-nar-combined.csv.gz

@justinelliotmeyers
Copy link
Member

Thanks @jwass

@pheersink
Copy link
Author

pheersink commented May 11, 2024 via email

@iandees
Copy link
Member

iandees commented May 11, 2024

The data is available at https://data.openaddresses.io/cache/uploads/iandees/760039/canada-nar-combined.csv.gz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants