-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using district names in Canadian Federal Electoral District OCD-IDs #323
Comments
To understand the proposal:
All ridings for all time are listed here: https://lop.parl.ca/sites/ParlInfo/default/en_CA/ElectionsRidings/Ridings There are 32 without any dates. If you enable to the "Additional Information" column, you'll see why 27 don't have a start/end date - they were abolished before coming into force. The other 5, however, just lack dates (data quality issue). Some other examples of renamings between redistrictings from one set of search terms (check for "royal assent on"): https://www.parl.ca/LegisInfo/en/bills?keywords=%22An%20Act%20to%20change%20the%20name%20of%22%20electoral%20district&parlsession=all&sortby=session-desc |
When X is renamed to Y, a new id will be created for Y and an alias will be created stating X sameAs Y making Y canonical.
Collisions will be resolved by removing the validThrough date and adding a validFrom date. The same id will be used for both periods that the district existed since the identity of the district is maintained by its name. https://en.wikipedia.org/wiki/Ahuntsic_(electoral_district)
That’s a good point, we should include province in the id to disambiguate (ocd-division/country:ca/province:on/ed:victoria)
I may be misunderstanding you here, but a new district which shares the same name as a previous redistricting should be resolved consistent to collisions across time. https://en.wikipedia.org/wiki/Laval_(electoral_district) I'm not certain what the effect of some of the renaming bills have or if they just haven't passed yet but maybe we can use https://www.elections.ca/content.aspx?section=res&dir=cir/list&document=index338&lang=e as the source of truth for name changes that should be reflected in the repository? |
Okay, now instead of 213 IDs that repeat across time, we'll have 202:
So the old district essentially disappears from the listing? If I have a historical application, how do I refer to the old district?
According to the Library of Parliament, the old district is abolished and the new district is created. In other words, according to the best authority on this subject, they are not identical. In cases like Algoma, there's a 64-year span between when each existed. The idea of districts being "re-established" seems to be an invention of the Wikipedia user Earl Andrew who authored most of the pages.
Have you found a real example, or are you hypothesizing?
Isn't constantly shifting canonical IDs kind of unusual?
https://www.elections.ca/content.aspx?section=res&dir=cir/list&document=index338&lang=e is very incomplete. |
We can also resolve collisions across time by appending the year that the new district was created in the case that an old district with the abolished, the way collisions with Canadian OCD-IDs are currently resolved. However if we do adopt the idea that names are how we identify districts, treating districts with the same name, even if there is a 64 year span between them, as having the same identifier seems most natural. There is no perfect methodology for how we create these ids but the current way of using federal electoral district codes treats every district as being abolished and recreated after every redistricting. This doesn't work well for representing districts that aren't new. @jdmgoogle maybe you can speak more about this?
It may be incomplete, but isn't this the set of district ids/names used in elections? Is the issue that name changes that aren't captured here can surface as new districts after redistricting? |
I'm curious why the new solution has to go back to 1867 but the the current solution only has to go back to ~2010. |
Renaming districts outside of a redistricting cycle is rather unusual, so this seems like something that should be taken up with the Canadian government first and foremost. :) |
(Sorry for the multiple comments here, but)
The district you reference is also described in the French-language Wikipedia as "reappearing" in the 1960s. https://fr.wikipedia.org/wiki/Algoma_(ancienne_circonscription_f%C3%A9d%C3%A9rale) The edit history to that page does not include any edits by "Earl Andrew", so this is clearly not a one-off invention of a single person. If the goal of OCD-IDs is simply to provide a context-free catalog of THE CURRENT official names and numbers associated with locations and districts around the world, with no attempt made to provide a coherent view of the "identity" of a given district over time, then that should be made explicit. It should also be made clear, then, that these identifiers are likely incompatible and conflict with the ones being curated on Wikidata. The documentation should state that an OCD-ID is only valid in the context of an ( |
Okay, good luck changing a foreign country's legislature, Google ;)
We're discussing how to improve the current solution.
I don't understand this section. What is your position? Phrasing everything as conditionals makes it hard to know what you actually want to be the case. |
If that's the case then I propose we focus on solving the problem for the modern era (e.g., make sure we work reasonably well for the date ranges already covered by the data set) and not worry about going back to 1867.
I'm trying to get clarity on your position. You've dismissed Wikipedia as a valid data source (which is fine) and have said we should limit ourselves to only what the government is currently publishing. My section tries to lay out the implications of that position in the broader ecosystem and what it would mean for downstream clients. So ... what is your position here on what should be in this repository, how it should relate (or not) to other data sources, and how we should be updating our documentation and tooling to enforce that position? |
OCD-IDs should track the identity of divisions across time, including historical divisions where there is the capacity to do so. Due to data quality issues, etc. this goal is not always met. This issue is about improving the situation in Canada. It is not a goal of OCD-IDs to be consistent with Wikidata. Wikidata is nowhere mentioned in any documentation about OCD-IDs. That said, consistency across databases/standards (ISO 3166, etc.) is preferred where appropriate. The OCD-ID is sufficient to identify a division even in the current iteration. There is no conflict within the current dataset that requires Wikipedia and Wikidata do not have good coverage of non-current divisions. For example, there is a page for Algoma—Manitoulin—Kapuskasing but not for Algoma or Algoma—Manitoulin, which are not the same district according to the proposed "name = identity" method or according to the Library of Parliament (and to be 100% clear, the LOP is the greater authority, not Wikidata or any other secondary sources). Edit: For even greater clarity, it is Parliament that creates, renames and abolishes divisions. Elections Canada uses those divisions. It does not create, rename, abolish or otherwise affect their lifecycle. As they are concerned with elections, they do not care if a division is renamed 3 times in a parliamentary session. They care about the name at the time of an election. So Parliament is always the preferred source if they are publishing data of sufficient quality. |
I agree that this is a reasonable solution.
The OCD-ID project works with a simple lifecycle model of divisions being created (validFrom), abolished (validThrough) and renamed (sameAs): https://github.com/opencivicdata/docs.opencivicdata.org/blob/master/proposals/0002.rst The model does not allow for "re-establishment" of divisions. We can of course consider changes to the model, but I think that is a separate issue. If we want to work within the current model in this issue for Canada, then the only possible events are creating, renaming and abolishing divisions. We don't have the tools to properly model "re-establishment" (if this is even a thing – the Library of Parliament doesn't think so...). |
I would agree with that.
Then the documentation should be updated to say that. We should also reach out to people at Wikidata to make it clear that these two projects may provide overlapping identifiers but are not related and not guaranteed to be compatible.
I would disagree with that. For example:
Using the number to identify the districts even over a single redistricting cycle simply won't work. The names change, the shapes change, and they do so in a subtly very wrong way: the vast majority of ( Given this example, what set of OCD-IDs and other metadata would you propose to represent this kind of information? What information is associated with the "canonical" OCDs |
Are we to enumerate and reach out to every project with which we don't have an official policy of being consistent with? That doesn't make sense. Is there some undocumented agreement or discussion with Wikidata that I'm missing? I'm confused.
Anyway, can we focus on the proposal that @evannjw kindly suggested rather than going on about the deficiencies with the existing approach? |
Wikipedia is a large enough project that we should at least attempt to clarify our relationship to them. One last feedback here is that under the approach mentioned above it seems we'd be randomly appending years onto identifiers, which seems a bit chaotic. Unfortunately election-related issues are pulling me away so I can't comment a bunch more at the moment. Hope to get back to the mid/late next week. |
It's not random – when films are remade, years are appended. So, on Wikipedia, there's True Grit (1969 film) and True Grit (2010 film). Most films don't have these suffixes, because they have not been remade, or were remade under a different title. Similarly, if a district is abolished, and then a future district uses the same name, a year is suffixed to distinguish them. -- Please open a new issue to discuss the OCD-ID projects relationship to other databases/projects, so that we don't overload a conversation about Canada with those separate concerns. |
Food for thought on appropriate sources of information for political divisions: https://mitpress.mit.edu/9780262046299/writing-the-revolution/ Book review (paywalled) https://www.ft.com/content/872b64f5-e735-4feb-9580-1e362ba85b7b |
I can't remember why I created these files, but uploading here in case they are relevant to history. |
A new set of OCD-IDs are created every time there is redistricting creating a problem for how we represent districts contiguously across time. Rather than use the numeric federal district ids, which only represent a district’s identity for 10 years, we should use district names as they are more closely represent a district’s identity. Some concerns regarding the switch to using names as identifiers are that they can change between redistricting cycles (see https://www12.statcan.gc.ca/census-recensement/2011/ref/FED_name_changes-changements_noms_CEF-eng.cfm and https://openparliament.ca/bills/42-1/C-402/). When names are changed, new ids and aliases can be created so that identity is maintained through the canonical OCD-ID. When redistricting occurs, districts created after redistricting that don’t map to an old one (not the same name or renaming of a district) will be considered new districts and ids will be created for them. Old districts which no longer exist will have a validThrough date to indicate it is no longer active.
The text was updated successfully, but these errors were encountered: