-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parties: Organization classifications (e.g. women-owned businesses, legal entity type) #711
Comments
Based on discussions around #369 and the creation of the I believe that the Party Details extension approach makes it easier for us to surface, document, display and validate the different classification schemes that might be applied to organisations, and makes analysis in spreadsheet software easier when concepts are in distinct fields (rather than split into-subtables), as well as supporting cases where the detail may be more complex than a single classification. When we consider party details, we need to consider two use cases for organisational classification, and whether these need a local or a community extension. (1) Monitoring implementation of domestic policy In this case, it is important to have classifications with definitions that directly match the domestic definitions. Publishers will want to use their local categories, rather than mapping to some generic set. Policy-related 'flags' are often quite idiosyncratic, and tricky to map onto some global set with any universal applicability. These are well suited to creation of local extensions (e.g. (2) Cross-cutting analysis In these cases, generic categories are used for analysis across countries - such as analysis based on size of entity (micro, sme, large). Whilst each country might have their own category definitions and thresholds, they are likely to have a one-to-one or many-to-one mapping to the generic categories, close enough to support analytical re-use of the data. So, in the case of any categorisation like 'women-owned businesses' I think we need to look at the use case for the data, and identify:
Sometimes, we may find we want to encourage publishers to use both a local extension and a community extension. For example, an organisation tagged 'Women Owned Small Business' in a US dataset might (depending on exact definitions of the concepts here) be able to map to: {
"parties":[{
"id":"123456",
"details":{
"daimsClassification":["Women Owned Small Business"],
"scale":"sme",
"femaleOwnership":"full"
}
}]
} thus serving both domestic and cross-cutting analytical use-cases. |
I anticipate that a For the specific case of women-owned businesses, if additional detail like full, majority, partial, etc. ownership has sufficient supply and/or demand, then we have a strong case for adding a new field under |
The challenges I can see with a generic
A publisher still has the option of including an additional field, which the validator will report on, but not fail the data for. We don't currently validate classifications against their schemes. We considered some sort of generic
Yes - this will be an area where we might need some research / thinking in each case. |
Both structures require documentation and validation support… CoVE doesn't presently have logic for validating classification objects, but that's a problem we can solve. CoVE presently reports "Additional Codelist Values" and "Additional Fields". It's conceivable to report additional classification Having a specific field for every concept (instead of using generic building blocks) is not a panacea, either (sparse flattened data, collecting related fields, complexity of having more fields and extensions, etc.). Both generic and specific structures can be the better option depending on the circumstances. Once we have more use cases relating to women-owned businesses, we can consider which is best. Also, I can't find any discussion in GitHub issues about flags/classifications instead of |
This is proposed under |
Noting earlier discussions here: #181 (comment) |
Colombia tracks:
cc @yolile |
Regarding organization type, the EU uses the less ambiguous term "legal form": eForms/eForms#216 |
Which of the OCDS publishers specify women owned or small businesses? Hera is leading some upcoming research about empowering women to participate in public procurement through open contracting. It would be great to understand which of our partners are reporting whether women owned or women led businesses are participating as bidders. |
This is being worked on in CRM issue 4882, and this issue will be updated with an answer when ready. |
Dhangadhi has |
Thanks James! Sounds like Colombia and Paraguay would be good places to include in our gender research project and possibly Nepal too if we want to include a more LDC context @herahussain |
Dominican Republic will include SME and women-owned business classifications in their OCDS data. Both are expressed as flags in their source systems. |
Could we consider a mixed approach? Like having a |
@romifz Yes, it's looking like there will be a mixed approach. For this issue, we still need to figure out what the common, comparable approach should be. |
Re-organizing some points from the discussion: We will never be able to standardize every way of classifying organizations. We have a few options (which can be mixed):
We should continue to do (1). For the concepts mentioned so far:
I think we should recommend (3), using the organizationClassification extension. Publishers would set a scheme. In the EU profile, we prefix
The guidance to publishers would be similar to how organization identifier schemes are constructed, e.g. with an ISO-3166-1 code as the first part of the scheme. Publishers would be encouraged to list all schemes and classification codes in their publication policy. As concepts become standardized, we can recommend that publishers use the new fields, as in (1) – though they likely ought to continue to publish the existing schemes, for any users that rely on them. If we were to recommend (2), then users would end up seeing many additional fields and/or local extensions. They would have to reconcile which of these relate to organizational classification, and which relate to other details about the organization. Once concepts are standardized, users would see both the standardized field and the local field at the same level of hierarchy, which will likely cause confusion or at least raise questions. Publishers can still choose to do (2) if they have good reasons, but I think (3) is better, because it collects all non-standardized organizational classification concepts in one place, and because, once standardized, there is a clear segmentation between the standardized and non-standardized concepts. |
In the case of known classification schemes, I think a codelist should be provided. Otherwise, two publishers may choose different codes for the same scheme. |
We haven't done this in the past because:
We could provide a codelist in cases where we wouldn't violate intellectual property rights, but the value of providing that codelist is diminished until/unless we update the Data Review Tool (or some other tool) to be able to use that codelist. Or are you thinking that some publishers would use the codelist to guide their implementation? |
Sorry, that's not what I meant, "codelist" may not be the right term. What I think is a list of "names to use" in the From what I've seen, publishers struggle to understand the goal of the |
Ah, I understand. The The I think the solution here is to do something similar to the So, for 1.1.5, I would propose adding a Then, in 1.2.0, we can rename |
Last week @pindec and I discussed a simple option like an open code list to allow publishers to identify special characteristics of bidders and contractors according to their own policy needs, with the option of a details field to give more details if needed. also, just FYI according to this blog: https://www.opengovpartnership.org/stories/faces-of-open-government-jen-bretana/ South Cotabato publishes 'gender tagging of women owned businesses joining biddings or awarded with contracts' (but I don't have more details about how this is published) |
Regarding adding the codes mentioned above to the codelists: Sometime after we authored the EU profile, the EU created these controlled vocabularies:
They don't seem to yet have one for the transport categories at https://op.europa.eu/en/web/eu-vocabularies/authority-tables I can't find any list in the regulation or from a quick search of EUR-Lex, so for
The standard forms confusingly refer to these as "areas covered", but the legislation uses "areas" to mean geographic areas, whereas the forms are clearly about services. We can prefix the titles with "EC" like we do for CPV, to scope to the European Commission, and use an abbreviated/edited description from those pages's About tab, so:
It's most important to include in the codelist the link to the Source, so users can follow through. For COFOG, the https://unstats.un.org/unsd/classifications/Econ/ link you found is better than anything I'd seen so far (it has both data files with codes and titles, and PDFs with long descriptions of each code). So, I recommend that for the source. The title/description can be (based on this):
|
Now in Explorer: https://extensions.open-contracting.org/en/extensions/organizationClassification/master/ I need to review this issue to make sure everything is covered either by #990 or the new extension. Ideally, separate issues can be created for any follow-up, so that this wide-ranging issue can be closed. |
The most relevant comment is #711 (comment), which is now documented by #990. The comments about changes to |
Related to legal form (but at a less granular level), MAPS defines "Civil society organisation (CSO)" as:
|
Regarding legal form, here are two codelists. However, the codes used are opaque: https://www.gleif.org/en/about-lei/code-lists/iso-20275-entity-legal-forms-code-list These two are linked from https://semiceu.github.io/Core-Business-Vocabulary/releases/2.2.0/#LegalEntity.legalformtype |
Motivation
Some procurement policies create obligations to track and/or promote the participation of women-owned businesses in contracting processes.
Many other classifications exist; women-owned businesses is just used as a common example.
Discussion
Small or Medium Enterprise (SME) is another classification that is frequently the subject of procurement policies. There is a community extension to distinguish SMEs: https://github.com/open-contracting/ocds_partyDetails_scale_extension
For 'women-owned' and other ownership-related classifications, company registers rarely if ever track this information as structured data. All known cases of structured data are from procurement systems.
The US DATA Act Information Model Schema (DAIMS) tags awards with different facts ("Procurement Award Specific Characteristics"), like whether it's "Minority Owned Business" or "Woman Owned Business" but also whether it's "Airport Authority" or "Clinger-Cohen Act Planning Compliance". See this diagram from this page.
If the system were storing demographic facts about the organization's control in a separate field, we could create an extension with narrowly defined fields. However, given that known systems store various facts in a single field, we should create an extension with broadly defined fields, closer to "tags." The extension could have an open codelist to allow a standard way of expressing e.g. "woman-owned business".
Proposal
Given that the classifications of organizations may be about ownership, incorporation, sector, location, number of employees, etc., and given that these classifications intersect ("Small Agricultural Cooperative"), I propose reusing the Classification object. The object is currently only used for Item classifications. Its
scheme
field uses an itemClassificationScheme codelist, but otherwise the object is generic. As I have no reason to believe that organizations have a 'primary' classification, I propose naming the fieldclassifications
instead of following the pattern of a primaryclassification
and thenadditionalClassifications
.I also recommend deprecating the Party Scale extension as its use case will be duplicated by this extension.Example:
In terms of existing schemes:
We could model these classifications as an array of strings, but it seems likely that more robust classification schemes will emerge as structured data, for which an array of Classification objects is better suited.
Reference
For reference, the full list of Procurement Award Specific Characteristics in DAIMS is:
Click to expand
The text was updated successfully, but these errors were encountered: