AI Generation: Diverse Dataset Collections for Inclusivity & Empowerment
Creating diverse dataset collections is a requirement to verify AI systems are a force for inclusivity and empowerment instead of exclusion that rapidly increases the digital divide between privileged people receiving all the benefits while marginalized communities receive harm from the same AI systems.
Diverse dataset collections learn from a wider range of examples from a wider range of data sources that are very different from each other. This tactic makes them more accurate, fair and representative of the diverse world we all live in. The following provides info to help you create a diverse dataset collection.
- Define Inclusion Goals: Identify the attributes, characteristics and aspects of diversity that are important for your AI application. Gender, race, age, ethnicity, different abilities, socioeconomic status, language, religion, culture and more can be defined here.
- Fully Informed Prior Consent: Work with the individuals and communities that you defined in step 1 to obtain their fully informed prior consent regarding your current and future data collection and resulting AI application work. Clearly define each (your own, the individuals and the communities) expectations regarding all aspects of your data collection and resulting AI applications. Verify…