this post was submitted on 07 Aug 2023
496 points (97.7% liked)

Science

17979 readers
27 users here now

Subscribe to see new publications and popular science coverage of current research on your homepage


founded 6 years ago
MODERATORS
 

Was this AI trained on an unbalanced data set? (Only black folks?) Or has it only been used to identify photos of black people? I have so many questions: some technical, some on media sensationalism

you are viewing a single comment's thread
view the rest of the comments
[–] Yoruio@lemmy.ca 60 points 2 years ago (3 children)

Was this AI trained on an unbalanced dataset (only black folks?)

It's probably the opposite. the AI was likely trained on a dataset of mostly white people, and thus more easily able to distinguish between white people.

It's a problem in ML that has been seen before, especially for companies based in the US where it is just easier to find a large amount of white people as opposed to people of other skin colors.

It's really not dissimilar to how people work either, humans are generally more able to distinguish between two people who are races that they grew up with. You'll make more mistakes when trying to identify people of races you aren't as familiar with too.

The problem is when the police use these tools as an authoritative matching algorithm.

[–] LetterboxPancake@sh.itjust.works 11 points 2 years ago (2 children)

It's not only growing up with them. We're just better identifying people/animals/things we're familiar with. Horses all look the same if you're not around them regularly. You can distinguish colours, but that's it.

Not comparing people to horses by the way...

[–] lntl@lemmy.ml 3 points 2 years ago (1 children)

I thought they would have trained it on mugshots. Either way, it should never be used to make direct arrests. I feel like it's best use would be something like an anonymous tip line that leads to investigation.

[–] Yoruio@lemmy.ca 3 points 2 years ago

Using mugshots to train AI without consent feels illegal. Plus, it wouldn't even make a very good training set, as the AI would only be able to identify perfectly straight images shot in ideal lighting conditions.

[–] gramathy@lemmy.ml 2 points 2 years ago

Also makes me wonder if our defined digital color spaces being bad at representing darker shades contributes as well.