Scraped data of 2.6 million Duolingo users released on hacking forum

stopthatgirl7@kbin.social · 1 year ago

Scraped data of 2.6 million Duolingo users released on hacking forum

circuitfarmer@lemmy.sdf.org · edit-2 1 year ago

“Scraped” data suggests that it’s data available on public profile pages. However, the article also says the dump is a mix of public and non-public info. So which is it, scraped or not? It’s an important distinction, because data collection by scraping is technically not a breach.

ADTJ@feddit.uk · 1 year ago

Take this with a pinch of salt but what I’m gathering is that it’s essentially just taking people’s public profiles but the Duolingo api also exposes users’ e-mail addresses (and possibly other info) that isn’t normally displayed as part of the user’s public profile via their app.

In essence, they’re exposing more data than they probably should be and users were not really aware that data was being made public - that’s why people are upset about it.

circuitfarmer@lemmy.sdf.org · 1 year ago

Ok, this makes sense – in which case the API should not be exposing data that isn’t otherwise available on the public profile, so that is significant.