Looking forward to this short workshop just getting started now:
Reflecting on Power and AI: The Case of GPT-3
Part of the discussion will circulate around this paper:
Ffrom the GPT-3 paper https://papers.nips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf
"In collecting training data for GPT-3, we used the unfiltered distribution of languages reflected in internet text datasets (primarily Common Crawl)"
For those that are interested in why web archives matter, this is very significant.
The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!