Content Moderation for End-to-End Encrypted Messaging

amarashar's bookmarks 2019-10-06

Summary:

I aim to demonstrate these clarifications by formalizing specific content moderation properties for end-to-end encrypted messaging, then offering at least one possible protocol design for each property. User Reporting: If a user receives a message that he or she believes contains harmful content, can the user report that message to the service provider? Known Content Detection: Can the service provider automatically detect when a user shares content that has previously been labeled as harmful? Classifier-based Content Detection: Can the service provider detect when a user shares new content that has not been previously identified as harmful, but that an automated classifier predicts may be harmful? Content Tracing: If the service provider identifies a message that contains harmful content, and the message has been forwarded by a sequence of users, can the service provider trace which users forwarded the message? Popular Content Collection: Can the service provider curate a set of content that has been shared by a large number of users, without knowing which users shared the content?

Link:

https://freedom-to-tinker.com/2019/10/06/content-moderation-for-end-to-end-encrypted-messaging/

From feeds:

Harmful Speech » amarashar's bookmarks
Gudgeon and gist » Freedom to Tinker

Tags:

harmfulspeech

Authors:

Jonathan Mayer

Date tagged:

10/06/2019, 20:13

Date published:

10/06/2019, 11:53