All the ways I can implement blocking hate speech on my app

So at the Bubble conference I was super excited to share my app with everyone and found that someone put the first piece of hate speech on my social media app. I’m honestly a little saddened someone in the community would be such a troll but it also made me realize that I am now getting enough users that I need to make sure if there is hate speech, it can be located and removed as fast as possible. So, I’d love to brainstorm ways to be able to do this. So far I have:

  1. When a user reports a card, it disappears until an admin oks it again.
  2. Ability to ban an email address as well as the VPN it utilizes
  3. Create a list of words that if a user utilizes them, they will not be able to submit their post
  4. Create a system of “troll hunting” points so people are incentivized to report trolls.

What are your thoughts on these options and do you have other ideas?

Hi there, @aliana… first, I’m sorry to hear about what happened after you shared your app at BubbleCon, and while it is sad, it was pretty much inevitable that it was going to happen at some point, so it’s probably good that you are thinking about moderation tools now.

As you may or may not know, Bubble uses Discourse for this forum, so you might want to check out the Discourse Moderation Guide to get some ideas. That being said, the ideas you have so far are certainly reasonable, and I really dig the “troll hunting” points thing. The success of such a feature would likely depend on what the points can be used for (and you would have to make sure folks couldn’t “abuse” the feature just to get points), but incentivizing folks to help with moderation makes a lot of sense to me.

Congrats on the success of your app, and good luck keeping the trolls at bay! If you want to go deeper on the topic, I have seen it all out here (and you wouldn’t believe some of the things I’ve seen), so feel free to hit me up directly and we can continue the conversation.

Best…
Mike

1 Like

@aliana I’m using https://www.webpurify.com/ for a project. It’s been solid. The support team should be able to help you curate your “block list” accordingly.

Are you primarily trying moderate human inappropriate posts or troll/bot posts?

Obv both but you should focus on one first and since they have different solutions. IP and email has very limited efficacy: you should use device fingerprinting.

I’d also have different level of trust (first 10 posts or already flagged user vs trusted user) so you can apply different levels of moderation. Banned words is needed but there are oh so many workarounds…