Sitting through roll call in school is bad enough if you have a last name like Weiner, Butts, Cummings, Medick, Dickman, or Suconcock. The digital age has made it worse: Just try getting past the first stage of an online registration process.
The internet-related woes of people with dirty-sounding last names are officially known as [the Scunthorpe problem](. Websites routinely use tools to prevent users from making accounts with fake or obscene wordsâbut overzealous filters and poorly written code often flag innocent phrases that either happen to contain obscene words within them, or are legitimate use cases of such words.
Obscenity filters arenât new. On the contrary, theyâve been plaguing internet users since, well, the internet. Whatâs telling is why two decades of machine-learning advancements havenât improved them.
ð¦ [Tweet this!](
ð [View this email on the web](
[Quartz Obsession]
The "Scunthorpe" problem
September 14, 2018
What's in a name?
---------------------------------------------------------------
Sitting through roll call in school is bad enough if you have a last name like Weiner, Butts, Cummings, Medick, Dickman, or Suconcock. The digital age has made it worse: Just try getting past the first stage of an online registration process.
The internet-related woes of people with dirty-sounding last names are officially known as [the Scunthorpe problem](. Websites routinely use tools to prevent users from making accounts with fake or obscene wordsâbut overzealous filters and poorly written code often flag innocent phrases that either happen to contain obscene words within them, or are legitimate use cases of such words.
Obscenity filters arenât new. On the contrary, theyâve been plaguing internet users since, well, the internet. Whatâs telling is why two decades of machine-learning advancements havenât improved them.
ð¦ [Tweet this!](
ð [View this email on the web](
Origin story
Woe be Scunthorpe
---------------------------------------------------------------
The name âScunthorpe problemâ refers to the issues that residents of the English town have with online [profanity filters]( because of a certain profane word contained within the townâs name. Itâs an extra indignity for the town, which has been a punchline [since the music-hall days]( for its [name]( and the conditions that came from being the countryâs steel-processing capital: âin the club I was in last night in Scunthorpe they kept a pig on the counter as an air freshenerâ is one from [the late Ken Dodd]( considered âthe last of the great music-hall comedians.â (But itâs [pronounced]( âskun-thorp.â)
It was way back in [1996 that AOLâs filter first]( blocked residents from creating accounts. In [2004]( Scunthorpe felt the wrath of Googleâs SafeSearch filter, which[blocked local sites like]( ThisIsScunthorpe.co.uk and ScunthorpeDistrictCatsProtection.co.uk, a site for adopting cats. (The same SafeSearch feature came under fire for being so sensitive as to block any site that contained the string of letters âsexââto the frustration of the proprietors of ArkansasExtermination.com, and EssexCountyBeeKeepers.org, among many others).
AOL responded by changing the name of the town to [Sconthorpe]( in its systems; if youâre visiting, you can avoid any awkwardness by calling it [âScunny.â](
Giphy
Pop quiz
Which is not an English town that also suffers the Scunthorpe problem?
WhakataneLightwaterClitheroePenistone
Correct. This town is in New Zealand, where the municipal free internet once blocked its own name.
Incorrect.
If your inbox doesnât support this quiz, find the solution at bottom of email.
In the news
Algorithms gonna algorithate
---------------------------------------------------------------
The latest innocent person to run into an overly cautious algorithm is journalist [Natalie Weiner]( who noted on Twitter in August that she got an âOffensive language discovered in the last name fieldâ message when trying to register for high school sports outlet MaxPreps. She couldnât join with her actual nameâa pretty common one at that.
Cue a plethora of responses from similarly bedeviled internet users. A man named Steve Suconcock claimed to have âthe worst name possible for usernamesââand said that even employers [laugh at him in interviews](. Philip Sporn had to [set his name]( as âSpronâ on his computer. This problem happens [to Ben Shmuck]( âall the time.â Ditto for [Kyle Medick](. The âstruggle is realâ for [Mike Dickman](. And [Matt Cummings]( has âbeen there,â too.
Giphy
Great moments in...
Filter frustration
---------------------------------------------------------------
[1996:]( Congress mandates that all TVs include the âV-chip,â which blocks children from watching programs of a certain rating. But the ratings mostly failed to reflect prohibited content, and parents mostly didnât bother to use it.
[2001:]( Jonah Peretti, a masterâs student at MIT, foils a filter to get the word âsweatshopâ onto a pair of custom Nikes. The story goes viral, leading Peretti to help launch HuffPost and then BuzzFeed.
[2003:]( An email filter system causes âchaosâ in the UKâs House of Commons when it blocks emails discussing a sexual offenses bill and a consultation paper on censorship.
[2004:]( One in five messages sent by Londonâs Horniman Museum are blocked by email filters.
[2006:]( A search-and-replace error in a Reuters article about bees leads to the sentence âQueen Elizabeth has 10 times the lifespan of workers and lays up to 2,000 eggs a day.â
[2010:]( The Beaver, Canadaâs second-oldest magazine, is forced to change its name to the (considerably drier) Canadaâs History after filters cause its web traffic to plummet.
[2018:]( A graduate of a Christian home-schooling program ends up with a cake reading âSumma ⦠Laudeâ after a filter used by the supermarket Publix censors the Latin word for âwith.â
Reuters/Kacper Pempel
Brief history
The Clbuttic mistake
---------------------------------------------------------------
The âScunthorpeâ dilemma is unfortunate for those who just want to create normal logins. But it gets worse when programmers try to write code that swaps out offensive sequences of letters for more innocuous synonyms. Thatâs when you wind up reading about intelligence agenciesâ secret plots to [buttbuttinate]( world leaders, maybe reported by the Buttociated Press. Itâs a clbuttic blunder, but [perhaps not a common one](.
Jargon watch
The master expurgator
---------------------------------------------------------------
Back in the day, content filtering was artisanal, and no one wielded a more careful scalpel than Thomas Bowdler, a Renaissance man whose kid-friendly The Family Shakespeare was so (in)famous that his name is synonymous with sanitizing, which is why we know it as to [Bowdlerize](. Some of his edits were modest, like changing âJesu!â to âHeavens!â but he did a bit more violence to the text by making Opheliaâs drowning accidental.
Reuters/Keith Bedford
Fun fact!
The words socialism, socialist, and specialist have been known to snag filters for containing the substring Cialis.
Million-dollar question
Why can't we get obscenity filters right?
---------------------------------------------------------------
It simply isnât easy to create an algorithm that understands the vast nuance of the English (or any) language and the ways people use it. Even with advances in machine-learning algorithms, itâs still hard to create a filter that understands words in context, machine learning researcher Michael Veale [told Motherboard.]( As a result, many platforms (ahem, Facebook), are relying increasingly on human moderators. The bots may be coming, but as in many things, when it comes to parsing intention in conversation, it turns out humans still take the cake.
Last year, Google offshoot Jigsaw rolled out a public interface for its Perspective project, which [uses]( âmachine learning to automatically detect insults, harassment, and abusive speech online,â trained on Wikipedia discussions and newspaper comments and using the wisdom of crowds to determine whatâs âtoxic.â
But like any newborn, itâs slow to learn. Jigsawâs early tests rated âracism is badâ as 60% toxic, while âracism is goodâ scored just 35%. âHitler was an anti-semiteâ was 70% toxic; ârace war now,â just 24% toxic. âThe hope is over time, as this is used, weâll continue to see more and more examples of abuse, and those will be voted on by different people and improve its ability to detect more types of abuse,â a Jigsaw project manager [told Quartz](.
Giphy
Poll
How often does your name cause a headache?
[Click here to vote](
NeverAll the timeHigh school was rough but it's better now
Todayâs email was written by [Edmund Heaphy]( [Jessanne Collins]( and [Whet Moser,]( produced by [Luiz Romero](.
ð¬ Let's talk!
In yesterdayâs quiz about whether the government should be able to [neither confirm nor deny secrets]( 51% of you said âMaybeâbut they should have to come clean sooner than 50 years on.â
â¨ï¸ [Dive into our archive](
âï¸ [What did you think of todayâs email?](mailto:obsession%2Bfeedback@qz.com?cc=&subject=Thoughts%20about%20the%20%22Scunthorpe%22%20problem&body=)
ð¡ [What should we obsess over next?](mailto:obsession%2Bideas@qz.com?cc=&subject=Obsess%20over%20this%20next.&body=)
ð¬ [Forward this email to a friend](mailto:replace_with_friends_email@qz.com?cc=obsession%2Bforward@qz.com&subject=%F0%9F%98%A3%20The%20Scunthorpe%20problem%3A%20A%20clbuttic%20digital%20dilemma&body=Thought%20you%27d%20enjoy.%20%0ARead%20it%20here%20%E2%80%93%20http%3A%2F%2Fqz.com%2Femail%2Fquartz-obsession%2F1390414)
ð° [Discuss or suggest a topic on r/ObsessionObsessives](
The correct answer to the quiz is Whakatane.
Enjoying the Quartz Obsession? [Send this link]( to a friend!
If you click a link to an e-commerce site and make a purchase, we may receive a small cut of the revenue, which helps support our ambitious journalism. See [here]( for more information.
Not enjoying it? No worries. [Click here]( to unsubscribe.
Quartz | 675 Avenue of the Americas, 4th Fl | New York, NY 10011 | United States
[Share this email](