Stuart Morrow
π 1 karma
2024-06-19
Infer nationality, native language, whatever, of writer based on how they use English
1
vote
0 answers
It's not immediately clear how you'd train this. Location subreddits, for example - /r/northernireland has a subscriber count implying most users there don't actually live in Northern Ireland...
The title is descriptive enough, but if it's not, I'm talking about how "Euro English" gives someone away as European, "if ... would be ..." instead of "if ... were ..." narrows them down to German or Dutch, "graduate" in the same paragraph as "18" probably makes them American or Americanised-ESL as no non-American native English speakers "graduate" from anything other than university, etc. And, of course, American and non-American spellings of a word may differ.
Basically, there are textual equivalents of a person having an accent. Sometimes you actually have to read quite alot of text to make an inference by eye. Automate this.
The title is descriptive enough, but if it's not, I'm talking about how "Euro English" gives someone away as European, "if ... would be ..." instead of "if ... were ..." narrows them down to German or Dutch, "graduate" in the same paragraph as "18" probably makes them American or Americanised-ESL as no non-American native English speakers "graduate" from anything other than university, etc. And, of course, American and non-American spellings of a word may differ.
Basically, there are textual equivalents of a person having an accent. Sometimes you actually have to read quite alot of text to make an inference by eye. Automate this.
Post
Help
β + D bookmark this site for future reference
β + β/β go to top/bottom
β + β/β sort chronologically/alphabetically
ββββ navigation
Enter open selected entry in new tab
β§ + Enter open selected entry in new tab
β§ + β/β expand/collapse list
/ focus search
Esc remove focus from search
A-Z go to letter (when A-Z sorting is enabled)
+ submit an entry
? toggle help menu
Sign in to continue (100% free)
To prevent spam, some actions require being signed in. It's free and only takes a few seconds.
Sign in with Google0 AIs selected
Clear selection
#
Name
Task