this post was submitted on 27 Sep 2025
119 points (99.2% liked)

Wikipedia

3671 readers
126 users here now

A place to share interesting articles from Wikipedia.

Rules:

Recommended:

founded 2 years ago
MODERATORS
 

A Language Committee member provided the following comment:

The proposal for closing the Greenlandic Wikipedia is accepted. Despite Greenlandic being an official language with roughly 60,000 speakers, the wiki has never developed a viable community: over the last two decades only one or two Greenlandic users have contributed, and there has been almost no growth in the last five years. Most articles are short or unintelligible, and machine-generated content—initially from experimental Greenlandic machine translators and more recently from AI tools like Google Translate—has frequently produced nonsense that could misrepresent the language. The sole active admin, with academic expertise in Greenlandic, has had to monitor and delete such content, but no sustainable community exists to safeguard the language. Given the risk of harm to the Greenlandic language, the seemingly negative attitude towards the project in Greenland, and the absence of genuine user activity, the project should be closed, with any remaining content moved to the Incubator for future use. --MF-W 20:53, 12 September 2025 (UTC)

top 19 comments
sorted by: hot top controversial new old
[–] dantheclamman@lemmy.world 22 points 4 days ago (2 children)

Long read (maybe paywalled) about why the use of machine translation on wikipedia does a lot of harm and we should stop. It creates a feedback loop since these models often train on Wikipedia. https://www.technologyreview.com/2025/09/25/1124005/ai-wikipedia-vulnerable-languages-doom-spiral/

[–] antonim@lemmy.dbzer0.com 1 points 2 days ago

A great article, thanks for sharing it.

Research produced by Google before a major expansion of Google Translate rolled out three years ago found that translation systems for lower-resourced languages were generally of a lower quality than those for better-resourced ones. Researchers found, for example, that their model would often mistranslate basic nouns across languages, including the names of animals and colors.

I remember discussing this some time ago on reddit. Google Translate very suddenly introduced a number of small languages and IIRC one of the speakers personally expressed frustration at the horrible output. Some people proposed the speakers correct it (you can always report bad translations there and propose your own), but it hardly requires explaining why that's a bad and futile idea...

[–] Jessica@discuss.tchncs.de 9 points 3 days ago* (last edited 3 days ago) (1 children)

On the one hand, it's unfortunate Wikipedia is having to spend extra human labor to deal with that. On the other hand, I'm always down for poisoning LLM data sets

[–] dantheclamman@lemmy.world 5 points 3 days ago

Big generative LLMs are one thing, but the translation tools themselves like Google Translate are models, have been since before the current AI craze of recent years, and really can be helpful for people trying to learn their language if there are a shortage of native speakers. Those train with Wikipedia too

[–] thrawn21@lemmy.world 41 points 4 days ago

Well that's sad, but understandable.

[–] KSPAtlas@sopuli.xyz 9 points 3 days ago (1 children)

I assume they're trying to avoid another Scots Wikipedia situation here, where a low quality Wikipedia misrepresents the language?

[–] daggermoon@lemmy.world 3 points 3 days ago (3 children)

Is Scots Wikipedia really that bad? I was viewing a few pages in Scots and I was surprised how much of it I could read given how similar it is to English.

[–] KSPAtlas@sopuli.xyz 3 points 2 days ago* (last edited 2 days ago)

Basically a bunch of pages were written by an American teenager just using an English Scots dictionary, so it was written in basically English with improperly used Scots words

If you see an article with a heading like "the 'Scots' in this article wis written bi a body that hasna a good grip on the leid", it means it's written in fake Scots

Example:

[–] njm1314@lemmy.world 4 points 3 days ago (1 children)

I'm not familiar, but I assume from your description it must have been a horrible representation of the language because if you could understand it clearly it's not really a good representation of the Scottish tongue.

[–] daggermoon@lemmy.world 2 points 3 days ago (1 children)
[–] Soulg@ani.social 2 points 3 days ago

This is the bad one right? I can mostly follow it but a couple slang terms I kinda have to guess

[–] daggermoon@lemmy.world 4 points 3 days ago (1 children)

Hopefully Greenlandic users can restart it one day, though it seems unlikely.

[–] nutsack@lemmy.dbzer0.com 1 points 3 days ago

maybe the republic of greenlandia can self host it

[–] JubilantJaguar@lemmy.world -2 points 4 days ago (1 children)

Canary in the coal mine for the rest of the project?

Personally I've always worried about Wikipedia's top-heaviness. It's much easier to create content than to maintain it. Of those 7 million articles in EN, an awful lot are "short or unintelligible" or outright "nonsense" - and on top of that they're becoming steadily out of date.

IMO this amazing project needs to move to retrenchment so as safeguard its reputation. Logically that means lots and lots of deletion. Not a popular opinion alas.

[–] dantheclamman@lemmy.world 9 points 4 days ago* (last edited 4 days ago) (1 children)

As someone who browses a lot of Wikipedia in English, I haven't seen much outright nonsense

[–] JubilantJaguar@lemmy.world 4 points 3 days ago (1 children)

Perhaps we have difference thresholds for what constitutes nonsense.

The main issue IMO is outdatedness, and it's reaching almost insurmountable proportions. Take a random article outside the 1000 most popular ones (and outside the generally decent ones on hard science) and you'll find that the "center of balance" of cited dates is now a decade or more in the past. "As of 2009, the proposed bridge is awaiting approval", "The budget was to be revised in June 2012", etc. The problem is absolutely rampant. And completely logical because that was the period when all the editing was happening - the number of editors has dropped off hugely since then. And yet there's very little appetite for deleting obsolete content. In my analysis that's because the original generation of Wikipedians skew by nature towards the idealistic and tend to believe that all those articles will be updated and fixed eventually, it's a just a question of time. Personally I'm not convinced. I think that idealism is misplaced and it's now undermining the project.

[–] dantheclamman@lemmy.world 8 points 3 days ago

I would take that as evidence of Wiki being a mature site. Old school encyclopedias had a lot of issues with outdatedness too. They tended to just not have articles about timely topics to reduce that, but they still struggled to keep up with the times. But it definitely is a big issue, especially with link rot and other problems of the old internet.