this post was submitted on 08 Sep 2023
335 points (94.2% liked)

Technology

71998 readers
2738 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

But what if you do? Will you get caught?

all 38 comments
sorted by: hot top controversial new old
[–] Lettuceeatlettuce@lemmy.ml 82 points 2 years ago (2 children)

Lol this has the same energy as those NFT idiots crying about people taking screenshots of their stupid monkeys.

Noooo, stop scraping my dataaaa!

[–] cheese_greater@lemmy.world 1 points 2 years ago (1 children)

Never waste an opportunity to use "muh" in place of "my"

[–] ripcord@kbin.social 8 points 2 years ago

And the next generation of AI probably only needs a fraction of the data it needs now so the need to scrape the data is gone.

[–] Darkard@lemmy.world 51 points 2 years ago (2 children)

I hope someone makes some manic bot that scrapes every last tweet and posts it on a duplicate site call Y

[–] SinningStromgald@lemmy.world 13 points 2 years ago* (last edited 2 years ago) (1 children)

Might as well go whole hog and do the entire alphabet. Then do one for every iteration of every letter combination.

[–] veloxization@yiffit.net 44 points 2 years ago

He's still mad at those researchers for scraping the data that shows that ever since he took over, the antisemitism, racism and general bigotry has gone up on the platform.

[–] Sanctus@lemmy.world 40 points 2 years ago (1 children)
[–] cheese_greater@lemmy.world 0 points 2 years ago (1 children)

Let the Supreme Court enforce it ;)

[–] Sanctus@lemmy.world -2 points 2 years ago

It should, but it won't.

[–] tonytins@pawb.social 25 points 2 years ago (1 children)

How on earth will do they plan on enforcing that? xD

[–] xavier666@lemm.ee 13 points 2 years ago (1 children)

They don't have to enforce it. If someone says bad things about Twitter by analysing their content, Twitter can sue them scraping.

[–] IphtashuFitz@lemmy.world 18 points 2 years ago

“Our interns spent 500 hours collecting the raw data”.

[–] underisk@lemmy.ml 25 points 2 years ago (1 children)

I’m pretty sure both parties must agree to the terms before they legally bind anyone so wouldn’t this just apply to logged in users?

[–] thepianistfroggollum@lemmynsfw.com 8 points 2 years ago* (last edited 2 years ago) (3 children)

Accessing the website is often viewed as accepting the terms, so that wouldn't hold up. Not that they'd have a legal standpoint on the issue.

[–] NeoNachtwaechter@lemmy.world 15 points 2 years ago (1 children)

Accessing the website is often viewed as accepting the terms

The scraping bot can't read the terms

But even if it could, it wouldn't give a damn :-)

[–] mojo@lemm.ee 12 points 2 years ago (1 children)

By reading this message you agree to my terms that I'm really cool

[–] eskimofry@lemmy.one 3 points 2 years ago

Lol and you username

[–] TheEntity@kbin.social 8 points 2 years ago

How do you read the terms without accessing their website?

[–] ChunkMcHorkle@lemmy.world 22 points 2 years ago* (last edited 1 year ago)

deleted by creator

[–] Jaysyn@kbin.social 20 points 2 years ago

This is hilariously unenforceable as long as Twitter is on the public internet.

[–] 7fb2adfb45bafcc01c80@lemmy.world 19 points 2 years ago* (last edited 2 years ago) (2 children)

I thought this was an article about the X Windows system based on the preview for the article. Boy are those two similar-looking.

[–] MJBrune@lemmy.world 2 points 2 years ago

Realistically, very little people know about x windows system even less care about it.

You could always join wayland.social

[–] pseudorandom@kbin.social 17 points 2 years ago

Or just stop using X all together.

[–] cheese_greater@lemmy.world 14 points 2 years ago

Crawling for me, not thee!

[–] dingleberry@discuss.tchncs.de 14 points 2 years ago (1 children)

Just update robot.txt coward!

[–] Iwasondigg@lemmy.one 6 points 2 years ago

Took a look at their robots.txt, it appears to block all bots except Google.

[–] BlinkerFluid@lemmy.one 11 points 2 years ago

don't!

You heard him, scrape more.

[–] YurkshireLad@lemmy.ca 8 points 2 years ago

So he’s going to sue google then?