Now Playing Tracks

ralfmaximus:

image

To understand what’s going on here, know these things:

  1. OpenAI is the company that makes ChatGPT
  2. A spider is a kind of bot that autonomously crawls the web and sucks up web pages
  3. robots.txt is a standard text file that most web sites use to inform spiders whether or not they have permission to crawl the site; basically a No Trespassing sign for robots
  4. OpenAI’s spider is ignoring robots.txt (very rude!)
  5. the web.sp.am site is a research honeypot created to trap ill-behaved spiders, consisting of billions of nonsense garbage pages that look like real content to a dumb robot
  6. OpenAI is training its newest ChatGPT model using this incredibly lame content, having consumed over 3 million pages and counting…

It’s absurd and horrifying at the same time.

theriverbeyond:

Ideal work schedule:

  1. I show up and am given a list of cognitively engaging but achievable tasks
  2. I complete the list
  3. I leave immedietly
We make Tumblr themes