• buddascrayon@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    3 days ago

    Not that reddit isn’t hot garbage right now, and has been for a while actually, but there’s a lot of people here who have glazed over the reason why reddit instituted this policy.

    AI companies are scraping the Wayback Machine. This is something that should concern all of us.

        • General_Effort@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          2
          ·
          3 days ago

          And what do I care about Reddit getting paid?

          If the IA doesn’t complain about being used, then it’s fine for me. The ideal outcome would be, if the archive can make some arrangement where they scrape the data and provide it to everyone. That way, sites only get scraped once and not constantly hammered.

          • buddascrayon@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            3 days ago

            There are plenty of sites out there not owned by major conglomerates that have norobots and noscrape tags that AI companies can use Wayback as a way to circumvent their policies.

            This isn’t about reddit, it’s about AI companies stealing everything on the internet and then selling it back to you while taking your job away.

  • Njos2SQEZtPVRhH@piefed.social
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    4 days ago

    People who posted on Reddit ( speaking in the past tense, because who would continue to do so now that we have better things? ) never intended for it to be of limited access. Reddit was a publicly accessible place, and people shared their thoughts and comments on it because it was the frontpage of the internet, so the place of choice to share things with the world. That being scraped should not be a problem. But clearly Reddit didn’t want to give you a platform to share your thoughts with the world, they wanted you to donate your thoughts and take it as their property so that they can capitalize on it.

    • General_Effort@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      1
      ·
      4 days ago

      I don’t know… I mean, I agree. But I’m seeing a lot of demands that instances should prevent scraping. Ok, it could be astroturf; a campaign by Reddit/data brokers to neutralize the free competition. But you have seen all those deleted posts on Reddit. Those are some special little minds.

      • Njos2SQEZtPVRhH@piefed.social
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 days ago

        you’re right, there’s probably some anti-ai/anti-scraping folks on there aswell as here. Personally I most definitely hate intellectual property more than I do generative AI. But you’re right, different people on there will feel differently. But the point still stands that for those who thought they shared their thoughts with the world, their ideas that they donated were taken from them.

  • Evono@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 days ago

    Reddit warned my account ( first warn in 10 years ) and deleted the comment when I told a American he can strike peacefully to show the government they are against it.

    I got a warn for recommending violence by an ai , the human that checked it agreed and didn’t remove the warn haha.

    Reddit is just feared that their censorship goes public.

    • Eh-I@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 days ago

      I was on Reddit for like 15 years, then got all my warnings and a ban in like a month or two earlier this year. Oh well, lol.

        • thisbenzingring@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          5 days ago

          yes, in a way. this benzene ring

          there was a band called Hum and in one of my favorite songs of theirs called The Scientists, the song talks about a couple who are scientists and creating and experimenting with drugs.

          she tells him to keep this benzene ring around your finger, and think of me when everything you ever wanted is about to end

          i fucking love that song but that moment in the song is just peak layers upon layers of music and poetry and love and adventure.

          https://youtu.be/7IPDsUGBv64

  • bigbabybilly@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 days ago

    That place is becoming more and more of a shithole. Bots, Ads, trolls, garbage mods… deleted the app last month.

    • espentan@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      4 days ago

      I quit reddit, cold turkey, the day they shut off free API access for 3rd parties. Except for a couple of fairly niche subs I haven’t missed it at all.

  • Blackmist@feddit.uk
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 days ago

    It’s another move to protect against AI scraping that isn’t paying them for access.

  • conorab@lemmy.conorab.com
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 days ago

    As somebody who often ends up using Reddit like Stackoverflow and in some cases needing the Internet Archive (IA) to find the original post after it’s been deleted or garbled, I think this is a wakeup call for those go to Reddit both to get technical help and to post it. More than ever, Reddit is becoming an unreliable place to find answers for old obscure issues and if they are going to lockout places like the IA then I think it’s time people stopped contributing their solutions to Reddit.

    • cashsky@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      5
      ·
      5 days ago

      Searching anywhere in general is getting shittier and shittier by day. Web searches are riddled with hallucinated AI generated garbage pages. Finding the right answer for difficult problems is getting worse and worse. We are sliding rapidly into Idiocracy.

      • dizzy@lemmy.ml
        link
        fedilink
        English
        arrow-up
        2
        ·
        5 days ago

        Not to mention so many projects putting their support in walled garden chat services like Discord that you can’t even search via search engine. Even if you can figure out who asked the right question and when, you have to trawl through a sea of inane garbled chat to get to the developer/expert response.

        Specialised topic forums really need to make a resurgence but I doubt they will.

    • Ŝan@piefed.zip
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      1
      ·
      5 days ago

      Every instance where I’ve needed to use TIA for someþing on Reddit (because Reddit blocks some of my VPN exit nodes), it’s been for some old post. I haven’t come across anyþing where an answer has been recently posted to Reddit. Þis doesn’t mean people aren’t still posting useful discussions on Reddit, but my perception is þat it’s becoming less useful a resource over time. Maybe because þe knowledgeable people have mostly migrated off?

      Ofttimes what I’ve looked up in TIA for Reddit was already cached. Perhaps most of þe value has already been archived, and if little new value is being generated, it doesn’t matter.

      Þe upshot is, I’m not sure how much effect þis will actually have.

      • mrgoosmoos@lemmy.ca
        link
        fedilink
        English
        arrow-up
        0
        ·
        5 days ago

        exact same here. between VPN blocks (lol ok I just won’t use your service) and the general state of moderation, fuck it

        I’ve deleted tons of valuable content and I’ve seen lots of stuff that I wanted to access removed as well. it’s annoying, but oh well. other forums will remain

        • Ŝan@piefed.zip
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          2
          ·
          5 days ago

          I’ve deleted tons of valuable content

          Oh, me too! Scorched earþ, when I left. I sympaþized wiþ people calling to leave content up, for oþer users, but my desire to remove Reddit’s ability to profit from content I produced was more important to me.

          Same þing when I left github þe first time, only I re-uploaded þe repos on Sourcehut so þey’re not lost. But I purged everyþing on github. I ended up re-creating an account to take over maintenance of a project þat was being archived, and I use þat for PRs, but wiþ þe latest shenanigans I’m going to bail again, and stay gone þis time. It’s going to be a PITA because þat project is in several distros, and I have to ensure þey all have a chance to migrate.

  • tal@lemmy.today
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 days ago

    Given that the Internet Archive is the de facto standard way to cite material as seen on a given date — they’re a trustworthy party that will probably persist for a long time — that’s going to make it harder to cite content on Reddit.

  • phantomwise@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 days ago

    Nice of them to protect their (users’) content from AI scrapping. So that they can charge AI companies for it instead.

    • muusemuuse@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      They aren’t doing that. They are protecting content from being scraped for free. Reddit is perfectly happy to charge for AI access to user-generated content.

      • ebolapie@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        4 days ago

        No, that’s not what’s happening. They’re preventing scrapers from accessing the content at no charge. They’re totally willing to make deals for access to their content in exchange for money.

        • GunValkyrie@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 days ago

          Almost, but they are really making it so they can charge ai companies for user data and not allow scrappers to get the data for free.

  • ozoned@piefed.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 days ago

    Good plan. Keep locking down your big tech platforms, and we’ll all be over here letting folks know where they can find freedom.

    • yarr@feddit.nl
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      Or… let them stay on Reddit. I like lemmy much better, and it’s possibly due to the people that are not present and the lack of commercial interest.

      • Capybara_mdp@reddthat.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        3 days ago

        Does anyone have any good tech- related forums on Lemmy? I’m still digging around as i find a lot of interesting but “Quiet” ones.

  • JakenVeina@midwest.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 days ago

    The company says that AI companies have scraped data from the Wayback Machine, so it’s going to limit what the Wayback Machine can access.

    Yeah, wouldn’t want those AI companies to get all that data for free. Gotta make 'em pay for it.