• FlordaMan@lemmy.world
    link
    fedilink
    arrow-up
    67
    ·
    3 days ago

    I kinda want to mirror this to the fediverse with a bot to 1. Make more people see it and 2. Mirror it so when it gets taken down its distributed on here.

    Should I do it? Or is that dumb?

    • Aniki@feddit.org
      link
      fedilink
      arrow-up
      2
      ·
      1 day ago

      wait is that website a fediverse instance actually?

      also if you do mirror it, make sure to do it in an efficient way. for example, some websites offer one large download to archive the whole site, like wikipedia. is less strain on the server than scraping each page individually.

    • DrMartinu@lemmy.dbzer0.com
      link
      fedilink
      arrow-up
      8
      ·
      3 days ago

      Yeah, I’m real torn. On one hand, I immediately want to scrape this site, but I also don’t want to beat the site up tying up their bandwidth. There seems to be a parent site db4p.org thats managing mirrors of this site, but I don’t see any sort of torrent or archive. If there’s something like that, I’d be very inclined to just archive the entire site/database.

      • FlordaMan@lemmy.world
        link
        fedilink
        arrow-up
        7
        ·
        3 days ago

        Mmm… such a bot could run once every 24 hours either “visiting the site” and reading the HTML contents. Or using the DB directly if they have an API somewhere.

        Either way it doesn’t cost them much.

    • ayyy@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      2 days ago

      There’s a kernel of a good idea there but I’m not sure how the actual format of that would work out…

    • stoy@lemmy.zip
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 days ago

      Mirror it how exactly?

      Do you mean scraping the data and publishing a report to lemmy/piefed/mastodon?

      • FlordaMan@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        3 days ago

        I was thinking indeed scraping (or using an API), and when a new entry is made, repost it on here (on a seperate community).

        • stoy@lemmy.zip
          link
          fedilink
          English
          arrow-up
          8
          ·
          3 days ago

          You may want to create a specific community for that to avoid flooding other communities, and post summaries on other communities every week/month if you do this.

          Just a tip to avoid getting banned for spamming communities.

          • FlordaMan@lemmy.world
            link
            fedilink
            arrow-up
            4
            ·
            3 days ago

            Yes ofcourse! It’s also against (at least lemmy.world’s) instance to post as a bot without express permission from the community mods.