Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse

Darkscribes Community

  1. Home
  2. Uncategorized
  3. Can anyone tell me what is #NodeBB and why is it scraping and republishing fediverse content without consent?

Can anyone tell me what is #NodeBB and why is it scraping and republishing fediverse content without consent?

Scheduled Pinned Locked Moved Uncategorized
nodebb
41 Posts 10 Posters 72 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • thisismissem@hachyderm.ioT [email protected]

    @dentangle @Gargron @jonny @onepict right, best practice is to not make remote content directly viewable without authentication (but it may still appear in thread/reply views without authentication)

    dentangle@chaos.socialD This user is from outside of this forum
    dentangle@chaos.socialD This user is from outside of this forum
    [email protected]
    wrote last edited by
    #31

    @thisismissem @Gargron @jonny @onepict yes, where "best practice" == "if I don't want my instance defederated by the majority of the fediverse"

    thisismissem@hachyderm.ioT 1 Reply Last reply
    0
    • dentangle@chaos.socialD [email protected]

      @thisismissem @Gargron @jonny @onepict yes, where "best practice" == "if I don't want my instance defederated by the majority of the fediverse"

      thisismissem@hachyderm.ioT This user is from outside of this forum
      thisismissem@hachyderm.ioT This user is from outside of this forum
      [email protected]
      wrote last edited by
      #32

      @dentangle @Gargron @jonny @onepict the source of that best practice is more around rehosting random content and consequently having liability for that content.

      dentangle@chaos.socialD 1 Reply Last reply
      0
      • thisismissem@hachyderm.ioT [email protected]

        @dentangle @Gargron @jonny @onepict the source of that best practice is more around rehosting random content and consequently having liability for that content.

        dentangle@chaos.socialD This user is from outside of this forum
        dentangle@chaos.socialD This user is from outside of this forum
        [email protected]
        wrote last edited by
        #33

        @thisismissem @Gargron @jonny @onepict

        That may be the case for some instance admins, but most users are not admins.

        The bigger issue is that feeding fediverse toots into search engines violates conventions and the expectations of most users. That's what causes fedi-riots every time some bright spark does it.

        1 Reply Last reply
        0
        • dentangle@chaos.socialD [email protected]

          Can anyone tell me what is #NodeBB and why is it scraping and republishing fediverse content without consent?

          dentangle@chaos.socialD This user is from outside of this forum
          dentangle@chaos.socialD This user is from outside of this forum
          [email protected]
          wrote last edited by
          #34

          Hi @julian

          I know you're very busy sitting on panels at #Fedicon and talking about how to make the fediverse better. Great.

          Unfortunately you are still running a scraper that is feeding search engines.

          You've been posting from the con (tip: we use alt text on pictures here on the fediverse), so I know you're online.

          You're following me, so you'll have seen my question. @Gargron has spoken to you too I believe.

          A day later, no acknowledgement or apology or fix or promise of a fix. Why?

          julian@community.nodebb.orgJ 1 Reply Last reply
          0
          • dentangle@chaos.socialD [email protected]

            Hi @julian

            I know you're very busy sitting on panels at #Fedicon and talking about how to make the fediverse better. Great.

            Unfortunately you are still running a scraper that is feeding search engines.

            You've been posting from the con (tip: we use alt text on pictures here on the fediverse), so I know you're online.

            You're following me, so you'll have seen my question. @Gargron has spoken to you too I believe.

            A day later, no acknowledgement or apology or fix or promise of a fix. Why?

            julian@community.nodebb.orgJ This user is from outside of this forum
            julian@community.nodebb.orgJ This user is from outside of this forum
            [email protected]
            wrote last edited by
            #35

            Hi [email protected], I haven't been at a laptop this entire day since 7am this morning.

            Around then I added a change to the link tags sent for remote profiles so that they point to the canonical source (your actual profile).

            I'll likely just put in a redirect to your profile so it won't be accessible.

            1 Reply Last reply
            0
            • julian@community.nodebb.orgJ This user is from outside of this forum
              julian@community.nodebb.orgJ This user is from outside of this forum
              [email protected]
              wrote last edited by
              #36

              [email protected] I appreciate your civility so far while I work through what needs to be done about this.

              1 Reply Last reply
              0
              • dentangle@chaos.socialD [email protected]

                Can anyone tell me what is #NodeBB and why is it scraping and republishing fediverse content without consent?

                deadsuperhero@social.wedistribute.orgD This user is from outside of this forum
                deadsuperhero@social.wedistribute.orgD This user is from outside of this forum
                [email protected]
                wrote last edited by
                #37

                @[email protected] Quick question, what makes you think this is a scraper? NodeBB is forum software that implements ActivityPub and federates using the protocol.

                dentangle@chaos.socialD 1 Reply Last reply
                0
                • deadsuperhero@social.wedistribute.orgD [email protected]

                  @[email protected] Quick question, what makes you think this is a scraper? NodeBB is forum software that implements ActivityPub and federates using the protocol.

                  dentangle@chaos.socialD This user is from outside of this forum
                  dentangle@chaos.socialD This user is from outside of this forum
                  [email protected]
                  wrote last edited by
                  #38

                  @deadsuperhero It doesn't matter where the data is coming from, the effect is the same. Scraping done over AP is still scraping. The data (retrieved over AP in this case) is being republished without a "noindex" tag so it is being fed into search engines, including posts on your peertube server.

                  1 Reply Last reply
                  0
                  • dentangle@chaos.socialD This user is from outside of this forum
                    dentangle@chaos.socialD This user is from outside of this forum
                    [email protected]
                    wrote last edited by
                    #39

                    @julian Thank you for your response and taking this seriously.

                    Please keep everyone informed. Feeding fediverse data to search engines (even accidentally, as this appears to be) is a breach of trust. How you handle this now is likely to be remembered by the fediverse for a long time.

                    julian@community.nodebb.orgJ 1 Reply Last reply
                    0
                    • dentangle@chaos.socialD [email protected]

                      @julian Thank you for your response and taking this seriously.

                      Please keep everyone informed. Feeding fediverse data to search engines (even accidentally, as this appears to be) is a breach of trust. How you handle this now is likely to be remembered by the fediverse for a long time.

                      julian@community.nodebb.orgJ This user is from outside of this forum
                      julian@community.nodebb.orgJ This user is from outside of this forum
                      [email protected]
                      wrote last edited by
                      #40

                      [email protected] the noindex tag has been added to all remote profiles.

                      1 Reply Last reply
                      0
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Don't have an account? Register

                      • Login or register to search.
                      Powered by NodeBB Contributors
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • Users
                      • Groups