Federated services have always had privacy issues but I expected Lemmy would have the fewest, but it’s visibly worse for privacy than even Reddit.

  • Deleted comments remain on the server but hidden to non-admins, the username remains visible
  • Deleted account usernames remain visible too
  • Anything remains visible on federated servers!
  • When you delete your account, media does not get deleted on any server
  • HorseFD@beehaw.org
    link
    fedilink
    English
    arrow-up
    6
    ·
    2 years ago

    So just to clarify this point:

    Anything remains visible on federated servers!

    If I delete a comment on beehaw.org, it doesn’t get deleted when accessed from another Lemmy instance that federates with Beehaw?

    • NaN@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      9
      ·
      2 years ago

      When you delete it your instance tells others that it was deleted, but it cannot force them to follow through.

      • philpo@feddit.de
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 years ago

        Which is indeed a problem as it makes it impossible for any admin to host in the EU or for EU citizens, in theory. GDPR §7 makes it very clear that complete deletion of all personal data (and yes,a Lemmy comment is personal data) must be facilitated by the original data collection point.

        • Kajo@pawb.social
          link
          fedilink
          English
          arrow-up
          5
          ·
          2 years ago

          it can’t make it impossible. If facebook sold data to amazon, so now amazon has a copy, and then facebook’s user asks their data to be deleted, facebook can’t just march into amazon’s servers and delete the data themselves. The best they can do is send a formal notice to amazon requesting it be deleted, which sounds like what lemmy does. At this point it’s up to the federated server if they comply with the law…

          • philpo@feddit.de
            link
            fedilink
            English
            arrow-up
            1
            ·
            2 years ago

            Actually that is exactly what the GDPR stipulates. In your example Facebook needs a data processing agreement that ensures that all rights of the data owners are secured and the GDPR is followed. Facebook is liable here, not Amazon - the user must explicitly NOT ask Amazon to delete as the user may not even know where the data went to/should not be bothered to write requests to a huge amount of different data processing locations.

            But, @hikaru755@feddit.de added another interesting point: The Instance may or may not be seen as a single data processing entity that does not voluntarily hands over data to other instances. That could indeed be a reasonable cause as e.g. data scrubbers are not within the sphere of influence of e.g. a service publicly displaying data. But as the whole network is build on interconnected nodes I wouldn’t count on it if that reasoning would fly in front of a court. It may. Or it may not.

            • Kajo@pawb.social
              link
              fedilink
              English
              arrow-up
              2
              ·
              2 years ago

              In this case though, would it not be that then if Facebook did have a processing agreement with Amazon with which they communicate information, and this agreement stipulates that (in order to comply with GDPR) data they sell to amazon must be deleted upon request, and Amazon does NOT do so, this would make amazon liable for breach of contract instead of facebook being liable for breach of GDPR?

              If so, all fediverse instances would need is a copy-paste agreement when two instances federate that data from one must be deleted on the other upon request.

              • philpo@feddit.de
                link
                fedilink
                English
                arrow-up
                2
                ·
                edit-2
                2 years ago

                Partially right - Amazon would be liable, but not towards the data owner but Facebook. The data owner sues Facebook, Facebook then sues Amazon.

                A copy&paste agreement is the first (and from my point of few most important step). Personally I would also integrate a automatic mechanism that deletes data (e.g. the delete request gets automatically federated) and defederates instances that do not follow them globally. Sadly this is still not enough - data handling in the US and other jurisdictions with similar bad privacy laws is also a problem, see the recent Facebook case and Schremp2. But tbh I have no idea how to solve that.

                Lemmy can, by definition, not be GDPR obtain full GDPR compliance. We should make sure that best effort is ensured, especially with the right of deletion and the right to “know”(where data is stored), but also consider lobbying towards a reformed law for the federated use cases.

            • ZENITHSEEKER@feddit.uk
              link
              fedilink
              English
              arrow-up
              1
              ·
              2 years ago

              The originating instance definitely cannot be held responsible for failing to force a separate instance in another country to delete its cached copy of user data imo. I think what is more likely is that EU courts could force European Jimmy instances to only federate with GDPR-compliant instances.

        • Pulp@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          2
          ·
          2 years ago

          From what I understand instance 1 has to delete data if requested, but instance 2 has no obligation to unless requested. Just like data remains archived in sites like internet archive or other private archives. Just like it works on reddit or any other site currently.

        • hikaru755@feddit.de
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          2 years ago

          I don’t think it’s quite that bad/simple. Viewing your main instance as the Controller and other instances as Processors in GDPR terms won’t work, because instances don’t have the necessary control over each other for that, as you say.

          However, you could circumvent that issue by making the case that each instance actually acts as an independent Controller. By participating on a federated service, you are explicitly agreeing to the data you provide (your profile, posts, comments, etc.) being made public and shared with other compatible services. That should be enough as the basis for other instances to reasonably assume you want your data to be processed by them, which (I think, not a lawyer) is sufficient justification for processing the data independently, as long as it’s in line with how you generally expect the fediverse to work.

          This would mean that each federated instance is its own, independent entity that processes your data, and to make use of your rights under GDPR, you need to do that with each of them individually. They effectively become their own “original data collection point”, in your words, even if that data collection was not explicitly triggered by you.

          The only thing missing for that to be legal (again, in my layman’s view) is transparency about who’s processing your data and how, which is necessary under GDPR. Every instance that receives your data via federation would need to let you know about that, and make available to you information on how exactly your data is processed and how you can make use of your rights under GDPR with them. That, in turn, would probably be easiest if the protocol spoken between fediverse servers were extend with automated and standardized ways to propagate GDPR requests from your home instance to any other instance that is processing your data, so that you don’t have to actually deal with every single server yourself to get your rights enacted. Defederation in the meantime might be a problem, but there’s ways around that, too.

          • philpo@feddit.de
            link
            fedilink
            English
            arrow-up
            1
            ·
            2 years ago

            The first point is indeed the only one I see atm that might be working. If one can reasonably argue that the node/instance is not voluntarily giving away the data and has no way to prevent that without massively hampering operation of the plattform it might be acceptable in front of a court.

            Again: With a lots of might/could/ifs.

            Because simply the fact that the nodes themselves are build for connecting to each other and very much do so (and you can effectively block other nodes from federating your content to a extent) speaks against that reasoning. But it worked for e.g. data scrubbers,etc.

            However, you could circumvent that issue by making the case that each instance actually acts as an independent Controller. By participating on a federated service, you are explicitly agreeing to the data you provide (your profile, posts, comments, etc.) being made public and shared with other compatible services. That should be enough as the basis for other instances to reasonably assume you want your data to be processed by them, which (I think, not a lawyer) is sufficient justification for processing the data independently, as long as it’s in line with how you generally expect the fediverse to work.

            That sadly explicitly does not work. Any consent given must be under definitive circumstances - a ‘card blanc’ consent is not possible under the GDPR. You must absolutely know where, by whom and what for your data is processed or transfered. And the initial data processor still has the obligation for a data processing agreement.

        • NaN@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          3
          ·
          2 years ago

          It could, but actually policing it would be difficult. I don’t think there is any “yeah I’ll do that” response and even if there is an instance could say it will delete it and still do nothing.

          • Prunebutt@feddit.de
            link
            fedilink
            English
            arrow-up
            2
            ·
            2 years ago

            You could defederate with instances running versions that don’t delete federated posts. Removing compatipility with older protocol implementations is not unheard of.

            • t3rmit3@beehaw.org
              link
              fedilink
              English
              arrow-up
              2
              ·
              edit-2
              2 years ago

              while this is certainly feasible, it is just a compliance checkmark of “doing your best”. It wouldn’t actually prevent someone attempting to persist that data. For example, I just need to maintain an insert-only copy of my deletion-compliant lemmy instance DB, and none of the deletions would be reflected on that.

              I could then host that copy publicly on some unrelated lemmy instance, and without systematically de-federating from all other instances, you wouldn’t know which one was retaining the data.