• 3 Posts
  • 98 Comments
Joined 1 year ago
cake
Cake day: June 11th, 2023

help-circle















  • What’s so bad about giving AI models something to learn on?

    From a user point of view? A lot. So far the AI has made itself the champion of the creation of fake. Fake news, fake pictures, fake videos, fake history, fake identity. Do you think that the AI will be used for your own good? Do you think that your private data are farmed for you own good? I don’t.

    I posted an example about fake identities and fake posters on Twitter. This is the end goal. This is where the money generated by the AI will come from.

    That way you could detect and address rogue scrubbers while still working with LLM creators who are open to an honest training integration. And if your company can’t really detect the difference between users and LLM crawlers after implementing something like this, well, then those crawlers don’t really affect the company as much as the CEOs would like to pretend.

    Twitter and Reddit probably want to be their own LLM creators. They don’t want to leave this market to another LLM. Also it doesn’t take a lot of API calls to generate the content that will astroturf your product.

    Anyway the cat is out of the bag and this data will be harvested. The brands will astroturf their products using AI processes. People are not stupid and will realize the trick played on them. We are probably heading toward platforms using full authenticated access.


  • For reddit and twitter it’s also induced by the threat of AI. Twitter and reddit host a lot of content, organized, sorted, coherent. It’s invaluable for training an AI and these companies don’t want to let it go for free. They want control over it, therefore they are making it very hard for AI companies to farm their content. The fact that it’s happening now is because AI companies are probably rushing to copy as much data as possible before laws are voted to put a limit over them.

    It will be the same for the fediverse, our content will be scanned by AI’s. Our content is freely visible, organized, sorted and scored. We should be careful about that. If you are not a professional publisher or a public person then you should probably think about rotating your username as often as possible.

    edit: But also, with the rise of tiktok, a lot of countries are now suspicious about the soft power of those apps, and are ready to legislate against them. The EU already did, they did vote fines against them and are regularly getting money out of them. The taboo is gone, you can attack those companies, it works. They were supposed to be out of reach, but they are not.

    Also there is no genius in Twitter, as far as I know they have no patent over anything. If someone manages to become more popular than them on the same principle then twitter is done. Gravity will do the rest and users will move to a different platform. People are using it because people are using it. So the model is fragile and the value is questionable.


  • Are you sure that you are answering to the right post or are you just jumping on the bandwagon of another post? You address nothing of what I mentioned about the federation system and the solution I’ve given.

    I don’t think you quite realize how much craziness is in the world at large. There are have been instances of pizzagate levels of craziness in my home country, as well as in the other countries whose news I follow.

    Give some example which show the magnitude of the pizzagate. I’m not talking about a follower of qanon making noise for views, I’m talking about the pizzagate with everything that it includes.

    You also don’t seem to grasp how discussions on the Internet work. People will post about things that interest them. Telling people not to post things that are of interest to them because you don’t like it is counterintuitive and borderline offensive.

    I thought that you were serious for a moment, I was wrong. Or you answered to the wrong post.


  • 100 per minute is faster than the speed at which I deleted my history. So I guess we can still help people deleting their history.

    I used the archive which is shared around and extracted my posts to get the id of my comments and deleted those, like 30 per minute. But I guess that if we rebuild a database with the author as a key then we can pretty quickly return a list of id’s based on an author. Then the user can feed this list to a python script by himself and delete himself.

    What I couldn’t do in time is edit the posts as I ran in some weird index errors that I couldn’t bypass.