[Request] Share any interesting LLM prompt engineering tips and references please, especially anything on individualized education

TheOtherJake@beehaw.org · 11 months ago

I won’t touch the proprietary junk. Big tech “free” usually means street corner data whore. I have a dozen FOSS models running offline on my computer though. I also have text to image, text to speech, am working on speech to text, and probably my ironman suit after that.

These things can’t be trusted though. It is just a next word statistical prediction system combined with a categorization system. There are ways to make an LLM trustworthy, but it involves offline databases and prompting for direct citations, these are different from Chat prompt structures.

TheOtherJake@beehaw.org · 11 months ago

The best years of YT before 2017, there was an advanced maker and DIY tech culture revolving around people sharing projects and content just to share it. That died. This all pro CC thing has an up side to some extent, but it also lobotomized YT. Peertube might eventually get to the same kind of utility level, but it needs a lot of time and momentum to get there.

TheOtherJake@beehaw.org · 11 months ago

Fire fox, Fire fox;

Fuck you Google;

We’re throwing rocks.

Alpha bet, Alpha bet;

Farming data is,

stalking/theft.

TheOtherJake@beehaw.org · 11 months ago

Have you seen the great gatspy with Wizard too? That’s what always comes up when mine goes too far. I’m working on compiling llama.cpp from source today. I think that’s all I need to be able to use some of the other models like Llama2-70B derivatives.

The code for llama.cpp is only an 850 line python file (not exactly sure how python=CPP yet but YOLO I guess, I just started reading the code from a phone last night). This file is where all of the prompt magic happens. I think all of the easy checkpoint model stuff that works in Oobabooga uses python-llama-cpp from pip. That hasn’t had any github repo updates in 3 months, so it doesn’t work with a lot of newer and larger models. I’m not super proficient with Python. It is one of the things I had hoped to use AI to help me learn better, but I can read and usually modify someone else’s code to some extent. It looks like a lot of the functionality (likely) built into the more complex chat systems like Tavern AI are just mixing the chat, notebook, and instruct prompt techniques into one ‘context injection’ (-if that term makes any sense).

The most information I have seen someone work with independently offline was using langchain with a 300 page book. So I know at least that much is possible. I have also come across a few examples of people using langchain with up to 3 PDF files at the same time. There is also the MPT model with up to 32k context tokens but it looks like it needs server machine ram in the hundreds of GB to function.

I’m having trouble with distrobox/conda/nvidia on Fedora Workstation. I think I may start over with Nix soon, or I am going to need to look into proxmox, virtualization or go back to an immutable base to ensure I can fall back effectively. I simply can’t track down where some dependencies are getting stashed and I only have 6 distrobox containers so far. I’m only barely knowledgeable enough in Linux to manage something like this well enough for it to function. - suggestions welcome

TheOtherJake@beehaw.org · 11 months ago

[Request] Share any interesting LLM prompt engineering tips and references please, especially anything on individualized education

TheOtherJake@beehaw.org · 11 months ago

WizardLM 30B at 4 bits with the GGML version on Oobabooga runs almost as fast as Llama2 7B on just the GPU. I set it up with 10 threads on the CPU and ~20 layers on the GPU. That leaves plenty of room for a 4096 context with a batch size of 2048. I can even run a 2GB Stable Diffusion model at the same time with my 3080’s 16GBV.

Have you tried any of the larger models? I just ordered 64GB of ram. I also got kobold mostly working. I hope to use it to try Falcon 40. I really want to try a 70B model at 2-4 bit and see how its accuracy is.

TheOtherJake@beehaw.org · 11 months ago

This is how I use the internet.

TheOtherJake@beehaw.org · edit-2 11 months ago

Cookies are not needed. They are shifting the security onto the user. Secure the information on the server just like any other business. Offloading onto the client is wrong. It leads to ambiguity and abuses. Visiting a store and a business on the internet are no different. My presence gives no right to my person, searches, or tracking in the location or outside of it. Intentions are worthless. The only thing that matters is what is possible and practiced. Every loophole is exploited and should be mitigated. The data storage and coding practices must change.

TheOtherJake@beehaw.org · 11 months ago

Nah, it should be the default state of affairs. Data mining is stalking and theft. It centers around very poor logic and decisions.

Things like browser cookies are criminal garbage. Storing anything on a user’s computer is stalking. Draw the parallel here; if you want to shop in any local store, I want you to first tell me everything you are wearing and carrying in a way that I can tell every possible detail about it, tell where you came from before you visited this store, where you are going next. They also want to know everything you looked at, how you react to changes in items presented to you and changes in prices. They want enough information to connect you across stores based on your mode of transportation, and have enough data to connect your habits over the last two decades.

Your digital existence should not be subject to slavery either. Ownership over ourselves is a vital aspect of freedom. Privacy is about ownership and dominion. If you dislike all the digital rights management and subscription services nonsense, these exist now as a direct result of people neglecting ownership. In the big picture, this path leads all of humanity back into another age of feudalism. The only difference between a serf and a citizen is ownership over property and tools. Everything happening right now is a battle over a new age of slavery. “You will own nothing and you will be happy about it.” Eventually this turns into 'Your grandchildren will own nothing and say nothing or they will be dead about it." What you do about your privacy now will be a very big deal from the perspective of future generations.

TheOtherJake@beehaw.org · 11 months ago

I just tried it a few hours ago. Indeed, it is quite good. I knew it when a NSFW prompt test on an uncensored model generated a stable diffusion picture of a robot skeleton and a snarky reply. Like, yay we finally have a bight spot with this one.

TheOtherJake@beehaw.org · 11 months ago

Not exactly. Stupid people with advanced tools make stupid outputs. Venture capital is pushing the propaganda sauce hard and a lot of stupid people are jumping on AI as a corporate trend. These are the idiots.

The tools are next level. We are on the edge of this tech becoming a really big deal. There are several research papers making breakthroughs regularly and making double digit percentile improvements on efficiency and accuracy. The reason it is a big deal is because you can have around 1/4 of the knowledge of the entire internet running on hardware as powerful as a current flagship phone. Sure it lies around 1/2 the time, but these are problems that are being solved. Like, the latest and greatest models are ancient history in a matter of 2-3 weeks. To be honest, have a casual conversation with an offline and uncensored LLM. You may know it is lying from time to time, but if you’re being objective, so are most humans you encounter under casual circumstances. The sociological function and potential value of this tech is pretty powerful medicine. Like if you need someone to talk to, or to talk out an issue in private, this is a way to make that happen.

TheOtherJake@beehaw.org · 11 months ago

What hardware does it take to run a 30B?

TheOtherJake@beehaw.org · edit-2 11 months ago

Watch this ~1hr long video when you get the chance. He’s using the stalkerware LLM, but he also describes how to use langchain to parse data like what you are wanting to do.

https://piped.video/watch?v=dXxQ0LR-3Hg&t=772

TheOtherJake@beehaw.org · 11 months ago

Google is broken because AI is making it obsolete. I bet in 10 years google will be a historical footnote.

TheOtherJake@beehaw.org · 11 months ago

Who here is messing with FOSS AI? What ya playing with?

TheOtherJake@beehaw.org · 1 year ago

Just got a new gigabyte. The bootloader is shit combined with shitvidia to make a terrible combination to avoid. I expect most companies are doing the same bullshit with TPM/Secure boot. Everything proprietary is criminal theft.

TheOtherJake@beehaw.org · 1 year ago

News on Wagner’s status and Prigozhin, Russia, the Pentagon

TheOtherJake@beehaw.org · 1 year ago

Giant Sloth bones may be evidence of humans in the Americas as early as 27k years ago

TheOtherJake@beehaw.org · 1 year ago

In a nutshell, a couple of drivers took me out on a bicycle 2/26/14, with a broken neck and back. The bones healed but I have some kind of undiagnosed soft tissue damage that makes it impossible for me to hold posture for more than around one hour. It doesn’t matter if I’m sitting or standing. If I push past this, I am a useless zombie. It will also take me somewhere between a few days to a month to be able to sleep for more than an hour or two after pushing myself to stay upright for too long.

Naturally doctors in the US don’t have a clue what to do with me and neither does disability court. I am stuck living as a burden to my parents waiting for them to die so that I can take up occupation in a ditch somewhere as is the American way.

TheOtherJake@beehaw.org · 1 year ago

I don’t know what to say really. I’ve been hesitant to engage with this place, but didn’t want to leave you hanging without a reply.

I spend most of my day stuck laying in a bed or on a couch after lots of broken stuff in 2014. No one has been able to say exactly what is wrong with me as far as what didn’t heal. I just can’t hold posture sitting or standing for more than around one hour. I’ve slept doing a bad impression of a rotisserie chicken for nearly 10 years. Tired would be an understatement. I also only really go out normally for medical appointments. I can do a daily physical exercise routine. That is what has held me together so far.

TheOtherJake@beehaw.org · 1 year ago

I was a buyer for a chain of high end bike shops for many years. Amazon really only sells junk products. Any real quality brands of niche products can’t support amazon and the typical brick and mortar business inventory structure. Like, I spent between $100k-$500k in preseason bike brand commitments for 3 stores. If any of those brands decided to allow sales on Amazon I would drop them immediately. Multiply this by every bike shop that exists. This is more than Amazon could compete with by a long shot. The issue is that every Buyer in a shop knows what they are able to sell effectively and buys accordingly. I tailored my orders for every shop independently. It would be impossible for Amazon to predict and fund high end bikes at this scale.

“So what,” you say, “it’s just bikes.” No it is not. The bike brands are usually part of a group of brands that include several parts, clothing, and accessory products. These are part of preseason commitments with the bike brands too. So all of these are not sold on Amazon either. This is the case with most things, the best or even decent stuff is not sold on Amazon.

The worst thing with amazon is that they aggregate all identical products in their warehouses. This makes it trivial for a seller to insert fake goods into a product pool and it is completely untraceable back to them.

TheOtherJake@beehaw.org · 1 year ago

Article seems super right wing. Maybe just stop the right from stealing from the people. I applaud them for saying hell no to BS reforms and pressures to impoverish the populous. I’m cheering for the kids with the jerry cans.

TheOtherJake@beehaw.org · 1 year ago

It could also be a way to shake out any elements of the Russian military or other paramilitary groups that are looking for ways to get out of Ukraine. Like, maybe the attacks on Wagner coming from the Russian army are rogue elements where Wagner is obviously the most effective target for friendly fire.

TheOtherJake@beehaw.org · 1 year ago

“we support the winning side”

TheOtherJake@beehaw.org · edit-2 1 year ago

Wagner forces are said to be in control of military facilities in the city of Voronezh, north of Rostov and on route to Moscow