Not all ‘open source’ AI models are actually open: here’s a ranking

ylai@lemmy.ml · edit-2 5 months ago

Not all ‘open source’ AI models are actually open: here’s a ranking

ylai@lemmy.ml · 5 months ago

Just for reference, a few years back, (ex-Microsoft) David Plummer had this historical dive into the (MIPS) origin of the blue color, and how Windows is not blue anymore: https://youtu.be/KgqJJECQQH0?t=780

ylai@lemmy.ml · 5 months ago

Linux's New DRM Panic "Blue Screen of Death" In Action

ylai@lemmy.ml · 5 months ago

Computing firm Raspberry Pi pops 38% in rare London market debut

ylai@lemmy.ml · edit-2 5 months ago

Stanford University Students Accused of Plagiarizing AI Model

ylai@lemmy.ml · 5 months ago

Study finds 268% higher failure rates for Agile software projects

ylai@lemmy.ml · edit-2 6 months ago

Likely due to being a prototype. Production laptops from Tuxedo tend to have the “TUX” penguin in a circle logo on the Super key by default. They also have been offering custom engraved keyboard (even with the entire keyboard engraved from scratch to the customer’s specifications) as added service, so probably there will be suppliers or production facility to change the Super key.

By the way, there was one YouTube channel that ended up ordering a laptop with Windings engraving from them: https://youtu.be/nidnvlt6lzw?t=186

ylai@lemmy.ml · 6 months ago

Schenker shows off a Linux laptop prototype with Snapdragon X Elite at Computex 2024

ylai@lemmy.ml · 6 months ago

AI training data has a price tag that only Big Tech can afford

ylai@lemmy.ml · 6 months ago

Godot 4.3 Beta 1 Released With Native Wayland Support

ylai@lemmy.ml · edit-2 6 months ago

Why RISC-V must get its messaging right on open standard vs open source

ylai@lemmy.ml · edit-2 6 months ago

If you want RTX though (does it work properly on Linux?)

Yes it does. For example, Hans-Kristian Arntzen declared the DirectX Raytracing (DXR) implementation in VKD3D-proton as feature complete in February 2023 (https://github.com/HansKristian-Work/vkd3d-proton/issues/154#issuecomment-1434761594). And since November 2023/release 2.11, VKD3D-proton in fact runs with DXR enabled by default (https://github.com/HansKristian-Work/vkd3d-proton/releases/tag/v2.11).

ylai@lemmy.ml · 6 months ago

GNOME Shell & Mutter Broke Their Good Faith With Ubuntu

ylai@lemmy.ml · 6 months ago

‘It’s not vital to spend five days a week in the office’: the bank boss who works from home

ylai@lemmy.ml · 6 months ago

Red Hat middleware takes a back seat in strategic shuffle

ylai@lemmy.ml · 6 months ago

Linux 6.10 Honors One Last ReiserFS Request Made By Hans Reiser

ylai@lemmy.ml · edit-2 6 months ago

How does this analogy work at all? LoRA is chosen by the modifier to be low ranked to accommodate some desktop/workstation memory constraint, not because the other weights are “very hard” to modify if you happens to have the necessary compute and I/O. The development in LoRA is also largely directed by storage reduction (hence not too many layers modified) and preservation of the generalizability (since training generalizable models is hard). The Kronecker product versions, in particular, has been first developed in the context of federated learning, and not for desktop/workstation fine-tuning (also LoRA is fully capable of modifying all weights, it is rather a technique to do it in a correlated fashion to reduce the size of the gradient update). And much development of LoRA happened in the context of otherwise fully open datasets (e.g. LAION), that are just not manageable in desktop/workstation settings.

This narrow perspective of “source” is taking away the actual usefulness of compute/training here. Datasets from e.g. LAION to Common Crawl have been available for some time, along with training code (sometimes independently reproduced) for the Imagen diffusion model or GPT. It is only when e.g. GPT-J came along that somebody invested into the compute (including how to scale it to their specific cluster) that the result became useful.

ylai@lemmy.ml · edit-2 6 months ago

This is a very shallow analogy. Fine-tuning is rather the standard technical approach to reduce compute, even if you have access to the code and all training data. Hence there has always been a rich and established ecosystem for fine-tuning, regardless of “source.” Patching closed-source binaries is not the standard approach, since compilation is far less computational intensive than today’s large scale training.

Java byte codes are a far fetched example. JVM does assume a specific architecture that is particular to the CPU-dominant world when it was developed, and Java byte codes cannot be trivially executed (efficiently) on a GPU or FPGA, for instance.

And by the way, the issue of weight portability is far more relevant than the forced comparison to (simple) code can accomplish. Usually today’s large scale training code is very unique to a particular cluster (or TPU, WSE), as opposed to the resulting weight. Even if you got hold of somebody’s training code, you often have to reinvent the wheel to scale it to your own particular compute hardware, interconnect, I/O pipeline, etc… This is not commodity open source on your home PC or workstation.

ylai@lemmy.ml · 6 months ago

The situation is somewhat different and nuanced. With weights there are tools for fine-tuning, LoRA/LoHa, PEFT, etc., which presents a different situation as with binaries for programs. You can see that despite e.g. LLaMA being “compiled”, others can significantly use it to make models that surpass the previous iteration (see e.g. recently WizardLM 2 in relation to LLaMA 2). Weights are also to a much larger degree architecturally independent than binaries (you can usually cross train/inference on GPU, Google TPU, Cerebras WSE, etc. with the same weights).

ylai@lemmy.ml · 6 months ago

Open Source Initiative tries to define Open Source AI

ylai@lemmy.ml · 6 months ago

Open Source Initiative tries to define Open Source AI

ylai@lemmy.ml · 6 months ago

Intel's OpenVINO Now Available In openSUSE

ylai@lemmy.ml · 6 months ago

Valve's Linux Graphics Engineers Begin Prepping RADV Driver For AMD RDNA4 "GFX12"

ylai@lemmy.ml · 6 months ago

Germany's Sovereign Tech Fund Now Supporting FFmpeg

ylai@lemmy.ml · 6 months ago

IBM open-sources its Granite AI models - and they mean business

ylai@lemmy.ml · 6 months ago

AMD Aims For AMF Decode In FFmpeg, Questioned Over Vulkan Video Commitment

ylai@lemmy.ml · 7 months ago

There is even a sentence in README.md that makes it explicit:

The source files in this repo are for historical reference and will be kept static, so please don’t send Pull Requests suggesting any modifications to the source files […]

ylai@lemmy.ml · 8 months ago

It is like the U.S. Wired catching up to the idea years later?

ylai@lemmy.ml · 8 months ago

In the beginning, only privileged ones will be allowed to run in pass-through mode. But goal/roadmap calls for all FUSE filesystems eventually to have this near-native performance.

ylai@lemmy.ml · 8 months ago

Well, if you have a constructive suggestion which site to link instead regarding kernel developments, I am all ears:

Not sure that raw commits are readable or have sufficient context for non kernel development readers here
LWN, particularly timely/kernel development news there, has gone mostly paywall, and there will be (legitimate) complaint if I link articles needing a LWN subscription

ylai@lemmy.ml · 8 months ago

Not sure what called for this blatant personal attack. My post history speaks for itself, quite in comparison to yours. And Phoronix is well-known Linux website, and its test suite is in fact even referenced in various regression tests/patches in LKML (also not sure what/if any kind of kernel development you have done).

ylai@lemmy.ml · 10 months ago

See to the right:

Here you may post anything related to DeGoogling, why we should do it or good software alternatives!

ylai@lemmy.ml · edit-2 10 months ago

Retention, or the lack thereof, when cold-stored.

In term of SD or standard NAND, not even Nintendo does that. Nintendo builds Macronix XtraROM in their Game Card, which is some proprietary Flash memory with claimed 20 year cold storage retention. And they introduced the 64 GB version only after a lengthy delay, in 2020. So it seems that the (lack of) cold storage performance of standard NAND Flash is viewed by some in the industry as not ready for prime time. Macronix discussed it many years back in a DigiTimes article: https://www.digitimes.com/news/a20120713PR201.html.

And Sony and Microsoft are both still building Blu-ray-based consoles.

ylai@lemmy.ml · edit-2 1 year ago

Not true. See — or actually nothing to be seen here, since “it just works”: https://github.com/ggerganov/llama.cpp/discussions/3368 and https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF

And here is someone describing how to do the quantization yourself: https://advanced-stack.com/resources/running-inference-using-mistral-ai-first-released-model-with-llama-cpp.html

ylai@lemmy.ml · edit-2 1 year ago

GCC, back in the days DJGPP in particular. As a child in the 1990s I could not afford the big name compilers like Watcom. And compared to DJGPP, all the “prized” Borland/Turbo stuff that my middle school pushed (with segmented real mode), were practically Fisher-Price and Mattel compilers.