@sweng

sweng@programming.dev · 9 days ago

there is no way to do the equivalent of banning armor piercing rounds with an LLM or making sure a gun is detectable by metal detectors - because as I said it is non-deterministic. You can’t inject programmatic controls.

Of course you can. Why would you not, just because it is non-deterministic? Non-determinism does not mean complete randomness and lack of control, that is a common misconception.

Again, obviously you can’t teach an LLM about morals, but you can reduce the likelyhood of producing immoral content in many ways. Of course it won’t be perfect, and of course it may limit the usefulness in some cases, but that is the case also today in many situations that don’t involve AI, e.g. some people complain they “can not talk about certain things without getting cancelled by overly eager SJWs”. Society already acts as a morality filter. Sometimes it works, sometimes it doesn’t. Free-speech maximslists exist, but are a minority.

sweng@programming.dev · 10 days ago

Well, I, and most lawmakers in the world, disagree with you then. Those restrictions certainly make e.g killing humans harder (generally considered an immoral activity) while not affecting e.g. hunting (generally considered a moral activity).

sweng@programming.dev · edit-2 10 days ago

So what possible morality can you build into the gun to prevent immoral use?

You can’t build morality into it, as I said. You can build functionality into it that makes immmoral use harder.

I can e.g.

limit the rounds per minute that can be fired
limit the type of ammunition that can be used
make it easier to determine which weapon was used to fire a shot
make it easier to detect the weapon before it is used
etc. etc.

Society considers e.g hunting a moral use of weapons, while killing people usually isn’t.

So banning ceramic, unmarked, silenced, full-automatic weapons firing armor-piercing bullets can certainly be an effective way of reducing the immoral use of a weapon.

sweng@programming.dev · 10 days ago

While an LLM itself has no concept of morality, it’s certainly possible to at least partially inject/enforce some morality when working with them, just like any other tool. Why wouldn’t people expect that?

Consider guns: while they have no concept of morality, we still apply certain restrictions to them to make using them in an immoral way harder. Does it work perfectly? No. Should we abandon all rules and regulations because of that? Also no.

sweng@programming.dev · 2 months ago

They can, and are being made. E.g. the state of accessibility on Gome.

sweng@programming.dev · 3 months ago

Yes, and what I’m saying is that it would be expensive compared to not having to do it.

sweng@programming.dev · edit-2 3 months ago

Doing OCR in a very specific format, in a small specific area, using a set of only 9 characters, and having a list of all possible results, is not really the same problem at all.

sweng@programming.dev · edit-2 3 months ago

How many billion times do you generally do that, and how is battery life after?

sweng@programming.dev · 3 months ago

Cryptographically signed documents and Matrix?

sweng@programming.dev · 3 months ago

I think you are replying to the wrong person?

I did not say it helps with accuracy. I did not say LLMs will get better. I did not even say we should use LLMs.

But even if I did, non of your points are relevant for the Firefox usecase.

sweng@programming.dev · 3 months ago

Wikipedia is no less reliable than other content. There’s even academic research about it (no, I will not dig for sources now, so feel free to not believe it). But factual correctness only matters for models that deal with facts: for e.g a translation model it does not matter.

Reddit has a massive amount of user-generated content it owns, e.g. comments. Again, the factual correctness only matters in some contexts, not all.

I’m not sure why you keep mentioning LLMs since that is not what is being discussed. Firefox has no plans to use some LLM to generate content where facts play an important role.

sweng@programming.dev · edit-2 3 months ago

At horrendous expense, yes. Using it for OCR makes little sense. And compared to just sending the text directly, even OCR is expensive.

sweng@programming.dev · 3 months ago

The issue is not sending, it is receiving. With a fax you need to do some OCR to extract the text, which you then can feed into e.g an AI.

sweng@programming.dev · edit-2 3 months ago

What do you mean “full set if data”?

Obviously you can not train on 100% of material ever created, so you pick a subset. There is a a lot of permissively licensed content (e.g. Wikipedia) and content you can license (e.g. Reddit). While not sufficient for an advanced LLM, it certainly is for smaller models that do not need wide knowledge.

sweng@programming.dev · 3 months ago

I’d say the main differences are at least

package availability
update frequency
backporting
packaging philosophy (e.g. plain upstream vs customizations, include all funtionality in single packege vs split out optional features)
default confguration for packages

sweng@programming.dev · edit-2 3 months ago

Feel free to assume that, but don’t claim an assumption as a fact.

You recommended using native package managers. How many of them have been audited?

sweng@programming.dev · edit-2 3 months ago

You know what else we shouldn’t assume? That that it doesn’t have a security feature. And we additionally then shouldn’t go around posting that incorrect assumption as if it were a fact. You know, like you did.

sweng@programming.dev · 3 months ago

There is no general copyright issue with AIs. It completely depends on the training material (if even then), so it’s not possible to make blanket statements like that. Banning technology, because a particular implementation is problematic, makes no sense.

sweng@programming.dev · edit-2 3 months ago

I’m confused why you think it would be anything else, and why you are so dead set on this. Repos include a signing key. There is an option to skip signature checking. And you think that signature checking is not used during downloads, despite this?

Ok, here are a few issues related to signatures being checked by default, when downloading: https://github.com/flatpak/flatpak/issues/4836 https://github.com/flatpak/flatpak/issues/5657 https://github.com/flatpak/flatpak/issues/3769 https://github.com/flatpak/flatpak/issues/5246 https://askubuntu.com/questions/1433512/flatpak-cant-check-signature-public-key-not-found https://stackoverflow.com/questions/70839691/flatpak-not-working-apparently-gpg-issue

Flatpak repos are signed and the signature is checked when downloading.

It’s OK to be wrong. Dying on this hill seems pretty weird to me.

sweng@programming.dev · edit-2 3 months ago

From the page:

It is recommended that OSTree repositories are verified using GPG whenever they are used. However, if you want to disable GPG verification, the --no-gpg-verify option can be used when a remote is added.

That is talking about downloading as well. Yes, you can turn it off, but so can you usually do it with native package managers, e.g. pacman: https://wiki.archlinux.org/title/Pacman/Package_signing