I Self-Hosted Llama 3.2 with Coolify on My Home Server

1. netdevnet ◴[16 Oct 24 08:11 UTC] No.41856729[source]▶

Am I right thinking that a self-hosted llama wouldn't have the kind restrictions ChatGPT has since it has no initial system prompt?

replies(4): >>41856734 #>>41856777 #>>41856779 #>>41856872 #

2. Kudos ◴[16 Oct 24 08:12 UTC] No.41856734[source]▶

>>41856729 (TP) #

Many protections are baked into the models themselves.

3. dtquad ◴[16 Oct 24 08:19 UTC] No.41856777[source]▶

>>41856729 (TP) #

All the self-hosted LLM and text-to-image models come with some restrictions trained into them [1]. However there are plenty of people who have made uncensored "forks" of these models where the restrictions have been "trained away" (mostly by fine-tuning).

You can find plenty of uncensored LLM models here:

https://ollama.com/library

[1]: I personally suspect that many LLMs are still trained on WebText, derivatives of WebText, or using synthetic data generated by LLMs trained on WebText. This might be why they feel so "censored":

>WebText was generated by scraping only pages linked to by Reddit posts that had received at least three upvotes prior to December 2017. The corpus was subsequently cleaned

The implications of so much AI trained on content upvoted by 2015-2017 redditors is not talked about enough.

replies(2): >>41856879 #>>41857328 #

4. exe34 ◴[16 Oct 24 08:19 UTC] No.41856779[source]▶

>>41856729 (TP) #

It has a sanitised output. You might want to look for "abliterated" models, where the general performance might drop a bit but the guard-rails have been diminished.

5. nubinetwork ◴[16 Oct 24 08:35 UTC] No.41856872[source]▶

>>41856729 (TP) #

That depends on the frontend, you can supply a system prompt if you want to... whether it follows it to the letter is another problem...

6. nubinetwork ◴[16 Oct 24 08:36 UTC] No.41856879[source]▶

>>41856777 #

> All the self-hosted [...] text-to-image models come with some restrictions trained into them

https://github.com/huggingface/diffusers/issues/3422

7. thrdbndndn ◴[16 Oct 24 09:57 UTC] No.41857328[source]▶

>>41856777 #

My to-go test for uncensoring is to ask the LLM to write erotic novel.

But I haven't yet find any "uncensored" ones (on ollama) that works. Did I miss something?

(On the contrary: when ChatGPT first came out, it was trivial to jailbreak it to make it write erotica.)

replies(3): >>41857379 #>>41857399 #>>41857875 #

8. dtquad ◴[16 Oct 24 10:11 UTC] No.41857399{3}[source]▶

>>41857328 #

Try the popular (pull count) dolphin models:

https://ollama.com/library/dolphin-mistral

9. TomK32 ◴[16 Oct 24 11:27 UTC] No.41857875{3}[source]▶

>>41857328 #

I found that "Don't censor your answer" works as intended and my self-hosted llm happily delivers smut.