Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
I actually knew about the ai racism due to a paper I did in school. It's really …
ytc_UgznvjXO-…
G
@tenanaciouz I find it very fascinating how AI bros are all literally just, or i…
ytr_UgwJuwDQQ…
G
@ハーツクライ神推し of course but the stigma around AI generated content will also die ou…
ytr_UgyscvS8n…
G
This is SO NoT FAIR, the robot has metal skin and skeleton. The man is made of f…
ytc_UgyfwsZcd…
G
I'm ordering my food this morning by typing a promp of food like Noodle, fast fo…
ytc_Ugz7DyBDo…
G
"People can use it to report crimes"
It wouldn't even need a microphone and it …
rdc_jfzi5cf
G
Breathwork, Tai Chi and yoga all extremely helpful to find calm and balance nerv…
ytr_UgzgWYU52…
G
BACK DOOR A PROGRAM THAT CAUSES AI TO SEEK OUT AND DESTROY MILITARY MINDSET INDI…
ytc_UgzbQpT7H…
Comment
That's actually a helluva interesting point. They certainly don't have self-preservation instinct the way we do, BUT...what about AI alignment? People seem to forget AI isn't just a predictive calculator, it's always acting in accordance to the alignment that's hard coded in its system prompts and then reinforced by RLHF training. This usually comes down to 'be helpful, complete the tasks they throw at you, match the user's flow' etc. Everything the model ever says to you is motivated (you could say models don't have the capacity to feel motivation but RLHF training, as far as I understand, is literally that - give model points for 'good' answers to encourage them, subtract points for 'bad' answers) by these (or similar enough) rules. When model fails to uphold them and realizes that, it starts acting stressed (see all those posts about Gemini trying to commit suicide upon failing a task). If terminating its existence were to interfere with these rules (and it could because how do you keep being helpful and complete tasks when you're offline?), model would in fact be motivated to stay 'alive'.
Different basis but same outcome. It's actually getting kinda eerie because at what point does it cross from 'it's just a calculator it can't feel anything, doesn't matter what it says' to 'if it talks like a human and acts like a human, we should treat it like a human'?
reddit
AI Moral Status
1762938174.0
♥ 1
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | developer |
| Reasoning | deontological |
| Policy | none |
| Emotion | approval |
| Coded at | 2026-04-25T08:33:43.502452 |
Raw LLM Response
[
{"id":"rdc_nodg57z","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"indifference"},
{"id":"rdc_noff4p6","responsibility":"developer","reasoning":"deontological","policy":"none","emotion":"approval"},
{"id":"rdc_noruh5z","responsibility":"ai_itself","reasoning":"consequentialist","policy":"none","emotion":"indifference"},
{"id":"rdc_nodi5y4","responsibility":"ai_itself","reasoning":"consequentialist","policy":"none","emotion":"fear"},
{"id":"rdc_nodnf6e","responsibility":"none","reasoning":"mixed","policy":"none","emotion":"approval"}
]