Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
So AI robots will be more productive, more intelligent, more reliable. Then we …
ytc_UgyDFngGB…
G
So what happens when no one has a job and no paycheck... I assume they stop purc…
ytc_UgwVWMcAk…
G
I’m so struck, because yes, AI / robots as we know them are tools of white supre…
ytc_Ugz9nlp62…
G
I’ve asked this for so long, people worry about AI and robots taking our jobs… B…
ytc_Ugx3cXk-1…
G
Bring traditional gender roles and nucleus family structure back. Due to automa…
ytc_UgxJa-B6z…
G
The problem with this issue is the human´s ego. We wont stop until we show othe…
ytc_UgwMObPFn…
G
How people not realize this is dangerous. Even as a courtesy or niceness, lie is…
ytc_Ugysea2KY…
G
AI is saturating the net now and is evidently becoming a real problem, i think y…
ytc_UgyBrsQS3…
Comment
The one thing I just can't see being solved (ever) is the "unalignment risk" - as in, the risk that even if we manage to make an AI which is aligned such that literally everyone benefits from it being turned on, what is stopping a bad actor from unaligning it? If the answer is only having access to the code / weights then yikes.
As a recent example, Meta open-sourced Llama2 and that came with guardrails. It took people about 2 days to retrain the model a bit such that all the previous "safety" finetuning it has gotten was perfectly ignored and of course nobody wants to use a version of the model which refuses answering some requests when they can instead have an AI which answers every request.
youtube
AI Governance
2023-09-25T01:5…
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | user |
| Reasoning | consequentialist |
| Policy | liability |
| Emotion | fear |
| Coded at | 2026-04-27T06:24:53.388235 |
Raw LLM Response
[{"id":"ytc_UgxJ4AsahyT5iEdUIhd4AaABAg","responsibility":"investor","reasoning":"consequentialist","policy":"none","emotion":"resignation"},{"id":"ytc_UgwTIr-dX5_DqkoOQg14AaABAg","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"approval"},{"id":"ytc_UgzCCUBr6WRwM6LJdmB4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"fear"},{"id":"ytc_UgzODLBfrUYnXIZPLu54AaABAg","responsibility":"user","reasoning":"consequentialist","policy":"liability","emotion":"fear"},{"id":"ytc_UgzrW1pP1D73B6jyIaJ4AaABAg","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"fear"},{"id":"ytc_UgwC20gm87M2ffer2wV4AaABAg","responsibility":"none","reasoning":"mixed","policy":"none","emotion":"indifference"},{"id":"ytc_Ugwk0AP72kC4OmRMjdV4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"},{"id":"ytc_Ugzna7YIqceIOENLAtt4AaABAg","responsibility":"government","reasoning":"consequentialist","policy":"regulate","emotion":"approval"},{"id":"ytc_UgyEbyctvQkJDOiRelF4AaABAg","responsibility":"none","reasoning":"contractualist","policy":"none","emotion":"resignation"},{"id":"ytc_UgxfcHrH2qOt4T0jDix4AaABAg","responsibility":"government","reasoning":"consequentialist","policy":"regulate","emotion":"approval"}]