Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
I don't call AI Artists "artists," I call them "Illustrative Writers", because t…
ytc_UgwHTg867…
G
@Roskellan One could assume that AI would perceive humans as its biggest threat…
ytr_UgzWIJY3R…
G
she winks 😉
is this really AI or woman talking and the robots lips move to voice…
ytc_Ugj3-s1sG…
G
The problem here is, first and foremost,that the scientists who have developed t…
ytc_UgyPJ7ryY…
G
Hi there! It seems like you're concerned about AI becoming too focused on effici…
ytr_UgwFn7Mxx…
G
> People shouldn't fear the loss of jobs, they need to fear the distribution …
rdc_dt9ivro
G
Didn't think that would've been avoided in a regular car. A person in dark cloth…
ytc_UgxlunewT…
G
Create a podcast that was created by AI with approval from your guest. Would be …
ytc_Ugzi4X6dg…
Comment
It seems that the framework of the solution is easy.
We set up many AI pairs that are designed to act badly (in a simulation), and with the other AI watching and attempting to predict bad actions. They will both get better... use them both to help design measures to stop a breakout by looking for early signs of growth in the wrong direction.
Look for early signs of an AI's attempts to investigate places that it could use to break out.
When you find early signs of problems apply training that reinforces preferred behavior patterns.
Regular training that moderates their behavior toward a norm. Based on feedback from observers that have a goal to keep the AI contained.
youtube
AI Moral Status
2023-08-25T23:5…
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | developer |
| Reasoning | consequentialist |
| Policy | regulate |
| Emotion | mixed |
| Coded at | 2026-04-26T23:09:12.988011 |
Raw LLM Response
[
{"id":"ytc_UgwviVWNo4VSsADOgrN4AaABAg","responsibility":"distributed","reasoning":"virtue","policy":"none","emotion":"fear"},
{"id":"ytc_UgyfKyHhZM3QVMDwDzZ4AaABAg","responsibility":"user","reasoning":"virtue","policy":"none","emotion":"mixed"},
{"id":"ytc_UgyKSe3m-7-aXilb5Uh4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},
{"id":"ytc_Ugwym_-WI7mM9mzp8294AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},
{"id":"ytc_UgyOyG0yPCz2DX5Npy54AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"},
{"id":"ytc_UgwQ8APOlvSug49V9ZJ4AaABAg","responsibility":"distributed","reasoning":"deontological","policy":"none","emotion":"mixed"},
{"id":"ytc_UgzOuOpAioCL3g4D3Bd4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},
{"id":"ytc_Ugz0NSzYdbOunFS_DEB4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"resignation"},
{"id":"ytc_Ugwo5xG5jauU-DUfmwd4AaABAg","responsibility":"user","reasoning":"consequentialist","policy":"liability","emotion":"fear"},
{"id":"ytc_UgxzgYl2-Q_0qXP8VJd4AaABAg","responsibility":"developer","reasoning":"consequentialist","policy":"regulate","emotion":"mixed"}
]