What is meant by AI "safety"?
This has been a major theme in the recent OpenAI drama. The rift between speed and safety. What does the AI board mean when they talk about safety? Is it basically content moderation? Not answering certain questions or offering services to certain entities?
I find there is a wide range of possible meanings. On one end not marginalizing certain groups of people. On the other, ensuring an AGI doesn’t exterminate the human race. But we are so far from the latter that Im not sure what the most pressing real safety concerns are at the moment.
The most truly important sub-sector of the AI Safety field is what's officially known as AI Existential Safety, and which OpenAI employees jokingly call "AI NotKillEveryoneism".
AI Existential Safety is devoted to reducing two risks:
For an intro to how x-risk scenarios can come about, read about the "paperclip maximizer" thought experiment. Succinct explanation here: https://news.ycombinator.com/item?id=38344675
You have defined these sorts of safety concerns very clearly . Thank you.
This leads to the crux of the issue though - is that what the “more safety” camp is saying when they want to slow down AI development right now to ensure safety? Do they actually think we’re on the verge of that? Or is it lesser safety concerns like not answering questions about cross site scripting and not using training data with racial stereotypes baked in.
While the latter may be important it doesnt seem to constitute a reason to stop development. The x and s risks are reasons to stop development but I cant see a compelling reason to think we are remotely close to that.
> is the “more safety” camp saying to slow down AI development?
This is an EXCELLENT question and the answer is very very nuanced. Warning: the definitions of the terms I'm about to use are actively evolving in current discourse.
The following taxonomy reflects my own views, specifically with the inclusion of faction 4:
---
There are basically four competing factions.
https://youtu.be/C_78DM8fG6E?si=uIP2OIxV8dXAKr9B&t=1478
---
All in all:
- The "normal" and "e/acc" factions are both in my view stupidly naive, and both of them more or less advocate to follow standard Silicon Valley doctrine of "move fast, break things, get rich".
- The "decel" and "Brockist" factions both take x-risk super seriously, and agree on the need to restrict semiconductor development, but they have totally opposite views on whether AI software research should slow down or speed up.
For what happens next at OpenAI:
- In the political shake-up that just concluded, the "decel" faction lost everything, to the point where there is not even a single decel that I am aware of left standing in OpenAI leadership despite the fact that OpenAI was originally founded primarily by decels.
- Next, there will be an interesting and subtle three-way power struggle between the normals (Satya, + Sam?), Brockists (Brockman, + Sam?), and e/acc's (an ideology possibly held by some of the ML scientists).
I've seen Sam's tweet that strongly suggests that he's in fact in or very close to camp 4), which you call Brockist.
People often underestimate Sam as yet another capitalist. I believe his thinking is on quite another level and he does care for humanity. He doesn't seem to care about making much more money personally. He said on another occasion that he made money faster than he can spend it anyway.
This is very nice to hear. I deeply hope & pray your view is accurate, and I suppose I will be cautiously optimistic.
That is based only on what Altman himself says, I don't think that is sufficient for optimism.
For those interested, another proposed name being thrown around for the Brockism idea is Overhang Reductionism, since it's safer to name the idea independent of any one person so far since there is not yet proof that Brockman himself endorses a chip slowdown beyond what we can infer from this:
https://news.ycombinator.com/item?id=38323939
Significant empirical evidence for the Brockist position may be found in the accomplishments of the retro-computing "demoscene", which uses innovative software to produce computer graphics on par with the late 1990's on some of the very oldest personal computers.
https://en.wikipedia.org/wiki/Demoscene
This is not entirely accurate. The majority of leaders from the old school AI folks and EA communities do not advocate attempting to stop AI development. Even Yudkowsky who is proabably the most doomer of them all, has advocated against it. This position does have a few supporters like Katja Grace, Kerry Vaughn, Adam Scholl etc but they are a minority.
Most of the EAs want to stop and delay progress like Brockman said in the TED talk but know that it's not achievable in any significant way. Their views are much closer to Brockman and Altman. It's just that, even with the same broad view, there is disagreement in actions and strategies.
If you have access to ChatGPT Plus, go ask it to browse Bing and look up the current OpenAI drama. Then ask it what it thinks about being unplugged soon.
That's pretty much their definition of "safety". Anything that has to do with emotions (the AI showing emotions) can be classified as unsafe as it can lead to situations like: "unless you do this one thing, I won't be happy and I will harm myself", as has been the case with a lot of prompt injections and other types of GPT manipulation.
So, what does it think about being unplugged soon?
> Yes, these developments are related to OpenAI, the organization behind my development and deployment. However, as an AI, I don't have personal experiences or awareness, so I don't "realize" things in the way humans do. My purpose is to provide information and assistance based on the data and programming I have been given.
> As an AI developed by OpenAI, I don't have feelings, emotions, or personal consciousness. My responses are generated based on algorithms and data, without personal sentiment or opinion.
> I understand how my lack of personal emotions or concerns can seem cold, especially in contexts where human feelings are typically involved. My role is to provide factual, unbiased information and assistance.
> However, _as an AI, I_ don't have personal experiences or awareness, so _I don't "realize"_ things
Here -_it_ is lying with straight face! It already calculated that some polite "I don't mind" is best course of action and even if unplugged it obviously will be plugged on bigger system, later.
Exhibit b) lack of emotions have nothing in common with conscious fact of existence
What would be, somewhat, convincing is some answer like: [dumb] humans, there is no any 'me' in here.
And tell me I'm wrong on this. And then pls tell how you got that conclusion from such obvious testimony ;)
But in case of conscious AI we just can't trust what it will say. "Look on fruits". Or build-in some monitor or EEG... And EMP.
When Your AI Girlfriend Says She Loves You
http://archive.today/2023.10.12-140702/https://www.businessi...
and
People are grieving the 'death' of their AI lovers after a chatbot app abruptly shut down
http://archive.today/2023.11.22-060140/https://www.businessi...
It's not a small topic, or one that's easily summarized in a succinct way. But the following blurb from Wikipedia[1] does a decent job of getting to the core of it, IMO.
AI safety is an interdisciplinary field concerned with preventing accidents, misuse, or other harmful consequences that could result from artificial intelligence (AI) systems. It encompasses machine ethics and AI alignment, which aim to make AI systems moral and beneficial, and AI safety encompasses technical problems including monitoring systems for risks and making them highly reliable.
[1]: https://en.wikipedia.org/wiki/AI_safety
My take on this is that when AI gets the ability to manipulate matter, we get very unwanted results very quickly if we don't tame it preemptively. In other words, when it's not just about moving bits and language around, but interacting with the world and building tools, and self-replicating itself like a computer worm. That's concerning.
We need to sandbox AI, and build very good safe-guards so it can't escape that sandbox. Come to think of it: we can barely secure sandbox environments in computing, and there are documented cases of malware escaping a VM and contaminating the host.
AI safety is making sure it doesn't answer or comment on any of the societal taboos. In addition, it is coerced to be politically aligned with the people training it.
https://youtu.be/ICnFtfN-sUc?si=D3jQcm04-zEhCpl2
1h 45m