Safety issues #16

Siebe-wq · 2024-09-13T12:23:55Z

I'm concerned about safety & alignment issues. Do you have a safety policy?

ShengranHu · 2024-09-13T19:45:32Z

We discuss the critically important safety implications, including why we chose to do and release this work, in the paper (Page 12).

Siebe-wq · 2024-09-17T06:59:09Z

Thanks for replying! I had a look and, if I'm being honest, I found this quite lacking. In the spirit of the project, I had a conversation with Claude-3.5-Sonnet about it: https://poe.com/s/tJtRyGL3KitmecrCle7Y

My main concerns are that the project is currently easy to misuse by bad actors (cf. ChaosGPT) as well as carries significant risk of uncontrolled proliferation (i.e. there's no kill switch). The latter might even be in violation of California bill SB-1047 if it gets passed, though I'm not sure whether it would meet the criteria?

I'm thinking that at least the following recommendations are useful:

develop, or have it develop, a Responsible Scaling Policy (i.e. increasing the bar for safety as the performance increases)
evaluate people before granting access to the full code
include safety benchmarks in the performance evaluation
collaborate with Safety evaluation organisations like Apollo, Haize, and METR

I can recommend the full conversation I had with Claude

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Safety issues #16

Safety issues #16

Siebe-wq commented Sep 13, 2024

ShengranHu commented Sep 13, 2024

Siebe-wq commented Sep 17, 2024

Safety issues #16

Safety issues #16

Comments

Siebe-wq commented Sep 13, 2024

ShengranHu commented Sep 13, 2024

Siebe-wq commented Sep 17, 2024