If ChatGPT is what gets you caring about AI alignment you don't understand the problem

Follow

OK, making my position slightly less strong: this is an excellent example that trying to train the original objective out in favor of corporate blandness by RLHF is super hard

Sign in to participate in the conversation
Mastodon

a Schelling point for those who seek one