Generative AI can easily be made malicious despite guardrails, say scholars
Scholars found by gathering as little as a hundred examples of question-answer pairs for illicit advice or hate speech, they could undo the careful “alignment” meant to establish guardrails around generative AI. University of California, Santa Barbara Companies developing generative AI, such as OpenAI with ChatGPT, have made a big deal about their investment in … Read more