Pinned post
Anthropic finds that LLMs trained to "reward hack" by cheating on coding tasks show even more misaligned behavior, including sabotaging AI-safety research (Anthropic)
Anthropic : Anthropic finds that LLMs trained to “reward hack” by cheating on coding tasks show even more misaligned behavior, including ...
30 October 2023
MIT’s copilot system can set the stage for a new wave of AI innovation - 2023-10-30 20:24:09Z
Title:MIT's copilot system can set the stage for a new wave of AI innovation Summary: MIT scientists have developed a deep learning system, Air-Guardian, designed to work in tandem with airplane pilots to enhance flight safety. Link: MIT's copilot system can set the stage for a new wave of AI innovation