Filters
AI Safety Articles
Showing 1 of 1 articles in AI Safety from Future of AI Journal
Study Reveals Limitations in Current AI Alignment Techniques
A study reveals that current AI alignment techniques like RLHF may not scale reliably to more advanced systems,...
Study Reveals Limitations in Current AI Alignment Techniques
A study reveals that current AI alignment techniques like RLHF may not scale reliably to more advanced systems, calling for new approaches with stronger theoretical foundations.
Future of AI Journal • Apr 10, 2025
Read