These forums are actively updated with new research:

AI Alignment Forum

LessWrong

Research

Cool New Research Picks:

Emergent introspective awareness in large language models