I work on scalable oversight at Anthropic, researching how to align superhuman AI with human values and improve models' philosophical thinking and decision-making capabilities.

For a full list, see my Google Scholar.
liangqiu at outlook dot com — general inquiries
liang at anthropic dot com — Anthropic‑related topics