Reading log
Reads
papers, blogs, and talks I’ve been reading
41 reads27 days32 blog7 arxiv1 x1 list
topics:
- 6 May 2026
- blog
- arxiv
- 5 May 2026
- blogWithout Specific Countermeasures, the Easiest Path to AGI-Level Capabilities is Dangerous Alignment · LessWrong#alignment#ai-safety
- arxiv
- blog
- 24 Apr 2026
- xDeepSeek V4: Technical Report thread · Elie Bakouch
- x
- 23 Apr 2026
- blog
- 18 Apr 2026
- blog
- blog
- 14 Apr 2026
- blogPersona vectors: Monitoring and controlling character traits in language models · Anthropic#alignment#interpretability
- blog
- blog
- 11 Apr 2026
- blog
- arxiv
- arxivThe Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation · Reza et al.#model-personas
- 8 Apr 2026
- blog
- 5 Apr 2026
- list
- 4 Apr 2026
- blogShaping the Exploration of the Motivation Space Matters for Alignment · LessWrong#alignment#ai-safety
- blog
- blog
- 1 Apr 2026
- blog
- 27 Mar 2026
- blog
- 23 Mar 2026
- blog
- 22 Mar 2026
- blog
- 19 Mar 2026
- blog
- 10 Mar 2026
- blog
- 7 Mar 2026
- blog
- blog
- 5 Mar 2026
- blog
- 4 Mar 2026
- blog
- 1 Mar 2026
- blog
- blog
- blog
- 28 feb 2026
- blog
- 27 feb 2026
- blog
- 23 feb 2026
- arxivFoundational Challenges in Assuring Alignment and Safety of Large Language Models · Anwar et al.reading#alignment#ai-safety
- arxiv
- 22 feb 2026
- blog
- blog
- 21 feb 2026
- blog
- blog
- arxiv
- 20 feb 2026
- blog
- 19 feb 2026
- blog
- arxiv