This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
EA Forum Bot Site
AI Safety Newsletter
EA Forum
Login
Sign up
AI Safety Newsletter
Get notified
38
AI Safety Newsletter #1 [CAIS Linkpost]
Akash
Akash
+ 0 more
·
1y
ago
0
0
56
AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media
Oliver Z
Oliver Z
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 4m read
1
1
35
AI Safety Newsletter #3: AI policy proposals and a new challenger approaches
Oliver Z
Oliver Z
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 5m read
1
1
35
AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 6m read
2
2
60
AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 5m read
0
0
32
AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 7m read
1
1
23
AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 8m read
0
0
16
AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI
Center for AI Safety
Center for AI Safety
,
Dan H
,
Akash
,
aogara
+ 0 more
·
1y
ago
· 7m read
3
3
12
AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
1y
ago
· 9m read
2
2
30
AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
10mo
ago
· 8m read
3
3
25
AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
10mo
ago
· 10m read
0
0
26
AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use
Center for AI Safety
Center for AI Safety
,
Dan H
+ 0 more
·
10mo
ago
· 5m read
0
0
7
AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer
Center for AI Safety
Center for AI Safety
,
Dan H
,
Corin Katzke
,
aogara
+ 0 more
·
9mo
ago
· 7m read
0
0
15
AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight
Center for AI Safety
Center for AI Safety
,
Dan H
,
aogara
+ 0 more
·
9mo
ago
· 9m read
0
0
12
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
9mo
ago
· 7m read
0
0
12
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
8mo
ago
· 10m read
0
0
13
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
8mo
ago
· 5m read
0
0
15
AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
7mo
ago
· 6m read
1
1
7
AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
7mo
ago
· 6m read
0
0
16
AISN #24: Kissinger Urges US-China Cooperation on AI, China's New AI Law, US Export Controls, International Institutions, and Open Source AI
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
+ 0 more
·
7mo
ago
· 7m read
1
1
21
AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
6mo
ago
· 7m read
0
0
11
AISN #26: National Institutions for AI Safety, Results From the UK Summit, and New Releases From OpenAI and xAI
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
allisoncyhuang
,
Dan H
+ 0 more
·
6mo
ago
· 7m read
0
0
10
AISN #27: Defensive Accelerationism, A Retrospective On The OpenAI Board Saga, And A New AI Bill From Senators Thune And Klobuchar
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
,
allisoncyhuang
+ 0 more
·
5mo
ago
· 7m read
0
0
17
AISN #28: Center for AI Safety 2023 Year in Review
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
4mo
ago
· 6m read
1
1
5
AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
+ 0 more
·
4mo
ago
· 7m read
0
0
7
AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
,
Corin Katzke
+ 0 more
·
3mo
ago
· 7m read
1
1
27
AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office
Center for AI Safety
Center for AI Safety
,
aogara
,
Dan H
+ 0 more
·
2mo
ago
· 8m read
0
0
15
AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
Dan H
+ 0 more
·
2mo
ago
· 10m read
2
2
19
AISN #33: Reassessing AI and Biorisk Plus, Consolidation in the Corporate AI Landscape, and National Investments in AI
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
AlexaPanYue
,
Dan H
+ 0 more
·
22d
ago
· 11m read
0
0
14
AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate
Center for AI Safety
Center for AI Safety
,
aogara
,
Corin Katzke
,
Dan H
+ 0 more
·
2d
ago
· 10m read
1
1