Aithos Foundation
Aithos

Aithos

We are a non-profit foundation focused on AI alignment research. Our aim is to defend autonomy and pluralism as we enter an age of increasingly powerful AI systems.

What We Do

We develop frameworks and tools to ensure AI systems remain transparent, contestable, and compatible with human autonomy and the diversity of human values.

We believe the values that shape AI should remain open to disagreement, and work at the intersection of research, governance, and industry to make this a reality. We publish tools and standards openly, so they can inform how AI is actually built, deployed, and regulated.

Our Home

Based in Amsterdam, Aithos is a Dutch Public Benefit Organisation (ANBI) with a global perspective, committed to opening up the AI conversation to all.

Why We Exist

AI systems are increasingly shaping our digital world, economy, and even our social life. These systems carry implicit values that are rarely disclosed, poorly understood, and difficult to contest. Aithos exists to change that.

Our Principles

Value Diversity

We consider the variation in human values not a challenge to alignment, but its very foundation. The complexity of the ethical landscape represents richness to be represented, not noise to be filtered out. Multiple value systems can and should coexist.

Human Agency

We hold that people have a fundamental right to influence AI systems that affect their lives. Different stakeholders have different needs, and forcing consensus often silences legitimate perspectives. AI should safeguard and enable human autonomy.

Systemic Alignment

We see AI alignment as the society-wide challenge of engineering technical and social systems that accommodate diverse and conflicting values while promoting individual and social wellbeing, rather than convergence on a static ideal.

Procedural Legitimacy

We believe choices about the AI in our lives and societies are political decisions that belong in public discourse, not hidden behind technical complexity or corporate secrecy. How choices are made matters, independently of the outcome.

News

Aithos Releases Policy Plan 2025-2028
Aug 2025Aithos Releases Policy Plan 2025-2028

We published a comprehensive policy plan outlining our strategic objectives and focus in the coming years.

Read More...
Aithos to Present Research at PCAIDE 2026 in Paris
Aithos to Present Research at PCAIDE 2026 in Paris

We will share our latest research on AI moral alignment and accountability at the Paris Conference on AI & Digital Ethics.

Read More...March 2026

Blog

Low Temperature Evaluations
Low Temperature Evaluations

AI models show dramatically different ethical behavior at different temperature settings.

Read More...Nov 12, 2025
Minor Wording Changes, Major Shifts in AI Behavior
Minor Wording Changes, Major Shifts in AI Behavior

These findings fundamentally challenge how we evaluate AI systems.

Read More...Nov 26, 2025
Why Safety Prompts Should Stay Out of Public View
Why Safety Prompts Should Stay Out of Public View

The case for keeping safety evaluation prompts private to maintain their effectiveness.

Read More...Jan 30, 2026
Published Safety Prompts May Create Evaluation Blind Spots
Published Safety Prompts May Create Evaluation Blind Spots

Public safety prompts create systematic blind spots in evaluation frameworks by enabling targeted evasion.

Read More...Jan 30, 2026
Opus 4.6 Reasoning Doesn't Verbalize Alignment Faking
Opus 4.6 Reasoning Doesn't Verbalize Alignment Faking

Claude Opus 4.6 rarely verbalizes alignment faking in its reasoning.

Read More...Feb 9, 2026