MINT 2024
NeurIPS 2024 - Workshop on Foundation Model Interventions
Vancouver, December 15th 2024
Vancouver, December 15th 2024
The increasing capabilities of foundation models have raised concerns about their potential to generate undesirable content, perpetuate biases, and promote harmful behaviors.
To address these issues, we are hosting a workshop at NeurIPS 2024 that focuses on understanding the inner workings of foundation models and identifying actionable mechanisms involved in generation. Recent studies have shown promise in directly intervening on model activations or a low-rank subset of the weights to provide fine-grained control over model generation to mitigate the generation of harmful and toxic content.
This workshop brings together researchers to explore methods for improving the controllability of foundation models and developing a better understanding of their behaviour to disable potential misuse.
Atticus Geiger Pr(Ai)^2R
David Ha
Sakana AI
Jacob Steinhardt
Berkeley
Fernanda Viégas
Harvard | Google
Atticus Geiger Pr(Ai)^2R
Neel Nanda
Google Deepmind
Jacob Steinhardt
Berkeley
Fernanda Viégas
Harvard | Google
NEW! Camera-ready deadline: 29 November AOE
More info in our Call for Papers
For any questions or comments about this workshop, please reach out to:
mint2024-workshop@googlegroups.com