MINT 2024

NeurIPS 2024 - Workshop on Foundation Model Interventions

Vancouver, December 15th 2024

About this workshop

The increasing capabilities of foundation models have raised concerns about their potential to generate undesirable content, perpetuate biases, and promote harmful behaviors.

To address these issues, we are hosting a workshop at NeurIPS 2024 that focuses on understanding the inner workings of foundation models and identifying actionable mechanisms involved in generation. Recent studies have shown promise in directly intervening on model activations or a low-rank subset of the weights to provide fine-grained control over model generation to mitigate the generation of harmful and toxic content.

This workshop brings together researchers to explore methods for improving the controllability of foundation models and developing a better understanding of their behaviour to disable potential misuse.

Speakers

Atticus Geiger Pr(Ai)^2R

David Ha
Sakana AI

Jacob Steinhardt
Berkeley

Fernanda Viégas
Harvard | Google

Panelists

Atticus Geiger Pr(Ai)^2R

Neel Nanda
Google Deepmind

Jacob Steinhardt
Berkeley

Fernanda Viégas
Harvard | Google

Information for Authors

NEW! Camera-ready deadline: 29 November AOE

More info in our Call for Papers

Contact

For any questions or comments about this workshop, please reach out to:

mint2024-workshop@googlegroups.com

Page updated

Google Sites

Report abuse

This site uses cookies from Google to deliver its services and to analyze traffic. Information about your use of this site is shared with Google. By using this site, you agree to its use of cookies.

Learn more

Got it