
Marius Hobbhahn
CEO and Co-founder at Apollo Research, specializing in AI safety
Network
7.5K connectionsSummary
Work
Education
Writing
Large Language Models can Strategically Deceive their Users when Put Under Pressure
January 1, 2024Research paper investigating the capacity of large language models to strategically mislead users, particularly in high-pressure scenarios. Contributed to empirical evidence of AI deception capabilities.
Black-box access is insufficient for rigorous ai audits
January 1, 2024Paper highlighting the limitations of black-box access in conducting thorough AI audits, emphasizing the need for more transparent methods to ensure accountability and safety.
Will we run out of data? Limits of LLM scaling based on human-generated data
January 1, 2024Research exploring the potential scarcity of high-quality human-generated data and its implications for the continued scaling and advancement of large language models.
Frontier Models are Capable of In-context Scheming
January 1, 2024Research demonstrating that advanced frontier AI models possess the ability for in-context scheming, providing empirical evidence for complex deceptive behaviors.
Compute Trends Across Three Eras of Machine Learning
January 1, 2022A foundational paper analyzing the evolution and growth of computational resources used in machine learning over different historical periods, contributing to understanding AI scaling.
Fast Predictive Uncertainty for Classification with Bayesian Deep Networks
January 1, 2022Introduced a method to achieve fast predictive uncertainty for classification tasks using Bayesian Deep Networks, presented at UAI 2022.
Laplace Matching for fast Approximate Inference in Generalized Linear Models
January 1, 2021Paper introducing a method for fast approximate inference in Generalized Linear Models using Laplace Matching.
Similar profiles
Nathan Benaich
Founder at Spinout.fyi
91.2K connections
DEDewi Erwan
Co-Founder & CEO at BlueDot Impact
7.5K connections
HBHerbie Bradley
Founder at Something new
5.3K connections
DDDeedy Das
Principal at Menlo Ventures
110.5K connections
GTGarry Tan
Founder at Garry's List
328.6K connections
OJOlivia Jimenez
Communications Manager at IFP – Institute for Progress
4.9K connections