On the (In)Security of Loading Machine Learning Models
We identified six zero-day vulnerabilities, including the first CVEs ever assigned to Keras safe_mode. Our results show that loading a machine learning model can be equivalent to executing untrusted code, despite the security claims often present in framework and hub documentation.
We also show that Hugging Face’s integrated scanners do not always provide an effective additional line of defense against framework-level exploits. Finally, through a survey of machine learning practitioners, we show that security claims in framework and hub documentation can create misplaced trust. For example, over 90% of non-security ML practitioners perceived no risk of arbitrary code execution when safe_mode=True.
Source: arxiv.org/pdf/2509.06703#MLSecurity#AISecurity#ModelSecurity#MachineLearning#SecureAI#ModelSupplyChain#ModelLoading#ArbitraryCodeExecution#SoftwareSecurity#CyberSecurity#AIVulnerabilities#ModelHubSecurity#SecureML#AIAttackSurface#IEEEsp#SecurityResearch
63% of AI agents deployed in production have critical security flaws.
Most security teams focus on traditional vectors: network perimeter, endpoint protection, access control.
But agents introduce entirely new vulnerabilities:
🎯 𝗣𝗿𝗼𝗺𝗽𝘁 𝗜𝗻𝗷𝗲𝗰𝘁𝗶𝗼𝗻 Malicious inputs that manipulate agent behavior, bypassing intended constraints.
🎯 𝗗𝗮𝘁𝗮 𝗣𝗼𝗶𝘀𝗼𝗻𝗶𝗻𝗴 Corrupted training data that influences decision-making at scale.
🎯 𝗠𝗼𝗱𝗲𝗹 𝗘𝘅𝘁𝗿𝗮𝗰𝘁𝗶𝗼𝗻 Adversaries reverse-engineering your proprietary AI logic through repeated queries.
🎯 𝗔𝘂𝘁𝗵𝗼𝗿𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗕𝘆𝗽𝗮𝘀𝘀 Agents performing actions beyond their intended scope due to unclear boundaries.
🎯 𝗖𝗵𝗮𝗶𝗻-𝗼𝗳-𝗧𝗵𝗼𝘂𝗴𝗵𝘁 𝗟𝗲𝗮𝗸𝗮𝗴𝗲 Sensitive reasoning processes exposed in logs or outputs.
Traditional security frameworks weren't built for these threats.
Nexus was.
Our infrastructure provides agent-specific protections: → Input validation at the semantic level → Behavioral anomaly detection → Fine-grained permission boundaries → Encrypted reasoning paths → Continuous compliance monitoring
Because securing AI agents isn't about adding tools. It's about rethinking infrastructure.
#AIAgents#CybersecurityEducation#AgentSecurity#AIThreats#MachineLearning#MLSecurity#AIInfrastructure#ThreatIntelligence#EnterpriseAI#SecureML
Toward Stealthy Bit-Flip Attacks on Large Language Models - arxiv.org/pdf/2509.17371
While input manipulation attacks (e.g., prompt injection) have been well-studied, Bit-Flip Attacks (BFAs) which exploit hardware vulnerabilities to corrupt model parameters and cause severe performance degradation-have received far less attention.
Existing BFA methods suffer from key limitations: they fail to balance performance degradation and output naturalness, making them prone to discovery. In this paper, we introduce SilentStriker, the first stealthy bit-flip attack against LLMs that effectively degrades task performance while maintaining output naturalness. Our core contribution lies in addressing the challenge of designing effective loss functions for LLMs with variable output length and the vast output space.
Unlike prior approaches that rely on output perplexity for attack loss formulation, which in-evidently degrade the output naturalness, we reformulate the attack objective by leveraging key output tokens as targets for suppression, enabling effective joint optimization of attack effectiveness and stealthiness.
#AISecurity#LLMSecurity#GenAI#ModelSafety#AdversarialML#PromptInjection#DataPoisoning#ModelGovernance#AIRedTeam#SecureML@ZJU_China@Huawei#BitFlip#SilentStriker#BFA
4/9
🌐 Machine learning is increasingly integrated into all aspects of our lives, from healthcare to transportation. Aleo's zkML Initiative aims to ensure the security and privacy of these applications, laying the foundation for a more trusted future.
#SecureML
@dawnsongtweets’s invited talk mentions the importance of understanding and combatting security and privacy issues in (deployable) ML. This is a topic I have been thinking a lot about recently. Happy to see researchers talking about this. #secureML#WiML2019#NeurIPS2019