CommitShield - Search

Users
Tweets

AISecHub

@AISecHub

17 Jan 2025

AI-Driven Security Research: Weekly Highlights 🔍 This week’s research covers advancements in malware detection, abuse prevention, adversarial attacks, and privacy-preserving AI. Below is a summary of key studies and findings, curated by Brandon Dixon. BERT and Bi-LSTM improve cyber abuse detection accuracy: arxiv.org/pdf/2501.05443v1.p… MalParse achieves 77% accuracy in Android malware categorization: arxiv.org/pdf/2501.04848v1.p… FlipedRAG reveals vulnerabilities in RAG models with 50% opinion manipulation success: arxiv.org/pdf/2501.02968v1.p… RAG-WM embeds watermark texts in RAG systems for 100% IP verification success: arxiv.org/pdf/2501.05249v1.p… SpaLLM-Guard achieves near-perfect SMS spam detection with LLMs: arxiv.org/pdf/2501.04985v1.p… Layer-AdvPatcher efficiently mitigates jailbreak attacks on LLMs: arxiv.org/pdf/2501.02629v1.p… AI improves defect prediction and vulnerability detection in secure software engineering: arxiv.org/pdf/2501.05165v1.p… LLM4CVE enhances iterative automated software vulnerability repair: arxiv.org/pdf/2501.03446v1.p… CommitShield excels in tracking vulnerability fixes in version control systems: arxiv.org/pdf/2501.03626v1.p… DFUZZ uses LLMs for effective bug detection in deep learning libraries: arxiv.org/pdf/2501.04312v1.p… CGP-Tuning improves code vulnerability detection with graph-text interactions: arxiv.org/pdf/2501.04510v1.p… Stack v2 dataset evaluation reveals security vulnerabilities in LLM pre-training: arxiv.org/pdf/2501.02628v1.p… PromptGuard moderates NSFW content in text-to-image models efficiently: arxiv.org/pdf/2501.03544v1.p… GuardedTuning balances privacy and utility in fine-tuning LLMs: arxiv.org/pdf/2501.04323v2.p… PAWN excels in robust AI-generated text detection under adversarial conditions: arxiv.org/pdf/2501.03940v1.p… Language models enhance GNSS interference characterization accuracy: arxiv.org/pdf/2501.05079v1.p… HP-BERT effectively monitors Hinduphobia on social media during crises: arxiv.org/pdf/2501.05482v1.p… #CyberAbuseDetection #HateSpeechAI #LLMs #AndroidMalware #MalwareDetection #AIInSecurity #OpinionManipulation #AdversarialAttacks #RAGModels #WatermarkingAI #IPProtection #RAGSystems #SpamDetection #SMSFraud #LLMSecurity #JailbreakDefense #LLMSafety #AdversarialDefense #SecureSoftwareEngineering #AIForSecurity #VulnerabilityDetection #VulnerabilityRepair #SoftwareSecurity #AIInDevelopment #VersionControl #VulnerabilityFix #CommitShield #BugDetection #DeepLearningLibraries #APISecurity #CodeVulnerability #GraphTextAI #CyberResilience #LLMDatasets #PreTrainingRisks #DataSecurity #ContentModeration #NSFWDetection #TextToImageAI #PrivacyPreservation #LLMFineTuning #DataProtection #TextDetection #AIContent #AdversarialResilience #GNSSInterference #GenAISecurity @arxiv #arxiv

6

1

29

3,798