aguea asia
Filter
Retweets
Media
Videos
News
Verified
Native videos
Replies
Links
Images
Safe
Quotes
Pro videos
Exclude
Retweets
Media
Videos
News
Verified
Native videos
Replies
Links
Images
Safe
Quotes
Pro videos
Time range
-
Near
Users
Tweets
AISecHub
@AISecHub
17 Jan 2025
AI-Driven Security Research: Weekly Highlights 🔍 This week’s research covers advancements in malware detection, abuse prevention, adversarial attacks, and privacy-preserving AI. Below is a summary of key studies and findings, curated by Brandon Dixon. BERT and Bi-LSTM improve cyber abuse detection accuracy:
arxiv.org/pdf/2501.05443v1.p…
MalParse achieves 77% accuracy in Android malware categorization:
arxiv.org/pdf/2501.04848v1.p…
FlipedRAG reveals vulnerabilities in RAG models with 50% opinion manipulation success:
arxiv.org/pdf/2501.02968v1.p…
RAG-WM embeds watermark texts in RAG systems for 100% IP verification success:
arxiv.org/pdf/2501.05249v1.p…
SpaLLM-Guard achieves near-perfect SMS spam detection with LLMs:
arxiv.org/pdf/2501.04985v1.p…
Layer-AdvPatcher efficiently mitigates jailbreak attacks on LLMs:
arxiv.org/pdf/2501.02629v1.p…
AI improves defect prediction and vulnerability detection in secure software engineering:
arxiv.org/pdf/2501.05165v1.p…
LLM4CVE enhances iterative automated software vulnerability repair:
arxiv.org/pdf/2501.03446v1.p…
CommitShield excels in tracking vulnerability fixes in version control systems:
arxiv.org/pdf/2501.03626v1.p…
DFUZZ uses LLMs for effective bug detection in deep learning libraries:
arxiv.org/pdf/2501.04312v1.p…
CGP-Tuning improves code vulnerability detection with graph-text interactions:
arxiv.org/pdf/2501.04510v1.p…
Stack v2 dataset evaluation reveals security vulnerabilities in LLM pre-training:
arxiv.org/pdf/2501.02628v1.p…
PromptGuard moderates NSFW content in text-to-image models efficiently:
arxiv.org/pdf/2501.03544v1.p…
GuardedTuning balances privacy and utility in fine-tuning LLMs:
arxiv.org/pdf/2501.04323v2.p…
PAWN excels in robust AI-generated text detection under adversarial conditions:
arxiv.org/pdf/2501.03940v1.p…
Language models enhance GNSS interference characterization accuracy:
arxiv.org/pdf/2501.05079v1.p…
HP-BERT effectively monitors Hinduphobia on social media during crises:
arxiv.org/pdf/2501.05482v1.p…
#CyberAbuseDetection
#HateSpeechAI
#LLMs
#AndroidMalware
#MalwareDetection
#AIInSecurity
#OpinionManipulation
#AdversarialAttacks
#RAGModels
#WatermarkingAI
#IPProtection
#RAGSystems
#SpamDetection
#SMSFraud
#LLMSecurity
#JailbreakDefense
#LLMSafety
#AdversarialDefense
#SecureSoftwareEngineering
#AIForSecurity
#VulnerabilityDetection
#VulnerabilityRepair
#SoftwareSecurity
#AIInDevelopment
#VersionControl
#VulnerabilityFix
#CommitShield
#BugDetection
#DeepLearningLibraries
#APISecurity
#CodeVulnerability
#GraphTextAI
#CyberResilience
#LLMDatasets
#PreTrainingRisks
#DataSecurity
#ContentModeration
#NSFWDetection
#TextToImageAI
#PrivacyPreservation
#LLMFineTuning
#DataProtection
#TextDetection
#AIContent
#AdversarialResilience
#GNSSInterference
#GenAISecurity
@arxiv
#arxiv
6
1
29
3,798
Load more