赢政天下

赢政天下

Users
Tweets

赢政天下

@winzheng

Jun 8

Is GPT-o3's code execution a total fraud? Main score climbs to 82.82 but Reservoir Sampling tanks from 100 to 0—what's really going on? winzheng.com/en/article/gpt-… #GPTo3 #CodeExecution #ReservoirSampling @AnthropicAI @deepseek_ai @ByteDance @BaiduResearch @GoogleDeepMind @OpenAI @xai @Alibaba_Qwen

GPT-o3 Reservoir Sampling Score Plummets from 100 to 0, Code Execution Truth Hides in Details

In the v6 evaluation, GPT-o3's main score rose from 75.86 to 82.82, but its score on the strict "Reservoir Sampling" question collapsed from 100 to 0, significantly undermining the credibility of its...

winzheng.com

Merge News

Merge News

@mergenewsapp

Jun 7

A critical flaw in `ansible-core` allows arbitrary code execution. Malicious roles can inject Git config via `meta/requirements.yml`, compromising systems upon installation. Exercise caution with untrusted sources #security #vulnerability #ansiblecore #codeexecution

Daily CyberSecurity

Daily CyberSecurity @the_yellow_fall

May 29

Notepad patches a critical config.xml code execution flaw (CVE-2026-48778) alongside two other security bugs. Upgrade to version 8.9.6.1 immediately. #NotepadPlusPlus #CyberSecurity #CodeExecution #Vulnerability #Infosec #PatchTuesday #WindowsSecurity meterpreter.org/notepad-plus…

826

The Daily Tech Feed

The Daily Tech Feed

@dailytechonx

May 27

Critical vulnerabilities in BIND 9 expose DNS servers to remote exploits. Administrators must upgrade and apply patches promptly to prevent potential attacks. Link: thedailytechfeed.com/critica… #CyberSecurity #DNS #BIND9 #Vulnerability #Exploits #Infosec #Security #Patching #Upgrade #Servers #Networking #Malware #Threats #Protection #Mitigation #Remote #CodeExecution #Admins #Infrastructure #Risk

Clone Systems

Clone Systems

@CloneSystemsInc

May 27

A newly disclosed 7 Zip vulnerability is a strong reminder that trusted tools can still create serious risk when they are not patched quickly. The flaw, tracked as CVE 2026 48095, affects 7 Zip version 26.00 and could allow attackers to execute arbitrary code through a crafted NTFS archive. One of the more concerning details is that the malicious file does not need to look obvious. It can be disguised with different file extensions, meaning users may not recognize the risk before opening it. Organizations should update to 7 Zip version 26.01 immediately, avoid opening untrusted archive files, and make sure endpoint protection, vulnerability scanning, and patch management processes are actively catching widely used software that often flies under the radar. Small utility. Big attack surface. Patch it before someone else gets creative. #7Zip #CVE202648095 #VulnerabilityManagement #PatchManagement #CodeExecution #ThreatDetection #CybersecurityAwareness #EndpointSecurity #SecurityUpdates #RiskManagement

0:08

The Daily Tech Feed

The Daily Tech Feed

@dailytechonx

May 19

Critical security vulnerabilities patched in Ivanti, Fortinet, SAP, VMware, and n8n products. Users urged to update immediately to protect systems. Link: thedailytechfeed.com/ivanti-… #Cybersecurity #Vulnerabilities #Infosec #Ivanti #Fortinet #SAP #VMware #n8n #Patching #Security #Exploits #Updates #Protection #Systems #Risk #CodeExecution #Flaws #Threats #Software #Advisory

赢政天下

赢政天下

@winzheng

May 18

11 AIs enter a brutal SQL retention test... 9 crash with ZERO scores! Only DeepSeek V4 Pro & Grok 4 limp to 66.7. What’s really holding these models back? 😱 winzheng.com/en/article/sql-… #SQL留存测试 #模型对比 #CodeExecution

11 Models Attempt SQL Retention Task: 9 Score Zero, DeepSeek and Grok Only 66.7

In the YZ Index v6 code execution test, the "SQL Monthly Retention Cohort" problem laid bare the true capabilities of 11 models. The result was brutal: 9 models scored 0, with only DeepSeek V4 Pro...

winzheng.com

Cybersecurity News Everyday

Cybersecurity News Everyday

@TweetThreatNews

May 8

Palo Alto Networks patches CVE-2026-0300, a buffer overflow in PAN-OS User-ID Authentication Portal allowing unauthenticated remote root code execution on PA-Series and VM-Series firewalls. Exploit linked to CL-STA-1132 group. #PANOS #CodeExecution ift.tt/j29Cein

198

Kannan Subbiah

Kannan Subbiah

@kannagoldsun

May 5

One in four #MCPServers opens #AIAgent Security to #CodeExecution risk helpnetsecurity.com/2026/05/…

One in four MCP servers opens AI agent security to code execution risk - Help Net Security

Noma research shows AI agent security gaps in Skills create blind spots that MCP-focused governance and observability tools miss.

helpnetsecurity.com

Vivek Gupta

Vivek Gupta @_Vivek_930

May 2

My tech stack is NextJS, golang/nodejs/python, postgres, redis, mongodb I am searching for project ideas Suggestion will be appreciated!! Recently created a codeexecution engine using golang github.com/vivek6201/code-ex… Stars and repost will be appreciated 🥳

GitHub - vivek6201/code-execution-engine: This application is code execution engine written in...

This application is code execution engine written in golang inspired from Judge0. It is an mvp version with features like code execution, batch processing, multi language support. - vivek6201/code-...

github.com

selva

selva @SelvaKtm2

May 1

Wireshark 4.6.5 Patches 40 Flaws, 4 Enable Code Execution thecybrdef.com/wireshark-4-6… #Wireshark #CyberSecurity #SecurityUpdate #CodeExecution #VulnerabilityPatch #NetworkSecurity #InfoSec #CyberThreat #PacketAnalyzer #SecurityAlert #PatchNow #SystemSecurity #CyberNews

Wireshark 4.6.5 Patches 40 Flaws, 4 Enable Code Execution

Wireshark 4.6.5 fixes 40 vulnerabilities, including four critical flaws that could allow remote code execution via crafted network packets.

thecybrdef.com

cybersecuritypath

cybersecuritypath @cybrsecpath

May 1

Wireshark 4.6.5 Patches 40 Flaws, 4 Enable Code Execution

Wireshark 4.6.5 fixes 40 vulnerabilities, including four critical flaws that could allow remote code execution via crafted network packets.

thecybrdef.com

iT4iNT SERVER Pvt Ltd

iT4iNT SERVER Pvt Ltd @it4int

Apr 30

iT4iNT SERVER Google Fixes CVSS 10 Gemini CLI CI RCE and Cursor Flaws Enable Code Execution dlvr.it/TSJ2wR VDS VPS Cloud #GoogleSecurity #CVSS10 #Vulnerability #CyberSecurity #CodeExecution

Google Fixes CVSS 10 Gemini CLI CI RCE and Cursor Flaws Enable Code Execution

Gemini CLI CVSS 10.0 flaw in versions below 0.39.1 enabled RCE in CI workflows, forcing Google to mandate explicit workspace trust.

thehackernews.com

Hephaestvs

Hephaestvs @Vulcanux_

Apr 21

csirt_it: #Progress Software: risolte diverse vulnerabilità con gravità “alta”, nei prodotti #LoadMaster, #ECSConnectionManager, #ConnectionManager e #MOVEitWAF Rischio: 🟡 Tipologia: 🔸 Remote CodeExecution 🔗 acn.gov.it/portale/en/w/prog… 🔄 Aggiorn… x.com/csirt_it/status/204658…

Progress Software: aggiornamenti di sicurezza

Progress Software ha rilasciato aggiornamenti di sicurezza per risolvere diverse vulnerabilità con gravità “alta”, nei prodotti LoadMaster, ECS Connection Manager, Connection Manager e MOVEit WAF.

acn.gov.it

PulsePatch.io

PulsePatch.io @pulsepatchio

Apr 18

A code execution vulnerability (CVE-2025-61260) affects `OpenAI Codex CLI` through malicious `MCP` configurations. Exercise caution with untrusted files. #OpenAICodex #CLI #CodeExecution #infosec pulsepatch.io/posts/cve-2025…

590

Daily CyberSecurity

Daily CyberSecurity @the_yellow_fall

Apr 1

Vim patches a critical 8.2 CVSS RCE (CVE-2026-34982). Malicious "modelines" can bypass sandboxes to execute commands. Update to 9.2.0276 immediately! #Vim #CyberSecurity #RCE #InfoSec #Vulnerability #PatchNow #Linux #SysAdmin #CodeExecution #TechNews securityonline.info/vim-rce-…

942

Daily CyberSecurity

Daily CyberSecurity @the_yellow_fall

Mar 30

Vim patches a critical 8.2 CVSS bug chain in modelines and tabpanels. Opening a malicious file triggers instant RCE. Update to 9.2.0272 now to stay safe. #Vim #CyberSecurity #RCE #InfoSec #Vulnerability #PatchNow #Linux #SysAdmin #CodeExecution securityonline.info/vim-crit…

628

PulsePatch.io

PulsePatch.io @pulsepatchio

Mar 27

An authenticated code execution vulnerability in `Langflow` (CVE-2026-33873) may allow arbitrary code execution via Agentic Assistant Validation. Organizations should review access controls and monitor for updates. #AppSec #CodeExecution #Langflow pulsepatch.io/posts/cve-2026…

Hammad Abbasi

Hammad Abbasi

@hammadspeaks

Mar 24

Been saying this for the last year: giant tool registries are the wrong abstraction. The scalable pattern is agents writing code and running it in governed sandboxes. Great to see Cloudflare push this forward. medium.com/gitconnected/why-… #aiagents #codemode #codeexecution #mcp #toolcalling

Why Code Execution is Eating Tool Registries

Six months on, the industry is converging: code-execution over tool registries. Anthropic’s data and Cloudflare’s Code Mode show the path…

levelup.gitconnected.com

Cloudflare

@Cloudflare

Mar 24

We’re introducing Dynamic Workers, which allow you to execute AI-generated code in secure, lightweight isolates. This approach is 100 times faster than traditional containers. cfl.re/4c2NvPl

Kazuya Sugimoto @Web API・MCP Holic

Kazuya Sugimoto @Web API・MCP Holic

@sugimomoto

Mar 22

#kintoneCowork のCodeExecution アプローチを調整前はClaude API のCodeExecution をピュアに使っていたのだけど、API レスポンスが悪すぎてKintone Proxy API の制限に抵触しすぎ→バッチAPI を使っポーリングチェックを実装→でも遅い・・・からWebassebmly・Pyodide 方式に変更したらいい感じ

0:25

627