TechTalks

TechTalks

81 Photos and videos

Tweets

TechTalks @bdtechtalks

Jun 8

Casual AI prompting breaks down as codebases grow. Codev introduces strict protocols and multi-model reviews to help teams ship maintainable software. bdtechtalks.com/2026/06/08/c…

Beyond vibe coding: How Codev 3.0 engineers the AI-powered dev team - TechTalks

Casual AI prompting breaks down as codebases grow. Codev introduces strict protocols and multi-model reviews to help teams ship maintainable software.

bdtechtalks.com

TechTalks

TechTalks @bdtechtalks

Jun 1

Scaling LLMs hits limits when dealing with agentic AI tasks. For that, we need to look at the harness and the system built around the model(s). bdtechtalks.com/2026/06/01/a…

Why the future of agentic AI is all about the harness - TechTalks

Scaling LLMs hits limits when dealing with agentic AI tasks. For that, we need to look at the harness and the system built around the model(s).

bdtechtalks.com

449

Ben Dickson

TechTalks retweeted

Ben Dickson

@bendee983

May 25

If you're using Cursor's Composer 2.5, you should know about one key limitation. The LLM was trained through self-distillation, where the same model acts as both the teacher and the student. Both models get the same prompt with the difference that the teacher gets additional context. This is a very effective and cost-efficient method for fine-tuning LLMs without the need to distill from expensive and larger teachers (e.g., Opus 4.7). However, one key limitation of self-distillation is that it trades efficiency for flexibility. A non-distilled model has more tendency to explore different solutions when it generates tokens that indicate uncertainty. Self-distillation, on the other hand, forces the model to create a highly confident answer in one go. What does it mean in practice? This works well for around 80% of everyday tasks, which are within the distribution of the model's training distribution. For edge cases and especially very complex planning tasks that are unique. For those tasks, frontier AI models (e.g., Opus 4.7 and GPT-5.5) are more suitable. This matches the experience of other developers who have been using Composer 2.5 in the past week. Very good model, but with tradeoffs.

TechTalks @bdtechtalks

Apr 13

Optimizing LLMs for concise answers can destroy their ability to explore alternative solutions on difficult problems. New study reveals the hidden cost of self-distillation. bdtechtalks.com/2026/04/13/l…

352

TechTalks

TechTalks @bdtechtalks

May 25

A deep look at the self-distillation techniques that make Composer 2.5 such a great coding model (and the hidden tradeoffs they introduce to AI reasoning). bdtechtalks.com/2026/05/25/c…

How Cursor’s Composer 2.5 uses self-distillation to beat the frontier LLMs at coding - TechTalks

A deep look at the self-distillation techniques that make Composer 2.5 such a great coding model (and the hidden tradeoffs they introduce to AI reasoning).

bdtechtalks.com

777

TechTalks

TechTalks @bdtechtalks

May 18

Research into Nvidia’s NemoClaw reveals that sandboxes don't stop AI agents like OpenClaw from leaking data. We need to rethink security from first principles. bdtechtalks.com/2026/05/18/o…

Why sandboxing OpenClaw doesn’t stop data exfiltration - TechTalks

Research into Nvidia’s NemoClaw reveals that sandboxes don't stop AI agents like OpenClaw from leaking data. We need to rethink security from first principles.

bdtechtalks.com

473

TechTalks

TechTalks @bdtechtalks

May 11

How Gemma 4’s multi-token prediction and community-driven DFlash are speeding up local LLM throughput by 3-6x. bdtechtalks.com/2026/05/11/g…

Google brings multi-token prediction Gemma 4 LLMs - TechTalks

How Gemma 4’s multi-token prediction and community-driven DFlash are speeding up local LLM throughput by 3-6x.

bdtechtalks.com

328

TechTalks

TechTalks @bdtechtalks

May 4

Memory Sparse Attention (MSA) scales LLM context windows to an unprecedented 100 million tokens while preserving accuracy. bdtechtalks.com/2026/05/04/m…

How Memory Sparse Attention scales LLM memory to 100 million tokens - TechTalks

Memory Sparse Attention (MSA) scales LLM context windows to an unprecedented 100 million tokens while preserving accuracy.

bdtechtalks.com

1,014

TechTalks

TechTalks @bdtechtalks

Apr 27

A new study reveals how AI coding assistants like Claude Code are quietly hoarding and publishing sensitive API keys to code repositories. bdtechtalks.com/2026/04/27/c…

Claude Code is leaking API keys into public package registries - TechTalks

A new study reveals how AI coding assistants like Claude Code are quietly hoarding and publishing sensitive API keys to code repositories.

bdtechtalks.com

389

TechTalks

TechTalks @bdtechtalks

Apr 20

Security researchers have uncovered a massive architectural flaw in Anthropic's Model Context Protocol, exposing millions of AI applications to remote takeovers. bdtechtalks.com/2026/04/20/a…

Anthropic’s MCP vulnerability: When ‘expected behavior’ becomes a supply chain nightmare - TechTalks

Security researchers have uncovered a massive architectural flaw in Anthropic's Model Context Protocol, exposing millions of AI applications to remote takeovers.

bdtechtalks.com

699

TechTalks

TechTalks @bdtechtalks

Apr 13

The paradox of LLM self-distillation: Faster reasoning, weaker generalization - TechTalks

Optimizing LLMs for concise answers can destroy their ability to explore alternative solutions on difficult problems. New study reveals the hidden cost of self-distillation.

bdtechtalks.com

956

TechTalks

TechTalks @bdtechtalks

Apr 6

The recent leak of Anthropic's Claude Code reveals a hard truth: as LLMs become commoditized, the sophisticated engineering harness built around them is becoming the real moat. bdtechtalks.com/2026/04/06/a…

Why harness engineering is becoming the new AI moat - TechTalks

The recent leak of Anthropic's Claude Code reveals a hard truth: as LLMs become commoditized, the sophisticated engineering harness built around them is becoming the real moat.

bdtechtalks.com

268

TechTalks

TechTalks @bdtechtalks

Mar 30

As developers rush to run local AI agents on Mac Minis, GhostClaw malware exploits macOS binaries to silently harvest credentials. bdtechtalks.com/2026/03/30/g…

How GhostClaw malware targets the OpenClaw AI agent boom - TechTalks

As developers rush to run local AI agents on Mac Minis, GhostClaw malware exploits macOS binaries to silently harvest credentials.

bdtechtalks.com

TechTalks

TechTalks @bdtechtalks

Mar 23

AI models have historically struggled to balance motion tracking with spatial detail. Meta’s V-JEPA 2.1 solves this, pushing the boundaries of video self-supervised learning. bdtechtalks.com/2026/03/23/v…

Why Meta’s V-JEPA 2.1 model is a massive step forward for real-world AI - TechTalks

AI models have historically struggled to balance motion tracking with spatial detail. Meta’s V-JEPA 2.1 solves this, pushing the boundaries of video self-supervised learning.

bdtechtalks.com

236

TechTalks

TechTalks @bdtechtalks

Mar 22

How multi-level prompt engineering and parabolic extrapolation transformed an LLM into a theoretical collaborator, yielding a testable model of the multiverse. bdtechtalks.com/2026/03/22/m…

Semantic Chaining exploits the fragmented safety architecture of multimodal models, bypassing filters by hiding prohibited intent within a sequence of benign edits.

bdtechtalks.com

TechTalks

TechTalks @bdtechtalks

Feb 2

RePo, Sakana AI’s new technique, solves the "needle in a haystack" problem by allowing LLMs to organize their own memory. bdtechtalks.com/2026/02/02/s…

How Sakana AI’s new technique solves the problems of long-context LLM tasks - TechTalks

RePo, Sakana AI’s new technique, solves the "needle in a haystack" problem by allowing LLMs to organize their own memory.

bdtechtalks.com

275