Doug Turnbull

Doug Turnbull

4 Photos and videos

Tweets

Dev News retweeted

Doug Turnbull

@softwaredoug

Jan 20

This should be interesting maven.com/p/1bb91a/reasoning…

ReasoningLayer.ai - neurosymbolic reasoning for your data

LLMs are powerful, but they reason probabilistically—not logically. As AI systems take on higher-stakes decisions, hallucinations, rising costs, and lack of auditability become real risks. This topic...

maven.com

281

ReasoningLayer

Dev News retweeted

ReasoningLayer @ReasoningLayer

5 Dec 2025

🚨 AI BUILDERS — waitlist is OPEN 🚨 LLMs aren’t enough. Prompting isn't enough, Context engineering is NOT enough. "Thinking mode" is marketing bullshit. So I built ReasoningLayer a deterministic reasoning layer. And today… the waitlist is OPEN. 👉 reasoninglayer.ai/

ReasoningLayer - Neuro-Symbolic AI Platform for Enterprise

The reasoning layer that makes enterprise AI trustworthy, auditable, and cost-effective. Build explainable AI systems with fuzzy logic, temporal reasoning, and formal proof trees.

reasoninglayer.ai

PaddlePaddle

Dev News retweeted

PaddlePaddle

@PaddlePaddle

11 Sep 2025

General VLMs struggle with precise text localization & often hallucinate on dense docs. 🔥PP-OCRv5 solves this with a modular, two-stage pipeline: ✅Smaller, faster, lighter ✅Accurate bounding boxes ✅Edge-device friendly Read more 👇huggingface.co/blog/baidu/pp…

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

A Blog post by BAIDU on Hugging Face

huggingface.co

7,920

PaddlePaddle

Dev News retweeted

PaddlePaddle

@PaddlePaddle

12 Sep 2025

💥Big news! ERNIE-4.5-21B-A3B-Thinking just hit #1 overall on @huggingface trending models.🚀 👉Check out the model page: huggingface.co/baidu/ERNIE-4…

11,027

NotebookLM

Dev News retweeted

NotebookLM

@NotebookLM

8 Sep 2025

Also, while we have your attention... Flashcards & Quizzes are rolling out TODAY! 🥳 You can now create customizable flashcards and quizzes in @NotebookLM. Stumped on a question? Tap the Explain button to receive an in-depth summary in the chat. Study on!

0:37

124

293

2,425

216,565

Xenova

Dev News retweeted

Xenova

@xenovacom

24 Jul 2025

Introducing Voxtral WebGPU: State-of-the-art audio transcription directly in your browser! 🤯 🗣️ Transcribe videos, meeting notes, songs and more 🔐 Runs on-device, meaning no data is sent to a server 🌎 Multilingual (8 languages) 🤗 Completely free (forever) & open source

0:46

319

20,025

Cline

Dev News retweeted

Cline

@cline

23 Jul 2025

Less than two weeks Kimi K2's release, @Alibaba_Qwen's new Qwen3-Coder surpasses it with half the size and double the context window. Despite a significant initial lead, open source models are catching up to closed source and seem to be reaching escape velocity.

435

73,906

Simon Willison

Dev News retweeted

Simon Willison

@simonw

23 Jul 2025

Interesting new benchmark from Hugging Face testing how well vision LLMs can handle long video inputs (generally after they've been split into many thousands of images) - my notes here: simonwillison.net/2025/Jul/2…

TimeScope: How Long Can Your Video Large Multimodal Model Go?

New open source benchmark for evaluating vision LLMs on how well they handle long videos: TimeScope probes the limits of long-video capabilities by inserting several short (~5-10 second) video...

simonwillison.net

Andi Marafioti

@andimarafioti

23 Jul 2025

Many VLMs claim to process hours of video. But can they follow the story?🤔 Today, we introduce TimeScope: The benchmark that separates true temporal understanding from marketing hype. Let's see how much VLMs really understand!⏳

111

15,977

Daniel van Strien

Dev News retweeted

Daniel van Strien

@vanstriendaniel

8 Jul 2025

465 people. 122 languages. 58,185 annotations! FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages. Huge thanks to all who contributed! huggingface.co/blog/davanstr…

0:25

107

20,109

Ivan Fioravanti ᯅ

Dev News retweeted

Ivan Fioravanti ᯅ

@ivanfioravanti

5 Jul 2025

Apple released DiffuCoder-7B a Masked Diffusion Model for Code Generation. MLX support is a work in progress right now.

186

15,676

Jerry Liu

Dev News retweeted

Jerry Liu

@jerryjliu0

2 Jul 2025

RAG POCs are easy, but building production-grade retrieval is legitimately hard. These are things you don’t realize when you’re first starting out building agents - “wow my chat over 10 pdfs works in 10 mins!”. We learned these lessons as we built out LlamaCloud and wanted to share them with you 💡: 1. Access Control: if you want to gate data through user permissions, you need to invest in data connectors that actually index different levels of permission metadata within the source document. 2. Failure Planning: LLM APIs have rate-limits on large volumes. Data sources require you to think about credential expiration and data throughput. If you’re indexing 1M PDFs and it fails halfway through, ideally you don’t start from scratch. 3. Noisy Neighbors: If you have a lot of RAG pipelines through a central system, you need to make sure that one large job doesn’t take down every other pipeline. These days context engineering is a hot topic, and one of the primary sources of context remains your own enterprise data. Full credits to @thesourabhd for this thoughtful blog post: llamaindex.ai/blog/4-ways-ll… Come check out LlamaCloud if you want to have a managed document indexing/retrieval platform: cloud.llamaindex.ai/

LlamaIndex 🦙

@llama_index

2 Jul 2025

New blog: 4 Ways LlamaCloud Scales Enterprise RAG As we've built LlamaCloud to scale up to truly huge enterprise workloads, we've learned a lot of lessons. In this post we share them so others building high-scale document indexing and retrieval can learn what's coming as they scale up. Lessons include: ➡️ Noisy neighbor problems - Heavy users can starve other users' file processing, requiring robust multi-tenant scheduling and resource management ➡️ Access control complexity - Enterprise RAG must maintain document-level permissions from source systems and filter retrieval results based on user access rights ➡️ Document parsing is critical - LlamaParse converts diverse enterprise formats (PDFs, PowerPoints, Excel) into standardized markdown while preserving semantic structure and extracting tables/images ➡️ Plan for failures - AI service rate limits, data source limitations, and system outages require custom retry logic and checkpointing to resume from failure points Read the full blog post here: llamaindex.ai/blog/4-ways-ll…

226

31,886

FFmpeg

Dev News retweeted

FFmpeg

@FFmpeg

17 Jun 2025

Hand written RISC-V assembly 23.49x faster than C, thanks to work from @CAS__Science

1,131

30,353

Adina Yakup

Dev News retweeted

Adina Yakup

@AdinaYakup

16 Jun 2025

Moonshot AI has quietly released their new coding model: Kimi-Dev 💻 @Kimi_Moonshot huggingface.co/moonshotai/Ki… ✨ 72B - MIT License ✨ 60.4% on SWE-bench Verified ✨ RL-trained to patch real repos in Docker ✨ Only rewarded if full test suite passes

moonshotai/Kimi-Dev-72B · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

330

49,465

FFmpeg

Dev News retweeted

FFmpeg

@FFmpeg

16 Jun 2025

December will mark 25 years of FFmpeg. Going from 100 downloads to being used by the NASA Mars Rover? What is in FFmpeg's future? What software do you still use that is 25 years old? Date: 2000-12-19 The first tarball is available! web.archive.org/web/20010710… link.springer.com/article/10…

696

20,787

Arena.ai

Dev News retweeted

Arena.ai

@arena

16 Jun 2025

🚨Breaking: New DeepSeek-r1 (0528) just tied for #1 in WebDev Arena, matching Claude Opus 4! More highlights: 💠 #6 Overall on Text Arena 💠 #2 in Coding, #4 in Hard Prompts, #5 in Math category 💠 MIT-licensed, currently the best open model on the leaderboard! Huge congrats @deepseek_ai on the incredible milestone 🔥

DeepSeek

@deepseek_ai

29 May 2025

🚀 DeepSeek-R1-0528 is here! 🔹 Improved benchmark performance 🔹 Enhanced front-end capabilities 🔹 Reduced hallucinations 🔹 Supports JSON output & function calling ✅ Try it now: chat.deepseek.com/ 🔌 No change to API usage — docs here: api-docs.deepseek.com/guides… 🔗 Open-source weights: huggingface.co/deepseek-ai/D…

103

850

164,761

Molly Cantillon

Dev News retweeted

Molly Cantillon

@mollycantillon

12 Jun 2025

Everything will be local Yesterday I gave a talk about Real-World Applications of MLX and built a fast on-device semantic search index over Apple WWDC 2025 docs. Open-sourced the code for anyone curious! github.com/mcantillon21/loca…

GitHub - mcantillon21/local-search: On-device semantic search over Apple WWDC 2025 docs using MLX...

On-device semantic search over Apple WWDC 2025 docs using MLX embeddings — SwiftUI app (WWDC OMT 2025) - mcantillon21/local-search

github.com

362

31,309

Longyue Wang

Dev News retweeted

Longyue Wang

@wangly0229

12 Jun 2025

✨Introducing ComfyUI-R1: a large reasoning model for automated workflow generation🌺 huggingface.co/papers/2506.0…

1,365

Qwen

Dev News retweeted

Qwen

@Alibaba_Qwen

28 Apr 2025

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro. Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.5-72B-Instruct. For more information, feel free to try them out in Qwen Chat Web (chat.qwen.ai) and APP and visit our GitHub, HF, ModelScope, etc. Blog: qwenlm.github.io/blog/qwen3/ GitHub: github.com/QwenLM/Qwen3 Hugging Face: huggingface.co/collections/Q… ModelScope: modelscope.cn/collections/Qw… The post-trained models, such as Qwen3-30B-A3B, along with their pre-trained counterparts (e.g., Qwen3-30B-A3B-Base), are now available on platforms like Hugging Face, ModelScope, and Kaggle. For deployment, we recommend using frameworks like SGLang and vLLM. For local usage, tools such as Ollama, LMStudio, MLX, llama.cpp, and KTransformers are highly recommended. These options ensure that users can easily integrate Qwen3 into their workflows, whether in research, development, or production environments. Hope you enjoy our new models!

346

1,571

8,030

2,214,936

AK

Dev News retweeted

@_akhaliq

23 Apr 2025

LiveCC just dropped on Hugging Face Learning Video LLM with Streaming Speech Transcription at Scale video LLM capable of real-time commentary, trained with a novel video-ASR streaming method, SOTA on both streaming and offline benchmarks.

2:07

622

45,962

Simon Willison

Dev News retweeted

Simon Willison

@simonw

11 Apr 2025

It's been 2.5 years with little progress finding mitigations for prompt injection attacks LLM apps... but that may finally have changed! Google DeepMind published a paper describing CaMeL, an ingenious system that could, maybe, lead to secure digital assistants

Defeating Prompt Injections by Design

Edoardo Debenedetti 1,3*, Ilia Shumailov 2, Tianqi Fan1, Jamie Hayes 2, Nicholas Carlini 2,
Daniel Fabian 1, Christoph Kern 1, Chongyang Shi 2, Andreas Terzis 2 and Florian Tramèr 3

1 Google, 2 Google DeepMind, 3 ETH Zurich

Large Language Models (LLMs) are increasingly deployed in agentic systems that interact with an external environment. However, LLM agents are vulnerable to prompt injection attacks when handling untrusted data. In this paper we propose CaMeL, a robust defense that creates a protective system layer around the LLM, securing it even when underlying models may be susceptible to attacks. To operate, CaMeL explicitly extracts the control and data flows from the (trusted) query; therefore, the untrusted data retrieved by the LLM can never impact the program flow. To further improve security, CaMeL relies on a notion of a capability to prevent the exfiltration of private data over unauthorized data flows.

ALT Defeating Prompt Injections by Design Edoardo Debenedetti 1,3*, Ilia Shumailov 2, Tianqi Fan1, Jamie Hayes 2, Nicholas Carlini 2, Daniel Fabian 1, Christoph Kern 1, Chongyang Shi 2, Andreas Terzis 2 and Florian Tramèr 3 1 Google, 2 Google DeepMind, 3 ETH Zurich Large Language Models (LLMs) are increasingly deployed in agentic systems that interact with an external environment. However, LLM agents are vulnerable to prompt injection attacks when handling untrusted data. In this paper we propose CaMeL, a robust defense that creates a protective system layer around the LLM, securing it even when underlying models may be susceptible to attacks. To operate, CaMeL explicitly extracts the control and data flows from the (trusted) query; therefore, the untrusted data retrieved by the LLM can never impact the program flow. To further improve security, CaMeL relies on a notion of a capability to prevent the exfiltration of private data over unauthorized data flows.

143

1,097

106,309