Numman Ali

Numman Ali

857 Photos and videos

Tweets

Pinned Tweet

Numman Ali

@nummanali

Feb 16

x.com/i/article/202349660287…

22,754

Numman Ali

Numman Ali

@nummanali

Jun 6

TIL the Codex remote feature works even if it’s not your account if you use SSH login Super useful if you’re using multiple accounts or like me, assisting friends and family Setup Tailscale, login with the device password and bobs your uncle

1,079

Oikon

Numman Ali retweeted

Oikon

@oikon48

Jun 4

Anthropicの再帰的自己改善（RSI）についてのブログの中で、Anthropic社内で起こってる話が面白い👀 • 2026年5月時点で、マージされたコードの80%以上がClaudeによって書かれている • エンジニア1人あたりのマージされたコード量が、2024年比で8倍になった • コードの品質はすでに人間と同等レベルに達しており、1年以内に人間を上回ると予想 • コード最適化の実験で、52倍の高速化を達成（Claude Mythos Preview） • オープンエンドなコーディング/エンジニアリングタスクでの成功率が76%に到達 • 一部のエンジニアは「もう自分でコードを書いていない」と公言しているこのままAIが自分自身の後継を自律的に設計・開発できる状態が進んだ場合のシナリオとしては3つのパターンが言及されている: 1. トレンドが停滞するが、現在の能力が広く普及する  S字カーブで成長が鈍化する可能性（新しいアーキテクチャの限界、計算資源・電力の制約など）。それでも「100人の会社が1,000人分の仕事をこなせるようになる」ような変化は起きる。 2. 効率向上が続き、AI開発がさらに自動化される  人間は方向性と最終判断を担い、AIが大部分をやる状態。生産性が劇的に上がる一方で、権威主義的な監視や個人向け操作などの悪用リスクも高まる。100人の会社が1万〜10万人規模の仕事をこなせる 3. 本格的な再帰的自己改善が始まる  AIが自分で後継モデルを設計・訓練するようになる。ペースは計算資源次第。人間は主に「監督・検証・安全確認」の役割にシフトするが、価値観のずれが複利的に拡大するリスクも指摘されている。 anthropic.com/institute/recu…

Anthropic

@AnthropicAI

Jun 4

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…

284

76,696

Anthropic

Numman Ali retweeted

Anthropic

@AnthropicAI

Jun 4

When AI builds itself

Our progress toward recursive self-improvement, and its implications.

anthropic.com

1,771

4,661

28,649

18,494,449

Numman Ali

Numman Ali

@nummanali

Jun 4

Cool project to generate synthetic agent traces

Julien Chaumond

@julien_c

Jun 4

Today I'm launching a new project called SynthTraces 🔥 It is a minimal codebase to generate synthetic coding agent session traces using Pi (from @badlogicgames) I wanted a large number of coding-agent traces, so I built a tiny harness where two models talk to each other: - an open model (served via HF Inference Providers) plays the coding agent. It gets read bash access to a real open source codebase (the huggingface OSS projects) - a small local model (llama.cpp) plays the human user, asking simple questions like "how do I run this?" or "how is CI set up?" The result is more than 2,000 Pi session traces which can be used to train or fine-tune LLMs, and optimize them for Pi 🤯 And ofc everything is published on @huggingface ✅

1,680

Numman Ali

Numman Ali

@nummanali

Jun 4

Going deep on flueframework.com - The Agent Harness Framework Claude Code Dynamic Workflows are really good for large structured implementations I actually like using the Claude Code desktop app with it, the visualisation is quite nice

999

Numman Ali

Numman Ali

@nummanali

Jun 3

Codex / ChatGPT design system is become very inviting It makes me feel like the future is mind with the subtle glows and gel like icons Would love to know how they did the design research

1,083

Numman Ali

Numman Ali

@nummanali

Jun 2

This is super cool, your own company Lovable

OpenAI

@OpenAI

Jun 2

Building apps has never been easier. With Sites, Codex can turn your work, ideas, and plans into an interactive website or app your team can explore, use, and share with a URL. Rolling out to Business and Enterprise plans, before expanding more broadly.

0:46

849

Numman Ali

Numman Ali

@nummanali

Jun 2

SSH between old and new laptop Codex handling configuration of all CLI tools and configs in one swoop It’s the little things

2,312

Numman Ali

Numman Ali

@nummanali

Jun 1

Yoooooooooooooo

xAI

@xai

Jun 1

Composer 2.5 is now available inside Grok Build. Composer 2.5 is a fast, highly intelligent model that excels on long-running tasks and following complex instructions.

0:04

1,346

Numman Ali

Numman Ali

@nummanali

Jun 1

Kind of criminal that @pontusab doesn’t have more GitHub sponsors If you want to support a healthy OSS ecosystem building in the AI Agent space, then add yourself to the list List of projects in thread🧵 github.com/sponsors/pontusab

8,620

more replies

Numman Ali

Numman Ali

@nummanali

Jun 1

CalText iMessage calorie tracking assistant powered by AI. github.com/pontusab/caltext

260

Numman Ali

Numman Ali

@nummanali

Jun 1

HyperJS Fast, opinionated, AI-native API framework for Bun. hyperjs.ai/

Hyper — an API framework for Bun, distributed as source

An HTTP framework for Bun. The CLI copies the components you want into your repo. No runtime dependency on the framework — the code is yours.

hyperjs.ai

281

Numman Ali

Numman Ali

@nummanali

Jun 1

Pontus will you leave anything for the rest of us? Too cool project, this is what you’d expect from a frontier lab building iMessage / WhatsApp agents He’s giving it out for free! Definitely going to make a weekend project on this

Pontus Abrahamsson — oss/acc

@pontusab

Jun 1

I just shipped message-ui, build dynamic iMessage attachments with React. ◆ Charts ◆ Tables ◆ Text primitives ◆ Local preview PNG export ◆ Tailwind support ◆ Works with Chat SDK Link ⬇️🧵

0:11

543

Numman Ali

Numman Ali

@nummanali

Jun 1

Codex App > Codex CLI You (and the team!) did it @ajambrosino I want to know how you fixed the performance for multiple agents and long threads A month ago it couldn't handle my workflows Blog post soon?

2,328

Numman Ali

Numman Ali

@nummanali

Jun 1

My Claude Code usage has gone from very little to many hours a day thanks to Dynamic Workflows But guess what - this means RLMs (Recursive Language Models) are directionally correct Time to spend time with aithy.dev/ and axllm.dev/ from @dosco

7,970

Numman Ali

Numman Ali

@nummanali

Jun 1

OSS Calorie Tracking through iMessage Coolest part is that this is the perfect blueprint for making an iMessage assistant, it utilises the Chat SDK under the hood Recommend reading the code (Pontus writes top 1% of code IMO) and fork the repo to experiment with your own ideas

Pontus Abrahamsson — oss/acc

@pontusab

Jun 1

Introducing Caltext, open-source calorie tracking in iMessage. Built with: • Bun Turborepo • Hono on Nitro (Vercel) • Chat SDK Sendblue • AI SDK GPT-4.1 vision • Upstash Redis • Vercel Workflows • USDA FoodData Central Link ⬇️🧵

0:11

3,627