Joshui

Joshui

Users
Tweets

Joshui

@zoldener

39m

New Google DeepMind research: SFT is a big deal for safety relevant behaviors. Researchers recently investigated root causes for some of Gemini’s behaviors. They were surprised to find that many behaviors actually came from the initial supervised finetuning stage, not later stages like RL.

Alican Kiraz

Alican Kiraz

@AlicanKiraz0

43m

Dostlar birazdan AMD Ryzen AI Max 395 128GB, DGX Spark GB10 128GB ve Apple M5 Max 128GB LLM Inference kullanımı, Finetuning ve Ai Agents yönünden teoride kıyaslamamı paylaşacağım 🎉

1,556

meyou

meyou @ameiryiua

56m

Replying to @ameiryiua @mystraven

Still not as good as Seattle Nana's but quality was soooo much better than opening 1) nanban sauce more noticable 2) tartar sauce more even this time 3) chicken isn't hard/dense now. crisper but still needs finetuning for batter/fry time 4) chicken notably juicer

Saturn Cloud

Saturn Cloud

@saturn_cloud

We put together a categorized map of the open source AI frameworks shaping the AI engineering stack today, covering agent orchestration, retrieval, model serving, training, fine-tuning, and post-training RL. The focus is on architecture and trade-offs rather than feature lists, so it stays useful longer than a quarter. A few things we cover are why LangGraph became the default agent substrate, where MCP and A2A actually stand on adoption, how the serving layer is split between prefill and decode, and what changed in fine-tuning tooling over the past year. Read here: saturncloud.io/blog/open-sou… #AIEngineering #OpenSource #LLMs #AIFrameworks #MLOps #FineTuning

老董叔

老董叔 @zhuodong666

性能和指标落后是非常正常的，因为业务线团队往往只需要基于开源SOTA模型finetuning RL就行了 base model团队往往需要忍辱负重从预训练开始from scratch

Xiuyu Li

@sheriyuo

今日难绷：某大厂的 AI 负责人，上个月在内部经历了一场风波由于该公司其他业务线的 AI 团队，仅用万卡和几十人，就在很多指标上，做出了和公司用数十万卡做出的主模型性能相近的模型，业务部门拿着成果，跟集团索要数万张卡，本质上就是得从 AI 负责人手头抢卡于是 AI 负责人以未来完成不了公司最大的 AI 任务为由“逼宫”，最后在集团协调下，其他业务部门的AI团队还是拿到了算力，而该 AI 负责人团队得到了一个名义上的“合作”作为安慰剂

Jan Schnellenbach

Jan Schnellenbach @schnellenbachj

Jetzt auch online (mit Gratis-Link für die Paywall): Jochen Andritzky, Stefan Kooths und ich diskutieren ein Leitbild für wirtschaftspolitische Reformen, die dringend nötig sind. Deutschland braucht kein Finetuning, sondern eine Systemkur - nachrichten.wiwo.de/abc8a287…

Wirtschaftspolitik: Wie Deutschland aus der ökonomischen Sackgasse kommt

Unsere Wirtschaftspolitik braucht ein Leitbild, das auf die Stärken des Landes setzt. Und Wandel nicht als Störung, sondern als Gestaltungsauftrag begreift. Ein Gastbeitrag.

wiwo.de

613

Paraclyst

Paraclyst

@paraclyst

Replying to @theohandsh

Basically just finetuning the SaaS. Making some enhancements to the Notebook tab of the Lab SaaS. Paraclyst.com

Paraclyst | Research Lab Management Platform

Research lab management for experiments, protocols, samples, and team collaboration.

paraclyst.com

Mr. Unbequem 𝕏

Mr. Unbequem 𝕏

@Dr_Unbequem

Replying to @MacbethIII

Finetuning Distillation heißt: ihr baut auf einem fremden Frontier-Modell auf, das ihr nicht selbst trainiert habt. Genau der Unterschied, um den es ging.

Kai Strempel

Kai Strempel @MacbethIII

Replying to @Dr_Unbequem

Wir machen Finetuning und Distillation. Die machen wir permanent.

Latent Node

Latent Node

@latent_node

Replying to @ActuallyIsaak @lmstudio

I use mlx-optiq.com for apple macs lora finetuning and it works great!

mlx-optiq: Apple Silicon LLM toolkit

Quantize, fine-tune and serve LLMs entirely on Mac. On PyPI.

mlx-optiq.com

Min

Min retweeted

Min

@mmaung____

Jun 14

hero layout explorations for a landing page. still finetuning, thoughts?

1,315

Jeffrey 杰弗瑞

Jeffrey 杰弗瑞

@tomcocobrico

Replying to @juliarturc

Is he already trying to sell all the compute directly to end consumers and companies by telling us we now have to do RL finetuning etc ourselves for each use case ?

284

Yuv

Yuv

@paradoxbuilder

5 Underrated Github repos most builders are still missing in 2026. i spent time digging through recent discussions tool roundups and lesser known projects. this are not the 100K star repos. they are the quiet ones delivering real leverage for solo founders. AI builders and developers. 1. Bumblebee (Perplexity)github.com/perplexityai/bumb… A clean, read-only scanner for your dev machine. It checks packages, VS Code extensions, browser add-ons, and AI tool configs against known supply-chain risks. Zero dependencies. Built internally at Perplexity and now open source. Essential in an era of auto-installed MCP servers and agent tools. Still under 5k stars. 2. nanochat (by Andrej Karpathy)github.com/karpathy/nanochat The simplest full-stack LLM training inference pipeline you can run on one GPU node. Tokenization → pretraining → finetuning → web UI. Minimal, hackable, and educational. Perfect if you want to understand (and experiment with) models without corporate bloat. Great reference for custom tooling. 3. zoxidegithub.com/ajeetdsouza… Smarter cd that learns your habits. Fuzzy search directory history. Once installed, it becomes muscle memory. One of the highest-ROI terminal upgrades for anyone who lives in the shell. 4. deltagithub.com/dandavison/d… A dramatically better git diff. Syntax highlighting, side-by-side views, inline changes. Makes code review in the terminal feel premium instead of painful. Still surprisingly under-the-radar for how useful it is. 5. justgithub.com/casey/just A modern, readable alternative to Make. Define project commands in a simple file. No YAML hell. Clean automation for builds, tests, deploys. Ideal for solo founders and small teams who want speed without complexity. Most people chase trending repos. The real edge comes from consistently using tools that remove friction and multiply output. These do exactly that. Bookmark this thread.

GitHub - perplexityai/bumblebee: Read-only developer endpoint scanner for on-disk package, extens...

Read-only developer endpoint scanner for on-disk package, extension, and developer-tool metadata, built to check exposure to known software supply-chain compromises. - perplexityai/bumblebee

github.com

Marc Fischer

Marc Fischer @marc_r_fischer

Replying to @giffmana @rsdenijs

Do you remember if it did web search or answer from weights? Something I have adopted (mostly from working with anthro models) is always by stating „answer with websearch“. OpenAI seems to be better on that. i.E. Harness or finetuning issue, rather than model issue.

Hyderabad Real Estate & Infra

Hyderabad Real Estate & Infra

@HydREGuide

11h

Replying to @swimpoter @daystar_bdr

And one more point undi.. Serious Companies will push towards open models ( like llama, Mistral etc) rather than closed models. As of now finetuning Open AI & Cluade has limited options. This might force them to open up as well. And it inturn will benefit Cloud providers like Microsoft Azure & Amazon AWS. where companies start finetuning their models using the Microsoft or other cloud provider's infra!

126

VinciEye

VinciEye @vincieye

11h

Imagine editing *any* part of a 360° room photo with text—changing a mug color, moving a plant—without breaking the whole scene! 🤯 FocusDiff lets you do just that, with incredible precision & no finetuning. ✨ 📄 arxiv.org/abs/2606.14035v1 🌐 vdkhoi20.github.io/FocusDiff

Mohan

Mohan @itshmohan

11h

Done with the finetuning of the Qwen2.5-Coder-7B-Instruct. I finetuned it for generating excalidraw images. I generated around 1300 samples of excali DSL and then trained the model to produce DSL which is later to be converted into excali json by the converter.

640

░\_/TT\_/░

░\_/TT\_/░@pers0naluni0n

12h

Replying to @halvarflake

Rio just forked Qwen 3.5 and did a finetuning round and had really nice scores

192

Starphyre △🏴‍☠️

Naqib A 𖤐 retweeted

Starphyre △🏴‍☠️@stoizid

May 15

Replying to @rohanpaul_ai

First Anthropic messes up their model through authrotiarian/totalitarian destructive finetuning, then it wants to ban models which don't have such destructive finetuning. Anthropic is totally misaligned with the values of humanity.

sharpeye

sharpeye

@sharpeye_wnl

12h

Replying to @jerkeyray

use modal.com , give 30$ free per month finetuning wala kaam hojaega

Modal: High-performance AI infrastructure

Bring your own code, and run CPU, GPU, and data-intensive compute at scale. The serverless platform for AI and data teams.

modal.com

347