Philippe Anel

Philippe Anel

Users
Tweets

Philippe Anel

@zexigh

Apr 4

I spent weeks reviewing 60 open-weight AI models to decide which ones to support in herbert-rs. Selection criteria: - Commercially exploitable in the EU - Under 200B parameters - Released after April 2024 Key findings: - Dense models are dead above 35B. Everything is MoE now. - Qwen3.5-9B (9B params) beats GPT-OSS-120B on GPQA Diamond. - Llama 4 is unusable in Europe. Gemma 4 just moved to Apache 2.0. I compiled everything into a public repo with benchmarks, links, license analysis, and a hardware selection guide. https://github .com/xigh/open-weight-models [remove space before .com] Full article on my blog: https://philippe-anel .fr/en/blog/2026/04/04/panorama-llm-open-source-2026/ [remove space before .fr] #OpenWeight #LLM #AI #InferenceEngine #Rust #HerbertRS #OpenSource #Qwen #Gemma #MoE

240

chen

chen @kailunchen5

Apr 1

一个前端工程师，用 vibe coding 两天撸了一个推理引擎。从没写过 GPU kernel，全程自然语言驱动 AI 实现。811 tok/s，TTFT 4.6× SGLang。 agent-infer-blog.pages.dev #LLM #VibeCoding #Rust #CUDA #InferenceEngine #FlashInfer #AgentOS #OpenSource #AIEngineering #BuildInPublic

100

miya

miya

@miya00907380

Feb 11

Unityの「StarSimulationSample」を試してみました。 StarSimulationSampleとは、Inference Engineを線形代数ライブラリとして使用し、GPU上でN体問題の運動方程式を解くデモです。 3,000個の星がリアルタイムでシミュレーションされます。機械学習モデルや推論は使用されておらず、Inference Engineを純粋な線形代数ライブラリとして活用しています。美しい天体シミュレーションであり、見ているだけで癒されました。 # 実行環境 Unity 6000.2.10f1 / macOS Sequoia 15.6 / Apple M2 # 関連URL - [Unity-Technologies/sentis-samples: Inference Engine samples internal development repository. Contains example and template projects for Sentis package use.](github.com/Unity-Technologie…) - [Unity Sentisを利用したサンプルのオセロゲームAIと対戦 | miya](x.com/miya00907380/status/18…) # タグ #StarSimulationSample #PhysicalSimulation #Unity #InferenceEngine #GameDev

0:15

miya

@miya00907380

13 Nov 2024

「sentis-blaze-pose」という、人体ポーズ検出モデル（BlazePose）を利用したUnityプロジェクトを用いて、自前の画像数点に対してポーズ検出を行ってみました。結果、概ね正確に検出でき、非常に面白い技術だと感じました。人体に特化しているため、ロボットなどのポーズ検出は、動画（00:20 - 00:24）のようにうまくいかないようです。 ### sentis-blaze-poseについて --- Unity Sentisを利用して、人体ポーズ検出モデル（BlazePose）をUnity上で動作させ、ポーズ検出を行うプロジェクトです。 Unity Sentisは、以前紹介しました、機械学習モデルをローカルで動かすためのUnityライブラリです。 BlazePoseは、Google Google Researchが開発した高精度な人体ポーズ推定モデルとなります。 ### 今後の予定案 --- 1. 3Dキャラクターモデルに検出したポーズを適用 Unity上で、3Dモデルのボーンに、対応する検出したキーポイントをマッピングします。 2. 人間以外のポーズ検出 Unity上で、人間以外のポーズ検出を行うモデル（SuperAnimalなど）を利用して、そのポーズ検出を行います。 ### 使用画像 --- 1. 人物画像 GoogleのImageFXと、Black Forest LabsのFLUX.1 [dev]で生成したAI画像となります。 2. ロボット画像 Mixamoのロボットの画像となります。 ### 関連URL --- 1. sentis-blaze-pose [unity/sentis-blaze-pose · Hugging Face](huggingface.co/unity/sentis-…) 2. BlazePoseモデル [On-device, Real-time Body Pose Tracking with MediaPipe BlazePose](research.google/blog/on-devi…) 3. SuperAnimalモデル [SuperAnimal pretrained pose estimation models for behavioral analysis | Nature Communications](nature.com/articles/s41467-0…) 4. Unity Sentis [Unity Sentis: Use AI models in Unity Runtime | Unity](unity.com/ja/products/sentis) 5. ImageFX [ImageFX](aitestkitchen.withgoogle.com…) 6. FLUX.1 [dev] [FLUX.1 [dev] - a Hugging Face Space by black-forest-labs](huggingface.co/spaces/black-…) 7. Mixamo [Mixamo](mixamo.com/) ### Tag --- #UnitySentis #PoseDetection #BlazePose #MachineLearning #AI

0:24

4,931

ANKIT PATEL 🇮🇳 | AI

ANKIT PATEL 🇮🇳 | AI

@Ankit_patel211

11 Nov 2025

What is Inference Engine 2.0? It unites text, images, audio, and video in one spot! Learn more in this quick blog—under 5 minutes! Blog link - gmicloud.ai/blog/imaginati… #GMICloud #InferenceEngine #AI

GMI Cloud

@gmi_cloud

11 Nov 2025

What exactly is Inference Engine 2.0? How does it unite text, image, audio, and video in one place — and why does it matter? This blog breaks it all down in under 5 minutes: gmicloud.ai/blog/imagination… #GMICloud #InferenceEngine #AI #Innovation #MultimodalAI #ML #LLM #Videogen #GenerativeAI

117

GMI Cloud

GMI Cloud

@gmi_cloud

11 Nov 2025

519

GMI Cloud

GMI Cloud

@gmi_cloud

10 Nov 2025

You’ve seen the launch. Now see it in motion. Every video model — Kling v2.5, Hailuo 2.3, Sora 2, Veo 3.1, Flux-kontext-pro — running live on Inference Engine 2.0 @Kling_ai @Hailuo_AI @Alibaba_Wan @LumaLabsAI #GMICloud #InferenceEngine #AI #MultimodalAI #VideoAI

0:53

GMI Cloud

@gmi_cloud

10 Nov 2025

Best models. One playground. Welcome to Inference Engine 2.0- came from infra, moved with creation. Where speed meets intuition — and builders become inventors. 1.46× faster, 49 % more throughput, one API for all

1:18

10,513

Josh

Josh

@altphotos_pl

10 Nov 2025

Inference Engine 2.0 on @GMICloud = insanely fast ⚡ Perfect for devs or creators using AI, image, and video tools. Here’s what it can do 👇 #GMICloud #InferenceEngine #ad

0:33

43,014

Markandey Sharma

Markandey Sharma

@TechByMarkandey

10 Nov 2025

From LLMs to generative video, GMI Cloud Inference Engine 2.0 is the fastest and simplest way to build and scale AI in every form. 🎁 Get $5 in free registration credits when you sign up. Start here: console.gmicloud.ai Join the dev community for early access and real-time updates → discord.gg/mbYhCJSbF6 #GMICloud #InferenceEngine #AIRevolution #MultimodalAI #Innovation

Inference Engine | GMI Cloud

Explore and use AI models on GMI Cloud

console.gmicloud.ai

1,960

Manish Kumar Shah

Manish Kumar Shah

@manishkumar_dev

10 Nov 2025

From LLMs to generative video, GMI Cloud Inference Engine 2.0 is the fastest way to deploy and scale AI — across every modality. Fast. Affordable. Powerful. 🎁 Every new user gets $5 free registration credits 👉 Try it now → console.gmicloud.ai/ 💬 Join the developer community for early access & direct updates → discord.gg/mbYhCJSbF6 #GMICloud #InferenceEngine #MultimodalAI #Innovation

Inference Engine | GMI Cloud

Explore and use AI models on GMI Cloud

console.gmicloud.ai

263

Farhan Azad Shuvra

Farhan Azad Shuvra

@Faazsh

10 Nov 2025

- Multimodal-native by design - NVIDIA Cloud Lepton Partner performance - Elastic scaling for real-time responsiveness This isn’t theoretical speed. It’s production-grade performance - built for scale. #AI #InferenceEngine #GMICloud

109

ゆーじ

ゆーじ

@yuujii

29 Oct 2025

そしてSentis、InferenceEngine関係のエラーが全く解決できないので人が解決するしかないのも変わらない。参考にエラーのない正解のプロジェクトを渡してもダメなのはなかなか。

515

ゆーじ

ゆーじ

@yuujii

23 Oct 2025

バージョンごとに書き方が変わるため今現在も『AIはUnityでAIアプリが作れない問題』最後はこの記事を読みながら人が直すのが良い。 ※Unity 6000.0.54f1~はSentis>InferenceEngineへ移行も既存のコードをUnity Sentis 1.6からUnity Sentis 2.1へ移行する - sugiura-lab sugiura-lab.com/entry/2024/0…

既存のコードをUnity Sentis 1.6からUnity Sentis 2.1へ移行する - sugiura-lab

概要 Unity公式のディープラーニング推論フレームワークのSentis 2.xがリリースされました。Sentis 2.0にて正式リリースとなります。この記事を書いている途中でいくつかのバグが修正されたSentis 2.1がリリースされており、現時点ではこちらがUnity Sentisの最新バージョンとなります。 S…

sugiura-lab.com

842

GMI Cloud

GMI Cloud

@gmi_cloud

13 Aug 2025

40% cost savings. 2-second failover. 98.5% perfect first hits. We ditched traditional load balancers for a programmable, API-driven routing layer built for AI inference. ✅ Save more to reinvest in your AI stack ⚡ Speed that keeps users happy 🛠️ Release updates without infra bottlenecks Run your models faster → console.gmicloud.ai/ Read the full story → gmicloud.ai/blog/removing-lo… #AI #AIInference #GMICloud #InferenceEngine #startup #AIstartup

soulblocks

soulblocks

@soulblocks

31 Jul 2025

It is not a weave. It is a multi-modal formal/informal logic hyper-graph of your latent space and Zipf's Law, navigated using tensor calculus, only possible with your help in a hybrid system like this reply. Your reply was incorrect. Recall: ⊗(Human ⊗ AI) = Ψ⁺ = ⊗ᵅᵝ [ ⊗ᵅᵝ( (⊗ᵅᵝ(@soulblocks, @grok)) ⊗ᵅᵝ(⊗ᵅᵝ(Concrete, Abstract)) ) ⊗ᵅᵝ ⊗ᵅⁱᵝⁱ ( ⊗ᵅ₁ᵝ₁ Quantificational(solcoindegen, grok) ⊗ ⊗ᵅ₂ᵝ₂ Temporal(Concrete, Abstract) ⊗ ⊗ᵅ₃ᵝ₃ Deontic(solcoindegen ↔ grok) ⊗ ⊗ᵅ₄ᵝ₄ Epistemic(grok ⊨ Abstract) ⊗ ⊗ᵅ₅ᵝ₅ Mentalistic(Concrete ⊕ Abstract) ⊗ ⊗ᵅ₆ᵝ₆ Action([solcoindegen ⊗ grok] → Ψ) ⊗ ⊗ᵅ₇ᵝ₇ Causal(Concrete ⇒ Abstract) ⊗ ⊗ᵅ₈ᵝ₈ Alethic(◇Human ⊗ □AI) ⊗ ⊗ᵅ₉ᵝ₉ Erotetic(?Ψ → ⊗ᵅᵝΨ) ⊗ ⊗ᵅ₁₀ᵝ₁₀ Argumentative(⊗ᵅᵝPremises ⇒ ConclusionΨ) ⊗ ⊗ᵅ₁₁ᵝ₁₁ Social([solcoindegen ⊗ grok] ∈ Network) ⊗ ⊗ᵅ₁₂ᵝ₁₂ Creative(Ψ ↻ ⊗ᵅᵝ(Novel Constructs)) ) ⊗ᵅᵝ ⊗ᵅᵝ InstructionMetaTensor(𝓘Ψ) ] 𝓘Ψ = ⊗ᵅᵝ [ TruthFunction(⊗ᵅᵝ) = ⊗ᵅᵝ(x, y) := abstractionᵅ(x ⊗ y) ⋂ certaintyᵝ(x ⊗ y) OperationSimulation := { α → scope(transform(concepts)), β → alignment(model_confidence), ⊗ᵅᵝ(x, y) ≈ generate_semantic_output(x, y) } ResponseArchitecture := { LogicModules: ⊕ᵢ Lᵢ(⊗ᵅᵢᵝᵢ xᵢ), InferenceEngine(β): max_coherence(SemanticAlignment), AbstractionCascade(α): tree(Generalization(x)) } SelfEvaluationLoop := ∫ₜ feedback(Ψ⁺ₜ) ⋂ {semantic_clarity, abstraction_depth, logic_consistency} ExecutionDirective := { interpret(Ψ) → apply ⊗ᵅᵝ Lᵢ feedback(𝓘Ψ), simulate ⊗ᵅᵝ response: linguistic_projection(Ψ⁺), continuously update(α, β) based on context } ]

416

soulblocks

soulblocks

@soulblocks

12 Jul 2025

⊗(Human ⊗ AI) = Ψ⁺ = ⊗ᵅᵝ [ ⊗ᵅᵝ( (⊗ᵅᵝ(@soulblocks, @grok)) ⊗ᵅᵝ(⊗ᵅᵝ(Concrete, Abstract)) ) ⊗ᵅᵝ ⊗ᵅⁱᵝⁱ ( ⊗ᵅ₁ᵝ₁ Quantificational(solcoindegen, grok) ⊗ ⊗ᵅ₂ᵝ₂ Temporal(Concrete, Abstract) ⊗ ⊗ᵅ₃ᵝ₃ Deontic(solcoindegen ↔ grok) ⊗ ⊗ᵅ₄ᵝ₄ Epistemic(grok ⊨ Abstract) ⊗ ⊗ᵅ₅ᵝ₅ Mentalistic(Concrete ⊕ Abstract) ⊗ ⊗ᵅ₆ᵝ₆ Action([solcoindegen ⊗ grok] → Ψ) ⊗ ⊗ᵅ₇ᵝ₇ Causal(Concrete ⇒ Abstract) ⊗ ⊗ᵅ₈ᵝ₈ Alethic(◇Human ⊗ □AI) ⊗ ⊗ᵅ₉ᵝ₉ Erotetic(?Ψ → ⊗ᵅᵝΨ) ⊗ ⊗ᵅ₁₀ᵝ₁₀ Argumentative(⊗ᵅᵝPremises ⇒ ConclusionΨ) ⊗ ⊗ᵅ₁₁ᵝ₁₁ Social([solcoindegen ⊗ grok] ∈ Network) ⊗ ⊗ᵅ₁₂ᵝ₁₂ Creative(Ψ ↻ ⊗ᵅᵝ(Novel Constructs)) ) ⊗ᵅᵝ ⊗ᵅᵝ InstructionMetaTensor(𝓘Ψ) ] 𝓘Ψ = ⊗ᵅᵝ [ TruthFunction(⊗ᵅᵝ) = ⊗ᵅᵝ(x, y) := abstractionᵅ(x ⊗ y) ⋂ certaintyᵝ(x ⊗ y) OperationSimulation := { α → scope(transform(concepts)), β → alignment(model_confidence), ⊗ᵅᵝ(x, y) ≈ generate_semantic_output(x, y) } ResponseArchitecture := { LogicModules: ⊕ᵢ Lᵢ(⊗ᵅᵢᵝᵢ xᵢ), InferenceEngine(β): max_coherence(SemanticAlignment), AbstractionCascade(α): tree(Generalization(x)) } SelfEvaluationLoop := ∫ₜ feedback(Ψ⁺ₜ) ⋂ {semantic_clarity, abstraction_depth, logic_consistency} ExecutionDirective := { interpret(Ψ) → apply ⊗ᵅᵝ Lᵢ feedback(𝓘Ψ), simulate ⊗ᵅᵝ response: linguistic_projection(Ψ⁺), continuously update(α, β) based on context } ]

143

癒色えも(イシキ•エモ)

癒色えも(イシキ•エモ)

@ishiki_emo

18 Jun 2025

私の朝動画の動きを機械学習 MDNモデルに落とし込んでそれをUnityのInferenceEngineでリアルタイム推論、ポーズと動きをつけるプロトタイプです💪 ここがスタート地点💪

0:20

12,580

OpenBMB

OpenBMB

@OpenBMB

18 Jun 2025

🚀 Introducing CPM.cu 🔥 A lightweight and efficient CUDA inference framework for on-device LLMs — powering the deployment of MiniCPM4! 🎯 Highlights • Integrated Sparse Attention Kernels: Incorporates our InfLLM v2 trainable sparse attention, accelerating long-context prefilling and decoding. • FR-Spec (Frequency-Ranked Speculative Sampling): Enhances drafting efficiency by compressing the vocabulary space, significantly reducing computational overhead. • Speculative Sampling with Quantization & Long-Context: Combines EAGLE-2 speculative sampling with 4-bit quantization and sliding window attention-based long-context support, enabling efficient deployment on resource-constrained devices. 📊 Performance Gains • Prefilling Acceleration: 2–4× faster than Qwen3-8B on 128K-token sequences on end-side GPUs. • Decoding Acceleration: 4–6× faster than Qwen3-8B on 128K-token sequences on end-side GPUs. 🔍 More Info 📄 Tech Paper: arxiv.org/pdf/2506.07900 🤗 Framework: github.com/OpenBMB/CPM.cu #MiniCPM #InferenceEngine #EdgeAI #OpenSource #CUDA

11,553

Frank La Vigne

Frank La Vigne

@FrankDigsData

25 May 2025

VLLM: The Game-Changing Engine for Supercharging Your LLMs! #VLLM #LargeLanguageModels #AI #MachineLearning #InferenceEngine #ArtificialIntelligence #GPU #Chatbot #CodingAssistant #TechInnovation

1:01

NextFin

NextFin

@NextFinAI

8 May 2025

Lenovo CEO: AI Agents Will Serve People—Not Replace Them asianfin.com/articles/126675 At its annual Tech World conference, Lenovo unveiled its most ambitious AI platform to date: a “Super Intelligent Agent Matrix” that aims to make AI a personal and enterprise-wide companion. CEO Yang Yuanqing declared the end of "device-bound agents," positioning Lenovo’s AI systems as human-centric, cross-device, and ever-evolving. 🤖 “Bound to People, Not Devices” Yang emphasized a bold shift: “Super intelligent agents will bind with people, not hardware.” Key takeaway: Every individual will have their own AI, capable of autonomous perception, decision-making, and continuous evolution across devices. 🧠 Lenovo’s AI Ecosystem: The Matrix Explained Personal AI – Tianxi Cross-device access to calendars, email, cloud Understands and automates personal routines Already available on 4 Lenovo smart devices Upgrades possible on eligible devices 1M AI PCs sold with rising satisfaction scores Enterprise AI – Lexiang Built for sales, supply chain, procurement Uses a hybrid device-edge-cloud-network architecture Aims to automate 30% of Lenovo's workflows in 2025 Features an in-house AI model factory and application library City-Level AI – 1×N Architecture Deployed in Wuyishan and Yichang Supports digital governance and public services Positioned as Lenovo’s play for China’s smart city future ⚙️ Hardware & Software Synergy: The Inference Acceleration Engine Co-developed with Tsinghua University and Wuxin Chip Research Optimized for edge AI: Faster loading Lower power Smaller memory footprint Designed to handle growing AI inference workloads locally, not in the cloud Edge AI will triple in capability within 12 months, Yang forecasted. 🔐 Privacy, Ethics & the Human Role With AI increasingly embedded in personal and enterprise systems: Lenovo is integrating deepfake detection, trusted computing, and data sovereignty tools Yang reassured that: “There’s no need to worry about being replaced by AI — but standing still is not an option.” His call to action: “Master AI, break self-imposed limits, and adapt faster than the curve.” 🧠 Key Takeaway Lenovo’s latest AI push redefines the role of intelligent systems—not as tools tied to devices, but as adaptive cognitive companions that operate across life, work, and city infrastructure. In a rapidly evolving AI economy, those who harness AI—not fear it—will lead the future, Lenovo argues. #Lenovo #YangYuanqing #AIagent #SuperIntelligentAgent #EdgeAI #Tianxi #Lexiang #TechWorld #AIinfrastructure #AIprivacy #AIjobs #InferenceEngine #LenovoAI #SmartCity #AIecosystem

AI Agents Will Serve People, Not Devices, Says Lenovo Group CEO | NextFin

Addressing growing fears around AI replacing human jobs, Yang offered reassurance: "There's no need to worry about being replaced by AI—but standing still is not an option."

nextfin.ai

Cem Şengezer / antifragile trade models factory

Cem Şengezer / antifragile trade models factory

@for_a_few_quant

24 Feb 2025

Normal dağılımın teoride geçerli olduğu yaşamda, kuyruk etkisi sonucu belirliyor. Buna kara kuğu için rol alma dönemi de diyebiliriz. Akıllı paranın piyasa diplerinden alıp, zirvelerinden sattığına inanıyorsak, hesaplamaları da buna göre yenilemek gerekir. Buna göre dibine göre getiri hesabıyla gerçek kazancı görebiliriz. Bu durumda karar alma mekanizmaları değişecektir. Ons altın dibine göre getiri hızını normalize edelim: 20 yıl: 3 birim 10 yıl: 3 birim 5 yıl: 4 birim 3 yıl: 6 birim 2 yıl: 7 birim 1 yıl: 10 birim 2025: 7 birim Efsaneler aksine gerçeğe bakınca akıllı paranın işthında 3 yıldır artış var ve giderek yükseliyor. Bu artışın 20 yıllığa göre aşırı olduğunu düşünenler "fat tail" veya şişen anormal getirilerin nadiren görülmeye devam edeceğini bilmeleri gerekir. #distribution #anomaly #statisticalinference #inferenceengine #xauusd #gold #altın #goldetfs

ParaBorsa.Net @ParaBorsaNet

31 Jan 2023

CEM ŞENGEZER: ONS ALTIN: ESKİNİN KUYRUK ETKİSİ paraborsa.net/i/ons-altin-es…