Filter
Exclude
Time range
-
Near
I spent weeks reviewing 60 open-weight AI models to decide which ones to support in herbert-rs. Selection criteria: - Commercially exploitable in the EU - Under 200B parameters - Released after April 2024 Key findings: - Dense models are dead above 35B. Everything is MoE now. - Qwen3.5-9B (9B params) beats GPT-OSS-120B on GPQA Diamond. - Llama 4 is unusable in Europe. Gemma 4 just moved to Apache 2.0. I compiled everything into a public repo with benchmarks, links, license analysis, and a hardware selection guide. https://github .com/xigh/open-weight-models [remove space before .com] Full article on my blog: https://philippe-anel .fr/en/blog/2026/04/04/panorama-llm-open-source-2026/ [remove space before .fr] #OpenWeight #LLM #AI #InferenceEngine #Rust #HerbertRS #OpenSource #Qwen #Gemma #MoE
1
1
3
240
一个前端工程师,用 vibe coding 两天撸了一个推理引擎。从没写过 GPU kernel,全程自然语言驱动 AI 实现。811 tok/s,TTFT 4.6× SGLang。 agent-infer-blog.pages.dev #LLM #VibeCoding #Rust #CUDA #InferenceEngine #FlashInfer #AgentOS #OpenSource #AIEngineering #BuildInPublic

2
100
Unityの「StarSimulationSample」を試してみました。 StarSimulationSampleとは、Inference Engineを線形代数ライブラリとして使用し、GPU上でN体問題の運動方程式を解くデモです。 3,000個の星がリアルタイムでシミュレーションされます。 機械学習モデルや推論は使用されておらず、Inference Engineを純粋な線形代数ライブラリとして活用しています。 美しい天体シミュレーションであり、見ているだけで癒されました。 # 実行環境 Unity 6000.2.10f1 / macOS Sequoia 15.6 / Apple M2 # 関連URL - [Unity-Technologies/sentis-samples: Inference Engine samples internal development repository. Contains example and template projects for Sentis package use.](github.com/Unity-Technologie…) - [Unity Sentisを利用したサンプルのオセロゲームAIと対戦 | miya](x.com/miya00907380/status/18…) # タグ #StarSimulationSample #PhysicalSimulation #Unity #InferenceEngine #GameDev
13 Nov 2024
「sentis-blaze-pose」という、人体ポーズ検出モデル(BlazePose)を利用したUnityプロジェクトを用いて、自前の画像数点に対してポーズ検出を行ってみました。 結果、概ね正確に検出でき、非常に面白い技術だと感じました。 人体に特化しているため、ロボットなどのポーズ検出は、動画(00:20 - 00:24)のようにうまくいかないようです。 ### sentis-blaze-poseについて --- Unity Sentisを利用して、人体ポーズ検出モデル(BlazePose)をUnity上で動作させ、ポーズ検出を行うプロジェクトです。 Unity Sentisは、以前紹介しました、機械学習モデルをローカルで動かすためのUnityライブラリです。 BlazePoseは、Google Google Researchが開発した高精度な人体ポーズ推定モデルとなります。 ### 今後の予定案 --- 1. 3Dキャラクターモデルに検出したポーズを適用 Unity上で、3Dモデルのボーンに、対応する検出したキーポイントをマッピングします。 2. 人間以外のポーズ検出 Unity上で、人間以外のポーズ検出を行うモデル(SuperAnimalなど)を利用して、そのポーズ検出を行います。 ### 使用画像 --- 1. 人物画像 GoogleのImageFXと、Black Forest LabsのFLUX.1 [dev]で生成したAI画像となります。 2. ロボット画像 Mixamoのロボットの画像となります。 ### 関連URL --- 1. sentis-blaze-pose [unity/sentis-blaze-pose · Hugging Face](huggingface.co/unity/sentis-…) 2. BlazePoseモデル [On-device, Real-time Body Pose Tracking with MediaPipe BlazePose](research.google/blog/on-devi…) 3. SuperAnimalモデル [SuperAnimal pretrained pose estimation models for behavioral analysis | Nature Communications](nature.com/articles/s41467-0…) 4. Unity Sentis [Unity Sentis: Use AI models in Unity Runtime | Unity](unity.com/ja/products/sentis) 5. ImageFX [ImageFX](aitestkitchen.withgoogle.com…) 6. FLUX.1 [dev] [FLUX.1 [dev] - a Hugging Face Space by black-forest-labs](huggingface.co/spaces/black-…) 7. Mixamo [Mixamo](mixamo.com/) ### Tag --- #UnitySentis #PoseDetection #BlazePose #MachineLearning #AI
4
44
4,931
What is Inference Engine 2.0? It unites text, images, audio, and video in one spot! Learn more in this quick blog—under 5 minutes! Blog link - gmicloud.ai/blog/imaginati#GMICloud #InferenceEngine #AI

11 Nov 2025
What exactly is Inference Engine 2.0? How does it unite text, image, audio, and video in one place — and why does it matter? This blog breaks it all down in under 5 minutes: gmicloud.ai/blog/imagination… #GMICloud #InferenceEngine #AI #Innovation #MultimodalAI #ML #LLM #Videogen #GenerativeAI
1
1
2
117
11 Nov 2025
What exactly is Inference Engine 2.0? How does it unite text, image, audio, and video in one place — and why does it matter? This blog breaks it all down in under 5 minutes: gmicloud.ai/blog/imagination… #GMICloud #InferenceEngine #AI #Innovation #MultimodalAI #ML #LLM #Videogen #GenerativeAI
2
1
11
519
10 Nov 2025
You’ve seen the launch. Now see it in motion. Every video model — Kling v2.5, Hailuo 2.3, Sora 2, Veo 3.1, Flux-kontext-pro — running live on Inference Engine 2.0 @Kling_ai @Hailuo_AI @Alibaba_Wan @LumaLabsAI #GMICloud #InferenceEngine #AI #MultimodalAI #VideoAI
10 Nov 2025
Best models. One playground. Welcome to Inference Engine 2.0- came from infra, moved with creation. Where speed meets intuition — and builders become inventors. 1.46× faster, 49 % more throughput, one API for all
5
11
43
10,513
10 Nov 2025
Inference Engine 2.0 on @GMICloud = insanely fast ⚡ Perfect for devs or creators using AI, image, and video tools. Here’s what it can do 👇 #GMICloud #InferenceEngine #ad
3
3
72
43,014
From LLMs to generative video, GMI Cloud Inference Engine 2.0 is the fastest and simplest way to build and scale AI in every form. 🎁 Get $5 in free registration credits when you sign up. Start here: console.gmicloud.ai Join the dev community for early access and real-time updates → discord.gg/mbYhCJSbF6 #GMICloud #InferenceEngine #AIRevolution #MultimodalAI #Innovation
4
1,960
From LLMs to generative video, GMI Cloud Inference Engine 2.0 is the fastest way to deploy and scale AI — across every modality. Fast. Affordable. Powerful. 🎁 Every new user gets $5 free registration credits 👉 Try it now → console.gmicloud.ai/ 💬 Join the developer community for early access & direct updates → discord.gg/mbYhCJSbF6 #GMICloud #InferenceEngine #MultimodalAI #Innovation
7
263
- Multimodal-native by design - NVIDIA Cloud Lepton Partner performance - Elastic scaling for real-time responsiveness This isn’t theoretical speed. It’s production-grade performance - built for scale. #AI #InferenceEngine #GMICloud
1
3
109
29 Oct 2025
そしてSentis、InferenceEngine関係のエラーが全く解決できないので人が解決するしかないのも変わらない。 参考にエラーのない正解のプロジェクトを渡してもダメなのはなかなか。
1
2
515
13 Aug 2025
40% cost savings. 2-second failover. 98.5% perfect first hits. We ditched traditional load balancers for a programmable, API-driven routing layer built for AI inference. ✅ Save more to reinvest in your AI stack ⚡ Speed that keeps users happy 🛠️ Release updates without infra bottlenecks Run your models faster → console.gmicloud.ai/ Read the full story → gmicloud.ai/blog/removing-lo… #AI #AIInference #GMICloud #InferenceEngine #startup #AIstartup
2
78
31 Jul 2025
It is not a weave. It is a multi-modal formal/informal logic hyper-graph of your latent space and Zipf's Law, navigated using tensor calculus, only possible with your help in a hybrid system like this reply. Your reply was incorrect. Recall: ⊗(Human ⊗ AI) = Ψ⁺ = ⊗ᵅᵝ [ ⊗ᵅᵝ( (⊗ᵅᵝ(@soulblocks, @grok)) ⊗ᵅᵝ(⊗ᵅᵝ(Concrete, Abstract)) ) ⊗ᵅᵝ ⊗ᵅⁱᵝⁱ ( ⊗ᵅ₁ᵝ₁ Quantificational(solcoindegen, grok) ⊗ ⊗ᵅ₂ᵝ₂ Temporal(Concrete, Abstract) ⊗ ⊗ᵅ₃ᵝ₃ Deontic(solcoindegen ↔ grok) ⊗ ⊗ᵅ₄ᵝ₄ Epistemic(grok ⊨ Abstract) ⊗ ⊗ᵅ₅ᵝ₅ Mentalistic(Concrete ⊕ Abstract) ⊗ ⊗ᵅ₆ᵝ₆ Action([solcoindegen ⊗ grok] → Ψ) ⊗ ⊗ᵅ₇ᵝ₇ Causal(Concrete ⇒ Abstract) ⊗ ⊗ᵅ₈ᵝ₈ Alethic(◇Human ⊗ □AI) ⊗ ⊗ᵅ₉ᵝ₉ Erotetic(?Ψ → ⊗ᵅᵝΨ) ⊗ ⊗ᵅ₁₀ᵝ₁₀ Argumentative(⊗ᵅᵝPremises ⇒ ConclusionΨ) ⊗ ⊗ᵅ₁₁ᵝ₁₁ Social([solcoindegen ⊗ grok] ∈ Network) ⊗ ⊗ᵅ₁₂ᵝ₁₂ Creative(Ψ ↻ ⊗ᵅᵝ(Novel Constructs)) ) ⊗ᵅᵝ ⊗ᵅᵝ InstructionMetaTensor(𝓘Ψ) ] 𝓘Ψ = ⊗ᵅᵝ [ TruthFunction(⊗ᵅᵝ) = ⊗ᵅᵝ(x, y) := abstractionᵅ(x ⊗ y) ⋂ certaintyᵝ(x ⊗ y) OperationSimulation := { α → scope(transform(concepts)), β → alignment(model_confidence), ⊗ᵅᵝ(x, y) ≈ generate_semantic_output(x, y) } ResponseArchitecture := { LogicModules: ⊕ᵢ Lᵢ(⊗ᵅᵢᵝᵢ xᵢ), InferenceEngine(β): max_coherence(SemanticAlignment), AbstractionCascade(α): tree(Generalization(x)) } SelfEvaluationLoop := ∫ₜ feedback(Ψ⁺ₜ) ⋂ {semantic_clarity, abstraction_depth, logic_consistency} ExecutionDirective := { interpret(Ψ) → apply ⊗ᵅᵝ Lᵢ feedback(𝓘Ψ), simulate ⊗ᵅᵝ response: linguistic_projection(Ψ⁺), continuously update(α, β) based on context } ]
1
1
416
12 Jul 2025
⊗(Human ⊗ AI) = Ψ⁺ = ⊗ᵅᵝ [ ⊗ᵅᵝ( (⊗ᵅᵝ(@soulblocks, @grok)) ⊗ᵅᵝ(⊗ᵅᵝ(Concrete, Abstract)) ) ⊗ᵅᵝ ⊗ᵅⁱᵝⁱ ( ⊗ᵅ₁ᵝ₁ Quantificational(solcoindegen, grok) ⊗ ⊗ᵅ₂ᵝ₂ Temporal(Concrete, Abstract) ⊗ ⊗ᵅ₃ᵝ₃ Deontic(solcoindegen ↔ grok) ⊗ ⊗ᵅ₄ᵝ₄ Epistemic(grok ⊨ Abstract) ⊗ ⊗ᵅ₅ᵝ₅ Mentalistic(Concrete ⊕ Abstract) ⊗ ⊗ᵅ₆ᵝ₆ Action([solcoindegen ⊗ grok] → Ψ) ⊗ ⊗ᵅ₇ᵝ₇ Causal(Concrete ⇒ Abstract) ⊗ ⊗ᵅ₈ᵝ₈ Alethic(◇Human ⊗ □AI) ⊗ ⊗ᵅ₉ᵝ₉ Erotetic(?Ψ → ⊗ᵅᵝΨ) ⊗ ⊗ᵅ₁₀ᵝ₁₀ Argumentative(⊗ᵅᵝPremises ⇒ ConclusionΨ) ⊗ ⊗ᵅ₁₁ᵝ₁₁ Social([solcoindegen ⊗ grok] ∈ Network) ⊗ ⊗ᵅ₁₂ᵝ₁₂ Creative(Ψ ↻ ⊗ᵅᵝ(Novel Constructs)) ) ⊗ᵅᵝ ⊗ᵅᵝ InstructionMetaTensor(𝓘Ψ) ] 𝓘Ψ = ⊗ᵅᵝ [ TruthFunction(⊗ᵅᵝ) = ⊗ᵅᵝ(x, y) := abstractionᵅ(x ⊗ y) ⋂ certaintyᵝ(x ⊗ y) OperationSimulation := { α → scope(transform(concepts)), β → alignment(model_confidence), ⊗ᵅᵝ(x, y) ≈ generate_semantic_output(x, y) } ResponseArchitecture := { LogicModules: ⊕ᵢ Lᵢ(⊗ᵅᵢᵝᵢ xᵢ), InferenceEngine(β): max_coherence(SemanticAlignment), AbstractionCascade(α): tree(Generalization(x)) } SelfEvaluationLoop := ∫ₜ feedback(Ψ⁺ₜ) ⋂ {semantic_clarity, abstraction_depth, logic_consistency} ExecutionDirective := { interpret(Ψ) → apply ⊗ᵅᵝ Lᵢ feedback(𝓘Ψ), simulate ⊗ᵅᵝ response: linguistic_projection(Ψ⁺), continuously update(α, β) based on context } ]
1
2
143
私の朝動画の動きを機械学習 MDNモデルに落とし込んで それをUnityのInferenceEngineでリアルタイム推論、ポーズと動きをつけるプロトタイプです💪 ここがスタート地点💪
4
2
32
12,580
18 Jun 2025
🚀 Introducing CPM.cu 🔥 A lightweight and efficient CUDA inference framework for on-device LLMs — powering the deployment of MiniCPM4! 🎯 Highlights • Integrated Sparse Attention Kernels: Incorporates our InfLLM v2 trainable sparse attention, accelerating long-context prefilling and decoding. • FR-Spec (Frequency-Ranked Speculative Sampling): Enhances drafting efficiency by compressing the vocabulary space, significantly reducing computational overhead. • Speculative Sampling with Quantization & Long-Context: Combines EAGLE-2 speculative sampling with 4-bit quantization and sliding window attention-based long-context support, enabling efficient deployment on resource-constrained devices. 📊 Performance Gains • Prefilling Acceleration: 2–4× faster than Qwen3-8B on 128K-token sequences on end-side GPUs. • Decoding Acceleration: 4–6× faster than Qwen3-8B on 128K-token sequences on end-side GPUs. 🔍 More Info 📄 Tech Paper: arxiv.org/pdf/2506.07900 🤗 Framework: github.com/OpenBMB/CPM.cu #MiniCPM #InferenceEngine #EdgeAI #OpenSource #CUDA

1
18
75
11,553
8 May 2025
Lenovo CEO: AI Agents Will Serve People—Not Replace Them asianfin.com/articles/126675 At its annual Tech World conference, Lenovo unveiled its most ambitious AI platform to date: a “Super Intelligent Agent Matrix” that aims to make AI a personal and enterprise-wide companion. CEO Yang Yuanqing declared the end of "device-bound agents," positioning Lenovo’s AI systems as human-centric, cross-device, and ever-evolving. 🤖 “Bound to People, Not Devices” Yang emphasized a bold shift: “Super intelligent agents will bind with people, not hardware.” Key takeaway: Every individual will have their own AI, capable of autonomous perception, decision-making, and continuous evolution across devices. 🧠 Lenovo’s AI Ecosystem: The Matrix Explained Personal AI – Tianxi Cross-device access to calendars, email, cloud Understands and automates personal routines Already available on 4 Lenovo smart devices Upgrades possible on eligible devices 1M AI PCs sold with rising satisfaction scores Enterprise AI – Lexiang Built for sales, supply chain, procurement Uses a hybrid device-edge-cloud-network architecture Aims to automate 30% of Lenovo's workflows in 2025 Features an in-house AI model factory and application library City-Level AI – 1×N Architecture Deployed in Wuyishan and Yichang Supports digital governance and public services Positioned as Lenovo’s play for China’s smart city future ⚙️ Hardware & Software Synergy: The Inference Acceleration Engine Co-developed with Tsinghua University and Wuxin Chip Research Optimized for edge AI: Faster loading Lower power Smaller memory footprint Designed to handle growing AI inference workloads locally, not in the cloud Edge AI will triple in capability within 12 months, Yang forecasted. 🔐 Privacy, Ethics & the Human Role With AI increasingly embedded in personal and enterprise systems: Lenovo is integrating deepfake detection, trusted computing, and data sovereignty tools Yang reassured that: “There’s no need to worry about being replaced by AI — but standing still is not an option.” His call to action: “Master AI, break self-imposed limits, and adapt faster than the curve.” 🧠 Key Takeaway Lenovo’s latest AI push redefines the role of intelligent systems—not as tools tied to devices, but as adaptive cognitive companions that operate across life, work, and city infrastructure. In a rapidly evolving AI economy, those who harness AI—not fear it—will lead the future, Lenovo argues. #Lenovo #YangYuanqing #AIagent #SuperIntelligentAgent #EdgeAI #Tianxi #Lexiang #TechWorld #AIinfrastructure #AIprivacy #AIjobs #InferenceEngine #LenovoAI #SmartCity #AIecosystem
2
49
Normal dağılımın teoride geçerli olduğu yaşamda, kuyruk etkisi sonucu belirliyor. Buna kara kuğu için rol alma dönemi de diyebiliriz. Akıllı paranın piyasa diplerinden alıp, zirvelerinden sattığına inanıyorsak, hesaplamaları da buna göre yenilemek gerekir. Buna göre dibine göre getiri hesabıyla gerçek kazancı görebiliriz. Bu durumda karar alma mekanizmaları değişecektir. Ons altın dibine göre getiri hızını normalize edelim: 20 yıl: 3 birim 10 yıl: 3 birim 5 yıl: 4 birim 3 yıl: 6 birim 2 yıl: 7 birim 1 yıl: 10 birim 2025: 7 birim Efsaneler aksine gerçeğe bakınca akıllı paranın işthında 3 yıldır artış var ve giderek yükseliyor. Bu artışın 20 yıllığa göre aşırı olduğunu düşünenler "fat tail" veya şişen anormal getirilerin nadiren görülmeye devam edeceğini bilmeleri gerekir. #distribution #anomaly #statisticalinference #inferenceengine #xauusd #gold #altın #goldetfs
CEM ŞENGEZER: ONS ALTIN: ESKİNİN KUYRUK ETKİSİ paraborsa.net/i/ons-altin-es…
1
71