Filter
Exclude
Time range
-
Near
iOS 27 introduces an AI-driven dictation feature, enhancing speech-to-text accuracy. Available on select devices, users must manually enable it in settings. This on-device processing ensures privacy and consistent performance. A significant leap in voice recognition technology. #iOS27 #AIDictation #Apple #SpeechToText #TechNews #iPhone17Pro thedailytechfeed.com/how-to-…
1
1
39
Most AI note-taking apps charge monthly. VoiceScribe doesn’t. 🎙️ Record 📝 Transcribe 📄 Export PDF One-time purchase. No subscriptions. apps.apple.com/us/app/voices… #OneTimePurchase #Productivity #iOSApp #AIProductivity #SpeechToText

7
🚀 Big update for Cognix. @ai_cognix is now powered by @DeepgramAI This integration helps us deliver real-time speech-to-text, multilingual voice interactions, and the foundation for next-generation AI voice agents. #Cognix #Deepgram #VoiceAI #SpeechToText #BuildInPublic
1
2
4
40
PipeSay — live dictation for Linux 🎙️ Speak → see words in real time → auto-copy when you stop. Built for PipeWire Wayland. Not the first STT tool — one I shaped for daily use on Linux. MIT · github.com/metahubaifeel/pip… #OpenSource #Wayland #SpeechToText #PipeWire
10
Stop typing and start talking 🎙️✨ Talk Notes turns meetings, lectures, interviews, and voice memos into clear, searchable notes. Powerful audio enhancement helps you get better transcripts from noisy recordings mindeon.com/talk-notes/ #SpeechToText #VoiceNotes #Productivity #iOS
3
8
30
I'd love to contribute to the future of voice interfaces and am excited about opportunities to build products like this at Wispr. @tankots @SahajGarg6 @WisprFlow #SoftwareEngineering #VoiceAI #SpeechToText #PrivacyEngineering #AIEngineering #WisprFlow
2
42
After 11 incredible years with Progress Software, it's time for a new chapter. During my time at Progress, I had the opportunity to help shape the future of web user experiences. I helped launch Telerik UI for Blazor, led modernization initiatives across our UI component libraries, and worked alongside talented teams focused on building tools that empower developers to create exceptional applications. Over the last two years, my focus shifted toward AI and the evolving relationship between people and software. During that time, I developed a vision for how user interfaces can work alongside AI, exploring natural language interactions, speech technologies, and new ways for people to engage with applications. That work ultimately led me to where I am today. Along the way, I came to an important realization: voice is becoming a new user interface. Speech-to-text, text-to-speech, and real-time conversational AI are creating more natural ways for people to interact with technology. These experiences reduce cognitive overhead, improve accessibility, and open new possibilities for users who have historically been underserved by traditional interfaces. I'm deeply grateful to Progress Software for giving me the freedom to be creative, challenge assumptions, think outside the box, and tackle difficult problems in new ways. The opportunities, friendships, and experiences I've gained over the last decade have shaped both my career and perspective. Today, I'm excited to share that I'm joining Deepgram as a Developer Advocate. In this role, I'll be working with developers, partners, and the broader AI community to help build the next generation of applications powered by voice, AI, and real-time conversation. This move feels like a natural progression of the work I've been passionate about over the last several years. As software evolves from graphical interfaces to conversational experiences, we're entering a new era of human-computer interaction; one that is more natural, more accessible, and more human. I'm excited to help shape what comes next. If you're learning or building agentic apps, connect with me here and share what you're working on. #Deepgram #VoiceAI #ConversationalAI #SpeechToText #TextToSpeech #VoiceAgents
7
2
27
1,331
Record, transcribe & summarize your meetings in seconds with one tap. AI-powered notes from any conversation. Download now: apps.apple.com/jp/app/id6758… #AI #Productivity #iOS #MeetingNotes #SpeechToText
1
123
Want a free AI dictation app that actually works? I tested the best offline speech-to-text tools for Mac, iPhone, and Windows. They’re fast, accurate, support multiple languages, and can even clean up grammar and formatting automatically. No subscriptions. No internet required. 🎙️✨ techpp.com/2026/06/03/how-to… #AI #Productivity #Mac #iPhone #Windows #SpeechToText #Dictation by @rameshreddy71 on @techpp
3
2
4
744
VRCAvatarAntiRipping v0.37.0 をリリースしました (累積アップデート)。 【主な変更点】 解錠キー・OSC パスの DPAPI 暗号化によるセキュリティ強化、 PC/Quest クロスプラットフォーム対応、 MMD ワールド / VRCOSC 等の外部ツールとの互換性改善、 そして一部アバターで頂点が法線方向に飛び出ていた二重デコードの構造的修正が含まれます。 セキュリティ強化 (v0.35.x) ・ 解錠キーと OSC パラメータアドレスを Windows DPAPI で暗号化するように変更 ・古い AntiRippingClient.exe がインストールされたまま publish しようとした場合、更新ダイアログが表示されるように変更。 ・レガシー機能の *_unlock.ps1 出力を廃止 ・ビルドレポート (.md) を詳細化 クロスプラットフォーム対応 (v0.36.0) ・パラメータ難読化を「再現可能にする (PC/Quest 対応)」 オプショントグルを追加。 ON にするとパラメータ名が常に同じ難読名にマップされ、 PC と Quest を別々にビルドしてもクロスプラットフォーム同期が成立します (既定 OFF)。 安定シードは Inspector で自動生成・編集・再生成可能。 外部ツールとの互換性改善 (v0.37.0) ・BlendShape 難読化に「MMD 標準モーフを除外 (MMD ワールド互換)」 サブトグル (既定 ON) を追加。 MMD ワールドのダンス中に標準モーフ (あ / い / う / え / お / ω / にこり / まばたき / ハイライト消し 等) が rename されて表情が動かなくなる問題を解消。 ・Animator パラメータ難読化に「VRCOSC パラメータを除外 (外部 OSC アプリ互換)」 サブトグル (既定 ON) を追加。 VRCOSC の心拍計や SpeechToText など、 VRCOSC/ で始まるパラメータが rename されて受信されなくなる問題を解消。 保護強化とバグ修正 (v0.37.0) ・シェーダーレベル復号 ON lilToon material を使う SkinnedMeshRenderer で頂点が法線負方向にズレる二重デコードバグを構造的に修正。 ・MeshRenderer を build 時に SkinnedMeshRenderer に型変換するオプション「Mesh→Skinned 変換」 を追加 (既定 OFF)。 ON にすると lockable shader (lilToon / Poiyomi 系) を持つ MeshRenderer の Safety Mode 対応 (VRChat Custom Shaders 無効化時も形状復元) が完全化されます。 副作用として SMR 数が増えるため Performance Rank 低下のリスクあり (特に Quest)。 ajisaiflow.booth.pm/items/83…
2
15
89
7,599
The Gemma 4 Good Hackathon is now officially closed. 1,613 submissions from around the world. 🌍 CodeDodona participated with ConversaShield — an AI-powered Quality Assurance platform for call centers focused on: 🎙️ Speech-to-Text in noisy environments 🧠 Open-source conversational AI analysis 🛡️ Privacy-first local AI deployment 📊 Automated coaching & compliance insights But this submission was also something more: An introduction of blockchain-oriented thinking into the broader Gemma ecosystem. Not speculation. Not hype. Real-world AI infrastructure, enterprise workflows, local inference, and sustainable monetization models connected to decentralized ecosystems. From legacy CRM systems to modern AI agents and conversational intelligence platforms — the journey continues. Respect to @GoogleDeepMind and everyone who participated. 🚀 #AI #Gemma #OpenSource #LLM #Solana #ConversationalAI #SpeechToText #MachineLearning #BuildInPublic #CallCenter #Python ConversaShield: AI-Powered Quality Assurance for Ethical Call Centers on #kaggle kaggle.com/competitions/gemm…
1
4
75
May 17, 2026 🚀 Today we finalize the ConversaShield GitHub repository for the global GEMMA 4 Good Hackathon hosted by @GoogleDeepMind. ConversaShield is an AI-powered Quality Assurance platform for call centers focused on: ✅ Speech-to-Text in difficult noisy environments ✅ Speaker separation ✅ Conversational AI evaluation ✅ Privacy-first local AI deployment ✅ Automated coaching & compliance analysis To celebrate the milestone, we also do a Live Coding Session covering: 🎙️ How to achieve effective STT in crowded environments with heavy background noise 🧠 How to reduce AI operational costs using open-source LLMs running locally on GPU infrastructure ⚡ How products like ConversaShield can be monetized through the Solana ecosystem Real-world AI. Real enterprise workflows. Built with open technologies. #AI #LLM #OpenSource #Solana #SpeechToText #MachineLearning #CustomerService #CallCenter #Python #GPU #Hackathon #Gemma #BuildInPublic
1
6
78
🚨 This open-source project could seriously disrupt paid AI dubbing tools 👀 A new project called OmniVoice Studio just dropped — and it runs voice cloning video dubbing completely locally. ❌ No API ❌ No internet ❌ No subscription And the wild part? 🌍 It supports 646 languages and works on Windows, Mac, and Linux. 💡 What can it do? 1️⃣ Clone a voice from just 3 seconds of audio Even across different languages. 2️⃣ Auto-dub videos Just paste a YouTube link and it will: • Transcribe audio • Translate it • Re-dub the video • Export a ready-to-use MP4 3️⃣ Universal speech-to-text Global hotkey works directly inside any app. 4️⃣ Separate vocals & music Automatically isolates background audio and detects speakers. 5️⃣ Batch processing Can process dozens of videos in the background automatically. 💥 The biggest advantage: ✅ Fully open source ✅ Runs entirely offline ✅ No cloud dependency ✅ No API or monthly fees Projects like this are pushing hard toward fully local AI workflows instead of relying only on expensive cloud services. Try it now 👇👇 #AI #ArtificialIntelligence #OpenSource #MachineLearning #VoiceAI #VoiceCloning #VideoDubbing #TTS #SpeechToText #AItools #GitHub #ContentCreator #CreatorEconomy #LocalAI #GenerativeAI #ElevenLabs #YouTubeCreators #Automation #TechNews :::
1
2
62
Today we officially submitted ConversaShield to the global Gemma 4 Good Hackathon hosted by @GoogleDeepMind 🚀 ConversaShield is an AI-powered Quality Assurance platform for call centers built with open-source AI. The system automatically: ✅ Transcribes conversations ✅ Separates speakers ✅ Evaluates communication quality ✅ Detects objections & compliance risks ✅ Generates coaching insights for agents Designed with a privacy-first architecture, ConversaShield can run locally on GPU infrastructure without relying entirely on external cloud AI providers. As a developer coming from decades of enterprise CRM and call center software development, building practical AI systems for real operational environments is a very important step for me. The complete implementation was submitted privately to the Hackathon judges. Non-sensitive parts are now gradually being published to GitHub. Big respect to @GoogleDeepMind and the open AI ecosystem pushing practical AI forward. #AI #LLM #OpenSource #CallCenter #CustomerService #Gemma #Python #MachineLearning #SpeechToText #ConversationalAI #Hackathon
4
68