小松啓(EUREKAPU.com)

小松啓(EUREKAPU.com)

小松啓(EUREKAPU.com)

@EUREKAPU_com

Jan 14

gemini-3-flash、手書き文字のOCR精度高くてよい。まえはVisionAPI挟まないとできなかったけど、今はもうgemini-3-flashだけで良くなった。

282

Rikuo

Rikuo

@riku720720

Jan 6

macOSのVision API、やばいな Claude Code skillsでいい感じのオシャレな画像素材作らせてるんだけど nanobananaは背景透過できない問題があった。試しにmacOSのvisionAPI使わせたらローカルでこのクオリティの背景透過ができた

3,083

Immy

Immy @ImmyGlow

14 Nov 2025

gSenti CT I’ve finished building the API section for my VisionAPI you can now generate an API key and test VisionAI. It’s built using @SentientAGI ROMA and the Sentient Agent Framework. You can also try out the API directly on the website (link in the comment). I’ll be really excited if you build something with the VisionAI API.

Immy @ImmyGlow

3 Nov 2025

Gsenti guys I’m really excited to share VisionAI, my AI agent that gives you real time crypto prices, insights, and much more through a simple chat interface. Try it out here: 🌐vision.immylab.tech/ VisionAI also comes with a API that developers can easily integrate into their own projects. You can log in using X (Twitter) or Google, create your own API keys, and test everything directly in the API Playground. Get your API key here: 📘vision.immylab.tech/docs/api I built VisionAI using Sentient ROMA and the Sentient Agent Framework, combining real-time data and AI into one smooth experience. Would love for you to try it out and share your thoughts or screenshots in the comments. Your feedback means a lot.

353

Jonathan

Jonathan

@gravyskyy

10 Nov 2025

“Hey Cursor!” in AR 👀 In the coming years, AI will feel as natural as your phone - but it’ll need new ways to interact with you and the world. I wanted to see what Cursor might look like on the go with AR. Nothing fancy - just RDP Voice Accessibility APIs bunch of bash scripts- but it worked (after 3 crashes 😅). Built in <24hrs: • Electron tray app that listens for “Hey Cursor!” • Translates what you say into command context • Executes commands (open/close, focus, create/remove chats, approved, review, new file, TTS/STT) • Feeds Cursor keybindings into a nav map for chaining harder tasks (“Hey Cursor, create a new chat so we can plan something” → [new chat] → [focus text] → [start STT]) • Organizes Windows to maximize space per screen Over time, the window-bound IDE (Editor, Agent, Chat, FS, Terminal) will deconstruct into something new - you’re already seeing the first signs in the Agent view. STT, VisionAPI, Chat completion: @OpenAI TTS: @elevenlabs IDE: @cursor_ai AR: @MetaQuestVR Accessibility API: @Apple

1:27

1,790

Naoto Nakai

Naoto Nakai @NuCode

2 Jul 2025

プロンプト「RGBCameraExample.csをカメラ画像を定期的にbase64でAnthropicのVisionAPIに送って写っている物体の説明を表示できるように変更してください。UIまわりも全てYUVImageの子オブジェクトとしてコードで生成してもらえますか」

452

なかむらさとる

なかむらさとる

@satoluxx

25 Jun 2025

いま、紙でもらった資料を何も考えずにスキャンしてNotebookLMに食わせた。そーいえば、スキャンしたら画像だが大丈夫だっけ？とか思ったけど、普通にできてるやん。びっくりした。 VisionAPIごめんなって気分になった。

317

snowpool

snowpool @snowpoollikely

8 Jun 2025

#1日1時間チャレンジ cloud visionAPIでもgeminiと同じ結果になるため一度yomitokuでOCRし結果をLLMで処理が正解かもしれない。なお今年の4月に手書き文字もサポートされたらしい github.com/kotaro-kinoshita/…

GitHub - kotaro-kinoshita/yomitoku: YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an...

YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language. - kotaro-kinoshita/yomitoku

github.com

175

Bitgrit

Bitgrit @bitgrit_global

23 Feb 2025

With the arrival of GPT-4 Turbo with Vision, AI developers must be excited about the new possibilities! The bitgrit community is full of such awesome AI developers! Their technology and passion will open up the future of AI. #GPT4 #VisionAPI #bitgrit

OpenAI

@OpenAI

21 Feb 2025

Operator is now rolling out to Pro users in Australia, Brazil, Canada, India, Japan, Singapore, South Korea, the UK, and most places ChatGPT is available. Still working on making Operator available in the EU, Switzerland, Norway, Liechtenstein & Iceland—we’ll keep you updated!

1,050

Alican Tarım

Alican Tarım @alicantarim

15 Feb 2025

📌 Sonuç AI destekli özellikler iOS uygulamalarına hız, kullanıcı deneyimi ve yenilik katıyor. 🚀 Core ML, SiriKit, Vision ve büyük dil modelleri gibi araçlarla uygulamanıza geleceğin teknolojilerini entegre edebilirsiniz. 🔍 Sizce en faydalı AI özelliği hangisi? Yorumlarda paylaşın! 👇 #iOS #Swift #CoreML #AI #MachineLearning #SiriKit #VisionAPI #ChatGPT #MobileApps #iOSDevelopment

204

brainapp｜より良く生きるを実装する。

brainapp｜より良く生きるを実装する。

@brainapp12

9 Dec 2024

Replying to @xjuntaro

J-kunさん、超絶ウルトラスーパー大共感します🙇‍♂️✨ 特徴量が増やせたり、構造化できたり、データとしての持ち方がものすごく変わりますよね。そしてVLMも着目されてるの流石です。今VisionAPIとかでサムネ分析も試してますが、私もVLM使いたく、その際はまたお力やお知恵をお貸しください...🙇‍♂️ ビッグデータのこと、本当におっしゃる通りです。凄まじくでかい波来そうです。そして意外と気付いてない人もいる気がする...😳

brainapp｜より良く生きるを実装する。

brainapp｜より良く生きるを実装する。

@brainapp12

25 Nov 2024

brainappのこちらのサムネ分析、VisionAPIでさらに進化して、サムネ画像や広告コピーを分析してLLMで人気動画やCVRの高い広告バナーなどからコピーの特徴を分析できるように進化しております！ユーザーに役立つプロダクトにどんどん成長させるために、多くの方に使っていただきながら改良するぞ〜😊

brainapp｜より良く生きるを実装する。

@brainapp12

12 Nov 2024

brainapp開発中、YouTubeの人気動画のサムネ分析アプリ。同様のカテゴリの動画からサムネ画像を分類して特徴を分析したり、人気動画のサムネの特徴を分析してLLMで解説してもらったり。まずはお手軽なライブラリから実装中。画像と文章一緒に分析、すごく良い。サイバーエージェントのVLMも試したい😳

0:59

267

𝙏𝙪𝙣𝙖

𝙏𝙪𝙣𝙖 @yeah_tuna

2 Nov 2024

Google Cloud VisionAPIの使用料の１円の請求書

341

shimayuz ⤴️ AI影分身構築

shimayuz ⤴️ AI影分身構築

@Shimayus

15 Oct 2024

【Create_xyzコンペに応募します！】ギリギリ飛び込みセーフ！！？ @jun_ymmd @create_xyz を使って、財務諸表分析AIをつくってみた。作品タイトル：Financial Analyzer コメント：苦手な人も多い、財務諸表分析。決算短信を見るだけで、頭がパンクする人もいるのでは？というPainに従って作成しました。これは企業の決算短信PDF（URL）を渡すだけで、AIが分析を行います。特徴として、PDFを読み取っていること。 Yahoo!ファイナンスにある企業情報開示リンクを利用して、URLを貼り付け＆任意のuserID(user_xxx)を入れるだけでAIが勝手に企業分析を始めます。残念ながらChatGPTのVisionAPIは今の所pngやjpeg画像しか読み取れません。そのため、ここは、Difyの力を借りて解決しました。なので、CreateのfunctionはDify APIが走っています。作品URL：financialanalysis.created.ap… #createjapan

0:47

1,871

Erkan

Erkan @byrzerkan

31 Jul 2024

Replying to @elowendark

Hahahah sana gelen bana gelsin 😅😅bu arada ekibe android ve iOS ci arıyorum 😂 2 iOS 1 android arkadaş alacağız. Özellikle SDK geliştirmiş olursa 1, VisionApi deneyimi 1, Bluetooth 1, OCR 1 yazar

2,024

中山陽平 - 中小企業専門Webコンサル/700社/20年

中山陽平 - 中小企業専門Webコンサル/700社/20年

@b_gone

13 Jul 2024

【過去ブログ再掲( 2023年1月16日) 】VisionAPIの一つの怖さ、売り手は言語を介さない探索行動から距離を置かないと改善が回せなくなる roundup-inc.co.jp/post-15658 #web戦略

VisionAPIの一つの怖さ、売り手は言語を介さない探索行動から距離を置かないと改善が回せなくなる - 中小企業専門WEBマーケティング支援会社・ラウンドナップWebコンサルティング(Roun...

米国現地時間 2022 年 9 月 28 日（水）10 時から、Google は「Search on '22」というイベントを開催しました。このイベント内では主として「これから」についてのGoogleの様々な新しい技術やサービスの発表が行われました。それはその中で、将来的な検索行動の変化についての話がありました。そこで、皆さんがこれに対して気をつけるべきことがあるのではないか、と言う点につい...

roundup-inc.co.jp

からさん

からさん

@karasan_itips

9 Jul 2024

名刺管理ソフトのトライアルでOCRの精度が悪かったので、某記事を参考にしてGCPのVisionAPIとGeminiで何とかした😅

筒井.xls@エクセル関数擬人化本著者

@Tsutsui0524

8 Jul 2024

トライアルでは百発百中だったOCRソフトが、導入したとたんに精度が下がったのでトライアル期間は人力だった説が浮上している

581

Vision

Vision @Track_Vision_

14 Jun 2024

📢 The Vision API documentation is now online! 📢 Read it here: vision-8.gitbook.io/docs/dif… We're reaching out to bot admins for potential integrations. Interested? Contact us! Stay updated: 🌐 vision-scanner.com💬 t.me/VisionPublic $VSN #VisionAPI #Blockchain #Crypto #DeFi #APIIntegration #DeveloperTools

Vision API (Beta) | docs

vision-8.gitbook.io

154

りーさん iOSエンジニア

りーさん iOSエンジニア @Resan0725Apple

31 May 2024

今日は金曜日だー！仕事終わったゆっくりするぞヽ(•̀ω•́ )ゝ今日の振り返りワークショップは楽しくワイワイできて楽しかった。そして、調査系タスクのVisionAPIやっぱりめちゃくちゃ面白い個人開発でいろんなAPI使ってみよ

214

小松啓(EUREKAPU.com)

小松啓(EUREKAPU.com)

@EUREKAPU_com

27 May 2024

クレカ明細（紙面）もVisionAPI＋ChatGPT＋GASでいけますね。ChatGPT先生が優秀で。VisionAPIで返ってきたデータを行ごとにまとめて渡してあげれば、95点くらいで返してくれるのほんとすごいです。

890

小松啓(EUREKAPU.com)

小松啓(EUREKAPU.com)

@EUREKAPU_com

25 May 2024

VisionAPIが使えるようになると経理系の方はだいぶ楽になります。VisionAPIで認識された文字・単語・文のすべてに位置情報がついています。これをうまく加工したらけっこういろいろできます。

444