Filter
Exclude
Time range
-
Near
12 May 2025
Join the IntentVC Challenge @ ACM MM 2025! Tackle user-controllable video captioning — generate captions based on user-defined intentions. Top teams will be invited to publish & present! 🗓️ Deadline: June 30 🔗 sites.google.com/view/intent… #VideoCaptioning #ACMMM2025 #Multimedia
1
2
700
✨ Introducing a new #SOTA action recognition large multimodal language model: #LLaVAction! Understanding human behavior requires recognizing actions—a challenging task given the complexity of behavior. Large multimodal language models (#MLLMs) offer a promising path forward, but how well do they perform in action recognition? In our latest work - by @shaokaiyeah, Haozhe Qi, @TrackingPlumes and me | @EPFL_en - we rigorously evaluate and enhance MLLMs for action recognition in a real-world and challenging settings- egocentric views in the kitchen! 🧑‍🍳🔪🧽🤖 👀 We find that developing a multi-question-answer (#MQA) task serves as a valuable intermediate step in training (and evaluating) MLLMs for action understanding. Namely, we introduce EPIC-KITCHENS-100-MQA, a reformulation of the highly challenging EPIC-KITCHENS-100 dataset into a video multiple-choice question-answering task which allows for rigorous benchmarking of MLLMs in this task. Next, we propose methods that substantially improve MLLM performance, and even achieving state-of-the-art results 🏆 (#SOTA) on the EPIC-KITCHENS-100 validation set 🔥✨. Our approach also outperforms GPT-4o by 21 points in accuracy on EPIC-KITCHENS-100-MQA and demonstrates improvements across other action-related video benchmarks, including #VideoMME, #PerceptionTest, and #MVBench. Our #LLaVAction-7B and -0.5B models can do #MQA and, critically, can do video captioning! 🙏🚀 As MLLMs become central to AI-driven video understanding in such real-world settings, ensuring their robustness in real-world tasks is critical. Excited to push the boundaries of multimodal AI further! 💪 🇨🇭We could not have done this without the amazing support of #SwissAI: the Swiss AI Initiative & the Swiss National Supercomputing Centre (#CSCS). @EPFL_AI_Center #ProjectPage: mmathislab.github.io/llavact… 📝 #arXivPaper: arxiv.org/abs/2503.18712 👩‍💻💻 GitHub code & Google #ColabDemo: github.com/AdaptiveMotorCont… 🤗 Hugging Face models (use with transformers): huggingface.co/MLAdaptiveInt… #AI #MultimodalLearning #ActionRecognition #EPICKITCHENS100 #MLLM #LLaVAction #VideoCaptioning #VLMs
1
17
61
8,484
Check out this insightful blog post by Sotiris Karavarsamis, Research Assistant at @VCL_ITI Read the full blog post 👉voxreality.eu/one-piece-at-a… #AI #DeepLearning #VideoCaptioning #SwinBERT #ComputerVision #TechInnovation #MachineLearning #FutureOfAI
6
81
🚀 Excited to announce our latest paper! We are introducing ‘NarrativeBridge’, a ground breaking benchmark and a novel architecture for video captioning! #videocaptioning #llms #cvssp
🚨 Breakthrough alert! 🚨 Introducing NarrativeBridge - the future of video captioning is here! 🎥🤖📝 📈 Crushing SOTA models on MSVD-CTN & MSRVTT-CTN datasets. 💪 Labeling unlabeled videos! 📄Paper: arxiv.org/abs/2406.06499v1 @ArminMustafa @FaeghehSardari @cvssp_research
1
9
2,875
Don't limit your audience. CaptionMe bridges language barriers and enhances your video's reach. 🌐🤝 #LanguageBarrier #VideoCaptioning
4
22
#FunFact The CTL provides #videoCaptioning support for @durhamcollege faculty? To make your course content more accessible for your students, contact us for assistance with captioning & including these videos in your courses!
2
104
2 Mar 2023
🤩 Check out this amazing paper on video captioning by Antoine Yang et al.! 📹 #Computation #Language #VideoCaptioning #DeepLearning deepai.org/publication/vid2s…

1
1
772
The CTL provides #videoCaptioning support for @durhamcollege faculty! To make your course content more accessible for your students, reach out to the CTL for assistance with captioning & including these videos in your courses! #FacultySupport
2
90
#DidYouKnow The CTL provides #videoCaptioning support for @durhamcollege faculty? To make your course content more accessible for your students, contact us for assistance with captioning & including these videos in your courses!
1
3
I am looking for a video transcriber for my online courses. Payment will be done as per the duration of the lectures. Anyone interested can comment or DM or me. #DataEntry #Online #Hiring #VideoCaptioning #Transcription
2
1
1
#FunFact We provide #videoCaptioning support for @durhamcollege faculty members? To make your course content more accessible for your students, reach out to the CTL for assistance with captioning & including these videos in your courses! #FacultySupport
2
1
Currently open for #transcription or #proofreading projects. Price is $0.75 per minute of audio to transcribe your high quality audio file. #TranscriptionService #Transcriber #Proofreader Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
8
5
Currently open for #transcription or #proofreading projects. Price is $0.75 per minute of audio to transcribe your high quality audio file. #TranscriptionServices #Service #Publishing Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
1
1
4
Currently open for #transcription or #proofreading projects. Price is $0.75 per minute of audio to transcribe your high quality audio file. #CopyTyping #MondayMotivation #Transcribe Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
3
Currently open for #transcription or #proofreading projects. Price is $0.75 per minute of audio to transcribe your high quality audio file. #Dictation #SpokenWord #Formatting #Captions #Caturday Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
3
Currently open for #transcription or #proofreading projects. Current price is $0.75 per minute of audio to transcribe your high quality audio file. #Transcribe #Interviews Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
2
Currently open for #transcription or #proofreading projects. Current price is $0.75 per minute of audio to transcribe your high quality audio file. #Proofreader #Professional Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
1
2
Currently open for #transcription or #proofreading projects. Current price is $0.75 per minute of audio to transcribe your high quality audio file. #Grammar #Accessibility Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
2
Currently open for #transcription or #proofreading projects. Current price is $0.75 per minute of audio to transcribe your high quality audio file. #Transcript #TranscriptionService Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
2
Currently open for #transcription or #editing projects. Current price is $0.75 per minute of audio to transcribe your high quality audio file. Also do #VideoCaptioning for $1.99 per minute of video (transcript required)
1
3