For the past few months, I've been working on indx - a modern local media manager for artists, developers, designers, and multidisciplinary creatives. During the Hermes Agent creative hackathon, we developed, refined, and honed indx's agent integrations in a series of creative experiments.
Hermes can work through indx’s CLI/API/skills/MCP surfaces to organize media, annotate files, run experiments, store embeddings, and turn a library into a lab. The database is an index, not a jail: metadata gets written to files and stays portable, and agents get a workspace they can actually operate.
The demo shows Hermes using indx as an operating surface for several creative/research loops. In the ComfyUI workflow, generated outputs come back into indx with workflow metadata. Ratings, tags, and notes added in indx can be read by the agent (including webhooks for live updates from the GUI), so human review becomes signal for the next batch.
In the embedding and breakbeat experiments, breakbeats and found sounds were sliced and compared using audio embeddings, and a range of audio analysis methods (embedded as images). indx-backed media and metadata feed latent-space visualizations, audio analysis, found-sound slice search, and VCV Rack performances — keeping the groove while replacing timbres. The current test library has nearly 300k indexed files; the hackathon runs included a found-sound corpus of 586 clips chopped into 10,192 searchable slices.
These are early research and creative workflows. The point is the reusable loop: a local, inspectable media workspace where Hermes can help explore, compare, organize, generate, and transform creative libraries, and respond to human feedback and curation, without trapping the work in a proprietary platform.
indx is moving toward an open-source beta soon, with the hackathon work serving as a preview of agent-operable creative media workflows.
Released today:
ComfyUI video matrix generation tools (scripts and Hermes skills) on GitHub
VCV Rack REX Player module
indx Hermes integration preview
(SOUND ON)