Joined August 2013
7 Photos and videos
Jai retweeted
Well, it was fun while it lasted
1
1
6
133
Jai retweeted
Hermes Agent now has Automation Blueprints, turning cron jobs into clickable, fillable, conversational workflows.
108
246
2,833
387,358
Jun 11
I wonder how accurate Fable "silent failure trigger" is. Is there a possibility that it steers you to an incorrect answer beyond the blacklisted topics?
1
2
48
Jun 11
Trust is hard to get and easily lost. I hope Anthropic learns this lesson
1
7
286
Jai retweeted
Brilliant idea! Next up: Apple randomly reboots your Mac if you're building competing tech, Gmail silently edits your email if you mention rival platforms, and Tesla Autopilot swerves if it detects you're working on self-driving cars. All in the name of safety, of course. Because malicious actors controlling the world’s operating systems, inboxes and cars would be extremely dangerous!
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy
101
764
6,770
357,401
Jai retweeted
Claude is unfortunately a supply chain risk for any ML lab now
42
77
1,860
94,870
Jun 10
Fable 5 shows that open source has to win. Otherwise... I don't really want to think about otherwise
1
2
67
Jai retweeted
Replying to @valhalla_dev
We will
10
4
232
4,581
Jai retweeted
We are excited to join Nvidia's Nemotron Coalition of leading AI labs working together to advance open frontier foundation models. To celebrate we have partnered with @nvidia and @nebiustf to provide 2 free weeks of the new Nemotron 3 Ultra model on the Nous Portal!
143
213
2,882
1,494,727
Jai retweeted
The next evolution of Hermes Agent is here! Introducing Hermes Desktop: everything you love about Hermes, now native on your machine. First demoed in Jensen's GTC keynote, it's now in public preview.
1,230
1,460
12,749
5,808,223
Jai retweeted
A new beginning of PC starts with @NVIDIARTXSpark, supercharging what's possible in Hermes Agent.
This is the NVIDIA RTX Spark Superchip. A new beginning for personal computers. Designed for creators, AI developers, and gamers, RTX Spark brings over 30 years of NVIDIA innovation to slim Windows laptops and small, ultra-efficient desktop PCs.
33
54
639
52,172
Jai retweeted
We have been working closely with @nvidia to ensure Hermes Agent works smoothly on their new @NVIDIARTXSpark superchip and integrates with the new OpenShell runtime, which connects Hermes to @Microsoft's security primitives. Watch our feature in the big announcement at Computex:
313
628
6,713
5,971,572
May 28
There should be a token efficiency caveat to benchmarks. I think people would trade off some performance to be more token efficient
1
41
Jai retweeted
Today we release Lighthouse Attention, a selection-based hierarchical attention for long-context pre-training that delivers a 1.4-1.7× wall-clock speedup at 98K context. It runs the same forward backward pass ~17× faster than standard attention at 512K context on a single B200, without a custom sparse attention kernel, a straight-through estimator, or an auxiliary loss. During training, queries, keys, and values are pooled symmetrically into a multi-resolution pyramid. We then score every pyramid heads, and a top-k cascade selects a small hierarchical dense sub-sequence, and after a sorting pass that enforces causality, we use standard attention for token mixing. A brief full attention resume at the end converts the checkpoint back into a competent dense-attention model. Validated this using 530M parameter Llama-3 models across 50B tokens, with up to 1M-token benchmarks across 32 B200s under context parallelism. The work on Lighthouse Attention was led by @bloc97_, @SubhoGhosh02, and @theemozilla.
53
229
2,019
162,360
Jai retweeted
Today we release Token Superposition Training (TST), a modification to the standard LLM pretraining loop that produces a 2-3× wall-clock speedup at matched FLOPs without changing the model architecture, optimizer, tokenizer, or training data. During the first third of training, the model reads and predicts contiguous bags of tokens, averaging their embeddings on the input side and predicting the next bag with a modified cross-entropy on the output side. For the remainder of the run, it trains normally on next-token prediction. The inference-time model is identical to one produced by conventional pretraining. Validated at 270M, 600M, and 3B dense scales, and at 10B-A1B MoE. The work on TST was led by @bloc97_, @gigant_theo, and @theemozilla.
150
415
3,695
448,270
Jai retweeted
Hermes Agent is now #1 on the Global @OpenRouter token rankings. While our journey together has just begun, we'd like to take this opportunity to thank our contributors, supporters, and users for all they have done to get us this far.
437
718
7,200
2,954,785
Jai retweeted
👀 v. soon
36
13
550
40,945
Jai retweeted
Trinity-Large-Thinking, @arcee_ai's latest model, is now free on Nous Portal for the next week Sign up for Nous Portal to use it in your Hermes Agent today
36
40
674
225,581
Jai retweeted
Replying to @Teknium
We do really need a Hermes nice cheatsheet at this point :)
12
37
272
10,037
Jai retweeted
Shopify is the all-in-one commerce platform powering millions of businesses worldwide Thank you to the @Shopify team for building their own official Hermes Agent skill enabling your agent to manage products, orders, inventory, and fulfillments from any channel.
131
200
2,674
447,893