Alex Ellis

Alex Ellis

5,264 Photos and videos

Tweets

Pinned Tweet

Alex Ellis

@alexellisuk

Jun 2

Replying to @alexellisuk

slicervm.com/blog/let-your-a…

Stop driving Slicer by hand - give your agent the wheel

Driving Slicer by hand is like using LLMs for tab completion. Let your agent automate the workflow end to end.

slicervm.com

826

Alex Ellis

Alex Ellis

@alexellisuk

Jun 15

I'm surprised it's taken us since 2016 to add --json faas- cli for @openfaas.. @slicervm was built for agents and had it from the beginning. What changes are you making to your products for agents LLMs?

1,249

Alex Ellis

Alex Ellis

@alexellisuk

Jun 15

@DmytroKrasun I'm sure you have something to say here.. MCP, skills, new API layers?

389

Alex Ellis

Alex Ellis

@alexellisuk

Jun 12

Squeezing in two Qwopus finetunes of @Alibaba_Qwen 3.6 27B into a single card and fronting it with a vibe coded metering proxy made by Claude Fable Very capable at following skills (if a little weak on concurrency patterns in @golang). Thanks Jackrong and @KyleHessling1 !

936

Alex Ellis

Alex Ellis

@alexellisuk

Jun 12

Well worth experimenting with. Kyle set up early access for me with @inletsdev and it performed easily as well as Qwopus in my Go evals

Kyle Hessling

@KyleHessling1

Jun 11

Qwopus 3.6 27b-Coder is now live! Scores a 67% on a full run of SWE bench verified with thinking completely disabled! Q5_K_M This model is lightning fast for dense class! With a natively finetuned MTP head, it achieves 100 tps on a single 5090! The biggest upgrade here, though, is its stability in programming and tool calling within @NousResearch Hermes agent, with thinking off! Wall time is crazy fast this way, which makes Hermes feel "native" and snappy, like they were meant for each other. The freedom of running without thinking at all makes you part of the thinking process, and you never get caught waiting 15 minutes for it to finish a thought string, like with the base models. Thinking on and temp high, .9-1 seems to produce really incredible design and svg results. I reran the Boat survival prompt through a few turns, thinking on, and it seemed to render more fancy models in HTML canvas, but it was much more of a start-a-prompt and wait experience vs the snappy and active iteration with it disabled. It may be worth turning it off and on throughout the build process if you want to get really creative with design. Really looking forward to seeing how this one performs for y'all! Please post comments with your opinions and use cases below! As always with our fine-tunes, mess with the temperature setting, and run them much hotter than the base! Please check out the Boat Survival game I posted yesterday, made in 12 turns using Hermes and this model, with thinking off. Link below! Full swe bench repo-specific breakdown also posted in the comments for those interested! Happy building, everyone! We're looking forward to your thoughts! Quants uploading now! huggingface.co/Jackrong/Qwop…

939

Alex Ellis

Alex Ellis

@alexellisuk

Jun 5

My new favourite theme for Superterm (.dev) Theme: Catppuccin Mocha Font: Berkeley Mono Free for personal use, commercial use is coffee money.

1,320

Alex Ellis

Alex Ellis

@alexellisuk

Jun 4

Time to normalise dual agent use: SOTA vs. best available local for VRAM available. Many believe a time is coming when coding plans will fade away and smaller teams won't be able to afford real token costs. Get familiar with strengths/weaknesses and clear gaps for your workflows and products. Prompting is a skill - and so are local models. This is what we've found helps: * More extensive planning and breaking down tasks into smaller chunks * Additional guidance on architecture and implementation * Explicit notes on testing, unit and e2e * Fine-tunes like Qwopus and the uncoming Qwopus-Coder variant * Agent Skills - will turn a 27B into an expert user of your product * AGENTS.md - place in each repo with specific notes and where to find conventions Pictured - both Opus and Qwen 3.6 27B find the same issue, and offer a similar solution. Opus offers broader insights, but we can still use our own wetware to get more out of Qwen 🧠

588

Alex Ellis

Alex Ellis

@alexellisuk

Jun 3

In the spirit of "Let your agent do the work" - I'm having Qwen 3.6 27B (Qwopus variant) set up Slicer on an Intel NUC to test out the installer/experience with Ubuntu 26.04 Only snag.. I installed Ubuntu 24.04.. let's see if the agent flags it?

1,426

Alex Ellis

Alex Ellis

@alexellisuk

Jun 3

Worked just as expected, the local model smashed it.

637

Alex Ellis

Alex Ellis

@alexellisuk

Jun 3

And it flagged an interesting issue "slicer new" has a "--socket" short-cut to create its API listener on a UNIX socket. It's really sugar for --url ./some/file But "slicer vm list/cp/exec" etc only had "--url", so now the local model is authoring a full PR for it

539

Alex Ellis

Alex Ellis

@alexellisuk

Jun 2

Local sandboxes are fast and cost very little. But there's no need to set them up by hand Tell your agent to test its work fully, end-to-end, using a skill to automate everything with @slicervm Blog post with 6 Dev/DevOps examples below 👇

1,414

Alex Ellis

Alex Ellis

@alexellisuk

Jun 2

slicervm.com/blog/let-your-a…

Stop driving Slicer by hand - give your agent the wheel

Driving Slicer by hand is like using LLMs for tab completion. Let your agent automate the workflow end to end.

slicervm.com

826

Georgi Gerganov

Alex Ellis retweeted

Georgi Gerganov

@ggerganov

May 29

llama.cpp now has an official website: llama.app Our goal is to make local AI accessible to everyone, and improving the user experience is a big part of that. On the new landing page you’ll find a single-line cross-platform installer. The installation provides a single unified `llama` entrypoint which you can use to run/serve models and interface with 3rd-party agentic applications. While oriented towards simplified user experience, the new `llama` application also provides all the advanced functionality of the existing llama.cpp tooling with which experienced users are already familiar. Also note that all GGUF models that you might have already downloaded with llama.cpp in the past will be automatically available to use without downloading again (they are stored in the common HF cache on your machine). We have many improvements in the pipeline both at the UX and at the engine level and we plan to iteratively ship new things over the coming months. One of the main focuses will be seamless integration with local-friendly 3rd-party agents (such as Pi). In the meantime, we’ll continue to listen for feedback from the community and adjust accordingly, so keep letting us know what you think and need.

483

2,982

164,366

Alex Ellis

Alex Ellis

@alexellisuk

May 27

My first experience of an HTTP_PROXY was when I bypassed one at school, and was suddenly everyone's friend! Here's how made them more usable for @slicervm and AI agents slicervm.com/blog/look-ma-no…

Look ma! No HTTP_PROXY!

Proxies are inevitable when it comes to filtering egress traffic and credential injection, but can we make the configuration go away?

slicervm.com

1,033

Alex Ellis

Alex Ellis

@alexellisuk

May 27

1/2 We've been working on a "Function Developer" skill for @openfaas using the supported templates i.e. "Run a cron every day to check for a new release of Firecracker, send that to Discord" github.com/openfaas/agent-sk… @welteki has a few examples of what he's built..

GitHub - openfaas/agent-skills: Agent Skills for OpenFaaS

Agent Skills for OpenFaaS. Contribute to openfaas/agent-skills development by creating an account on GitHub.

github.com

678

Alex Ellis

Alex Ellis

@alexellisuk

May 27

2/2 enrich-telemetry using GeoLite2 based upon the user's IP decrypt-payload - decrypt AES 128 payload, and process it, reencrypt it for storage in S3 hn-serverless-monitor - monitor @ycombinator for "serverless" comments/posts github.com/welteki/train-you…

GitHub - welteki/train-your-agent-examples: Example OpenFaaS functions built by AI coding agents...

Example OpenFaaS functions built by AI coding agents using the openfaas-function-dev skill - welteki/train-your-agent-examples

github.com

533

Jan Tytgat 🇧🇪🇪🇺

Alex Ellis retweeted

Jan Tytgat 🇧🇪🇪🇺

@jantytgat

May 27

Kudos to @alexellisuk for making superterm.dev compatible with SSO using #authentik through OIDC. Works brilliant now I can simply logon using my own credentials instead of the generated token on startup.

superterm — The terminal built for the agentic era

Run 20 AI agents in tmux sessions. Know which one needs your attention. Check from your phone. A session-aware terminal dashboard that keeps you in control.

superterm.dev

499

Alex Ellis

Alex Ellis

@alexellisuk

May 27

Superterm.dev now supports Single Sign-On: GitHub (Device flow - built-in) GitHub (BYO client_id/client_secret) OIDC (Keycloak, Dex, Okta, etc) "superterm update" to get it then open command pallet and "Login & Authentication" for the explainer.

1,124

Kyle Hessling

Alex Ellis retweeted

Kyle Hessling

@KyleHessling1

May 22

BREAKING! Qwopus 3.6 27B is LIVE! Thank you for your patience on this one, but I believe you'll find the wait was worth it! We've benchmarked this thing up and down, verified that it holds at least a 75.25% (152/202) in the initial 202 SWE bench solves. Not a full run of 500, but it shows the agentic coding quality from the original 27B is retained while adding all of the additional Qwopus benefits across many domains. As always, Jackrong is absolutely cooking here! COT quality has improved significantly through the inversion techniques from our Negentropy proof of concept. It also went through thorough curriculum training. You can check out the MMLU pro benchmarks on the model card, but it improved a whopping 10 points over the base model in physics, as well as meaningful jumps in Chemistry, business, and computer science. However, the best part is that I was able to build an entire survival shooter game using this local model entirely. I genuinely was blown away by the results, which you can play right now on my HF space (link in comments below). "Qwopus Commander" was completed in 9 turns of Qwopus 3.6! To test the new long context training, I made it re-output the entire 3000 line program each turn, and it would make fixes and add features that I requested in large prompts, while perfectly replicating the entire rest of the game from context. What's more is that I did it all at Q8 KV cache quantization, and never had an issue over the entire 303k token run! IMPORTANT: Run it at --temp 0.75 to 1. Mess with it in that range for your use case. Higher temp actually lets the fine-tune shine and be exploratory and is also more stable. Swe Bench was run at temp 1, the game was built mostly at 0.8! We're so blessed to have all of you here and using the models! The support means so much! Please let me know what you build with it in the comments! Or if you have any issues getting it up and running, I will try my best to get back to you! Looking forward to seeing what you legends produce with it this weekend! huggingface.co/Jackrong/Qwop…

Jackrong/Qwopus3.6-27B-v2-GGUF · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

135

1,373

88,380

Alex Ellis

Alex Ellis

@alexellisuk

May 22

Coffee break with codex

1,086

Han Verstraete

Alex Ellis retweeted

Han Verstraete @welteki

May 21

Really nice addition to @slicervm, the proxy now supports GitHub Apps. Spin up a fresh sandbox and the proxy uses a GitHub App to inject credentials on the fly. Any action the app has permissions for can be performed without any secrets touching the VM.

988