There's a better way to serve your inference stack, you just haven't found it yet.
DynoSim is a workload-driven simulation of the Dynamo serving stack that turns exhaustive deployment search into a simulate-then-verify loop.
Instead of testing every deployment choice, teams can model the whole stack on one virtual timeline, screen thousands of configurations in high fidelity simulation, then validate only the best candidates on real hardware.
And because it's a full Rust implementation, it runs extremely fast. In our testing, 1,500x faster than real time.