im not a huawei shooter but im pretty sure its just that the ModelInference thread finishes execution in less than 6 seconds and only gets joined after the sleep is up
During Huawei's new gen AI demo, the presenter accidentally ctrl-c'ed the process, revealing that they had time.sleep(6) in their demo code. In the demo, the code took 6 seconds to run, so the whole demo's faked...