HydraOpt: Navigating the Efficiency-Performance Trade-off of...
Large language models (LLMs) often leverage adapters, such as low-rank-based adapters, to achieve strong performance on downstream tasks. However, storing a separate adapter for each task...
arxiv.org