π οΈ Solution in this Paper:
Dynamic Condensing: Extracts crucial info from each text chunk with dynamic prompts, ensuring no information loss.
Parallel Decoding: Integrates info across chunks.
Training Efficiency: Trained with less cost, performs better than baselines. π―