This project investigates token quality from a noisy-label perspective and propose a generic token cleaning pipeline for SFT tasks. Our method filters out uninformative tokens while preserving those ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results