Abstract: This paper introduces PFlow-VC, a conditional flow matching voice conversion model that leverages fine-grained discrete pitch tokens and target speaker prompt information for expressive ...
Abstract: This paper proposes a novel framework for frame rate up conversion. The framework contains a motion field estimator employing a hybrid of a novel predictive variable blocksize motion ...