-
Notifications
You must be signed in to change notification settings - Fork 39
Description
I tried the solution provided by lf0_lstm.py and so. When I tried to modify the parameters in tranning, a script in /data/training/compute_dtw.sh made me confused.
` ZASKA="Zaska -P $PRM_NAME $PRM_OPT"
# Compute mfcc $DIR_REF/${FILENAME}.wav $DIR_TST/${FILENAME}.wav => mfcc/$DIR_REF/${FILENAME}.prm mfcc/$DIR_TST/${FILENAME}.prm
${ZASKA} -t RAW -x wav=msw -n . -p mfcc -F ${DIR_REF}/${FILENAME}_sil ${DIR_TST}/${FILENAME}_sil
# Align: mfcc/${DIR_REF}/${FILENAME}.prm, mfcc/${DIR_TST/${FILENAME}.prm => dtw/${DIR_REF}-${DIR_TST}/${FILENAME}.dtw
b=2
dtw -b -${b} -t mfcc/${DIR_TST} -r mfcc/${DIR_REF} -a ${DIR_DTW}/beam${b} -w -B -f -F ${FILENAME}_sil`
Running the script is diffcult, as the command "Zaska" is not exist in any package I found and the "dtw" command doesnt have the parameter of "-b" . How could I sovle it?
By the way, I want to run this script because I wanted to add more parameters on training, I modified "tfglib" and tried to build the /data/train_datatable.h5 again.
It resulting in very few harmonic elements, may need to use high-order of feature extraction and adjust the network to fit more high-order implied feature.(It seems the default training also resulting over-fitting.)
In addition, the result of converted result sounds dull and low, lacks a sense of penetration, may due to the lack of high-order features harmonic elements.