At last pet got rid of Chinese accent by training a reference model a bit.
Smaller model with more iterations is better than bigger model with less iterations.
Pet is not sure there are no bugs in the dataset loading code.
Anyway, it works, but pet is looking for a real human to make things better. Fuck #ai.