Seeking Advice on RLHF and SFT Projects