隐藏层的误差项的计算代码有误,将 d2t = np.multiply((d3t, theta2), sigmoid_gradient(z2t)) 改为 d2t = np.multiply((theta2.T * d3t.T).T, sigmoid_gradient(z2t))