Looking at the tensorflow code, it seems to me that tf version has only one bilstm encoder ( only for words). Is that correct implementation?