which could be dumped during inference. The backbone should return plain embeddings. The neck can process these to make them suitable for the chosen heads. The heads perform the final processing that ...
// - ConstraintLength : The constraint length, K. Supported range is 3 to 9. // - TracebackLength : Number of states to trace back through the trellis during decoding. Use at least 6x ConstraintLength ...