transformer#
Classes
|
Transformer Decoder that supports both one-time and distributed decoding strategies. |
|
|
|
|
|
Transformer model |
|
Basic Transformer block with attention and feed-forward layers. |
|
Initializes the configuration for the Transformer model with the specified parameters. |