Coursera - Generative AI
Encoder Models - Sentiment analysis
Encoder - Decoder Models
Decoder Only Models - GPT family of models
The paper “Attention is all you need” replaced recurrent neural networks (RNN) and convolutional neural networks (CNN) with transformer models (or attention-based models).
The Transformer architecture consists of an encoder and a decoder, each of which is composed of several layers. Each layer consists of two sub-layers: a multi-head self-attention mechanism and a feed-forward neural network.
[Read More]