Coursera - Generative AI

Encoder Models - Sentiment analysis Encoder - Decoder Models Decoder Only Models - GPT family of models The paper “Attention is all you need” replaced recurrent neural networks (RNN) and convolutional neural networks (CNN) with transformer models (or attention-based models). The Transformer architecture consists of an encoder and a decoder, each of which is composed of several layers. Each layer consists of two sub-layers: a multi-head self-attention mechanism and a feed-forward neural network. [Read More]

Progress in AI - Natural Language Processing Edition

Even if we are not surrounded by self-driving cars (yet), AI is advancing in many domains. It reminds me of the spread of computers and internet in the last few decades where the cumulative progress looking back on the past few decades seem like huge leaps while new technology seem so incremental. When looking at the history of AI, there have been several paradigm shifts which have led to exponential gains in AI capabilities. [Read More]