Bert embedding. Bert transformer. bert model architecture. generative pretrained transformer. bert model architecture attention mechanism.