Pytorch linear. Pytorch vs tensorflow. self attention torch. generative pretrained transformer.