Transformers are models that process sequence data and output sequence data. They are used in many NLP tasks, such as machine translation, text summarization, and question answering. In this post, we will focus on how Transformer works in machine translation.