Understanding Text Generation Parameters in Transformers

This post is divided into seven parts; they are: – Core Text Generation Parameters – Experimenting with Temperature – Top-K and Top-P Sampling – Controlling Repetition – Greedy Decoding and Sampling – Parameters for Specific Applications – Beam Search and Multiple Sequences Generation Let’s pick the GPT-2 model as an example.

Further Applications with Context Vectors

This post is divided into three parts; they are: • Building a Semantic Search Engine • Document Clustering • Document Classification If you want to find a specific document within a collection, you might use a simple keyword search.

Applications with Context Vectors

This post is divided into two parts; they are: • Contextual Keyword Extraction • Contextual Text Summarization Contextual keyword extraction is a technique for identifying the most important words in a document based on their contextual relevance.

Advanced Q&A Features with DistilBERT

This post is divided into three parts; they are: • Using DistilBERT Model for Question Answering • Evaluating the Answer • Other Techniques for Improving the Q&A Capability BERT (Bidirectional Encoder Representations from Transformers) was trained to be a general-purpose language model that can understand text.

A Gentle Introduction to Attention and Transformer Models

This post is divided into three parts; they are: • Origination of the Transformer Model • The Transformer Architecture • Variations of the Transformer Architecture Transformer architecture originated from the 2017 paper “Attention is All You Need” by Vaswani et al.