LLMs & GenAI - ML Digest

How to Measure the Performance of LLM?

Measuring the performance of a Large Language Model (LLM) involves evaluating various aspects of its functionality, ranging from linguistic capabilities […]

How to Measure the Performance of LLM? Read More »

How To Control The Output Of LLM?

Controlling the output of a Large Language Model (LLM) is essential for ensuring that the generated content meets specific requirements,

How To Control The Output Of LLM? Read More »

Byte Pair Encoding (BPE) Explained: How It Fuels Powerful LLMs

Traditional tokenization techniques face limitations with vocabularies, particularly with respect to unknown words, out-of-vocabulary (OOV) tokens, and the sparsity of

Byte Pair Encoding (BPE) Explained: How It Fuels Powerful LLMs Read More »

How do LLMs Handle Out-of-vocabulary (OOV) Words?

LLMs handle out-of-vocabulary (OOV) words or tokens by leveraging their tokenization process, which ensures that even unfamiliar or rare inputs

How do LLMs Handle Out-of-vocabulary (OOV) Words? Read More »

Quantifying Prompt Quality: Evaluating The Effectiveness Of A Prompt

Evaluating the effectiveness of a prompt is crucial to harnessing the full potential of Large Language Models (LLMs). An effective

Quantifying Prompt Quality: Evaluating The Effectiveness Of A Prompt Read More »

Pushing the Boundaries of LLM Efficiency: Algorithmic Advancements

This article summarizes the content of the source, “The Efficiency Spectrum of Large Language Models: An Algorithmic Survey,” focusing on

Pushing the Boundaries of LLM Efficiency: Algorithmic Advancements Read More »

Dissecting the Vision Transformer (ViT): Architecture and Key Concepts

Vision Transformers (ViT) have emerged as a groundbreaking architecture that has revolutionized how computers perceive and understand visual data. Introduced

Dissecting the Vision Transformer (ViT): Architecture and Key Concepts Read More »

Weight Tying In Transformers: Learning With Shared Weights

Consider a bilingual dictionary. To understand a foreign word, you look it up and find its meaning. To express yourself

Weight Tying In Transformers: Learning With Shared Weights Read More »

transformer-embedding-lookup-illustration

Architecture of the Embedding Layer During Training of LLMs

The embedding layer in an LLM is a critical component that maps discrete input tokens (words, subwords, or characters) into

Architecture of the Embedding Layer During Training of LLMs Read More »

Attention Mechanism: The Heart of Transformers

Imagine a study group where every student is allowed to look around the room before answering a question. One student

Attention Mechanism: The Heart of Transformers Read More »

Understanding LoRA Technology for LLM Fine-tuning

Low-Rank Adaptation (LoRA) is a novel and efficient method for fine-tuning large language models (LLMs). By leveraging low-rank matrix decomposition,

Understanding LoRA Technology for LLM Fine-tuning Read More »