Tag: Generative AI
-
Attention: The Magic Behind Large Language Models (Part 2)
by
(4-5 mins read) Introduction & Recap In the previous article, we explored transformers & the concept of attention, the driving force behind the effectiveness of large language models (LLMs) like ChatGPT, using a real-life analogy. To recap, the attention mechanism helps models understand the context of input by focusing on specific parts and assigning weight…
-
Attention: The Magic Behind Large Language Models (Part 1)
by
(4-5 mins read) Introduction & why this article: We all have been encountering Large-Language models (LLMs) or Generative AI in recent times. This is the foundational model to which the revolutionary ChatGPT, Claude-3, Llama-2/3, Mistral (all text-to-text based) & even the recent one Sora (a text-to-video) to name a few, are based upon. With multiple breakthroughs…
