Tag feed

#transformer

17 posts1 followers

Explore Hashnode

Alternatives

Trending tags this week

AJAman Jaincurious-pm.hashnode.devJul 14 · 8 min read

LLM architecture (part 2): Inside a Transformer Block

In the previous article, we looked at tokenization and embeddings, which convert a finite vocabulary into continuous vectors that we can work with mathematically. We also explored attention, which hel

0

RGRaja Guptarajanotes.hashnode.devJul 3 · 13 min read

Understanding what happens behind the scenes when you chat with AI.

1. What is an LLM? Think of an LLM as the brain behind AI assistants like ChatGPT, Gemini, Claude, and Copilot. Just like our brain learns by reading books, listening to conversations, watching the wo

0

VPVed Pandeyvedpandeydev.hashnode.devJul 1 · 7 min read

How ChatGPT Understands Your Questions?

Introduction Ever thought how exactly does an LLM understands our question as well as answers it? Hey there everyone, hope you are doing great in your life and enjoying every bit of it! Today we are

0

MSMohd Sameermohd-sameer.hashnode.devJun 30 · 5 min read

How ChatGPT or any LLMs Understands Your Questions?

If you’ve used ChatGPT, Gemini, or Claude, you’ve interacted with some of the most complex software ever built. But beneath the surface, these tools aren't "thinking" the way humans don they are perfo

0

MHMohamed Hamedmo-mentor.hashnode.devApr 21 · 16 min read

Part 7 — The Transformer: The Architecture That Accidentally Changed the World

THE ENGINE OF THE FUTURE Transformer "Attention Is All You Need" — the paper that changed everything Last article we saw how the four learning types + training loop built ChatGPT. Today we open the box and see the exact architecture that made all of ...

0

Ttelostelos-robotics.hashnode.devApr 13 · 6 min read

Octo: Open-Source Generalist Robot Policy

TL;DR Octo is an open-source generalist robot policy developed by UC Berkeley RAIL Lab. It's a Transformer model pretrained on 800k trajectories from the Open X-Embodiment dataset, conditioned on natural language commands or goal images, and can adap...

0

Ttelostelos-robotics.hashnode.devApr 12 · 6 min read

RT-1: Robotics Transformer for Real-World Control at Scale

TL;DR RT-1 is a 35M-parameter transformer trained on 130,000 real robot demonstrations across 700+ tasks. It takes natural language instructions and camera images as input, and outputs discretized robot actions at 3 Hz in real time. It achieves 97% s...

0

SHSirui Hecliolabs.hashnode.devApr 11 · 22 min read

Manifolds and Genesis

What follows is a record of a conversation between a machine learning researcher and a large language model. It began with an engineering question about how system prompts work. Where it ended, well,

0

Aaiagentmemoryaiagentmemory.hashnode.devApr 7 · 8 min read

LLM Memory Calculator Hugging Face: Estimating Transformer Context

An LLM memory calculator Hugging Face tool estimates the token count for Transformer models, crucial for managing context window limits on the Hugging Face platform. It helps predict memory footprint and computational costs, ensuring efficient deploy...

0

AAAbstract Algorithmsabstractalgorithms.hashnode.devMar 9 · 18 min read

How Transformer Architecture Works: A Deep Dive

TLDR: The Transformer is the architecture behind every major LLM (GPT, BERT, Claude, Gemini). Its core innovation is Self-Attention — a mechanism that lets the model weigh relationships between all to

0

#transformer

Search Hashnode

#transformer

Explore Hashnode

Trending tags this week

LLM architecture (part 2): Inside a Transformer Block

Understanding what happens behind the scenes when you chat with AI.

How ChatGPT Understands Your Questions?

How ChatGPT or any LLMs Understands Your Questions?

Part 7 — The Transformer: The Architecture That Accidentally Changed the World

Octo: Open-Source Generalist Robot Policy

RT-1: Robotics Transformer for Real-World Control at Scale

Manifolds and Genesis

LLM Memory Calculator Hugging Face: Estimating Transformer Context

How Transformer Architecture Works: A Deep Dive