Tag feed

#transformers

485 posts46 followers

Trending tags this week

Inside the Brain of an LLM: From Raw Text to Next-Token Magic

21h ago · 8 min read · Imagine you are sitting in a dimly lit café, watching a master chess player. They aren't looking at the whole board with panic; they are looking at the current setup, calculating the absolute best nex

Join discussion

CGCristiano Gabrielicrisdigital.hashnode.dev

0

Julia and R the future of AI

6d ago · 15 min read · Julia for LLMs: Why a High‑Performance Language Finally Makes Sense for AI Workflows Introduction — Why Julia Matters Now There’s a moment in every technology cycle where a tool that’s been quietly m

Join discussion

CGCristiano Gabrielicrisdigital.hashnode.dev

0

Understanding Transformer Architecture in 2026 (SilentRecon Deep Dive)

May 23 · 4 min read · Understanding Transformer Architecture in 2026 — A SilentRecon Deep Dive SilentRecon Deep Dive: Understanding Transformer Architecture in 2026 By SilentRecon — Advanced Reconnaissance & AI Systems Eng

Join discussion

MFMohammed Fahd Abrahfreecodecamp.org

0

AI Paper Review: Improving Language Understanding by Generative Pre-Training (GPT-1)

May 6 · 12 min read · We use AI tools all the time, whether it’s asking questions, generating images, or getting help with everyday tasks. But most of these tools didn’t appear out of nowhere. They were developed based on

Join discussion

AAAbstract Algorithmsabstractalgorithms.dev

0

Softmax Function Explained: From Raw Scores to Probabilities

May 3 · 23 min read · TLDR: Softmax converts a vector of raw scores (logits) into a valid probability distribution by exponentiating each value and dividing by the total. Subtracting the max before exponentiating prevents

Join discussion

AAAbstract Algorithmsabstractalgorithms.dev

0

Dot Product in Machine Learning: The Engine Behind Similarity, Attention, and Neural Networks

May 3 · 22 min read · TLDR: The dot product multiplies corresponding elements of two vectors and sums the results. In machine learning it does three critical jobs: it scores semantic similarity between embeddings, computes

Join discussion

CTClementina Tomclementina-tom.hashnode.dev

0

The Logic Engine: Building a Constraint-Aware AI Recommendation System for Personalized Learning

May 2 · 14 min read · How I combined Deep Knowledge Tracing, curriculum graph theory, and multi-objective ranking to build a recommendation system that actually understands how people learn --- and achieves a 0.0% prerequi

Join discussion

SSShivankur Sharmashivankur018.hashnode.dev

0

Infosys Springboard: PaperIQ

Apr 30 · 4 min read · Overview As part of the Infosys Springboard DSAI Virtual Internship, I worked on building PaperIQ, an end-to-end intelligent system designed to analyze research papers and convert them into structured

Join discussion

UGUjjawal Guptaujjawalgupta.hashnode.dev

0

How LLMs actually read

Apr 27 · 3 min read · For a long time, I’ve kept my learning and building in private. Today, I’m changing that. I will be sharing my deep dives into 𝗧𝗿𝗮𝗻𝘀𝗳𝗼𝗿𝗺𝗲𝗿𝘀, 𝗟𝗟𝗠 𝗔𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁𝘂𝗿𝗲, 𝗮𝗻𝗱 𝗥𝗔𝗚

Join discussion

BEBright Etornam Sunuiametornam.hashnode.dev

0

Bringing it to Life: The Real-Time Inference Engine (Part 3)

Apr 24 · 5 min read · In Part 2, we successfully trained a Transformer model to map sequences of body keypoints to sign language glosses using CTC loss. However, training on pre-segmented videos is one thing; making it wor

Join discussion

#transformers

Search Hashnode

#transformers

Trending tags this week

Inside the Brain of an LLM: From Raw Text to Next-Token Magic

Julia and R the future of AI

Understanding Transformer Architecture in 2026 (SilentRecon Deep Dive)

AI Paper Review: Improving Language Understanding by Generative Pre-Training (GPT-1)

Softmax Function Explained: From Raw Scores to Probabilities

Dot Product in Machine Learning: The Engine Behind Similarity, Attention, and Neural Networks

The Logic Engine: Building a Constraint-Aware AI Recommendation System for Personalized Learning

Infosys Springboard: PaperIQ

How LLMs actually read

Bringing it to Life: The Real-Time Inference Engine (Part 3)