Tag feed

#qwen

88 posts0 followers

Explore Hashnode

Alternatives

Trending tags this week

MSManu Shuklaecorpit.hashnode.dev2d ago · 15 min read

Qwen3.7 Flash vs Gemini 3.5 Flash-Lite: the 2026 cost math for 1M-context vision agents

Qwen3.7 Flash vs Gemini 3.5 Flash-Lite: the 2026 cost math for 1M-context vision agents Summary. Alibaba released Qwen3.7 Flash on 27 July 2026 at \(0.03 per million input tokens and \)0.13 per millio

0

DSDarsh Shahfreecodecamp.orgJul 24 · 11 min read

How to Use Prompt Engineering and Context Engineering for AI Agents

In this tutorial, I’ll show you how prompt engineering and context engineering can improve an AI agent's performance. We’ll build a simple local agent, start with a baseline input, then improve it wit

0

DSDarsh Shahfreecodecamp.orgJul 22 · 10 min read

How to Trace and Monitor AI Agents with LangSmith

In this tutorial, I'll show you how to trace and monitor a local AI agent with LangSmith. We'll build a small local AI agent and then enable LangSmith tracing for it so that we can inspect model calls

0

YFYuki Furutablog.yukifuruta.comJul 22 · 24 min read

Running Edge LLMs on a Raspberry Pi

What if a Raspberry Pi could generate text entirely on its own, without sending prompts to a cloud-based AI service? To find out, I installed several compact large language models on a Raspberry Pi 5

0

OKOrhun Küpeliorhunkupeli.hashnode.devJul 21 · 6 min read

One GPU, Two LLMs

What I was aiming simple in theory: deploy two open-weight LLMs behind a custom gateway, on Kubernetes, infrastructure as code. The contraint that made it interesting was a hard cost cap. One spot GPU

0

MSManu Shuklaecorpit.hashnode.devJul 22 · 16 min read

Qwen3.8-Max: 2.4T parameters, 1.2TB of VRAM, and the number Alibaba has not published

Qwen3.8-Max: 2.4T parameters, 1.2TB of VRAM, and the number Alibaba has not published Summary. Alibaba previewed Qwen3.8-Max-Preview on 19 July 2026 at the World AI Conference in Shanghai and describe

0

DSDarsh Shahfreecodecamp.orgJul 20 · 12 min read

How to Serve a Multi-User AI Agent with FastAPI and Streamlit

In this tutorial, I’ll show you how to serve a multi-user local AI agent as a REST API using FastAPI, then add a lightweight Streamlit UI on top. Instead of interacting with the agent through a termin

0

DSDarsh Shahfreecodecamp.orgJul 17 · 12 min read

How to Evaluate AI Agents with an LLM-as-a-Judge Harness in Python

In this tutorial, I'll show you how to evaluate a local AI agent with a simple, repeatable evaluation harness. The harness runs the agent against a set of test cases, checks the results with both rule

0

DSDarsh Shahfreecodecamp.orgJul 14 · 13 min read

How to Build Your First Multi-Agent AI System in Python and LangGraph

In this tutorial, I'll show you how to build a multi-agent AI system in Python with no orchestration framework. We'll also implement this in LangGraph with nodes, edges, and shared state. The point of

0

KSKalpick Sharmakalpicksharma.hashnode.devJul 14 · 3 min read

Open Source Models Are Catching Proprietary Giants. Here's Why Developers Should Care.

Introduction If you've been exploring AI recently, you've probably noticed something interesting. More developers are talking about Llama, Mistral, and Qwen alongside GPT and Claude, Gemma. At first,

0

#qwen

Search Hashnode

#qwen

Explore Hashnode

Trending tags this week

Qwen3.7 Flash vs Gemini 3.5 Flash-Lite: the 2026 cost math for 1M-context vision agents

How to Use Prompt Engineering and Context Engineering for AI Agents

How to Trace and Monitor AI Agents with LangSmith

Running Edge LLMs on a Raspberry Pi

One GPU, Two LLMs

Qwen3.8-Max: 2.4T parameters, 1.2TB of VRAM, and the number Alibaba has not published

How to Serve a Multi-User AI Agent with FastAPI and Streamlit

How to Evaluate AI Agents with an LLM-as-a-Judge Harness in Python

How to Build Your First Multi-Agent AI System in Python and LangGraph

Open Source Models Are Catching Proprietary Giants. Here's Why Developers Should Care.