Feed
Pro
Search

Author

Write
Drafts

Bug0 - The AI-native e2e QA regression testing Passmark - The open-source AI framework for regression testing Hackathons Changelog Brand Hashnode gql skill - let your AI agent publish to your Hashnode blog The Foreword by Hashnode - official blog from the Hashnode team @hashnode on X Hashnode on LinkedIn Support - hello+support@hashnode.com Code of Conduct Terms Privacy Sitemap
Sign in

Search Hashnode

Search posts, tags, users, and pages

Discussion on "Local LLM's with .Net" | Hashnode

FeedDiscussion

Kevin Loggenberg

Leading innovative software development to drive business intelligence and transformation

Jul 9, 2024

Local LLM's with .Net

Introduction In this article we will explore performing inference on GGUF models with Llama.cpp using the Llamasharp nuget package. It sounds like it should take longer than it actually does. GGUF models are probably one of the easiest models to work...

thecodesmith.hashnode.dev3 min read

#llamasharp #cpuinference #thebloke #scisharp #ai #csharp #phi-3 #mistralai #inference #gguf #huggingface #microsoft

Responses(1)

Skyblade

Nov 4, 2024

How do I know the value for GpuLayerCount for a particular model? Are there any formulas or guidelines?

Hi there.. Sorry for the late, somehow didn't get any notifications for this... this will be dependent on the graphics card you have.... the more vram and cpu power it has, will allow more GPU offloading from my understanding.... it probably could be an exact science if you take everything into consideration, but quickest for me is a bit of experimentation... start low and increase...

Kevin Loggenberg

Leading innovative software development to drive business intelligence and transformation

Jan 12, 2025