Mahbub Bellomahbubbello.hashnode.dev·Mar 26, 2024A Primer on Large Language ModelsWhat are Large Language Models (LLMs)? They are Intelligent Text Generators: At their core, LLMs are advanced computer programs that can generate, understand, and manipulate human language with remarkable fluency and coherence. Think of them as supe...Discusslarge language models
Akmmus AIblog.akmmusai.pro·Mar 22, 2024LLAMAFACTORY: Unified Efficient Fine-Tuning of 100+ Language ModelsTLDR - LLAMAFACTORY, a unified framework that integrates a suite of cutting-edge efficient LLM fine-tuning methods. It allows users to flexibly customize the fine-tuning of 100+ LLMs without the need for coding through the built-in web UI LLAMABOARD....Discuss·269 readsgenerative ai
Asmaa Hadirasmaamhadir.hashnode.dev·Mar 18, 2024Text Extraction for Information Retrieval using LLMsherpa, Neo4j, and LangChainThe Challenge: LLMs and Large Texts Imagine being tasked with piecing together a puzzle, but you're only allowed to see a few pieces at a time without ever seeing the whole picture. That's the dilemma facing today's most advanced computer programs in...Discuss·76 readsnatural language processing
Akmmus AIblog.akmmusai.pro·Mar 12, 2024Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context (Short Summary)TLDR - Gemini 1.5 Pro is a new LLM in Google's Gemini family known for its advanced capabilities. Gemini 1.5 Pro outperforms models like Claude 2.1 and GPT-4 Turbo by handling up to 10 million tokens of information context (vs. 200k and 128k respecti...Discussgenerative ai
Akmmus AIblog.akmmusai.pro·Mar 9, 2024Chatbot Arena: An Open Platform for Evaluating LLMs by Human PreferenceTLDR - Large Language Models (LLMs) offer new capabilities but evaluating their alignment with human preferences is difficult. Chatbot Arena is a new open platform introduced to specifically address this evaluation challenge. For evaluation, this pla...Discuss·26 readsDeep Learning
Akmmus AIblog.akmmusai.pro·Mar 9, 2024GaLore: Memory-Efficient LLM Training by Gradient Low-Rank ProjectionTLDR - Training Large Language Models (LLMs) presents significant memory challenges because of their large sizes. Approaches like LoRA typically underperform training with full-rank weights in both pre-training and fine-tuning stages since they limit...Discuss·51 readsnatural language processing
Akmmus AIblog.akmmusai.pro·Mar 9, 2024ShortGPT: Layers in Large Language Models are More Redundant Than You Expect (short summary)TLDR - Large language models (LLMs) are getting larger to achieve better performances, but their large sizes create bottlenecks for deployment. Model compression techniques, like pruning, make LLMs smaller by removing some parameters while maintainin...Discussnatural language processing
Akmmus AIblog.akmmusai.pro·Mar 8, 2024Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation (Short Summary)Text-to-SQL involves converting natural language questions into SQL queries to interact with databases is a complex task. Large Language Models (LLMs) have shown great promise in text-to-SQL. There's no systematic way to evaluate LLMs for this task....DiscussDeep Learning
Akmmus AIblog.akmmusai.pro·Mar 8, 2024Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges (Short Summary)TLDR - Data Augmentation involves generating more labelled data to train deep learning models.Large Language Models can generate large amounts of realistic text data. This survey paper discusses the positive impact of LLMs on DA, including various st...DiscussDeep Learning
Akmmus AIblog.akmmusai.pro·Mar 8, 2024LLMGuard: Guarding against Unsafe LLM Behavior (Short Summary)TLDR - Sometimes, LLMs can generate inappropriate, biased, or factually incorrect responses. This might result in a violation of regulations and can lead to legal issues. LLMGuard is a tool which has the potential to address these LLM risks. LLMGuard...Discussgenerative ai