Romman Sabbirrommansabbir.com·Jan 7, 2025Kotlin : Sequences for Efficient Data ProcessingIn Kotlin, efficiently handling collections is important when working with large datasets or performing multiple transformations. One tool Kotlin offers for these situations is sequences, which allow lazy evaluation to reduce memory usage and improve...33 readsKotlinAndroid
nidhinkumarblog.nidhin.dev·Jan 1, 2025AgiBot Robotic Learning DatasetChinese robotics firm AgiBot, also known as Zhiyuan Robotics, has announced the release of the largest robotic learning dataset to date, designed to advance artificial intelligence foundation models for human-like activities. This significant develop...robotics
KiwiChipcodecompass.hashnode.dev·Dec 25, 2024LLMs 파인튜닝 프로젝트: 제품 설명으로 가격 예측하기 (3)데이터를 다운 받고, 카테고리가 어떻게 구성되어 있는지 살펴보면 다음과 같다. 🚗 데이터 현황 – 자동차 카테고리가 너무 많다! 자동차 카테고리가 90만 개 이상의 데이터 포인트로 가장 많다. 총 데이터 포인트: 280만 개 이상으로 매우 방대하다. 이러한 데이터 포인트수는 훈련에 필요한 데이터 수를 훨씬 초과한다. ➡️ 목표: 더 가치 있고 의미 있는 데이터 포인트를 선택하여 작고 강력한 데이터셋을 만들기. 📏 토큰과 가격 ...Learning LLMsdataset
KiwiChipcodecompass.hashnode.dev·Dec 25, 2024LLMs 파인튜닝 프로젝트: 제품 설명으로 가격 예측하기 (2)🔑 비즈니스 문제 해결을 위한 5단계 전략 비즈니스 문제를 해결하기 위한 5단계 전략은 다음과 같다. 이해 (Understanding) 비즈니스 요구 사항을 깊게 파고들어서 어떤 문제를 해결하려고 하는지 정확히 인지하기 목표가 정확히 무엇인지 문서화 준비 (Preparation) Baseline 모델과 데이터 집합 큐레이팅 과정 등 모델 선택 (Select Models) 모델의 장단점을 파악해서 모델을 선택 커스터마이징 ...Learning LLMsllm
Nischal Baidarnischalbaidar.hashnode.dev·Oct 17, 2024🤖 Machine Learning Terms You Must Know1. 🧠 Algorithm Definition: A set of rules or steps that a machine follows to make decisions or predictions. Real-life example: Like a chef following a recipe to bake a cake 🍰, an ML algorithm follows specific steps to learn from data and make pre...easyterms
Dharshini Sankar Rajdharshinisankarraj.hashnode.dev·Oct 8, 2024Sampling on DatasetsOnce you’ve created a dataset, you want to explore the values inside. Exploring very large datasets can be difficult, as even simple operations can be expensive, both in terms of computational resources and time. The same sampling principle applies t...1 likeData Science
Prasun Dandapatprasunspace.hashnode.dev·Sep 29, 2024Comprehensive Guide to Data Preprocessing in Python for Machine LearningData preprocessing is a crucial step in the machine learning pipeline to ensure the data is clean, organized, and in a format suitable for training models. Here’s an overview of key topics typically included in data preprocessing: Topics in Data Prep...Machine Learning
Aakashi Jaiswalaakashi.hashnode.dev·Sep 26, 2024Empathy Through AIIn this blog, I am going to share some updates of my bot, that I am building. This bot is an Empathetic Mental support bot, which will become your AI companion you share anything with it and can express how you're feeling and many more!! Today, I wil...2 likesArtificial Intelligence
ByteScrum TechnologiesforByteScrum Technologiesblog.bytescrum.com·Sep 16, 2024How to Use Scikit-learn for Classification Tasks: A Comprehensive GuideClassification is one of the most common tasks in machine learning, where the objective is to categorize data points into predefined labels or classes. Whether you're building a spam filter, diagnosing diseases, or identifying objects in an image, cl...11 likes·72 readsMachine LearningPython
Sam Schneiderblog.samschneider.me·Aug 12, 2024How to Create Your First Hugging Face DatasetModern AI tooling is mainly based on building models trained by lots of data rather than developing clever algorithms. This means that once you move beyond the basics of calling models others have developed and want to start training your own models,...156 readshuggingface