Carlos Alarcónblog.carlos-alarcon.com·Dec 7, 2024OpenAI presenta reinforcement fine tunning para sus modelosOpenAI lanzó o1 en ChatGPT como sorpresa del dia 1 de su campaña de lanzamientos navideños y pronto lo integrará a su API. Ahora en su segundo anuncio de navidad, la compañía anuncia un avance significativo en su programa de personalización de modelo...36 readsopenai
Pronod Bharatiyadata-intelligence.hashnode.dev·Dec 6, 2024IID vs. Non-IID Data: Choosing the Right Machine Learning and Deep Learning AlgorithmsIntroduction In the domain of machine learning (ML) and deep learning (DL), a primary assumption upheld by numerous algorithms is that the data is Independent and Identically Distributed (IID). This indicates that each data point is sourced independe...IID vs Non-IID Data in Machine Learning and Deep LearningIID assumption
Gabi Dobocanblog.telepat.io·Nov 22, 2024Birdie: Unlocking Advanced In-Context Capabilities with Efficient State Space ModelsBirdie Blog: Demystifying State Space Models for Business Efficiency Arxiv: https://arxiv.org/abs/2411.01030v3 PDF: https://arxiv.org/pdf/2411.01030v3.pdf Authors: Jimmy T. H. Smith, Amarda Shehu, Antonios Anastasopoulos, Sam Blouir Published: 2024...Bidirectional Processing
Fatima Jannetmahia.hashnode.dev·Nov 15, 2024ML Chapter 6: Reinforcement LearningWelcome to Part 6 - Reinforcement Learning! Reinforcement Learning is a powerful branch of Machine Learning used to solve problems by deciding actions based on past data. It's also used in AI to train machines for tasks like walking, using rewards an...1 like·26 readsMachine Learning (Python)Reinforcement Learning
Md Shahriyar Al Mustakim Mitulmitul-shahriyar.hashnode.dev·Nov 5, 2024Artificial Intelligence : Reinforcement Learning for Stock Market Trading (Part 41)Let’s apply RL for stock market trading Install dependencies !pip install tensorflow !pip install pandas-datareader import math import random import numpy as np import pandas as pd import tensorflow as tf import matplotlib.pyplot as plt import pandas...Reinforcement Learning
Gerard Sansai-cosmos.hashnode.dev·Oct 26, 2024Why Reinforcement Learning via Chain-of-Thought Misses the Point: Misguided Optimisations-Driven AI ResearchWhile artificial intelligence continues to make headlines with impressive benchmark scores, a troubling practice has taken root in AI research. Imagine a teacher who, instead of helping students understand the subject matter, simply hands them copies...159 readsOpenAI seriesAI
HowAiWorksblog.howai.works·Oct 21, 2024Machine LearningArtificial intelligence has been the subject of much hype in recent years. The reason for this is the progress made in the field of machine learning. This article gives an overview of the topic and is suitable for beginners and advanced users. Types ...36 readsMachine Learning
Tom X NguyenforDwarves Foundation's Team Blogdwarvesf.hashnode.dev·Oct 16, 2024Proximal Policy OptimizationIntroduction Proximal Policy Optimization (PPO) is an algorithm that aims to improve the stability of training by avoiding overly large policy updates. It is a popular and effective method used for training [[Reinforcement Learning | reinforcement le...AI
Anix Lynchgozeroshot.dev·Oct 15, 2024Advanced Machine Learning Q&A (1/2)Q: How does the vanishing gradient problem primarily affect training in deep neural networks? Slowing down convergence and making training difficult. Q: What mathematical function is used in the weighted sum of the inputs and the bias in a neuron bef...Machine Learning
Yashashwi SinghaniaforCOPS IIT BHUblogs.copsiitbhu.co.in·Oct 2, 2024AlphaZero - A Revolution in Game AIArtificial Intelligence (AI) has long been a crucial element in gaming, enabling machines to challenge and sometimes outwit human players. Traditionally, game AI was built using heuristic-based approaches: static rules hardcoded by developers. While ...116 likes·218 readsProgramming Blogs