Fatima Jannetmahia.hashnode.dev·Nov 15, 2024ML Chapter 6: Reinforcement LearningWelcome to Part 6 - Reinforcement Learning! Reinforcement Learning is a powerful branch of Machine Learning used to solve problems by deciding actions based on past data. It's also used in AI to train machines for tasks like walking, using rewards an...Discuss·1 likeMachine Learning (Python)Reinforcement Learning
Md Shahriyar Al Mustakim Mitulmitul-shahriyar.hashnode.dev·Nov 5, 2024Artificial Intelligence : Reinforcement Learning for Stock Market Trading (Part 41)Let’s apply RL for stock market trading Install dependencies !pip install tensorflow !pip install pandas-datareader import math import random import numpy as np import pandas as pd import tensorflow as tf import matplotlib.pyplot as plt import pandas...DiscussReinforcement Learning
Gerard Sansai-cosmos.hashnode.dev·Oct 26, 2024Why Reinforcement Learning via Chain-of-Thought Misses the Point: Misguided Optimisations-Driven AI ResearchWhile artificial intelligence continues to make headlines with impressive benchmark scores, a troubling practice has taken root in AI research. Imagine a teacher who, instead of helping students understand the subject matter, simply hands them copies...Discuss·86 readsAI
HowAiWorksblog.howai.works·Oct 21, 2024Machine LearningArtificial intelligence has been the subject of much hype in recent years. The reason for this is the progress made in the field of machine learning. This article gives an overview of the topic and is suitable for beginners and advanced users. Types ...Discuss·35 readsMachine Learning
Tom X NguyenforDwarves Foundation's Team Blogdwarvesf.hashnode.dev·Oct 16, 2024Proximal Policy OptimizationIntroduction Proximal Policy Optimization (PPO) is an algorithm that aims to improve the stability of training by avoiding overly large policy updates. It is a popular and effective method used for training [[Reinforcement Learning | reinforcement le...DiscussAI
Anix LynchProanixblog.hashnode.dev·Oct 15, 2024Advanced Machine Learning Q&A (1/2)Q: How does the vanishing gradient problem primarily affect training in deep neural networks? Slowing down convergence and making training difficult. Q: What mathematical function is used in the weighted sum of the inputs and the bias in a neuron bef...DiscussMachine Learning
Yashashwi SinghaniaforCOPS IIT BHUblogs.copsiitbhu.co.in·Oct 2, 2024AlphaZero - A Revolution in Game AIArtificial Intelligence (AI) has long been a crucial element in gaming, enabling machines to challenge and sometimes outwit human players. Traditionally, game AI was built using heuristic-based approaches: static rules hardcoded by developers. While ...Discuss·116 likes·212 readsProgramming Blogs
Aakashi Jaiswalaakashi.hashnode.dev·Oct 1, 2024Technologies used by OpenAI?OpenAI uses a combination of multiple advanced technologies to train its AI model, particularly Generative Pre-trained Transformers (GPT). In machine learning technologies, key technologies are Deep learning and Natural language processing. Here are ...Archana Prusty and 1 other are discussing this2 people are discussing thisDiscuss·2 likesopenai
Aman DhingraforCOPS IIT BHUblogs.copsiitbhu.co.in·Sep 27, 2024Exploring Meta Reinforcement LearningIn this blog we’ll discuss what is meta reinforcement learning and how it can be useful, To start with let’s know what is meta learning. Meta Learning → Consider a scenario where a device is built to monitor user’s health details like heart rate, sle...Discuss·75 likes·198 readsProgramming BlogsHey Aman, I really enjoyed reading this article, the explanations, with useful illustrations. What is your workflow to write such articles? 3
Ajay Kumar Prajapatipythoninnerwork.hashnode.dev·Sep 25, 2024Machine LearningWhat is Machine Learning? Machine learning (ML) is a modern software development technique, and a type of artificial intelligence (AI), that enables computers to solve problems by using examples of real-world data. It allows computers to automaticall...Discuss·37 readsMachine Learning