Skip to content

Adventures in Machine Learning

  • Home

Adventures in Machine Learning

  • Home

PPO Proximal Policy Optimization reinforcement learning in TensorFlow 2

  • March 29, 2021

In previous posts (here and here), I have been covering policy gradient-based reinforcement learning methods. In this post, I will continue the series by covering another pseudo-policy gradient based method called Proximal Policy Optimization (PPO).… Read More »PPO Proximal Policy Optimization reinforcement learning in TensorFlow 2

A2C architecture

A2C Advantage Actor Critic in TensorFlow 2

  • January 29, 2021

In a previous post, I gave an introduction to Policy Gradient reinforcement learning. Policy gradient-based reinforcement learning relies on using neural networks to learn an action policy for the control of agents in an environment.… Read More »A2C Advantage Actor Critic in TensorFlow 2

Python TensorFlow Tutorial – Build a Neural Network

  • November 26, 2020

Updated for TensorFlow 2 Google’s TensorFlow has been a hot topic in deep learning recently.  The open source software, designed to allow efficient computation of data flow graphs, is especially suited to deep learning tasks.  It is designed to be executed on single or multiple CPUs and GPUs, making it a good option for complex deep… Read More »Python TensorFlow Tutorial – Build a Neural Network

Bayes Theorem, maximum likelihood estimation and TensorFlow Probability

  • October 17, 2020

A growing trend in deep learning (and machine learning in general) is a probabilistic or Bayesian approach to the problem. Why is this? Simply put – a standard deep learning model produces a prediction, but… Read More »Bayes Theorem, maximum likelihood estimation and TensorFlow Probability

Policy Gradient Reinforcement Learning in TensorFlow 2

  • February 22, 2020

In a series of recent posts, I have been reviewing the various Q based methods of deep reinforcement learning (see here, here, here, here and so on). Deep Q based reinforcement learning operates by training… Read More »Policy Gradient Reinforcement Learning in TensorFlow 2

Prioritised Experience Replay in Deep Q Learning

  • February 3, 2020

In previous posts (here, here and here and others), I have introduced various Deep Q learning methodologies. If you have been across these posts, you will have observed that a memory buffer is used to… Read More »Prioritised Experience Replay in Deep Q Learning

SumTree - left traverse

SumTree introduction in Python

  • January 17, 2020

Weighted sampling from a list-like collection is an important activity in many applications. Weighted sampling involves selecting samples randomly from a collection, but where the “scales are tipped” towards those entries within the collection with… Read More »SumTree introduction in Python

Atari Space Invaders - Dueling Q training reward

Atari Space Invaders and Dueling Q RL in TensorFlow 2

  • November 15, 2019

In previous posts (here and here) I introduced Double Q learning and the Dueling Q architecture. These followed on from posts about deep Q learning, and showed how double Q and dueling Q learning is… Read More »Atari Space Invaders and Dueling Q RL in TensorFlow 2

Dueling Q architecture

Dueling Q networks in TensorFlow 2

  • October 5, 2019

In this post, we’ll be covering Dueling Q networks for reinforcement learning in TensorFlow 2. This reinforcement learning architecture is an improvement on the Double Q architecture, which has been covered here. In this tutorial,… Read More »Dueling Q networks in TensorFlow 2

ResNet layers and abstractions

Introduction to ResNet in TensorFlow 2

  • August 31, 2019

In previous tutorials, I’ve explained convolutional neural networks (CNN) and shown how to code them. The convolutional layer has proven to be a great success in the area of image recognition and processing in machine… Read More »Introduction to ResNet in TensorFlow 2

  • 1
  • 2
  • 3
  • 4
  • Next »

Neve | Powered by WordPress

>