Data Science Asked by Wireless Engineer on March 15, 2021
I am working on solving a problem with reinforcement learning which has to find the optimal matrix that maximize the reward. I am not able to see how I can formulate this problem as I have practiced example with simple actions like UP DOWN LEFT RIGHT. My actions are matrix of size (Nt, Nr) where Nt and Nr can be 64. I have tried to convert it to one dimension and use DDQN but when I run Q[action], I get an error because a list can only be indexed with an integer.
Please help me to find at least what I need to explore first. Do I need to use A2C networks?
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP