![]() |
aGrUM
0.16.0
|
<agrum/FMDP/planning/treeOperatorStrategy.h> More...
#include <treeOperatorStrategy.h>
Public Member Functions | |
Constructor & destructor. | |
TreeOperatorStrategy () | |
Default constructor. More... | |
~TreeOperatorStrategy () | |
Default destructor. More... | |
MultiDimFunctionGraph< GUM_SCALAR, ExactTerminalNodePolicy > * | getFunctionInstance () |
MultiDimFunctionGraph< ArgMaxSet< GUM_SCALAR, Idx >, SetTerminalNodePolicy > * | getArgMaxFunctionInstance () |
MultiDimFunctionGraph< ActionSet, SetTerminalNodePolicy > * | getAggregatorInstance () |
Protected Member Functions | |
INLINE void | _deleteFunctionGraph (const MultiDimFunctionGraph< GUM_SCALAR > *f1, const MultiDimFunctionGraph< GUM_SCALAR > *f2, Idx del) |
INLINE void | _deleteFunctionGraph (const MultiDimFunctionGraph< ArgMaxSet< GUM_SCALAR, Idx >, SetTerminalNodePolicy > *f1, const MultiDimFunctionGraph< ArgMaxSet< GUM_SCALAR, Idx >, SetTerminalNodePolicy > *f2, Idx del) |
Graph Function Operations Methods | |
MultiDimFunctionGraph< GUM_SCALAR > * | regress (const MultiDimFunctionGraph< GUM_SCALAR > *Vold, Idx actionId, const FMDP< GUM_SCALAR > *fmdp, const Set< const DiscreteVariable * > &elVarSeq) |
Computes Qaction for given actionid. More... | |
virtual MultiDimFunctionGraph< GUM_SCALAR > * | maximize (const MultiDimFunctionGraph< GUM_SCALAR > *f1, const MultiDimFunctionGraph< GUM_SCALAR > *f2, Idx del=3) |
virtual MultiDimFunctionGraph< GUM_SCALAR > * | minimize (const MultiDimFunctionGraph< GUM_SCALAR > *f1, const MultiDimFunctionGraph< GUM_SCALAR > *f2, Idx del=3) |
virtual MultiDimFunctionGraph< GUM_SCALAR > * | multiply (const MultiDimFunctionGraph< GUM_SCALAR > *f1, const MultiDimFunctionGraph< GUM_SCALAR > *f2, Idx del=3) |
virtual MultiDimFunctionGraph< ArgMaxSet< GUM_SCALAR, Idx >, SetTerminalNodePolicy > * | argmaximize (const MultiDimFunctionGraph< ArgMaxSet< GUM_SCALAR, Idx >, SetTerminalNodePolicy > *f1, const MultiDimFunctionGraph< ArgMaxSet< GUM_SCALAR, Idx >, SetTerminalNodePolicy > *f2, Idx del=3) |
ArgMaximizes between f2 and f1. More... | |
virtual MultiDimFunctionGraph< GUM_SCALAR > * | add (const MultiDimFunctionGraph< GUM_SCALAR > *f1, const MultiDimFunctionGraph< GUM_SCALAR > *f2, Idx del=1) |
Adds reward to given function( whether a f2 or f1) More... | |
virtual MultiDimFunctionGraph< GUM_SCALAR > * | subtract (const MultiDimFunctionGraph< GUM_SCALAR > *f1, const MultiDimFunctionGraph< GUM_SCALAR > *f2, Idx del=0) |
Subtract current f1 from old f1 to see if threshold is reached or not. More... | |
<agrum/FMDP/planning/treeOperatorStrategy.h>
A class to find optimal policy for a given FMDP.
Perform a TreeOperatorStrategy planning on given in parameter factored markov decision process
Definition at line 50 of file treeOperatorStrategy.h.
INLINE gum::TreeOperatorStrategy< GUM_SCALAR >::TreeOperatorStrategy | ( | ) |
Default constructor.
Definition at line 63 of file treeOperatorStrategy_tpl.h.
INLINE gum::TreeOperatorStrategy< GUM_SCALAR >::~TreeOperatorStrategy | ( | ) |
Default destructor.
Definition at line 71 of file treeOperatorStrategy_tpl.h.
|
inlineprotectedinherited |
Definition at line 154 of file IOperatorStrategy.h.
|
inlineprotectedinherited |
Definition at line 161 of file IOperatorStrategy.h.
|
protectedvirtual |
Adds reward to given function( whether a f2 or f1)
reward | : R(s) or R(s,a) |
function | : either V(s) or Q(s,a) |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 193 of file treeOperatorStrategy_tpl.h.
References gum::TreeOperator< GUM_SCALAR, COMBINEOPERATOR, TerminalNodePolicy >::compute().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().
|
protectedvirtual |
ArgMaximizes between f2 and f1.
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 170 of file treeOperatorStrategy_tpl.h.
References gum::TreeOperator< GUM_SCALAR, COMBINEOPERATOR, TerminalNodePolicy >::compute().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().
|
inlinevirtual |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 84 of file treeOperatorStrategy.h.
References gum::TreeOperatorStrategy< GUM_SCALAR >::add(), gum::TreeOperatorStrategy< GUM_SCALAR >::argmaximize(), gum::TreeOperatorStrategy< GUM_SCALAR >::maximize(), gum::TreeOperatorStrategy< GUM_SCALAR >::minimize(), gum::TreeOperatorStrategy< GUM_SCALAR >::multiply(), gum::TreeOperatorStrategy< GUM_SCALAR >::regress(), and gum::TreeOperatorStrategy< GUM_SCALAR >::subtract().
|
inlinevirtual |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 79 of file treeOperatorStrategy.h.
|
inlinevirtual |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 75 of file treeOperatorStrategy.h.
References gum::MultiDimFunctionGraph< GUM_SCALAR, TerminalNodePolicy >::getTreeInstance().
|
protectedvirtual |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 124 of file treeOperatorStrategy_tpl.h.
References gum::TreeOperator< GUM_SCALAR, COMBINEOPERATOR, TerminalNodePolicy >::compute().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().
|
protectedvirtual |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 139 of file treeOperatorStrategy_tpl.h.
References gum::TreeOperator< GUM_SCALAR, COMBINEOPERATOR, TerminalNodePolicy >::compute().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().
|
protectedvirtual |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 153 of file treeOperatorStrategy_tpl.h.
References gum::TreeOperator< GUM_SCALAR, COMBINEOPERATOR, TerminalNodePolicy >::compute().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().
|
protectedvirtual |
Computes Qaction for given actionid.
Performs a multiplication/projection on given qAction.
qAction | : the computed Q(s,a) |
pxip | : the transition probabilities for the xip variable |
xip | : the variable we eliminate on the projection |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 93 of file treeOperatorStrategy_tpl.h.
References gum::TreeRegress< GUM_SCALAR, COMBINEOPERATOR, PROJECTOPERATOR, TerminalNodePolicy >::compute(), gum::FMDP< GUM_SCALAR >::mapMainPrime(), RECAST, gum::FMDP< GUM_SCALAR >::transition(), and gum::MultiDimImplementation< GUM_SCALAR >::variablesSequence().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().
|
protectedvirtual |
Subtract current f1 from old f1 to see if threshold is reached or not.
old | and new VFuntion |
Implements gum::IOperatorStrategy< GUM_SCALAR >.
Definition at line 211 of file treeOperatorStrategy_tpl.h.
References gum::TreeOperator< GUM_SCALAR, COMBINEOPERATOR, TerminalNodePolicy >::compute().
Referenced by gum::TreeOperatorStrategy< GUM_SCALAR >::getAggregatorInstance().