We want to buy a house deciding which one to buy by visiting a fixed sequence of
houses.
The
houses have a value
(
) which are not know in advance.
We obtain the value of a current house at the time we visit it and we are then able to compare
its value with the previously visited ones. We assume that the probabilistic model is that
when visiting a sequence of house we will visit a random permutation of the
houses with
a uniform law on all the random permutations.
When we visit the
-th house we can decide to stop and buy the house or to continue it is
not possible to come back on a non-selected house.
Let
defined as follows :
Our problem is a stoping time problem. We have to decide at which time to stop in
order to maximize the probability that we have chosen the best house.
We want to compute
and obtain the associated optimal strategy (note that
),
where
is given by :
Question 2
We assume here that the values . Use grand to obtain a random
permutation of
.
|
Question 3
Compute along the trajectory the value of and compute the stoping time.
|
Question 4
Use monte Carlo simulation to evaluate the optimal value function.
|