-pi appeared to be described as the "anti policy", and by implication (or my inference) the "worst policy". This seems reasonable, but is it really true that swapping policies for everyone is really the worse policy?
Is your method sensitive to the weights? If the weights are poorly estimated, is your method still robust? For double robustness method, it requires that either of weighting or imputation is doing a good job.