From fdde45eebdd879f25ac917a172639c491c59e980 Mon Sep 17 00:00:00 2001 From: Antti Hyttinen <ajhyttin@gmail.com> Date: Wed, 7 Aug 2019 11:00:48 +0300 Subject: [PATCH] Trying to make sense of the setting. --- paper/sl.tex | 15 ++++++--------- 1 file changed, 6 insertions(+), 9 deletions(-) diff --git a/paper/sl.tex b/paper/sl.tex index 561d710..99e471d 100755 --- a/paper/sl.tex +++ b/paper/sl.tex @@ -357,10 +357,7 @@ In this section we present our results from experiments with synthetic and reali We experimented with synthetic data sets to examine accurateness, unbiasedness and robustness to violations of the assumptions. -We sampled $N=50k$ samples of $X$, $Z$, and $W$ as independent standard Gaussians. We then drew the outcome $Y$ from a Bernoulli distribution with parameter $p = 1 - \invlogit(\beta_xx+\beta_zz+\beta_ww)$ so that $P(Y=0|X, Z, W) = \invlogit(\beta_xx+\beta_zz+\beta_ww)$ where the coefficients for X, Z and W were set to $1$, $1$, $0.2$ respectively. This process follows the suggestion of Lakkaraju et al. \cite{lakkaraju2017selective}. - - -\acomment{How were the leniencies drawn?} \acomment{Explain the how we have several different judges?} +We sampled $N=50k$ samples of $X$, $Z$, and $W$ as independent standard Gaussians. We then drew the outcome $Y$ from a Bernoulli distribution with parameter $p = 1 - \invlogit(\beta_xx+\beta_zz+\beta_ww)$ so that $P(Y=0|X, Z, W) = \invlogit(\beta_xx+\beta_zz+\beta_ww)$ where the coefficients for X, Z and W were set to $1$, $1$, $0.2$ respectively. We sampled $50$ leniency levels $R$ uniformly from $[0,1]$. We assigned the randomly subjects such that a single leniency level was assigned for $1000$ subjects. In the example, this mimics having 50 judges deciding each for $1000$ defendants. The data was divided in half to form a training set and a test set. This process follows the suggestion of Lakkaraju et al. \cite{lakkaraju2017selective}. \acomment{Check before?} %This is one data generation module. % It can be / was modified by changing the outcome producing mechanism. For other experiments we changed the outcome generating mechanism so that the outcome was assigned value 1 if @@ -369,13 +366,13 @@ The \emph{default} decision maker in the data fits a logistic regression model $ $T=1$ to $R$ percent of subjects given by the leniency with highest probability of $Y=1$ in the test set. - We used a number of different decision mechanism. A \emph{limited} works as the default but uses regression model $Y \sim \invlogit(\beta_xx)$ . -A \emph{biased} decision maker works similarly but the logistic regression model is .. where biases decision... -Given leniency $R$, a \emph{random} decision maker decides on $T=1$ probability $R$. + We used a number of different decision mechanism. A \emph{limited} works as the default but uses regression model $Y \sim \invlogit(\beta_xx)$. Hence it is unable to observe $Z$. +A \emph{biased} decision maker works similarly as limited but the logistic regression model is .. where biases decision. +Given leniency $R$, a \emph{random} decision maker decides on $T=1$ probability given by $R$. In contrast, Lakkaraju et al. essentially order the subjects and decide $T=1$ with the percentage given by the leniency $R$. We see this as unrealistic: the decisions -on a subject should not depend on the decision on other subject. In the example this would induce unethical behaviour, a judge would need to jail somebody today in order to release a defendant tomorrow. -We treat the observations as independent and the still the leniency would be a good estimate of the acceptance rate. The acceptance rate converges to the leniency. +on a subject should not depend on the decision on other subject. In the example this would induce unethical behaviour: a single judge would need to jail defendant today in order to release a defendant tomorrow. +We treat the observations as independent and the still the leniency would be a good estimate of the acceptance rate. The acceptance rate converges to the leniency. \acomment{As a reviewer I would perhaps ask to see the results for the Lakkaraju mechanism.} %This is a decider module. We experimented with different combinations of decider and data generating modules to show X / see Y. (to see that our method is robust against non-informative, biased and bad decisions . Due to space constraints we defer these results...) -- GitLab