Skip to content
Snippets Groups Projects
Commit 60afd6e4 authored by Riku-Laine's avatar Riku-Laine
Browse files

Modifications to 4.1

parent e9e54517
No related branches found
No related tags found
No related merge requests found
......@@ -202,18 +202,17 @@ Moreover, it is easy to see based on the derivations of Eq.\ref{eqn:gp} that our
Below we present our results in various settings. Models are evaluated in contrast to the following quantities:
\begin{itemize}
\item {\it True evaluation:} Depicts the true performance of the predictive model. Constructed by sorting all the labels in the test data (even the ones hidden from the models) by the predicted probabilities and the simulating the leniency at a given level.
\item {\it True evaluation:} Depicts the true performance of the predictive model. Constructed by sorting all the labels in the test data (even the ones hidden from the models) by the predicted probabilities and then simulating the acceptance rate at the given level.
\item {\it Labeled outcomes:} Similar to {\it true evaluation} but only available labels with positive decisions $(\decision = 1)$ are used.
\item {\it Human evaluation:} Human decision makers with similar leniency levels are binned and treated as a single decision maker.
\item {\it Contraction:} Contraction curve was constructed as explained by Lakkaraju et al \cite{lakkaraju2017selective}.
\item {\it Human evaluation:} Human decision makers with similar leniency levels are grouped and treated as a single decision maker.
\item {\it Contraction:} Contraction curve was constructed as explained by Lakkaraju et al. \cite{lakkaraju2017selective}.
\item {\it Causal model, ep:} Curve presents the predicted probability $\prob{\outcome = 0 | \doop{\leniency = \leniencyValue}}$ at various levels of acceptance rate.
\end{itemize}
\subsection{Without unobservables}
\subsubsection{Data creation}
The causal model for this scenario corresponds to that depicted in Figure \ref{fig:causalmodel}.
For the analysis, we assigned 500 subjects for each of the 100 judges randomly.
For the analysis, we assigned 500 subjects to each of the 100 judges randomly.
Every judge's leniency rate $\leniency$ was sampled uniformly from a half-open interval $[0.1; 0.9)$.
Private features $\features$ were defined as i.i.d standard Gaussian random variables.
Next, probabilities for negative results $\outcome = 0$ were calculated as
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment