Skip to content
Snippets Groups Projects
Commit 85313849 authored by Riku-Laine's avatar Riku-Laine
Browse files

Acks, affiliations, keywords and figures

parent 1c8b35cb
No related branches found
No related tags found
No related merge requests found
......@@ -58,7 +58,7 @@ Additional noise is added to the outcome of each case via $e_\outcome$, which wa
\begin{figure}[!t]
%\begin{center}
%\includegraphics[width=0.95\linewidth,trim={0 0 0 1.0cm},clip]{./img/_deciderH_independent_deciderM_batch_maxR_0_9coefZ1_0_all}
\includegraphics[width=0.49\linewidth,trim={0cm 0 0cm 0cm},clip]{./img/fig4_left_styled_dpi}\quad\includegraphics[width=0.49\linewidth,trim={0cm 0 0cm 0cm},clip]{./img/fig4_right_styled_dpi}
\includegraphics[width=0.49\linewidth,trim={0cm 0 0cm 0cm},clip]{./img/fig4_left_styled_dpi.pdf}\quad\includegraphics[width=0.49\linewidth,trim={0cm 0 0cm 0cm},clip]{./img/fig4_right_styled_dpi.pdf}
%\end{center}
\caption{Left: Evaluation of \batch decision maker on data with \independent. Error bars show std. of the \failurerate estimate across 10 datasets. In this basic setting, both our \cfbi and contraction follow the true evaluation curve closely but \cfbi exhibits lower variation.
Right: Evaluating \batch on data employing \independent and with leniency at most $0.5$. \cfbi offers sensible estimates of the failure rates for all levels of leniency, whereas \contraction only up to leniency $0.5$.}
......@@ -188,7 +188,7 @@ In addition, \cfbi exhibits considerably lower variation than \contraction.
\begin{figure}[!t]
\center
\includegraphics[width=0.65\linewidth,trim={0 0 0 0.25cm},clip]{./img/summary_z1_dpi}
\includegraphics[width=0.65\linewidth,trim={0 0 0 0.25cm},clip]{./img/summary_z1_dpi.pdf}
\caption{Mean absolute error (MAE) of estimate w.r.t. true evaluation.
Error bars show std. of the absolute error over 10 datasets. \cfbi offers robust estimates across all decision makers. The error of \contraction varies within and across different decision makers.}
\label{fig:results_errors}
......@@ -308,7 +308,7 @@ The deployed machine decision maker was defined to release \leniencyValue fracti
%
\begin{figure}[!t]
\center
\includegraphics[width=0.65\linewidth,trim={0 0 0 0.26cm},clip]{img/summary_z5_dpi}
\includegraphics[width=0.65\linewidth,trim={0 0 0 0.26cm},clip]{img/summary_z5_dpi.pdf}
\caption{MAE of estimate w.r.t true evaluation when the effect of the unobserved $\unobservable$ is high ($b_\unobservable=5$). The decision quality is poorer, but \cfbi can still evaluate the decisions accurately. \contraction shows higher variance and lower accuracy.}
\label{fig:highz}
\end{figure} % RL: Note that only machine decision maker is poorer, not the human.
......@@ -316,7 +316,7 @@ The deployed machine decision maker was defined to release \leniencyValue fracti
%
\begin{figure} [!t]
\centering
\includegraphics[width=0.60\linewidth,trim={0 0 0 0.27cm},clip]{./img/sl_errors_compas_squeezed_dpi}
\includegraphics[width=0.60\linewidth,trim={0 0 0 0.27cm},clip]{./img/sl_errors_compas_squeezed_dpi.pdf}
\caption{Results with COMPAS data. Error bars show std. of the absolute \failurerate estimate errors across all levels of leniency w.r.t. true evaluation. \cfbi gives both more accurate and precise estimates despite of the number of judges used.
% Performance of \contraction gets notably worse as the number of judges increases.
}
......
......@@ -59,14 +59,15 @@
%% Add emails for camera ready
\author{Riku Laine$^\dagger$ \and
Antti Hyttinen\orcidID{0000-0002-6649-3229} \and
Michael Mathioudakis\orcidID{0000-0003-0074-3966}}
\author{Riku Laine\inst{1,\dagger} \and
Antti Hyttinen\inst{1, 2}\orcidID{0000-0002-6649-3229} \and \\
Michael Mathioudakis\inst{1}\orcidID{0000-0003-0074-3966}}
%
\authorrunning{R. Laine, A. Hyttinen, M. Mathioudakis}
\institute{University of Helsinki, Helsinki, Finland \\
\email{\{riku.laine, antti.hyttinen, michael.mathioudakis\}@helsinki.fi}
\email{\{riku.laine, antti.hyttinen, michael.mathioudakis\}@helsinki.fi} \\
\and HIIT, Department of Computer Science, University of Helsinki, Helsinki, Finland
}
%
\maketitle % typeset the header of the contribution
......@@ -91,7 +92,7 @@ Based on this model, we compute counterfactuals to impute missing outcomes, whic
As we demonstrate over real and synthetic data, our approach estimates the quality of decisions more accurately and robustly compared to previous methods.
%
%% Uncomment for Camera Ready
\keywords{Selective labels \and Selection bias \and Causal modelling}
\keywords{Selective labels \and Selection bias \and Causal modelling \and Bayesian inference \and Model evaluation}
\end{abstract}
\input{introduction}
......@@ -109,7 +110,7 @@ As we demonstrate over real and synthetic data, our approach estimates the quali
\input{conclusions}
\subsubsection{Acknowledgments.}
Authors acknowledge the computer capacity from the Finnish Grid and Cloud Infrastructure (urn:nbn:fi:research-infras-2016072533).
Authors acknowledge the computer capacity from the Finnish Grid and Cloud Infrastructure (urn:nbn:fi:research-infras-2016072533). AH was supported by Academy of Finland grants 295673, 316771 and by HIIT.
\input{appendix}
......
......@@ -15,7 +15,7 @@ from matplotlib.lines import Line2D
plt.rcParams.update({'font.size': 18})
plt.rcParams.update({'figure.figsize': (10, 4)})
plt.rcParams.update({'savefig.format': 'png'})
plt.rcParams.update({'savefig.format': 'pdf'})
plt.rcParams.update({'savefig.dpi': '1200'})
path = "C:/Users/Riku_L/bachelors-thesis/data/result_files/"
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment