From 5dd6713b8890f87e2de24c34b65d48577d498ed9 Mon Sep 17 00:00:00 2001 From: Antti Hyttinen <ajhyttin@gmail.com> Date: Wed, 7 Aug 2019 11:04:05 +0300 Subject: [PATCH] ... --- paper/sl.tex | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/paper/sl.tex b/paper/sl.tex index 99e471d..49dfe2f 100755 --- a/paper/sl.tex +++ b/paper/sl.tex @@ -376,7 +376,7 @@ We treat the observations as independent and the still the leniency would be a g %This is a decider module. We experimented with different combinations of decider and data generating modules to show X / see Y. (to see that our method is robust against non-informative, biased and bad decisions . Due to space constraints we defer these results...) -\paragraph{Algorithms} +\paragraph{Evaluators} We deployed multiple evaluator modules to estimate the true failure rate of the decider module. The estimates should be close to the true evaluation evaluator modules estimates and the estimates will eventually be compared to the human evaluation curve. \begin{itemize} \item True evaluation -- GitLab