Thanks for your detailed code implementation for inference and training. I wonder, for the evaluation of canny and lineart tasks, which checkpoints should I use to meet the results in Table 2? And for the code, should I reuse the evaluation code for hed task? Thanks for your answer.