Thanks for the great repo! it Works fine with strong LLMs but it fails with small ones such as OSS or Qwen32B - is there a way to have sort of refinement loop? I found myself just copy pasting the errors 2 or 3 times till it works, did you implement such an option? if no, any tips how to add it?