it’s genuinely hard to get normal humanlike labor out of LLMs.. The lack of continual learning is a huge problem. (..) The LLM baseline at many tasks might be higher than an average human's. But there’s no way to give a model high level feedback.