Criteria for Evaluating Reasoning