Previous research suggests that performance evaluations conducted for personnel decisions tend to be substantially more lenient than performance evaluations conducted for research purposes. Because LOE is a "jeopardy" event, we hypothesized that LOE ratings would be substantially more lenient that comparable LOFT ratings. The results failed to support this hypothesis. However, path analyses suggest that the instructors were using different rating strategies when evaluating overall PIC and SIC performance in LOFT than in LOE. Specifically, PIC and SIC ratings in the LOE tended to emphasize specific behavioral examples (i.e., TECH topics) to a much greater extent than in LOFT.