Machine learning classification: Evaluating performance across multiple datasets