Evaluating Large Language Models in Scientific Discovery • Libertify