Exploring question answering: metric analysis and evaluation framework for enhanced interpretability