Beyond One-Size-Fits-All: Layered Human Evaluation for Reliable NLG Assessment

Date: