Research has revealed that "multi-hop reasoning tasks exhibit substantial performance degradation as context grows, whereas ...