The results reflect the games' maps and game style, testing each game with the same simple box level would dramatically change the results, but wouldn't be a true representation of how the network changes affected the game performance. This is why Doom3 and Doom3 ROE, which are basically the same code, have different results.