Upload & Sell: Off
Please don't put words in my mouth, use your own voice
I'm not getting ping-ponging here. That occurs when there are TWO widely separated focal points of equal strength fighting for attention. Here the large figure on the left dominates is size and sharpness then eye moved smoothly to the middle and background figures. The main problem I have with the image is you've cut the statue in the foreground off at the knees and the flags are a distraction that pulls attention up off the statue in back, rather than tracking back across the statues to the one in the foreground
What could have been done to improve it would have been to find a point of view that would put the statue in the foreground between the two in the background like this...
Compositionally it's what is called "unifying the centers of interest". By finding a point of view that lines up content in the center of the frame it allows the viewer to see everything important at the same time. It also creates a triangular rather than linear eye path between the heads of the statues. In the middle the missing legs of the foreground statue doesn't seem as odd and the flags now behind the statue don't pull the eye off it.
I used an asymmetrical mat the same tone as the building with more space on the bottom to compensate visually for the unseen part of the legs creating the illusion they are there, but hidden under the mat rather than chopped off above the knees.
Mind you I'm just illustrating what to look for next time when shooting in a similar situation, not suggesting you edit the photo that way, but you could pull it off without anyone being the wiser and it would be stronger, tighter composition of the elements.