@straw
sorta like what devy said, the one where the portrait on the bottom of the screen in YOUR screen shot has not enough focus at all, which emphasizes devy bacon's point. the third screenshot example was more 3:4 or retangular in screen size/ratio, and the portrait there had a lot more emphasis, bigger and higher opacity. Yours doesn't really have those, so when the text/picture occurs, there's a larger amount of division for attention.
The ff tactics screenshot (i think?) worked better with that style namely because it had that small scriptlet working for it, like said, and the pictures were made "protraited". First, the text goes directly from the portrait, so there's less of a division of attention when reading, cause you sorta get used to it/it's implied who's speaking what better.
the picture is also made portraited, with the corners of the picture rounded off with a shadow to make the attention not see how it's abruptly cut off, like yours. If you put faces in a box, then there's more focus on the face. Even the first example screenshot (grandia?) had the area around the head darkened to make the face focused on.