Howdy, Nrshida.
If you look into the structure of this aircraft you can see how it is a little unusual and should have a good deal more integrity than a Spitfire, say.
No one can tell at all from looking at the structure. You can't tell by eye if a control surface will experience catastrophic flutter at a particular speed. As one example, there was a fatal crash at Reno Air Races a while back where an unlimited racer (based on P-51 but with a Learjet horizontal stab) flew through the wake of another racer, which triggered flutter on the tail, which then failed. A Learjet tail is plenty strong and able of course to fly at very high speeds usually, but the aerodynamics were such that on the P-51, it could get into a regime where flutter could be triggered and result in catastrophic failure. Not only would you not see that coming by looking at the structure of the Learjet tail or knowing that it routinely flew at near Mach 1, but the designers and crew (much more experienced in building and modifying high-speed unlimited-class racing aircraft than you or I) obviously didn't. There are examples of very capable aircraft designers not knowing this based on structure or design, either, and only knowing it once seeing it happen in test flights.
The loss of sets of surfaces can't be modelling flutter as they detach exactly at the same time.
They can. Flutter can happen for both at about the same time. In fact, since they are nearly identical in design, one's first thought should be that both ailerons should could very well experience flutter at about the same time. Here are a couple videos of planes suffering failure to both left and right wings and/or both left and right wingtips at the about the same time:
http://www.youtube.com/watch?v=n06WNSS4tFshttp://www.youtube.com/watch?v=LPbhQS6IljUBesides the evaluation aircraft went faster than ours does this, without incident.
There are two aspects here. One is that you say this, but what is the situation and what is the reference? Did one person dive a Ki-84 to 550 mph and recover without damage to the aircraft while several other references state that it comes apart at 550? Also, failures in real life are stochastic things. There are some P-38's that tore apart in terminal-velocity dives and many P-38's that didn't. To model a P-38, you then need to choose whichever mode is more typical, or if you have enough data statistically, you could model it as a random process of failure of 1 in N terminal-velocity dives, but you'd need a lot more statistics than are likely available to judge that and to set N correctly.