I have gathered my data regarding the speed and climb characteristics of P-51B and P-51D vs. Bf 109 K. The K-4`s advantage is rather massive, particularly below 7000m.


The P-51B results are from British flights tests. Data for P-51D is from the book America`s Hundred Thousend.
Data for Bf 109 K-4 are from German flight tests with DB 605 DB and DC engine, for 1850 and 2000 HP respectively, and a standard propellor.
Also, while I should note that F4U had somewhat right in that the 109 series were heavy to manouver in pitch, one should note that the Mustang was no better. As Mark Hanna mentioned, the elevator forces were similiar.
There`s also a confirmation to this by Dave Southwood:
The Bf109G is heavy to manoeuvre in pitch, being similar to a Mustang. At 520kph it is possible to pull 4g with one hand, but I find it more comfortable to use both hands on the stick for looping manoeuvres, normally entered at 420kph and 3g. Pitch trim changes with speed are moderate, and the tail plane trim wheel mounted abeam the pilots' left hip is easy to use. For a display, I run it at 420-450kph in trim, and then do not retrim. This causes no excessive stick forces during the display. Overall the aircraft is straightforward to handle in pitch.
It appears from the description that this wasn`t as restrictive though.
It also agrees with the finding of Society of Experimental Test Pilots, who ran a flight test comparison of the F6F-5, FG-1D (Goodyear F4U), P-47D-40 and P-51D. Chief test pilot was John Ellis of Kal-Aero.
The following give an insight to elevator stick forces and the pilot`s opinion :
MANEUVERING STABILITY stick forces/g at Vmax
FG-1--5 lbs/g (too light)
P-47--7.5 lbs/g (ideal)
F6F--12.5 lbs/g (barely acceptable)
P-51--over 20 lbs/g (excessive)
Regarding stall characteristics :
STALL normal (straight and level decelerating at 1 kt/sec.) and accelerated (constant 3g turn decelerating at 1 kt/sec.)
Aerodynamic warning:
Best--P-47, with buffet 5 kt above stall.
Worst--P-51, no buffet or other warning.
Height loss, accelerated stall:
Best--P-47, 100 ft.
Worst--P-51, 500 ft.
FG-1 and F6F both 150 ft.
Behavior during accelrated stall:
Most predictable and controllable: P-47 and F6F. Both could be flown at will into the pre-stall buffet, which at no time was heavy enough to present problems with tracking, and held at maximum usable lift coefficient with ease. Sideslip became noticeable as wing heaviness correctible with rudder. There was little tendency to depart controlled
flight. The FG-1 suffered severe airframe buffet shortly before the stall, but at the stall there was a strong g-break and rapid right wing drop--no matter which direction the turn. Careful left rudder could prevent wing drop, but then at the stall the aircraft became very unpredictable, bucking and porpoising, with a tendency to a sudden departure.
The P-51 gave no warning whatsoever of an accelerated stall. At the stall, the aircraft departed with complete loss of control, achieving 270-degree of roll before recovery. Departure was accompanied by violent aileron snatch strong enough to rip the control stick from the hand. In short, the P-51 suffered from a Part I deficiency.
Compare to Soutwood on the 109`s :
The idle power stall characteristics of the aircraft are very benign and affected little by undercarriage and flap position. Stalling warning is a slight wing rock with the stick floating right by about 2 inches. This occurs 10klph before the stall. The stall itself is a left wing drop through about 15 degrees with a slight nose drop, accompanied by a light buffet. All controls are effective up to the stall, and recovery is instant on moving the stick forward. Stall speeds are 155kph clean and 140kph with gear and flap down. In a turn at 280kphwith display power set, stall warning is given by light buffet at 3g, and the stall occurs at 3.5g with the inside wing dropping. Again, recovery is instant on easing the stick forward. One interesting feature is the leading edge slats. When these deploy at low speeds or in a turn, a 'clunk' can be heard and felt, but there is no disturbance to the aircraft about any axis. I understand that the Bf109E rolled violently as the slats deployed, and I am curious to know the difference to the Gustav that caused this.
The control heavyness of the 109 only applied to the elevators, the ailerons and rudders were light.
In fact, I have 3 different sources which all state that the aileron forces were light, a German, Finnish test, and Soutwood`s and Hanna`s words, all indicate that even at 450-480 km/h IAS range, stick force required for full deflection was a mere 20 lbs. Very light indeed. The measured roll rate at this speed was 80-95 degree/sec at this speed, again, rather good.