I understand why they are important to you, but if you turn it around and look at it from the point of view of the benchmark designers and their goals, there are just better ways to go about it.
If you're regularly getting autocompletion delays in the seconds then you have something else wrong. I would try reinstalling Visual Studio, or whatever IDE you use.
Even auto completion delays are going to be more the result of slower HD speed (Usually the result of having a cache miss). CPU speed may play a role, but not a major one. (not with newer CPUs).
The issue is that programming tools don't stress a particular component. That is what hardware reviews are all about, stressing particular components and getting reliable repeatable results that can be used to determine components speed.
If you look at most hard drive reviews, they don't review things like game FPS (and they shouldn't) or decompression speed. Those things depend on other factors besides the hard drive. In fact, the hard drive is a particularly easy to measure component, there are only 4 factors that matter, sequential read speed, sequential write speed, random read speed, and random write speed. Those are things that are very easy to measure very accurately. They translate directly to real world performance as well.
CPUs and GPUs, on the other hand, have no such simple measurements, hence the reason that you will frequently see a whole salvo of software used to test these components and not some standard synthetic test suite. These parts can have vastly different results all depending on how the software was programmed to use them.
Software development tools are almost completely I/O dependent. So the only test you would use them for is to see how fast the hard drive is. The problem is, there isn't really a good program that you can say "Compile this to see how good the hard drive is!". The thing is, you could have your software stored sequentially or it could be distributed throughout the hard drive. From a reviewers standpoint, it is nearly impossible to change this. So from hardware to hardware, setup to setup, you are going to have vastly differing speeds.
Not only that, but there are vast variances in things like "Where is the hard drive head when the compiler starts?" or "Was some file already compiled which gives a speed up?". The repeatability of the tests is staggeringly bad, and worse with larger projects. All for what? Something that a specialty program can already give precise and repeatable results for.
The next problem comes from "What sort of tests could be performed?". You mentioned intelisense. How, exactly, would you perform a test for that? Just about the only thing that would be measurable is the compile time. What would a reviewer be able to say beyond "It felt pretty snappy."
You say you want full system testing, but do you really? Do you really want anand to go through every permutation of hardware combinations available and test how the software suite works on them? It is impossible to do, there are at least 10,000! combinations (the ! is a factorial).
If you ever run a review site, go ahead and test compile times. But we are already telling you, they are going to be closely linked to hard drive speed while being varied enough to give you no conclusions about anything.