As much as I like the idea:
Something always tells me this is the compilers job... What clever reasoning are you applying that the compiler's inliner can't? It seems like a different situation to say SIMD code, where correctly structuring loops can require a lot of gymnastics that the compiler can't or won't (floating point conformance) do. The inlining decision seems easily automatable in comparison.
I understand that unoptimised builds for debugging are a problem, but a sensible compiler let's you hand pick your optimisation passes.
In short: why are compilers not good enough at this that the programmer needs to be involved?