call @PLT Performance - D Programming Language Discussion Forum

Forums

New users
- Learn
Community
- General
- Announce
Improvements
- DIP Ideas
- DIP Devel.
Ecosystem
- GDC
- LDC
- Debuggers
- IDEs
- DWT
Development
- Internals
- Issues
- Beta
- DMD
- Phobos
- Druntime
- Study
Turkish
- Genel
- Duyuru

Index » LDC » call @PLT Performance

Thread overview

call @PLT Performance
Jan 16, 2019 SrMordred
Jan 16, 2019 Johan Engelen
Jan 16, 2019 SrMordred

January 16, 2019

call @PLT Performance

Posted by SrMordred

SrMordred

Compiler noob here:

auto a = popcnt(bitset);
auto b = bsf(bitset);

generate this:

call    pure nothrow @nogc @safe int core.bitop.popcnt(uint)@PLT
call    pure nothrow @nogc @safe int core.bitop.bsf(uint)@PLT

Why not generate the bsf/popcnt instruction?

Aren't this call's slower?

(this question expand to all the places where calls to @PLT happen)

January 16, 2019

Re: call @PLT Performance

Posted by Johan Engelen
in reply to SrMordred

Johan Engelen

Posted in reply to SrMordred

On Wednesday, 16 January 2019 at 13:03:59 UTC, SrMordred wrote:
> Compiler noob here:
>
> auto a = popcnt(bitset);
> auto b = bsf(bitset);
>
> generate this:
>
> call    pure nothrow @nogc @safe int core.bitop.popcnt(uint)@PLT
> call    pure nothrow @nogc @safe int core.bitop.bsf(uint)@PLT
>
> Why not generate the bsf/popcnt instruction?
>
> Aren't this call's slower?

Yeah this is a known issue: LDC does not cross-module inline. You can enable that by passing the "-enable-cross-module-inlining" compile flag.
It's a long standing issue, but became a little less urgent because of LTO (`-flto=...`).

-Johan

January 16, 2019

Re: call @PLT Performance

Posted by SrMordred
in reply to Johan Engelen

SrMordred

Posted in reply to Johan Engelen

On Wednesday, 16 January 2019 at 14:19:27 UTC, Johan Engelen wrote:
> On Wednesday, 16 January 2019 at 13:03:59 UTC, SrMordred wrote:
>> Compiler noob here:
>>
>> auto a = popcnt(bitset);
>> auto b = bsf(bitset);
>>
>> generate this:
>>
>> call    pure nothrow @nogc @safe int core.bitop.popcnt(uint)@PLT
>> call    pure nothrow @nogc @safe int core.bitop.bsf(uint)@PLT
>>
>> Why not generate the bsf/popcnt instruction?
>>
>> Aren't this call's slower?
>
> Yeah this is a known issue: LDC does not cross-module inline. You can enable that by passing the "-enable-cross-module-inlining" compile flag.
> It's a long standing issue, but became a little less urgent because of LTO (`-flto=...`).
>
> -Johan

Oh Nice, thanks!

Top | Forum index | About this forum

Copyright © 1999-2021 by the D Language Foundation