Array start index - D Programming Language Discussion Forum

Forums

New users
- Learn
Community
- General
- Announce
Improvements
- DIP Ideas
- DIP Devel.
Ecosystem
- GDC
- LDC
- Debuggers
- IDEs
- DWT
Development
- Internals
- Issues
- Beta
- DMD
- Phobos
- Druntime
- Study
Turkish
- Genel
- Duyuru

Index » Learn » Array start index

Thread overview

Array start index
Aug 01, 2015 DLearner
Aug 01, 2015 Rikki Cattermole
Aug 01, 2015 bachmeier
Aug 01, 2015 John Colvin
Aug 01, 2015 DLearner
Aug 01, 2015 Andrej Mitrovic
Aug 01, 2015 bachmeier
Aug 02, 2015 QAston
Aug 03, 2015 bachmeier
Aug 03, 2015 DLearner
Aug 04, 2015 Jonathan M Davis
Aug 04, 2015 QAston
Aug 03, 2015 jmh530
Aug 03, 2015 Observer
Aug 04, 2015 Marc Schütz
Feb 06, 2017 Bastiaan Veelo
Feb 06, 2017 Bastiaan Veelo
Feb 06, 2017 pineapple
Feb 06, 2017 Nemanja Boric
Feb 07, 2017 bachmeier
Feb 06, 2017 Bastiaan Veelo
Feb 06, 2017 Ali Çehreli
Feb 07, 2017 Bastiaan Veelo
Feb 07, 2017 Ali Çehreli
Feb 07, 2017 Bastiaan Veelo
Feb 07, 2017 Ali Çehreli
Feb 07, 2017 Bastiaan Veelo
Feb 07, 2017 Ali Çehreli
Feb 08, 2017 Bastiaan Veelo
Feb 07, 2017 WhatMeForget

August 01, 2015

Array start index

Posted by DLearner

DLearner

Does the D language set in stone that the first element of an array _has_ to be index zero?
Wouldn't starting array elements at one avoid the common 'off-by-one' logic error, it does
seem more natural to begin a count at 1.

Actually, maybe even better to allow array definitions of form
int foo[x:y];
(y >= x) creating integer variables foo[x], foo[x+1],...,foo[y].

I think the (very old) IBM PL/I language was like this.

August 01, 2015

Re: Array start index

Posted by Rikki Cattermole
in reply to DLearner

Rikki Cattermole

Posted in reply to DLearner

On 1/08/2015 9:35 p.m., DLearner wrote:
> Does the D language set in stone that the first element of an array
> _has_ to be index zero?
> Wouldn't starting array elements at one avoid the common 'off-by-one'
> logic error, it does
> seem more natural to begin a count at 1.
>
> Actually, maybe even better to allow array definitions of form
> int foo[x:y];
> (y >= x) creating integer variables foo[x], foo[x+1],...,foo[y].
>
> I think the (very old) IBM PL/I language was like this.

In c style languages (like D) the index actually defines the offset in memory. Not actually the index.
So while in some languages 1 is used instead of 0, 0 maps better to the hardware.

Think of this byte array:

ubyte* myptr = [

|---|-------|
| i | value |
|---|-------|
| 0 | 1     |
| 1 | 255   |

];

For 0 start of index:
size_t i = 0;
assert(myptr[i] == 1);

For 1 start of index:
size_t i = 1;
assert(i != 0);
assert(myptr[i-1] == 1);

While this is not the complete reason why 0 is chosen, it is something to think about.

August 01, 2015

Re: Array start index

Posted by bachmeier
in reply to DLearner

bachmeier

Posted in reply to DLearner

On Saturday, 1 August 2015 at 09:35:53 UTC, DLearner wrote:
> Does the D language set in stone that the first element of an array _has_ to be index zero?
> Wouldn't starting array elements at one avoid the common 'off-by-one' logic error, it does
> seem more natural to begin a count at 1.
>
> Actually, maybe even better to allow array definitions of form
> int foo[x:y];
> (y >= x) creating integer variables foo[x], foo[x+1],...,foo[y].
>
> I think the (very old) IBM PL/I language was like this.

Seems you could easily wrap an array in a struct, define opIndex and opSlice appropriately, and use alias this to keep the other properties of the array. The problem you'll run into is interacting with other code that assumes zero-based indexing. I thought about setting the first index at 1 for my library that embeds D inside R, since that's how it's done in R, but very quickly realized how confusing it would be.

August 01, 2015

Re: Array start index

Posted by John Colvin
in reply to DLearner

John Colvin

Posted in reply to DLearner

On Saturday, 1 August 2015 at 09:35:53 UTC, DLearner wrote:
> Does the D language set in stone that the first element of an array _has_ to be index zero?

For the builtin slice types? Yes, set in stone.

> Wouldn't starting array elements at one avoid the common 'off-by-one' logic error, it does
> seem more natural to begin a count at 1.
>
> Actually, maybe even better to allow array definitions of form
> int foo[x:y];
> (y >= x) creating integer variables foo[x], foo[x+1],...,foo[y].
>
> I think the (very old) IBM PL/I language was like this.

See https://www.cs.utexas.edu/users/EWD/transcriptions/EWD08xx/EWD831.html

As other commenters have/will point out, you can easily define a custom type in D that behaves as you describe, and - Dijkstra notwithstanding - there are valid uses for such things.

August 01, 2015

Re: Array start index

Posted by DLearner
in reply to John Colvin

DLearner

Posted in reply to John Colvin

On Saturday, 1 August 2015 at 17:55:06 UTC, John Colvin wrote:
> On Saturday, 1 August 2015 at 09:35:53 UTC, DLearner wrote:
>> Does the D language set in stone that the first element of an array _has_ to be index zero?
>
> For the builtin slice types? Yes, set in stone.
>
>> Wouldn't starting array elements at one avoid the common 'off-by-one' logic error, it does
>> seem more natural to begin a count at 1.
>>
>> Actually, maybe even better to allow array definitions of form
>> int foo[x:y];
>> (y >= x) creating integer variables foo[x], foo[x+1],...,foo[y].
>>
>> I think the (very old) IBM PL/I language was like this.
>
> See https://www.cs.utexas.edu/users/EWD/transcriptions/EWD08xx/EWD831.html
>
> As other commenters have/will point out, you can easily define a custom type in D that behaves as you describe, and - Dijkstra notwithstanding - there are valid uses for such things.

D is a C derivative, so it seems a shame not to identify causes of bugs in C,
and design them out in D.

For example, in C difficult to write non-trivial commercial programs without using pointers.
Pointer manipulation has a terrible reputation for bugs.
But in D, easy to write commercial programs without using pointers.
Problem has been designed away.

Similarly, off-by-one array bugs are commonplace in C.
We should seek to eliminate the source of those bugs, which basically reduces to the
issue that programmers find it unnatural to start a count at zero.
Whether they _should_ find a zero start unnatural is irrelevant - they just do as an
observed fact, so let's change the language so the issue is avoided (designed away).

Suggestion: if the codebase for D is considered so large that zero-basing cannot now be changed,
why not extend the language to allow for array definitions like 'int[x:y] foo'?
And then have a rule that 'int[:y] bar' defines a 1-based array of y elements?

August 01, 2015

Re: Array start index

Posted by Andrej Mitrovic
in reply to DLearner

Andrej Mitrovic

Posted in reply to DLearner

On 8/1/15, DLearner via Digitalmars-d-learn <digitalmars-d-learn@puremagic.com> wrote:
> D is a C derivative, so it seems a shame not to identify causes
> of bugs in C,
> and design them out in D.

This has already been done! D defines an array to be a struct with a pointer and a length. See this article: http://www.drdobbs.com/architecture-and-design/cs-biggest-mistake/228701625

I would argue it's not "off-by-one" that's causing most issues when dealing with C "arrays", but instead it's in general out-of-bounds issues (whether it's off bye one or off by 50..) since you often don't have the length or could easily use the wrong variable as the length.

Think about how much D code would actually have subtle off-by-one errors if D didn't use 0-based indexing like the majority of popular languages use. Any time you would interface with other languages you would have to double, triple-check all your uses of arrays.

FWIW at the very beginning I also found it odd that languages use 0-based indexing, but that was before I had any significant programming experience under my belt. By now it's second nature to me to use 0-based indexing.

August 01, 2015

Re: Array start index

Posted by bachmeier
in reply to Andrej Mitrovic

bachmeier

Posted in reply to Andrej Mitrovic

On Saturday, 1 August 2015 at 19:04:10 UTC, Andrej Mitrovic wrote:
> On 8/1/15, DLearner via Digitalmars-d-learn <digitalmars-d-learn@puremagic.com> wrote:
>> D is a C derivative, so it seems a shame not to identify causes
>> of bugs in C,
>> and design them out in D.
>
> This has already been done! D defines an array to be a struct with a pointer and a length. See this article: http://www.drdobbs.com/architecture-and-design/cs-biggest-mistake/228701625
>
> I would argue it's not "off-by-one" that's causing most issues when dealing with C "arrays", but instead it's in general out-of-bounds issues (whether it's off bye one or off by 50..) since you often don't have the length or could easily use the wrong variable as the length.
>
> Think about how much D code would actually have subtle off-by-one errors if D didn't use 0-based indexing like the majority of popular languages use. Any time you would interface with other languages you would have to double, triple-check all your uses of arrays.
>
> FWIW at the very beginning I also found it odd that languages use 0-based indexing, but that was before I had any significant programming experience under my belt. By now it's second nature to me to use 0-based indexing.

But what type of programming are you doing? Even after decades of programming and trying out dozens of languages, zero-based indexing still gets me at times when the arrays I work with represent vectors and matrices. Especially when porting code from other languages that use one-based indexing. One of the nice things about D is that it gives you the tools to easily make the change if you want.

August 02, 2015

Re: Array start index

Posted by QAston
in reply to bachmeier

QAston

Posted in reply to bachmeier

On Saturday, 1 August 2015 at 23:02:51 UTC, bachmeier wrote:
> But what type of programming are you doing? Even after decades of programming and trying out dozens of languages, zero-based indexing still gets me at times when the arrays I work with represent vectors and matrices. Especially when porting code from other languages that use one-based indexing. One of the nice things about D is that it gives you the tools to easily make the change if you want.

Adding 1-indexed arrays to the language fixes nothing. Just write your 1-indexed array type and if you enjoy using it, publish it as a library. Who knows, if demand is high it may even end up in phobos.

August 03, 2015

Re: Array start index

Posted by bachmeier
in reply to QAston

bachmeier

Posted in reply to QAston

On Sunday, 2 August 2015 at 21:58:48 UTC, QAston wrote:

> Adding 1-indexed arrays to the language fixes nothing. Just write your 1-indexed array type and if you enjoy using it, publish it as a library. Who knows, if demand is high it may even end up in phobos.

Oh, I don't think that's a good idea. It's too confusing to have more than one method of indexing within the same language. You just have to do a thorough job of testing, as the possibility of errors is something you'll have to live with, given the different design choices of different languages.

August 03, 2015

Re: Array start index

Posted by jmh530
in reply to DLearner

jmh530

Posted in reply to DLearner

On Saturday, 1 August 2015 at 09:35:53 UTC, DLearner wrote:
> Does the D language set in stone that the first element of an array _has_ to be index zero?
> Wouldn't starting array elements at one avoid the common 'off-by-one' logic error, it does
> seem more natural to begin a count at 1.
>
> Actually, maybe even better to allow array definitions of form
> int foo[x:y];
> (y >= x) creating integer variables foo[x], foo[x+1],...,foo[y].
>
> I think the (very old) IBM PL/I language was like this.

I come from matlab and R which start matrices at 1. I used to think it was more natural. However after I started using numpy I now think 0 index is better.
Also see http://www.cs.utexas.edu/users/EWD/transcriptions/EWD08xx/EWD831.html
Give it a try. You might find you like it.

Top | Forum index | About this forum

Copyright © 1999-2021 by the D Language Foundation