Fix Issue 17679 - SortedRange.contains should be deprecated in favor of the generic canFind #5651

wilzbach · 2017-07-24T20:26:11Z

See also: http://forum.dlang.org/post/sjwsaofnanozytrvwxpb@forum.dlang.org

…of the generic canFind

dlang-bot · 2017-07-24T20:26:13Z

Thanks for your pull request, @wilzbach!

Bugzilla references

Auto-close	Bugzilla	Description
✓	17679	SortedRange.contains should be deprecated in favor of the generic canFind

wilzbach · 2017-07-24T20:27:17Z

std/range/package.d

+    assert(r1.release() == [ 64, 52, 42, 3, 2, 1 ]);
+}
+
+deprecated


I duplicated the relevant unittest, s.t. we continue to test contains

wilzbach · 2017-07-24T20:28:17Z

std/range/package.d

+$(RED Deprecated - please use $(REF canFind, std,algorithm.searching) instead).
+ */
+    // @@@DEPRECATED_2018-02@@@
+    deprecated("Please use std.algorithm.searching : canFind instead.")


@jmdavis - I hope I have done it correctly this time?
IIRC the stages were:

mark in documentation as deprecated

remove from documentation

remove entirely

bubnenkoff · 2017-07-24T20:45:19Z

Should canFind be aliased/renamed to contains? It's more general naming.

andralex · 2017-07-24T21:20:21Z

The apparent duplication is intentional and useful. canFind means "can run linear find and find this value", whereas contains means "quickly rummage for this value" and must finish in logarithmic time.

JackStouffer · 2017-07-24T21:24:30Z

@andralex But canFind specializes for SortedRange, so there's no speed benefit to using one over the other.

andralex · 2017-07-24T21:29:16Z

That's an opportunistic optimization. Calling canFind means "I don't care whether this is linear, just find the thing". It's nice to have that use contains if available.

wilzbach · 2017-07-24T21:29:20Z

@andralex But canFind specializes for SortedRange, so there's no speed benefit to using one over the other.

Yep copy/pasting from Bugzilla:

Since #4907 (December 2016), find takes advantage of Sortedness:

phobos/std/algorithm/searching.d

Line 1509 in 57b8d25

// If the haystack is a SortedRange we can use binary search to find the needle.

andralex · 2017-07-24T21:30:21Z

@wilzbach the usefulness of the distinction is in generic code that wants to make sure fast searching is available

wilzbach · 2017-07-24T21:46:37Z

the usefulness of the distinction is in generic code that wants to make sure fast searching is available

Isn't it the opposite for generic code? It should only care about the canFind operation.
Btw I just looked at the Phobos code and the only time a function cares about whether sth. is of type SortedRange is find.
Of course, there was #3534 which proposed to do more specializations like: with isSortedRange!(Range, pred), but we don't even have isSortedRange and afaict the purpose of the SortedRange was to do the optimizations in the function (e.g. canFind) a la:

static if(isSortedRange!(Range, pred)
 // faster path
else
 // normal path

and not at the call site:

static if (hasMember!(Range, "contains"))
    r.contains(42);
else
    r.canFind(42);

Anyway I don't care about this too strongly. I just found on the NG that people struggle with it and find it confusing.

andralex · 2017-07-24T23:07:50Z

Offering functions of distinct complexity is within the doctrine of generic programming; there's long established practice of such in the STL, viz. member vs. non-member find. The same applies to DbI at least for as long as there's no systematic way of getting the complexity of an algorithm by means of introspection. My BigO library may challenge that notion in the future, but as things stand reflecting complexity in the name of functions is robust established practice.

The Achilles's heel of checking for SortedRange is that not only sorted ranges have fast contains - a variety of hashed containers come to mind. Any range with fast searching should implement contains. That information is vital and could make the difference between working and practically impossible as anyone who's joined two tables would know.

JackStouffer · 2017-07-31T19:49:24Z

@wilzbach Since Andrei has a good argument in favor and there's been no activity on this, I'm closing this. Please reopen if you have a counter point.

Fix Issue 17679 - SortedRange.contains should be deprecated in favor …

6ae3ca5

…of the generic canFind

wilzbach requested review from JackStouffer, PetarKirov and andralex as code owners July 24, 2017 20:26

dlang-bot added the Severity:Enhancement label Jul 24, 2017

wilzbach changed the title ~~Fix Issue 17679 - SortedRange.contains should be deprecated in favor …~~ Fix Issue 17679 - SortedRange.contains should be deprecated in favor of the generic canFind Jul 24, 2017

wilzbach commented Jul 24, 2017

View reviewed changes

JackStouffer closed this Jul 31, 2017

wilzbach deleted the fix-17679 branch December 12, 2017 09:18

Uh oh!

Fix Issue 17679 - SortedRange.contains should be deprecated in favor of the generic canFind #5651

Fix Issue 17679 - SortedRange.contains should be deprecated in favor of the generic canFind #5651

Uh oh!

Conversation

wilzbach commented Jul 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dlang-bot commented Jul 24, 2017

Bugzilla references

Uh oh!

wilzbach Jul 24, 2017

Choose a reason for hiding this comment

Uh oh!

wilzbach Jul 24, 2017

Choose a reason for hiding this comment

Uh oh!

bubnenkoff commented Jul 24, 2017

Uh oh!

andralex commented Jul 24, 2017

Uh oh!

JackStouffer commented Jul 24, 2017

Uh oh!

andralex commented Jul 24, 2017

Uh oh!

wilzbach commented Jul 24, 2017

Uh oh!

andralex commented Jul 24, 2017

Uh oh!

wilzbach commented Jul 24, 2017

Uh oh!

andralex commented Jul 24, 2017

Uh oh!

JackStouffer commented Jul 31, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

wilzbach commented Jul 24, 2017 •

edited

Loading