Skip to content
This repository has been archived by the owner on Jan 6, 2024. It is now read-only.

Latest commit

 

History

History
170 lines (134 loc) · 6.55 KB

Trim.md

File metadata and controls

170 lines (134 loc) · 6.55 KB

Trim

[Source | Tests]

A group of methods that return the SubSequence formed by discarding elements at the start and/or end of the collection which satisfy the given predicate.

This example uses trimming(while:) to get a substring without the white space at the beginning and end of the string.

let myString = "   hello, world  "
print(myString.trimming(while: \.isWhitespace)) // "hello, world"

let results = [2, 10, 11, 15, 20, 21, 100].trimming(while: { $0.isMultiple(of: 2) })
print(results) // [11, 15, 20, 21]

The Algorithms library also includes methods that trim from each end, as well as mutating versions of all three methods.

Detailed Design

New methods are added to Collection and BidirectionalCollection:

extension Collection {
    func trimmingPrefix(while predicate: (Element) throws -> Bool) rethrows -> SubSequence
}

extension BidirectionalCollection {
    func trimmingSuffix(while predicate: (Element) throws -> Bool) rethrows -> SubSequence

    func trimming(while predicate: (Element) throws -> Bool) rethrows -> SubSequence
}

extension Collection where Self: RangeReplaceableCollection {
    mutating func trimPrefix(while predicate: (Element) throws -> Bool) rethrows
}

extension BidirectionalCollection where Self: RangeReplaceableCollection {
    mutating func trimSuffix(while predicate: (Element) throws -> Bool) rethrows

    mutating func trim(while predicate: (Element) throws -> Bool) rethrows
}

There are also overloads of the mutating methods when Self == Self.SubSequence, for non-range-replaceable self-slicing types.

Though the trimming and trimmingSuffix methods are declared on BidirectionalCollection, a less-efficient implementation is possible for any Collection, which would involve always traversing the entire collection. This implementation is not provided, as it would mean developers of generic algorithms who forget to add the BidirectionalCollection constraint will receive that inefficient implementation:

func myAlgorithm<Input>(input: Input) where Input: Collection {
    let trimmedInput = input.trimming(while: { ... }) // Uses least-efficient implementation.
}

func myAlgorithm2<Input>(input: Input) where Input: BidirectionalCollection {
    let trimmedInput = input.trimming(while: { ... }) // Uses most-efficient implementation.
}

Swift provides the BidirectionalCollection protocol for marking types which support reverse traversal, and generic types and algorithms which want to make use of that should add it to their constraints.

Supporting Methods

The endOfPrefix(while:) and startOfSuffix(while:) methods are used in the implementation of the trimming methods described above. As these supporting methods are independently useful, they are included in the library as well.

extension Collection {
  /// Returns the exclusive upper bound of the prefix of elements that satisfy
  /// the predicate.
  func endOfPrefix(
    while predicate: (Element) throws -> Bool
  ) rethrows -> Index
}

extension BidirectionalCollection {
  /// Returns the inclusive lower bound of the suffix of elements that satisfy
  /// the predicate.
  func startOfSuffix(
    while predicate: (Element) throws -> Bool
  ) rethrows -> Index
}

Complexity

Calling this method is O(n).

Naming

The name trim has precedent in other programming languages. Another popular alternative might be strip.

Example usage Languages
''String''.Trim([''chars'']) C#, VB.NET, Windows PowerShell
''string''.strip(); D
(.trim ''string'') Clojure
''sequence'' [ predicate? ] trim Factor
(string-trim '(#\Space #\Tab #\Newline) ''string'') Common Lisp
(string-trim ''string'') Scheme
''string''.trim() Java, JavaScript (1.8.1+), Rust
Trim(''String'') Pascal, QBasic, Visual Basic, Delphi
''string''.strip() Python
strings.Trim(''string'', ''chars'') Go
LTRIM(RTRIM(''String'')) Oracle SQL, T-SQL
string:strip(''string'' [,''option'', ''char'']) Erlang
''string''.strip or ''string''.lstrip or ''string''.rstrip Ruby
trim(''string'') PHP, Raku
[''string'' stringByTrimmingCharactersInSet:[NSCharacterSet whitespaceAndNewlineCharacterSet]] Objective-C/Cocoa
''string'' withBlanksTrimmed ''string'' withoutSpaces ''string'' withoutSeparators Smalltalk
string trim ''$string'' Tcl
TRIM(''string'') or TRIM(ADJUSTL(''string'')) Fortran
TRIM(''string'') SQL
String.trim ''string'' OCaml 4+

Note: This is an abbreviated list from Wikipedia. Full table

The standard library includes a variety of methods which perform similar operations:

  • Firstly, there are dropFirst(Int) and dropLast(Int). These return slices but do not support user-defined predicates. If the collection's count is less than the number of elements to drop, they return an empty slice.
  • Secondly, there is drop(while:), which also returns a slice and is equivalent to a 'left-trim' (trimming from the head but not the tail). If the entire collection is dropped, this method returns an empty slice.
  • Thirdly, there are removeFirst(Int) and removeLast(Int) which do not return slices and actually mutate the collection. If the collection's count is less than the number of elements to remove, this method triggers a runtime error.
  • Lastly, there are the popFirst() and popLast() methods, which work like removeFirst() and removeLast(), except they do not trigger a runtime error for empty collections.

The closest neighbours to this function would be the drop family of methods. Unfortunately, unlike dropFirst(Int),the name drop(while:) does not specify which end(s) of the collection it operates on. Moreover, one could easily mistake code such as:

let result = myString.drop(while: \.isWhitespace)

With a lazy filter that drops all whitespace characters regardless of where they are in the string. Besides that, the root trim leads to clearer, more concise code, which is more aligned with other programming languages:

// Does `result` contain the input, trimmed of certain elements?
// Or does this code mutate `input` in-place and return the elements which were dropped?
let result = input.dropFromBothEnds(while: { ... })

// No such ambiguity here.
let result = input.trimming(while: { ... })