Some DataFrames have a lazy api, which the standard should probably support Should we add a `collect` method, which for eager libraries would just return `self` and for lazy ones would materialise the dataframe?