Select a specific dataset for training #1014

Elfsong · 2022-04-26T06:18:54Z

Elfsong
Apr 26, 2022

seqio.TaskRegistry.add("task_1",
  source=seqio.FunctionDataSource(
	  dataset_fn=functools.partial(clef_dataset_fn, task="task_name", lang=lang),
	  splits=["train", "dev", ""test]
  ),
  preprocessors=[
	  c_preprocessor,
	  seqio.preprocessors.tokenize_and_append_eos,
  ],
  postprocess_fn=t5.data.postprocessors.lower_text,
  metric_fns=[c_metric],
  output_features=DEFAULT_OUTPUT_FEATURES,
)

May I ask which data split (train/dev/test) will be used for training by default? Can I name it specifically? Like using the dev dataset to train the model?

Answered by Elfsong

Apr 26, 2022

Can we use "utils.run.dataset_split" in the training phase?

View full answer

Elfsong · 2022-04-26T06:23:09Z

Elfsong
Apr 26, 2022
Author

Can we use "utils.run.dataset_split" in the training phase?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Select a specific dataset for training #1014

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Select a specific dataset for training #1014

Elfsong Apr 26, 2022

Replies: 1 comment

Elfsong Apr 26, 2022 Author

Elfsong
Apr 26, 2022

Elfsong
Apr 26, 2022
Author