Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add content option to tabular pipeline #177

Closed
davidmezzetti opened this issue Dec 14, 2021 · 0 comments
Closed

Add content option to tabular pipeline #177

davidmezzetti opened this issue Dec 14, 2021 · 0 comments
Assignees
Milestone

Comments

@davidmezzetti
Copy link
Member

Currently, the tabular pipeline will create a pandas DataFrame for input data and create (id, text, tags) tuples. A new option should be added to also create additional content records.

The content field can be either True/False or a list.

  • If False (default), no content is created
  • If True, all fields available in the DataFrame are added to a content record for each row id created
  • If content is a list, the list is used to select only those fields from the DataFrame to add to a content record
@davidmezzetti davidmezzetti added this to the v4.0.0 milestone Dec 14, 2021
@davidmezzetti davidmezzetti self-assigned this Dec 14, 2021
@davidmezzetti davidmezzetti changed the title Add option to tabular pipeline to add content records Add content option to tabular pipeline Dec 14, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant