Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalize content items data #35

Open
ledbetterljoshua opened this issue Aug 5, 2020 · 0 comments
Open

Normalize content items data #35

ledbetterljoshua opened this issue Aug 5, 2020 · 0 comments

Comments

@ledbetterljoshua
Copy link
Contributor

ledbetterljoshua commented Aug 5, 2020

The idea here is that data from different RSS feeds may be returned in different shapes; however, when stored in the database and when accessing the data via the API, the data should be uniform.

One example of this is if the content of the article exists in the "description" key, instead of the "content" key. The lesswrong blog does this. Another is if a description is missing entirely, like in Sam Altmans blog. For this, we may want to create a description from the content (just the first sentence or so), and save that as the description in the DB.

I'm sure there are other examples, but we might want to start collecting them and thinking about them now.

@ledbetterljoshua ledbetterljoshua changed the title Normalize content items Normalize content items data Aug 5, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant