Add keukenliefde parser #877

jaapio · 2023-09-30T12:58:40Z

This patch adds support for keukenliefde.nl

recipe_scrapers/keukenliefdenl.py

jaapio · 2023-10-02T17:47:55Z

@jayaddison i think it is better now, I merged the 2 loops into one large one.

jayaddison · 2023-10-02T20:15:15Z

Thanks! Looking pretty good - do you have an example recipe where the listitems (li tags) are used, to check the behaviour of those?

jaapio · 2023-10-02T20:16:43Z

I was just testing a bit more, but found some recipe that didn't work because the html is different, can I add a second test somehow to cover the other situations?

recipe_scrapers/keukenliefdenl.py

jayaddison · 2023-10-02T20:21:56Z

I was just testing a bit more, but found some recipe that didn't work because the html is different, can I add a second test somehow to cover the other situations?

Yep, you certainly can - for that I'd recommend copying the approach taking by an existing multi-test scraper.

For example, two tests for The Clever Carrot (note the different test_file_name setting in particular):

jaapio · 2023-10-02T20:25:20Z

Thanks, will have a look, I found out that some old recipes are different in html format, which will make it a challenge to make this work. But that's mostly my issue as this site seems to be a mess :-P

The site handles different formats depending on the age of the recipe, so add a second test case that covers this behavior.

The oldest recipes do not have the nice classes to find elements, so we do need text matching on the headers.

jaapio · 2023-10-02T21:56:41Z

I added 2 more test cases which brings the total number at 3:

normal case with paragraphs for the instructions
normal case with list items for the instructions
legacy format recipe without classes to match, which requires some guessing

I tried to extract methods when possible to reduce the number of duplicated lines in the code. And it should now not crash any more when something is a bit different from what we expected.

Please let me know what you think of this :-), should I extract the legacy format to another class or is it ok like this?

recipe_scrapers/keukenliefdenl.py

jayaddison · 2023-10-03T06:52:32Z

should I extract the legacy format to another class or is it ok like this?

Hmm, good question. Roughly speaking: I think the current approach with multiple fallbacks in a single class is fine here.

Trying to find a reason why that is / general guidance: firstly it's often down to what makes the code easiest to manage, with a large amount of personal preference. And by personal preference, I mean the scraper author (you in this case :)). So if you want, experiment with the alternative, and if you find that you have a clear preference, we can go with that.

The other consideration would be how much of the structure of the HTML page is shared. Generally I'd say that if most of the page is the same, then re-using a single class probably makes more sense. If there are three completely different page structures, then three different classes might be more likely to make sense.

tests/test_keukenliefdenl_3.py

recipe_scrapers/keukenliefdenl.py

jaapio · 2023-10-03T07:11:13Z

Think this is done for now, please let me know if other changes are required

jayaddison · 2023-10-03T07:12:40Z

Yep, looks good to me @jaapio - thank you for this contribution. If I had one minor nitpick it would be to add the same language coverage in all three test modules for consistency. That's optional though: I plan to merge this soon (within the next 30 mins or so) either way.

jaapio · 2023-10-03T07:15:58Z

Here you go sir, thanks for this project and your assistance!

As I'm working through my bookmarked recipes you can expect more contributions from my side.

jayaddison · 2023-10-03T07:19:05Z

Excellent, thanks! And you're welcome. Please do, and any feedback on improving the development process here would be gratefully received too.

jaapio force-pushed the feat/keukenliefde branch from bed9a85 to ce778b4 Compare September 30, 2023 13:10

jayaddison reviewed Sep 30, 2023

View reviewed changes

recipe_scrapers/keukenliefdenl.py Outdated Show resolved Hide resolved

jayaddison reviewed Sep 30, 2023

View reviewed changes

recipe_scrapers/keukenliefdenl.py Outdated Show resolved Hide resolved

Add keukenliefde parser

0beff5b

jaapio force-pushed the feat/keukenliefde branch from 61d93cc to 0beff5b Compare October 2, 2023 17:47

jayaddison reviewed Oct 2, 2023

View reviewed changes

recipe_scrapers/keukenliefdenl.py Outdated Show resolved Hide resolved

jaapio added 2 commits October 2, 2023 22:46

Add testcase with different format

eaa1fa9

The site handles different formats depending on the age of the recipe, so add a second test case that covers this behavior.

Add support for legacy recipe format

098da0c

The oldest recipes do not have the nice classes to find elements, so we do need text matching on the headers.

jaapio requested a review from jayaddison October 2, 2023 21:57

jayaddison reviewed Oct 3, 2023

View reviewed changes

recipe_scrapers/keukenliefdenl.py Outdated Show resolved Hide resolved

jayaddison reviewed Oct 3, 2023

View reviewed changes

tests/test_keukenliefdenl_3.py Show resolved Hide resolved

Raise exceptions on not found instructions & ingredents

f1a4893

jayaddison reviewed Oct 3, 2023

View reviewed changes

recipe_scrapers/keukenliefdenl.py Show resolved Hide resolved

Add language test

21d4510

jaapio force-pushed the feat/keukenliefde branch from 71b53f5 to 21d4510 Compare October 3, 2023 07:14

jayaddison merged commit a2abd42 into hhursev:main Oct 3, 2023
16 checks passed

jaapio deleted the feat/keukenliefde branch October 3, 2023 07:20

strangetom pushed a commit to strangetom/recipe-scrapers that referenced this pull request Nov 12, 2023

Adds support for keukenliefde (hhursev#877)

ca6e991

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add keukenliefde parser #877

Add keukenliefde parser #877

jaapio commented Sep 30, 2023

jaapio commented Oct 2, 2023

jayaddison commented Oct 2, 2023

jaapio commented Oct 2, 2023

jayaddison commented Oct 2, 2023

jaapio commented Oct 2, 2023

jaapio commented Oct 2, 2023

jayaddison commented Oct 3, 2023

jaapio commented Oct 3, 2023

jayaddison commented Oct 3, 2023

jaapio commented Oct 3, 2023

jayaddison commented Oct 3, 2023

Add keukenliefde parser #877

Add keukenliefde parser #877

Conversation

jaapio commented Sep 30, 2023

jaapio commented Oct 2, 2023

jayaddison commented Oct 2, 2023

jaapio commented Oct 2, 2023

jayaddison commented Oct 2, 2023

jaapio commented Oct 2, 2023

jaapio commented Oct 2, 2023

jayaddison commented Oct 3, 2023

jaapio commented Oct 3, 2023

jayaddison commented Oct 3, 2023

jaapio commented Oct 3, 2023

jayaddison commented Oct 3, 2023