[lightroom] add Lightroom gallery extractor #2263

Schnouki · 2022-02-02T13:11:22Z

No description provided.

rautamiekka · 2022-02-02T13:20:01Z

gallery_dl/extractor/lightroom.py

+    directory_fmt = ("{category}", "{user}", "{title}")
+    filename_fmt = "{num:>04}_{id}.{extension}"
+    archive_fmt = "{id}"
+    pattern = r"(?:https?://)?lightroom\.adobe\.com/shares/([0-9a-f]+)"


Wouldn't hurt having (?:www\.)? just in case.

The www subdomain doesn't exist, so such an URL wouldn't work anyway.

Hrxn · 2022-02-02T13:29:56Z

Is there any way to browse this site, well, normally?

Schnouki · 2022-02-02T14:42:52Z

@Hrxn Not that I know of. It's for people sharing their Lightroom galleries, not a general-purpose image host like imgur. So, you need to have the URL of a gallery to be able to see it, and there's no search or "explore" feature.

mikf · 2022-02-04T23:13:23Z

gallery_dl/extractor/lightroom.py

@@ -0,0 +1,105 @@
+# -*- coding: utf-8 -*-
+
+# Copyright 2018-2022 Mike Fährmann


I'm not the author or copyright holder.
Put your own name and current year there, or just delete this line.

mikf · 2022-02-04T23:21:06Z

gallery_dl/extractor/lightroom.py

+            response = self.request(url)
+            # skip 1st line as it's a JS loop
+            data_idx = response.text.index("\n") + 1
+            data = json.loads(response.text[data_idx:])


You should never access response.text more than once.
It internally does some heavy computations and doesn't cache its result.

Store it in its own variable and use that.

Suggested change

response = self.request(url)

# skip 1st line as it's a JS loop

data_idx = response.text.index("\n") + 1

data = json.loads(response.text[data_idx:])

page = self.request(url).text

# skip 1st line as it's a JS loop

data = json.loads(page[page.index("\n") + 1:])

mikf · 2022-02-04T23:27:44Z

gallery_dl/extractor/lightroom.py

+            data_idx = response.text.index("\n") + 1
+            data = json.loads(response.text[data_idx:])
+
+            next_url = data.get("links", {}).get("next", {}).get("href", None)


I'd rather use a try-except block than creating dozens of dicts every time.
Move this after the for loop and you can immediately return. Or you set next_url to None.

Suggested change

next_url = data.get("links", {}).get("next", {}).get("href", None)

try:

next_url = data["links"]["next"]["href"]

except KeyError:

next_url = None

mikf · 2022-02-04T23:30:17Z

gallery_dl/extractor/lightroom.py

+
+            next_url = data.get("links", {}).get("next", {}).get("href", None)
+
+            base_url = data["base"]


This overrides the base_url value set in line 73 before that got used even once.
Not sure if that's a problem, just something I noticed.

[lightroom] add Lightroom gallery extractor

d6c570b

Schnouki force-pushed the lightroom branch from 31c82fc to d6c570b Compare February 2, 2022 13:16

rautamiekka reviewed Feb 2, 2022

View reviewed changes

mikf reviewed Feb 4, 2022

View reviewed changes

[lightroom] update

a3721a0

mikf merged commit a7de819 into mikf:master Feb 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lightroom] add Lightroom gallery extractor #2263

[lightroom] add Lightroom gallery extractor #2263

Schnouki commented Feb 2, 2022

rautamiekka Feb 2, 2022

Schnouki Feb 2, 2022

Hrxn commented Feb 2, 2022

Schnouki commented Feb 2, 2022

mikf Feb 4, 2022

mikf Feb 4, 2022

mikf Feb 4, 2022

mikf Feb 4, 2022

		@@ -0,0 +1,105 @@
		# -- coding: utf-8 --

		# Copyright 2018-2022 Mike Fährmann

-            next_url = data.get("links", {}).get("next", {}).get("href", None)
+            try:
+                next_url = data["links"]["next"]["href"]
+            except KeyError:
+                next_url = None


		next_url = data.get("links", {}).get("next", {}).get("href", None)

		base_url = data["base"]

[lightroom] add Lightroom gallery extractor #2263

[lightroom] add Lightroom gallery extractor #2263

Conversation

Schnouki commented Feb 2, 2022

rautamiekka Feb 2, 2022

Choose a reason for hiding this comment

Schnouki Feb 2, 2022

Choose a reason for hiding this comment

Hrxn commented Feb 2, 2022

Schnouki commented Feb 2, 2022

mikf Feb 4, 2022

Choose a reason for hiding this comment

mikf Feb 4, 2022

Choose a reason for hiding this comment

mikf Feb 4, 2022

Choose a reason for hiding this comment

mikf Feb 4, 2022

Choose a reason for hiding this comment