feat: lemmy #56

Teqed · 2023-06-29T04:23:34Z

Parses /comment/ and /post/ URLs for comment IDs to use with getComment to obtain the parent post_id and then uses getComments to find all related comment URLs under ap_id.

nanos · 2023-06-29T05:53:02Z

Thanks for this @Teqed

Really great work, and really appreciate it!

Can't wait for this to be ready to merge. Let me know if you need help with anything!

Teqed · 2023-06-29T06:22:34Z

@nanos Thank you for writing FediFetcher! 👍

Working: Context of posts seen in the timeline.
In progress: Backfilling user profiles -- returned error is Extra data: line 1 column 4 (char 3), I will have to pick back up here later.

get_all_known_context_urls was returning None for the URL until I slightly refactored it b7ef2be (#56). I am still not sure what was happening here.

nanos · 2023-06-29T07:55:20Z

get_all_known_context_urls was returning None for the URL until I slightly refactored it b7ef2be (#56). I am still not sure what was happening here.

yeah, I must admit that I never truly understood that part 😆 it's something I just inherited from the original author, and never bothered to simplify / rewrite.

Teqed · 2023-06-30T06:23:37Z

Included are a few commits which help prevent FediFetcher from exiting ungracefully when encountering issues with unexpected types, missing properties, or unusual URLs. Not a comprehensive pass for robustness but a few spots that were helpful while writing this feature.

For future federation features, it should be noted that Pixelfed profiles don't use a subdirectory in their path, ex. https://pixelfed.social/dansup instead of something like https://pixelfed.social/u/dansup. The way the current regex is matching makes it likely to match against any currently-unmatched subdirectories instead of the user's actual name. A quick fix is to make sure Pixelfed profile matches are attempted last, though I'm sure a more sophisticated regex is possible. I've left a cautionary comment for the time being.

For federation with Kbin instances, there is the minor issue of similar profile URLs to Lemmy (ex. https://kbin.social/u/admin) that would have to be parsed separately somehow. However, reading their API documentation does not reveal to me any way to fetch comments by username. You can search for posts by magazine but AFAIK user profiles are not available as magazines. However, this may change, as they've said:

This is a very early beta version, and a lot of features are currently broken or in active development, such as federation.

Finally, included in these commits are the final pieces needed to backfill user profiles, followed communities, and "posts" from Lemmy. Testing has been done via GitHub action against my Mastodon v4.1.2+glitch instance which has content from relevant instances.

nanos · 2023-06-30T09:24:32Z

Thanks for your hard word @Teqed !

This is a fairly large PR, so I'm going to go through that with a bit of a fine toothed comb over the weekend, but it does look solid on firsts glance.

A quick fix is to make sure Pixelfed profile matches are attempted last

Personally, I think relying on a specific sequence is totally acceptable here.

Though I did think about using the /.well-known/nodeinfo endpoint to determine server software, but I'm not sure how widely implemented that is outside of mastodon either.

Teqed · 2023-06-30T19:02:17Z

Though I did think about using the /.well-known/nodeinfo endpoint to determine server software, but I'm not sure how widely implemented that is outside of mastodon either.

This is a good idea and you inspired me to do some quick research:

Making a request at https://{server}}/.well-known/nodeinfo and then accessing ["links"][0]["href"] on the JSON gets you:
https://{mastodon}/nodeinfo/2.0
https://{lemmy}/nodeinfo/2.0.json
https://{kbin}/nodeinfo/2.0
https://{pixelfed}/api/nodeinfo/2.0.json <-- Note: /api/ subdirectory
https://{pleroma}/nodeinfo/2.0.json
https://{peertube}/nodeinfo/2.0.json

A request for that JSON (2.0 schema here) gets you ["software"]["name"] containing the name of the service. Working with all six of the services listed above.

This gives me some more ideas on how to go about choosing API endpoints based on NodeInfo instead of URL parsing. I imagine that if you went this route, it'd be preferable to keep a cache of already-identified APIs so that you don't repeatedly make the same request in the same run. I may submit another PR if I find this worthwhile to do.

nanos

Solid work. thanks so much!

feat: lemmy

c1f0e8a

Teqed marked this pull request as draft June 29, 2023 04:25

nanos mentioned this pull request Jun 29, 2023

Enhancement: add the ability to parse messages from Peertube, Kbin, and Lemmy #55

Closed

chore: refactor get_all_known_context_urls

b7ef2be

Teqed force-pushed the feat/lemmy branch 2 times, most recently from 989ab44 to b7ef2be Compare June 29, 2023 07:14

Teqed and others added 8 commits June 30, 2023 00:11

chore: use getters

e290f2c

feat: lemmy-2

4011883

chore: check none type

4751d96

chore: deliminate regex with forward slash

b04664f

feat: lemmy communities and users

d212e7a

chore: access context items safely

8edfbc0

fix: match pixelfed profile last

0472fe6

feat: fetch root lemmy post

6f7392c

Teqed force-pushed the feat/lemmy branch from 784b31c to 6f7392c Compare June 30, 2023 05:40

Teqed marked this pull request as ready for review June 30, 2023 06:24

nanos approved these changes Jul 1, 2023

View reviewed changes

nanos merged commit f7d0150 into nanos:main Jul 1, 2023

Teqed mentioned this pull request Jul 2, 2023

Enhancement: Add CalcKey Instance Support #60

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: lemmy #56

feat: lemmy #56

Teqed commented Jun 29, 2023 •

edited

Loading

nanos commented Jun 29, 2023

Teqed commented Jun 29, 2023

nanos commented Jun 29, 2023

Teqed commented Jun 30, 2023

nanos commented Jun 30, 2023 •

edited

Loading

Teqed commented Jun 30, 2023

nanos left a comment

feat: lemmy #56

feat: lemmy #56

Conversation

Teqed commented Jun 29, 2023 • edited Loading

nanos commented Jun 29, 2023

Teqed commented Jun 29, 2023

nanos commented Jun 29, 2023

Teqed commented Jun 30, 2023

nanos commented Jun 30, 2023 • edited Loading

Teqed commented Jun 30, 2023

nanos left a comment

Choose a reason for hiding this comment

Teqed commented Jun 29, 2023 •

edited

Loading

nanos commented Jun 30, 2023 •

edited

Loading