Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Many posts inaccurately tagged as langs: ['no'] #7691

Open
oaustegard opened this issue Feb 7, 2025 · 3 comments
Open

Many posts inaccurately tagged as langs: ['no'] #7691

oaustegard opened this issue Feb 7, 2025 · 3 comments
Labels
bug Something isn't working

Comments

@oaustegard
Copy link

Steps to Reproduce

  1. Go to https://bsky.app/search?q=https and switch language to Norwegian -- or simply https://public.api.bsky.app/xrpc/app.bsky.feed.searchPosts?q=https*&limit=10&sort=top&lang=no
  2. Observe that many posts are in fact by Japanese accounts in Japanese, but the posts are tagged with langs: ['no']

Attachments

Image

What platform(s) does this occur on?

Web (Desktop)

Device Info

NA -- also on Web Mobile

What version of the app are you using?

Current api as of 07 Feb 25

Additional Information

Speculation: since this appears to be much more prevalent for Norwegian than say Swedish or Danish -- could it be the 'no' being a falsey value in some languages could affect things?

@oaustegard oaustegard added the bug Something isn't working label Feb 7, 2025
@MichaScant
Copy link

MichaScant commented Feb 18, 2025

Hello, having looked at this issue this looks like a user error.

I wasn't able to reproduce this by creating a post solely in Japanese, but I was able to set multiple tags for a post which is what the users were doing.

there are other cases where a similar case is true, for instance, when filtering by Danish, i get the following english post, tagged with the da tag, despite being only written in English: https://bsky.app/profile/jacobchr.bsky.social/post/3licl3lvidb2w

@oaustegard
Copy link
Author

It would be expected for Europeans to frequently reference and link to English, French and German medium sites, far less so Japanese...

Agree that there is a likely user error at the poster level, so maybe the problem is less the feeds and more the UX of the app?

@Tamschi
Copy link

Tamschi commented Feb 19, 2025

They're likely using TOKIMEKI, a Japanese multi-column client, which by default intransparently (and badly) tries to auto-detect post language when creating a post. It usually attaches the top two or three matched languages.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants