Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sanitize names #1

Open
jsejcksn opened this issue May 15, 2020 · 1 comment
Open

Sanitize names #1

jsejcksn opened this issue May 15, 2020 · 1 comment

Comments

@jsejcksn
Copy link

The names of recently-added emoji include indicator symbols:

Example:

"🪡": "⊛ sewing needle",

It looks like you're scraping these. Unicode offers a "machine-readable" text format if you're interested. You don't seem to be scraping keywords or other data only available in the html pages, so it might be a better fit for this project.

If you are interested in adding keywords to the dictionary, this page might interest you.


Off-topic, but mildly-related: If you prefer electron scraping to puppeteer, care to share why? I've never seen someone use electron for that purpose, and I'm interested.

@rdela
Copy link

rdela commented Aug 5, 2024

Yes as @jsejcksn wrote the ⊛ character is not part of the name: “Recently-added emoji are marked by a ⊛ in the name” - Full Emoji List, v15.1

@WebReflection are you interested in implementing this change? or open to a PR? I am using https://github.com/WebReflection/emoji-short-name via the accessible-emoji component in Tugboat and the ⊛ in the names is a big drag here and defeats the whole purpose of trying to increase access, Cc @zachleat

UPDATE: I added a workaround in webcbed #5. (EDIT: and webcbed #6)
Current code

UPDATE 2: Seems like Mac speech ignores ⊛ but I am sure some devices read “CIRCLED ASTERISK OPERATOR” out loud.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants