[bounty] speaker identification (next steps) #695

louis030195 · 2024-11-18T21:31:49Z

WIP

also i'm curious how we could associate frames to specific person (speaker) somehow (and thus screen text) but this is lower priority

/bounty 200

(TBD what is exactly the things to do)

linear · 2024-11-18T21:31:52Z

MED-291 [bounty] speaker identification (next steps)

louis030195 · 2024-11-18T21:34:33Z

kinda tihkning about google photo ui

i think we need similar for speaker identification over long ranges, eg "listen to this voice, is it john? can you tell me the name?"

i bet they compute embeddings of face and group together and tune based on user feedback

louis030195 · 2024-11-18T21:36:05Z

@EzraEllette what do you mean by "Attempt to use LLM for identification through meeting context"

we already do this in meeting page

algora-pbc · 2024-11-18T21:37:09Z

💎 $200 bounty • Screenpi.pe

Steps to solve:

Start working: Comment /attempt #695 with your implementation plan
Submit work: Create a pull request including /claim #695 in the PR body to claim the bounty
Receive payment: 100% of the bounty is received 2-5 days post-reward. Make sure you are eligible for payouts

Thank you for contributing to mediar-ai/screenpipe!

Add a bounty • Share on socials

Attempt	Started (GMT+0)	Solution
🟢 @EzraEllette	Nov 18, 2024, 9:46:17 PM	WIP

EzraEllette · 2024-11-18T21:44:27Z

@EzraEllette what do you mean by "Attempt to use LLM for identification through meeting context"

we already do this in meeting page

When this function is called, It should now be able to update speaker information in the database with user permission.

EzraEllette · 2024-11-18T21:46:14Z

/attempt #695

Algora profile	Completed bounties	Tech	Active attempts	Options
@EzraEllette	11 mediar-ai bounties	Rust, TypeScript, JavaScript & more		Cancel attempt

NicodemPL · 2024-11-19T06:35:01Z

Limitless uses emails/names from calendar events. Might be good idea to implement calendar to meetings at this stage.

Other idea - OCR from meeting frames to find and confirm names. We already have this data (OCR)

louis030195 · 2024-11-19T17:03:45Z

@EzraEllette fyi build is down because of sqlite extension i feel like

https://github.com/mediar-ai/screenpipe/actions/runs/11904908266/job/33174510422
https://github.com/mediar-ai/screenpipe/actions/runs/11904908266/job/33174511134

louis030195 · 2024-11-19T17:12:38Z

Limitless uses emails/names from calendar events. Might be good idea to implement calendar to meetings at this stage.

Other idea - OCR from meeting frames to find and confirm names. We already have this data (OCR)

yes to 2

the screen (and mic) is the universal interface that contains all apps - we do not need any integration if we just use pixels and LLMs well

let's try without calendar first and if it's too hard we can think about it but it involves many things (auth, cloud API, db etc) that are not core to screenpipe

EzraEllette · 2024-11-19T17:30:22Z

@EzraEllette fyi build is down because of sqlite extension i feel like

https://github.com/mediar-ai/screenpipe/actions/runs/11904908266/job/33174510422

https://github.com/mediar-ai/screenpipe/actions/runs/11904908266/job/33174511134

It looks like knfc. I will debug when I get a chance

louis030195 added the enhancement New feature or request label Nov 18, 2024

louis030195 mentioned this issue Nov 18, 2024

Speaker Identification #672

Merged

1 task

algora-pbc bot added the 💎 Bounty label Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bounty] speaker identification (next steps) #695

[bounty] speaker identification (next steps) #695

louis030195 commented Nov 18, 2024 •

edited

Loading

linear bot commented Nov 18, 2024

louis030195 commented Nov 18, 2024 •

edited

Loading

louis030195 commented Nov 18, 2024

algora-pbc bot commented Nov 18, 2024 •

edited

Loading

EzraEllette commented Nov 18, 2024

EzraEllette commented Nov 18, 2024 •

edited by algora-pbc bot

Loading

NicodemPL commented Nov 19, 2024

louis030195 commented Nov 19, 2024

louis030195 commented Nov 19, 2024 •

edited

Loading

EzraEllette commented Nov 19, 2024

[bounty] speaker identification (next steps) #695

[bounty] speaker identification (next steps) #695

Comments

louis030195 commented Nov 18, 2024 • edited Loading

linear bot commented Nov 18, 2024

louis030195 commented Nov 18, 2024 • edited Loading

louis030195 commented Nov 18, 2024

algora-pbc bot commented Nov 18, 2024 • edited Loading

💎 $200 bounty • Screenpi.pe

Steps to solve:

EzraEllette commented Nov 18, 2024

EzraEllette commented Nov 18, 2024 • edited by algora-pbc bot Loading

NicodemPL commented Nov 19, 2024

louis030195 commented Nov 19, 2024

louis030195 commented Nov 19, 2024 • edited Loading

EzraEllette commented Nov 19, 2024

louis030195 commented Nov 18, 2024 •

edited

Loading

louis030195 commented Nov 18, 2024 •

edited

Loading

algora-pbc bot commented Nov 18, 2024 •

edited

Loading

EzraEllette commented Nov 18, 2024 •

edited by algora-pbc bot

Loading

louis030195 commented Nov 19, 2024 •

edited

Loading