Request to Include Named Entity Recognition and Relation Extraction Model Finetuning Examples and Guidance Request #2119

HarikrishnanK9 · 2024-10-01T05:50:24Z

Feature request

As an NLP enthusiast working on Named Entity Recognition (NER) and Relation Extraction (RE) tasks, I would like to request the inclusion of NER and RE-related examples, best practices, and guidance for fine-tuning models specifically for these tasks on your GitHub page.
The documentation on fine-tuning models for NER and RE would help guide researchers in developing state-of-the-art models without having to reinvent the wheel.Currently Finetuning examples for NER and RE tasks are not available there.These techniques have relevance in identifying technical terms,chemical names,etc from texts and recognize the relationship between identified domain specified technical terms.

Motivation

ner models are critical in various technologies to identify particular domain related words from an input text,Especially if we are dealing with business terms,chemical names,particle names,and other domain specific technical terms.2nd importance is identify the relation between identified entities.Eg: sentence:Apple acquired Beats for $3 billion in 2014;Entities:Apple, Beats, $3 billion,2014;Relation:acquired

Pre-requisite

First there must be an appropriate dataset for finetuning, including 1)sentences,2)entities corresponding to that sentence in next column and 3)finally relation between identified entities.Dataset must include diverse domain specific datas like finance,medical,chemical,news,politics,etc then only a general kind of model can identify entities generally.If the purpose is domain specific then particulr domain specific dataset is a must.

BenjaminBossan · 2024-10-01T08:54:09Z

I agree that having such examples could be useful for many users. If you have an existing example that uses full fine-tuning that we can take as a starting point, that would be very helpful. I also added a call for contribution tag so that hopefully someone from the community with expertise on this topic can step in.

JINO-ROHIT · 2024-10-02T11:28:56Z

Hi guys, Im happy to take this up, but do we have any example guidelines? Like should we open a seperate folder and should it be a jupyter notebook or py scripts?

BenjaminBossan · 2024-10-02T12:24:08Z

Thanks a lot for offering to work on this @JINO-ROHIT.

There is no specific template we expect from examples. If you check the examples directory, you'll see it's a bit chaotic. I would suggest to add either a script or notebook (whatever you like most) to the examples/token_classification directory (as NER would be considered "token classification").

As a dataset, I know that conll is used for NER but maybe there is a better dataset by now or @HarikrishnanK9 has something else in mind.

As to the PEFT method, probably LoRA would be a good start.

JINO-ROHIT · 2024-10-02T13:41:22Z

gotcha, im on it

HarikrishnanK9 · 2024-10-09T04:42:14Z

@JINO-ROHIT @BenjaminBossan Apologies for the delayed response. Over the past few days, I’ve been exploring NER and Relation Extraction (RE), fine-tuning, along with codes and datasets on platforms like Kaggle and various GitHub repositories.Unfortunately, I wasn’t able to complete it as planned ,Anyway I’m happy to hear that @JINO-ROHIT is also ready to take over the task. If I succeed with any of the codes, I will share it with you immediately.

JINO-ROHIT · 2024-10-09T07:17:39Z

Hey @HarikrishnanK9 have a look at the PR and let us know if this is something along the lines you were looking for.

BenjaminBossan added good first issue Good for newcomers contributions-welcome labels Oct 1, 2024

BenjaminBossan mentioned this issue Oct 7, 2024

adding peft lora example notebook for ner #2126

Merged

HarikrishnanK9 closed this as completed Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request to Include Named Entity Recognition and Relation Extraction Model Finetuning Examples and Guidance Request #2119

Request to Include Named Entity Recognition and Relation Extraction Model Finetuning Examples and Guidance Request #2119

HarikrishnanK9 commented Oct 1, 2024 •

edited

Loading

BenjaminBossan commented Oct 1, 2024

JINO-ROHIT commented Oct 2, 2024

BenjaminBossan commented Oct 2, 2024

JINO-ROHIT commented Oct 2, 2024

HarikrishnanK9 commented Oct 9, 2024

JINO-ROHIT commented Oct 9, 2024

Request to Include Named Entity Recognition and Relation Extraction Model Finetuning Examples and Guidance Request #2119

Request to Include Named Entity Recognition and Relation Extraction Model Finetuning Examples and Guidance Request #2119

Comments

HarikrishnanK9 commented Oct 1, 2024 • edited Loading

Feature request

Motivation

Pre-requisite

BenjaminBossan commented Oct 1, 2024

JINO-ROHIT commented Oct 2, 2024

BenjaminBossan commented Oct 2, 2024

JINO-ROHIT commented Oct 2, 2024

HarikrishnanK9 commented Oct 9, 2024

JINO-ROHIT commented Oct 9, 2024

HarikrishnanK9 commented Oct 1, 2024 •

edited

Loading