-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
question about your ChFi-nAnn dataset and bert model #6
Comments
We create the ChFinAnn dataset by distant supervision, and processing details are included by the main paper (Section 4) and the supplementary material (Section A.1). I will refactor the BERT part later. |
Thank you. I want to create a annual report dataset of China A-share companies. |
Sounds cool! |
Thank you for your advice. I work at the department of finance, and we have the Bloomberg, Wind, and Thomson Reuters. So we want to utilize text mining and nlp to process the financial news, and want to do some news event identification, risk identification and quantitative factor analysis based on the historical news data of individual stocks. In this way, we can make some predictions at a certain point in the future, which can be added in the quantitative trading strategy. A product, like Kensho made in USA, and they make a good product and have acquired by Goldman Sachs. Your work surprises me very well. If possible, we can take a talk. |
Hi~ |
@pjfeng What you have mentioned is a very challenging topic, and many startups also worked on it in recent years. |
@xiaocuigit Founder is about extracting inter-entity relations from richly formatted documents, while Doc2EDAG focuses on extracting various event records (each with multiple entities) from a text document. |
@dolphin-zs Thank you. I have talked to my Professor who is on quantitative trading strategies using NLP. He is very interested in your research. Do you have time to talk about NLP in the finance field? |
@dolphin-zs Could you show more details on how to use DS-based method to generate labeled data? I am currently working on event extraction for news data, but I am stuck in the lack of data source. I would like to implement method shown in the paper to generate the news domain dataset. |
同学请问你们的数据集做出来了吗,最近我也想用DS做一个 |
您现在效果如何了,我也在做新闻领域事件抽取,希望可以交流 |
I'm interested in how you created the ChFi-nAnn dataset. And I want to know more details about the way you did. And also, when I run bert model, it doesn't work too. Thank you.
The text was updated successfully, but these errors were encountered: