👋 Hi there
I am an AI Researcher at Baidu Inc. which I joined in 2021. My research interest covers a wide range of topics in computer vision and multimodal large language model. My publications have over
1,400
citations (as of Nov. 2024).My works on visual object detection include
RTDETR, RTDETRv2, PP-YOLOE, PP-YOLOE+, PP-YOLOE-SOD, PP-PicoDet and PP-YOLOv2
. The best known model RTDETR has been integrated intohuggingface/transformers
andultralytics/ultralytics
repositories. I also have some works on multimodal large language model includingPP-InsCapTagger, PP-InfinityDocData and PP-DocBee(2B)
for data analysis, data generation, and document understanding. I am also a contributor of several prestigious communities, includingpytorch and PaddlePaddle
.Before joining Baidu Inc., I was a Software Engineer at Microsoft from 2019 to 2021, and a Research Intern at Microsoft Research Asia (MSRA) from 2016 to 2017. I received my M.S. degree from Harbin Institute of Technology in 2018.
📬 Reach out to me: lyuwenyu@foxmail.com