Skip to content
View lyuwenyu's full-sized avatar
👀
👀
  • Harbin Institute of Technology
  • Beijing, China

Block or report lyuwenyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lyuwenyu/README.md

👋 Hi there

I am an AI Researcher at Baidu Inc. which I joined in 2021. My research interest covers a wide range of topics in computer vision and multimodal large language model. My publications have over 1,400 citations (as of Nov. 2024).

My works on visual object detection include RTDETR, RTDETRv2, PP-YOLOE, PP-YOLOE+, PP-YOLOE-SOD, PP-PicoDet and PP-YOLOv2. The best known model RTDETR has been integrated into huggingface/transformers and ultralytics/ultralytics repositories. I also have some works on multimodal large language model including PP-InsCapTagger, PP-InfinityDocData and PP-DocBee(2B) for data analysis, data generation, and document understanding. I am also a contributor of several prestigious communities, including pytorch and PaddlePaddle.

Before joining Baidu Inc., I was a Software Engineer at Microsoft from 2019 to 2021, and a Research Intern at Microsoft Research Asia (MSRA) from 2016 to 2017. I received my M.S. degree from Harbin Institute of Technology in 2018.

🔭 Google scholar

📬 Reach out to me: [email protected]

Pinned Loading

  1. RT-DETR RT-DETR Public

    [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

    Python 2.8k 327

  2. PP-InsCapTagger PP-InsCapTagger Public

    Instance Capability Tagger(InsCapTagger) is a multimodal data capability tagging model. 多模态数据能力标签模型,可用于图文数据分析和处理(e.g. 基于信息密度的数据过滤方案、基于模型能力的数据配比方案)。 🔥 🔥 🔥

    8

  3. PaddlePaddle/PaddleDetection PaddlePaddle/PaddleDetection Public

    Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

    Python 12.9k 2.9k

  4. PaddlePaddle/PaddleMIX PaddlePaddle/PaddleMIX Public

    Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

    Python 417 164