Skip to content
View S1s-Z's full-sized avatar
  • Tsinghua University
  • Beijing
  • 10:56 (UTC +08:00)

Organizations

@pkunlp-icler

Block or report S1s-Z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
S1s-Z/README.md

Hi πŸ§‘πŸ»β€πŸ’»πŸ‘‹πŸ»

I am Shuzheng Si (司书正 in Chinese ✍🏻), currently a first-year CS Ph.D. student at Tsinghua University. I am lucky to be advised by Prof. Maosong Sun and affiliated with TsinghuaNLP Lab. Previously, I completed my master’s degree at Peking University and I was very fortunate to be under the supervision of Prof. Baobao Chang at the Institute of Computational Linguistics. I spent my sweet undergraduate days at the School of Software (rank: 1/307), Yunnan University, which is a very beautiful university πŸ‚.

Now, my research interests lie in Natural Language Processing and Large Language Models, specifically focusing on Data-centric Methods, including Data Selection, Data Synthesis, and Learning from Noisy Data, etc. My long-term research goal is to open the black box of data influence in LLMs and to improve the performance of LLMs using (organized, selected, or synthesized) high-quality data. Find my up-to-date publication list in πŸ”— Google Scholar.

Feel free to drop an email if you are interested in connecting πŸ§‘πŸ»β€πŸ€β€πŸ§‘πŸ».

Pinned Loading

  1. HaozheZhao/MIC HaozheZhao/MIC Public

    MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

    Python 337 15

  2. SCL-RAI SCL-RAI Public

    [COLING'22] Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER"

    Python 47 3

  3. SANTA SANTA Public

    [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition"

    Python 43 1

  4. GATEAU GATEAU Public

    Code for "GATEAU: Selecting Influential Sample for Long Context Alignment"

    Python 45

  5. QUEEN QUEEN Public

    [NAACL'22] Repo for "Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting"

    Jupyter Notebook 10 2