avatar
Yifeng He
We must know, we shall know.

About Me

Greetings! My name is 贺一峰 (He, Yifeng). I am a second year Ph.D. student at UC Davis. My advisor is Professor Hao Chen. Previously, I obtained my Bachelor of Science degree (with Honors) from UC Davis, double majored in Applied Math and Computer Science. During undergrad, I was the president of HackerHub(UCDavis Club). My current study interests are artificial intelligence (AI) and software engineering (SE). My research topics include AI for SE, SE for AI, SE for (AI for SE). For AI, my main research topic is (large) language model. For SE, my main research topics are security, software testing/verification/analysis, and (safe/secured/functional) programming languages.

I like video games. My favorite games are Pokemon (recently: Sword :(, Violet, and Arceus:)), The Witcher 3, Clash of Clans (World top #169 in April 2018!), and Genshin Impact (haven’t play much since 4.4, I blame miHoYo for this). I also like cozy-style role-playing games like Whimel Academy to reset my mind after research, experiments, and paper writing..

I also enjoy powerlifting. I have trained for a little over two years. My (training) personal records are Squat 405lb (184kg), Bench Press 245lb (111kg), Deadlift 455lb (206kg) summing up to 1105lb (501kg) @ 85kg body weight.

Curriculum vitae pdf

Education

Ph.D. in Computer Science

University of California, Davis (2023 – present)

B.S. in Computer Science and Applied Mathematics

University of California, Davis (2019 – 2023)

Publications

Peer-reviewed Papers

Yifeng He, Jiabo Huang, Yuyang Rong, Yiwen Guo, Ethan Wang, Hao Chen. UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing, International Symposium on Software Testing and Analysis (ISSTA), 2024. doi, pdf, code, slides, poster.

Jiabo Huang, Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen. Code Representation Pre-training with Complements from Program Executions. Empirical Methods in Natural Language Processing: Industry Track (EMNLP), 2024. doi, pdf, poster.

Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen. Understanding Programs by Exploiting (Fuzzing) Test Cases, Findings of Association for Computational Linguistics (ACL), 2023. doi, pdf, code.

Yifeng He, Big Data and Deep Learning Techniques Applied in Intelligent Recommender Systems, International Conference on Civil Aviation Safety and Information Technology (ICCASIT), 2022. doi.

Preprints

Hongxiang Zhang, Yifeng He, Hao Chen. SteerDiff: Steering towards Safe Text-to-Image Diffusion Models. https://arxiv.org/abs/2410.02710

Jicheng Wang, Yifeng He, Hao Chen. RepoGenReflex: Enhancing Repository-Level Code Completion with Verbal Reinforcement and Retrieval-Augmented Generation. https://arxiv.org/abs/2409.13122

Yifeng He, Ethan Wang, Yuyang Rong, Zifei Cheng, Hao Chen. Security of AI Agents. https://arxiv.org/abs/2406.08689

Yifeng He, Jicheng Wang, Yuyang Rong, Hao Chen. FuzzAug: Data Augmentation by Fuzzing for Neural Test Generation. https://arxiv.org/abs/2406.08665

Hongxiang Zhang, Yuyang Rong, Yifeng He, Hao Chen. LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing. https://arxiv.org/abs/2406.07714

Honors and Awards

Citation For Outstanding Performance, Dept. Mathematics, UC Davis

Dean’s Honor List, Fall 2019, Spring 2020, Spring 2021, Spring 2022, College of L&S, UC Davis

Teaching

  • ECS 36C Data Structure
    • TA, Spring 2023
    • TA, Fall 2023

Internship

ByteDance, Beijing, China 04/2021 - 08/2021

Software Engineering Intern in Income Platform Team

  • Used microservice tech to connect parts of the author income settlement bushiness
  • Transformed author-relation data architecture design from relational database (SQL) to graph database (Gremlin) to allow better efficiency for the business model
  • Refactored the income calculation control process with visitor design pattern using Python 3
  • Created a data cleaner script with ORM to maintain the size and readability of online data settlement table
  • Created the offline flow of Medium Video Encouragement Project for weekly data calculation
  • Built the interface for frontend of web and mobile app to display the data visualization

Projects

CourseReco 06/2022 - 09/2022

  • Designed the overall system architecture
  • Led the programming for API server and recommender engine
  • Negotiated with the third-party provider, SchedGo, for data service
  • Provided technical leadership to teammates

Music Genre Classifier 05/2022 - 06/2022

  • Processed music samples into spectrogram by Short-time Fourier transform
  • Designed the appropriate model (CNN) to classify spectrograms into category
  • Analyzed the resulting model and test outputs with saliency maps

ImageOrientation 03/2022 - 04/2022

  • Pre-processed image data by rotating them with random generated angles, and assigned these angles as label
  • Designed the appropriate CNN for regression task, tested and improved the model
  • Applied Hyper-parameter tuning based on train, validation, and tested results

Dcash-server 05/2021 - 07/2021

  • Created a multi-threaded API server using C++ to allow users to create accounts to make deposit and transfer
  • Used MySQL to store and maintain user data
  • Made API calls to the Stripe API server to handle credit card information

Genshine Impact Gacha Analyzer 08/2021 - 09/2021

  • Designed fetching process of gacha data from MiHoYo and categorized the process
  • Stored data into local database automatically, wrote into excel for data analysis by option
  • Generated text or graph visualization report from data analyze results

Activities

HackerHub Club, UC Davis 07/2020 - 06/2023

Co-founder, President, Technical Officer

  • Design and maintain a course recommendation system, CourseReco, for UC Davis students
  • Organize and lead the Code Jam Competition on data visualization, AI, augmented reality and virtual reality, and machine learning
  • Coach in introductory programming workshops, including Assembly, functional programming, recommender system, generative adversarial network, etc.