avatar
Yifeng He
We must know, we shall know.

About Me

Greetings! My name is 贺一峰 (He, Yifeng). I am a Ph.D. student at UC Davis, supervised by Prof. Hao Chen. Previously, I obtained my Bachelor of Science degree (with Honors) from UC Davis, double majored in Applied Math and Computer Science. During undergrad, I was the president of HackerHub(UCDavis Club). My current study interests are artificial intelligence (AI) and software engineering (SE). My research topics include AI for SE, SE for AI, SE for (AI for SE). For AI, my main research topic is (large) language model. For SE, my main research topics are security, software testing/verification/analysis, and (safe/secured/functional) programming languages.

I like video games. My favorite games are Pokemon (recently: Sword :(, Violet, and Arceus:)), The Witcher 3, Clash of Clans (World top #169 in April 2018!), and Genshin Impact (haven’t play much since 4.4, I blame miHoYo for this).

I also enjoy powerlifting. I have trained for a little over two years. My (training) personal records are Squad 405lb (184kg), Bench 230lb (104kg), Deadlift 455lb (206kg), summing up to 1090lb (494kg) @ 85kg body weight.

Curriculum vitae pdf

Education

University of California, Davis

  • Ph.D., Computer Science, 2023 – 2028 (expected)
  • B. S. with Honor, Computer Science and Applied Mathematics, 2019 – 2023

Publications

Conference Papers

He, Y., Huang, J., Rong, Y., Guo, Y., Wang, E., Chen, H. UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing, International Symposium on Software Testing and Analysis (ISSTA), 2024. https://arxiv.org/abs/2402.03396

Zhao, J., Rong, Y., Guo, Y., He, Y., Chen, H. Understanding Programs by Exploiting (Fuzzing) Test Cases, Findings of Association for Computational Linguistics (ACL), 2023. https://arxiv.org/abs/2305.13592

He, Y., Big Data and Deep Learning Techniques Applied in Intelligent Recommender Systems, 2022 IEEE 4th International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Dali, China, 2022, pp. 1119-1124, doi: 10.1109/ICCASIT55263.2022.9986837.

Preprints

He, Y., Wang, E., Rong, Y., Cheng, Z., Chen, H. Security of AI Agents, Symposium on Security and Privacy (S&P), 2025. Under Review. https://arxiv.org/abs/2406.08689

He, Y., Wang, J., Rong, Y., Chen, H. FuzzAug: Exploring Fuzzing as Data Augmentation for Neural Test Generation International Conference on Automated Software Engineering (ASE), 2024. Under Review. https://arxiv.org/abs/2406.08665

Zhang, H, Rong, Y, He, Y, Chen, H. LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing International Conference on Automated Software Engineering (ASE), 2024. Under Review. https://arxiv.org/abs/2406.07714

Huang, J., Zhao, J., Rong, Y., Guo, Y., He, Y., Chen, H. Code Representation Pre-training with Complements from Program Executions, Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024. Under Review. https://arxiv.org/abs/2309.09980

Honors and Awards

Citation For Outstanding Performance, Dept. Mathematics, UC Davis

Dean’s Honor List, Fall 2019, Spring 2020, Spring 2021, Spring 2022, College of L&S, UC Davis

Teaching

  • ECS 36C Data Structure
    • TA, Spring 2023
    • TA, Fall 2023

Internship

ByteDance, Beijing, China 04/2021 – 08/2021

Software Engineering Intern in Income Platform Team

  • Used microservice tech to connect parts of the author income settlement bushiness
  • Transformed author-relation data architecture design from relational database (SQL) to graph database (Gremlin) to allow better efficiency for the business model
  • Refactored the income calculation control process with visitor design pattern using Python 3
  • Created a data cleaner script with ORM to maintain the size and readability of online data settlement table
  • Created the offline flow of Medium Video Encouragement Project for weekly data calculation
  • Built the interface for frontend of web and mobile app to display the data visualization

Projects

CourseReco 06/2022 – 09/2022

  • Designed the overall system architecture
  • Led the programming for API server and recommender engine
  • Negotiated with the third-party provider, SchedGo, for data service
  • Provided technical leadership to teammates

Music Genre Classifier 05/2022 – 06/2022

  • Processed music samples into spectrogram by Short-time Fourier transform
  • Designed the appropriate model (CNN) to classify spectrograms into category
  • Analyzed the resulting model and test outputs with saliency maps

ImageOrientation 03/2022 – 04/2022

  • Pre-processed image data by rotating them with random generated angles, and assigned these angles as label
  • Designed the appropriate CNN for regression task, tested and improved the model
  • Applied Hyper-parameter tuning based on train, validation, and tested results

Dcash-server 05/2021 – 07/2021

  • Created a multi-threaded API server using C++ to allow users to create accounts to make deposit and transfer
  • Used MySQL to store and maintain user data
  • Made API calls to the Stripe API server to handle credit card information

Genshine Impact Gacha Analyzer 08/2021 – 09/2021

  • Designed fetching process of gacha data from MiHoYo and categorized the process
  • Stored data into local database automatically, wrote into excel for data analysis by option
  • Generated text or graph visualization report from data analyze results

Activities

HackerHub Club, UC Davis 07/2020 – 06/2023

Co-founder, President, Technical Officer

  • Design and maintain a course recommendation system, CourseReco, for UC Davis students
  • Organize and lead the Code Jam Competition on data visualization, AI, augmented reality and virtual reality, and machine learning
  • Coach in introductory programming workshops, including Assembly, functional programming, recommender system, generative adversarial network, etc.