Truong-Phuc Nguyen

Truong-Phuc Nguyen

To stand where others cannot, you must endure what others will not.

About Me

Hello! I'm Truong-Phuc Nguyen, a CS student specializing in ML, DL, and NLP. My expertise spans across NLP tasks, such as: Information Retrieval, Question Answering, Text Generation, Summarization for Vietnamese text processing using both PLMs and LLMs.

Through research journey, I have successfully built some NLP demo systems demonstrating feasibility, including legal question-answering systems, clinical report summarization, and question generation tools for education. Currently, I am seeking advanced learning opportunities related to CS/NLP where I can leverage my knowledge of language modeling, text processing algorithms, and research experience.

Education

Hung Yen University of Technology and Education

Bachelor of Engineering in Computer Science (Gifted and Talented Programs)

September 2021 - June 2025

9.04
#1 in 10 scale
3.75
#2 in 4 scale
9.9
thesis score

Graduate Thesis: A Study of Vietnamese Legal Question Answering with Pre-trained and Large Language Models
Score: 9.9/10 (Awarded Excellence Graduate Thesis Presentation)

Publications

Journal Articles

[1] A fine-tuning framework based on question, context, and answer relationships for enhancing legal information retrieval

Authors: Nhu Hai Phung, Chi Thanh Nguyen, Minh-Tien Nguyen, Thu Ha Nguyen, Huu Loi Le, and Truong-Phuc Nguyen
Published in: Engineering Applications of Artificial Intelligence, Volume 159, Page 111570. Elsevier, 2025
Impact: WoS-SCIE, Q1, IF: 8.0

[2] Towards Vietnamese Legal Question Answering: An Empirical Study

Authors: Truong-Phuc Nguyen, Van-Quyet Nguyen, and Minh-Tien Nguyen
Status: Submitted to Artificial Intelligence and Law, Springer, 2025

[3] Application of Machine Learning in Image Recognition to Detect Some Abnormalities in the Examination Rooms

Authors: Tien-Dat Nguyen, Truong-Phuc Nguyen, and Pham Minh Chuan
Published in: UTEHY Journal of Applied Science and Technology (University Journal), Vol. 40, 2023, pp. 27-32

Conference Papers

[1] ViLegalLM: Language Models for Vietnamese Legal Text

Authors: Truong-Phuc Nguyen, Quy-Nhan Nguyen, and Minh-Tien Nguyen
Status: Submitted to ACL ARR 2026 January Cycle.

[2] UTEHY-NLU@ALQAC 2025: Dynamic Weighted Ensemble and Adaptive Reasoning for Vietnamese Legal Text Processing

Authors: Truong-Phuc Nguyen, Quy-Nhan Nguyen, Manh-Cuong Phan, Chi-Hai Cao, Trinh-Hoai-An Duong, and Minh-Tien Nguyen
Published in: Proceedings of 2025 17th International Conference on Knowledge and System Engineering (KSE 2025), Da Lat, Vietnam, 2025, pp. 1-5

[3] ViEduQA: A New Vietnamese Dataset for Question Answer Generation in Education

Authors: Truong-Phuc Nguyen, Huu-Loi Le, Pham Quoc-Hung, Nong Quang Huy, Xuan-Hieu Phan, and Minh-Tien Nguyen
Published in: Information and Communication Technology. SOICT 2024. CCIS, vol. 2352, pp. 441-455. Springer, Singapore, 2025

[4] Vietnamese Legal Question Answering: An Experimental Study

Authors: Thu-Ha Nguyen, Truong-Phuc Nguyen, Khang T. Trung, Huu-Loi Le, Le Thi Viet Huong, Chi Thanh Nguyen, and Minh-Tien Nguyen
Published in: Proceedings of 2024 16th International Conference on Knowledge and System Engineering (KSE 2024), Kuala Lumpur, Malaysia, 2024, pp. 440-446.

Experience

Jan 2026 - present

Emotion Recognition in Conversation (ERC)

NLU Laboratory, Hung Yen University of Technology and Education, Vietnam

(to be updated ...)

Advisor: Assoc. Prof. Minh-Tien Nguyen

August 2025 - January 2026

MedMAS: Multi-agent System for Pre-intake Clinical Note Generation in Conversation

BioInfomatic Laboratory, Feng Chia University, Taiwan

Building a multi-agent system that extracts patient information through conversations, generates targeted follow-up questions to gather comprehensive patient data, and creates detailed pre-visit clinical reports. This streamlines the examination process, saving physician time and improving patient experience during medical consultations. The system is evaluated across three core tasks: Named Entity Recognition (NER), Question Generation (QG), and Summarization, achieving state-of-the-art results on both MTS-Dialog and CliniKnote benchmark datasets, demonstrating the superiority of multi-agent architectures over conventional approaches such as in-context learning and instruction-tuning in the medical domain.

Advisor: Prof. Fang-Rong Hsu

September 2024 - January 2026

ViLegalLM: Language Models for Vietnamese Legal Text

NLU Laboratory, Hung Yen University of Technology and Education

ViLegalLM comprises one representation model (135M) and two generation models (1.54B, 1.72B) specifically for Vietnamese legal text through continual pretraining on newly 16GB of high-quality legal documents. ViLegalLM achieves state-of-the-art performance across 10 benchmarks spanning four main tasks: Information Retrieval (IR), Question Answering (QA), Natural Language Inference (NLI), and Syllogism Reasoning, outperforming 7 state-of-the-art Vietnamese models and establishing new strong baselines for Vietnamese legal text processing. The project also contributes three large-scale synthetic training datasets to address the shortage of high-quality legal training data in Vietnam.

Advisor: Assoc. Prof. Minh-Tien Nguyen

August 2025 - September 2025

Adaptive Weighted Ensemble for Legal Text Processing

NLU Laboratory, Hung Yen University of Technology and Education, Vietnam

Designed a framework combining multiple bi-encoders through query-specific confidence calculation, advanced dynamic weighting, and ensemble score fusion with cross-encoder reranker. Achieved 3rd place in Legal Information Retrieval task (F2-score: 0.8482, 7.51% improvement) and 2nd place in Legal Question Answering (97.56% accuracy) in ALQAC 2025 Competition. Paper accepted at 17th International Conference on Knowledge and System Engineering (KSE 2025).

Advisor: Assoc. Prof. Minh-Tien Nguyen

February 2024 - September 2025

IntelliChat - Question Answering System for Vietnam Legal Documents

NLU Laboratory, Hung Yen University of Technology and Education, Vietnam

Built a demo legal question-answering system for Vietnamese, integrating information retrieval with answer extraction/generation optimized for the legal domain. IntelliChat outperforms GPT-3.5 and state-of-the-art open-source LLMs (~7B parameters) in both automatic and human evaluations, and is deployed online to enable Vietnamese citizens to independently access and understand legal documents.

Advisor: Assoc. Prof. Minh-Tien Nguyen

February 2024 - June 2025

QACTune - Advanced Legal Information Retrieval Framework

NLU Laboratory, Hung Yen University of Technology and Education, Vietnam

Developed a novel fine-tuning framework leveraging Question-Context-Answer relationships for enhancing legal information retrieval in low-resource settings. Average improvements of 3.9% and 4.8% in MAP@100. Published in Engineering Applications of Artificial Intelligence (WoS-SCIE, Q1, IF: 8.0).

Advisor: Assoc. Prof. Minh-Tien Nguyen

July 2023 - January 2024

ViEduQAG - Vietnamese Question and Answer Generation in Education

NLU Laboratory, Hung Yen University of Technology and Education, Vietnam

Pioneered Vietnamese Question-Answer Generation research in education domain by creating ViEduQA - the first comprehensive Vietnamese educational QAG dataset with 12,618 QA pairs across 319 lessons from 4 high school subjects. Published in SOICT 2024 (Springer CCIS).

Advisor: Assoc. Prof. Minh-Tien Nguyen

Work Experience

February 2025 – July 2025

NLP Fresher @ Artificial Intelligence Institute JSC, Hung Yen, Vietnam

SmartChat - Smart AI Assistant for Vietnamese

Led comprehensive evaluation of large language models for production chatbot system, focusing on optimizing performance across multiple NLP tasks including document reranking, question rewriting, and content generation. Conducted systematic benchmarking of state-of-the-art models including GPT-4o-mini, Amazon Nova Lite, Amazon Nova Micro, and Amazon Nova Pro.

AI Assistant for Vietnam Ministry of Agriculture

Developed a comprehensive domain-specific chatbot system enabling intelligent question-answering capabilities over legal document collections and regulatory text corpora. Built innovative text-to-analytics functionality and designed a novel document chunking mechanism combining Depth-First Search (DFS) algorithms with advanced Regular Expression patterns.

October 2024 – January 2025

AI Intern @ Viettel Telecom Corporation, Hanoi, Vietnam

Per-Title Encoding

Developed and optimized per-title encoding algorithms for video compression on the TV360 streaming platform. Implemented advanced analysis techniques to assess video complexity and dynamically adjust encoding parameters, achieving optimal balance between visual quality and file size.

Video Frame Interpolation

Designed and implemented advanced video frame interpolation systems to enhance motion smoothness for TV360 platform content delivery. Developed sophisticated interpolation algorithms using deep learning techniques to generate high-quality intermediate frames.

Video Quality Assessment for User Generated Content

Built comprehensive video quality assessment frameworks for evaluating and enhancing user-generated content on the TV360 platform using computer vision and machine learning techniques.

Teaching & Mentoring

April 2024 – July 2025

Natural Language Processing - Teaching Assistant

CS19TN, Hung Yen University of Technology and Education

Assisted in teaching Natural Language Processing course, guiding students through fundamental and advanced NLP concepts, helping with assignments and projects.

April 2024 – July 2025

Natural Language Processing - Key Mentor

UTEHY-NLU Lab

Mentored students in NLP research projects, guiding them through literature review, experimental design, and paper writing.

December 2023 – July 2025

Deep Learning - Key Mentor

UTEHY-NLU Lab

Guided students in deep learning fundamentals and applications, covering neural networks, CNNs, RNNs, and Transformers.

June 2023 – July 2025

Machine Learning - Key Mentor

UTEHY-NLU Lab

Mentored students in machine learning concepts and practical applications, covering supervised and unsupervised learning algorithms.

Achievements

Academic Competitions

ALQAC 2025 Competition

July 2025

Top #2 in Legal Question Answering (0.9756 Accuracy) and Top #3 in Legal Information Retrieval (0.8482 F2-Score)

ALQAC 2024 Competition

July 2024

Top #5 in Legal Question Answering task

Scholarships

Academic Excellence Scholarships

2021-2025

4 Academic Excellence & 8 Talented Program Scholarships - Consistently ranked #1 in CS program

Full Tuition Waiver

2021-2023

Two consecutive academic years

Awards

Sencond Prize - MOE-level Student Scientific Research

June 2025

Research project: Research on building question answering system for Vietnamese legal documents

Excellence Graduate Thesis Presentation

June 2025

Thesis title: A Study of Vietnamese Legal Question Answering with Pre-trained and Large Language Models

First Prize - University-level Student Scientific Research

May 2025

Research project: Research on building question answering system for Vietnamese legal documents

First Prize - Faculty-level Student Scientific Research

March 2025

Research project: Research on building question answering system for Vietnamese legal documents

Second Prize - Faculty-level Student Scientific Research

June 2024

Research project: Research on developing a student attendance system using facial recognition and detecting unusual behavior in the classroom

Outstanding Student (5 Goods)

April 2023

First Prize - University-level English Olympic

February 2022

Skills

Programming Languages

  • Python
  • T-SQL
  • C#

Libraries & Frameworks

  • Transformers
  • Sentence-Transformers
  • HuggingFace
  • NLTK, Spacy
  • Scikit Learn
  • PyTorch
  • Pandas, Numpy
  • Streamlit

Techniques

  • Continual Pretraining PLMs
  • Continual Pretraining LLMs
  • Fine-tuning PLMs
  • Instruction-tuning LLMs
  • Parameter-Efficient Fine-Tuning
  • Hard-negative Mining

Soft Skills

  • Academic Presentation
  • Technical Report Writing
  • Problem Solving
  • Creative Thinking

Database

  • SQL
  • MySQL
  • Apache

Get In Touch

nguyentruongphuc_12421tn@utehy.edu.vn

Feel free to reach out for collaborations, research opportunities, or just to connect!