Truong-Phuc Nguyen

Truong-Phuc Nguyen

NLP Enthusiast

Graduate Computer Science student specializing in Natural Language Processing and Machine Learning, with focus on Vietnamese language processing, question answering systems, and legal AI applications.

About Me

Hello! I'm Truong-Phuc Nguyen (you can call me Fook), a graduate Computer Science student specializing in Natural Language Processing and Machine Learning. My expertise spans across question answering systems, text generation, information retrieval, and Vietnamese language processing using both pre-trained language models and large language models.

Through my research and practical projects, I have successfully built proof-of-concept NLP systems, including legal QA platforms, educational question generation tools, and multilingual text processing pipelines. Currently seeking a position related to NLP where I can leverage my deep understanding of language models, text processing algorithms, and research experience.

Education

Hung Yen University of Technology and Education

Bachelor of Engineering in Computer Science (Talented Engineer Program)

September 2021 - June 2025

9.04
GPA / 10
3.75
GPA / 4
#1
Class Rank

Publications

Journal Articles

[1] A fine-tuning framework based on question, context, and answer relationships for enhancing legal information retrieval

Authors: Nhu Hai Phung, Chi Thanh Nguyen, Minh-Tien Nguyen, Thu Ha Nguyen, Huu Loi Le, & Truong-Phuc Nguyen
Published in: Engineering Applications of Artificial Intelligence, Volume 159, Page 111570. Elsevier, 2025
Impact: WoS, Q1, IF: 8.0
DOI: 10.1016/j.engappai.2025.111570

[2] Towards Vietnamese Legal Question Answering: An Empirical Study

Authors: Truong-Phuc Nguyen, Van-Quyet Nguyen, & Minh-Tien Nguyen
Status: Submitted to Artificial Intelligence and Law, Springer, 2025

[3] ViLegalBERT & ViLegalQwen: Lightweight Domain-Adaptive Language Models for Vietnamese Legal Text Processing

Authors: Truong-Phuc Nguyen, Quy-Nhan Nguyen, Manh-Cuong Phan, Tien-Manh Tran, Huy-The Vu & Minh-Tien Nguyen
Status: Under Writing

[4] Application of Machine Learning in Image Recognition to Detect Some Abnormalities in the Examination Rooms

Authors: Tien-Dat Nguyen, Truong-Phuc Nguyen, and Pham Minh Chuan
Published in: UTEHY Journal of Applied Science and Technology, Vol. 40, 2023, pp. 27-32

Conference Papers

[1] UTEHY-NLU@ALQAC 2025: Dynamic Weighted Ensemble and Adaptive Reasoning for Vietnamese Legal Text Processing

Authors: Truong-Phuc Nguyen, Quy-Nhan Nguyen, Manh-Cuong Phan, Chi-Hai Cao, Trinh-Hoai-An Duong, Minh-Tien Nguyen
Conference: The 17th International Conference on Knowledge and Systems Engineering (KSE 2025), ISAILD, 2025
Status: Accepted

[2] ViEduQA: A New Vietnamese Dataset for Question Answer Generation in Education

Authors: Truong-Phuc Nguyen, Huu-Loi Le, Phuc Quoc-Hung, Nhan Quy Huy, Xuan-Hieu Phan, and Minh-Tien Nguyen
Published in: Information and Communication Technology. SOICT 2024. CCIS, vol. 2352, pp. 441-455. Springer, Singapore, 2025
DOI: 10.1007/978-981-96-4288-5_34

[3] Vietnamese Legal Question Answering: An Experimental Study

Authors: Thu-Ha Nguyen, Truong-Phuc Nguyen, Khang T. Trung, Huu-Loi Le, Le Thi Viet Huong, Chi Thanh Nguyen, and Minh-Tien Nguyen
Published in: Proceedings of 2024 16th International Conference on Knowledge and Systems Engineering (KSE), Kuala Lumpur, Malaysia, 2024, pp. 1-6. IEEE
DOI: 10.1109/KSE63888.2024.11063637

Experience

August 2025 – Present

MedMAS - Multi-agent System for Pre-visit Clinical Note Generation

Biomedical Laboratory, Feng Chia University, Taiwan

Building a Multi-Agent System for the medical domain to extract patient information through conversations, generate follow-up questions to collect more patient information and create summary reports for the pre-medical examination stage for patients, to save examination time for doctors and improve the patient's experience during medical examination and treatment.

Advisor: Prof. Fang-Rong Hsu

September 2024 – Present

ViLegalBERT & ViLegalQwen - Domain-specific Language Models

NLU Laboratory, Hung Yen University of Technology and Education

Developing representation and generation models specifically for the legal domain in Vietnam through continual pretraining of language models on large datasets from four sources of authoritative legal documents in Vietnam. Legal pretrained models are trained on high-quality large-scale synthetic datasets, compared with base models and Vietnamese-specific models of the same size on the problems of Question Answering (True/False, Multiple-choice), Natural Language Inference, Text Classification.

Advisor: Assoc. Prof. Minh-Tien Nguyen

August 2025 – September 2025

Adaptive Weighted Ensemble for Legal Text Processing

NLU Laboratory, UTEHY

Designed a framework combining multiple bi-encoders through query-specific confidence calculation, advanced dynamic weighting, and ensemble score fusion with cross-encoder reranker. Achieved 3rd place in Legal Information Retrieval task (F2-score: 0.8482, 7.51% improvement) and 2nd place in Legal Question Answering (97.56% accuracy). Paper accepted at ISAILD-KSE 2025.

Advisor: Assoc. Prof. Minh-Tien Nguyen

February 2024 – September 2025

IntelliChat - Vietnamese Legal Question Answering System

NLU Laboratory, UTEHY

Built an end-to-end legal QA system for Vietnamese, integrating information retrieval with answer generation. System processes legal queries, retrieves relevant legal articles, and generates accurate answers using state-of-the-art NLP techniques.

Advisor: Assoc. Prof. Minh-Tien Nguyen

February 2024 – June 2025

QACTune - Advanced Legal Information Retrieval Framework

NLU Laboratory, UTEHY

Developed a novel fine-tuning framework leveraging Question-Context-Answer relationships for enhancing legal information retrieval in low-resource settings. Average improvements of 3.9% and 4.8% in MAP@K. Published in Engineering Applications of Artificial Intelligence (WoS, Q1, IF: 8.0).

Advisor: Assoc. Prof. Minh-Tien Nguyen

July 2023 – January 2024

ViEduQAG - Vietnamese Question and Answer Generation in Education

NLU Laboratory, UTEHY

Pioneered Vietnamese Question-Answer Generation research in education domain by creating ViEduQA - the first comprehensive Vietnamese educational QAG dataset with 12,618 QA pairs across 319 lessons from 4 high school subjects. Published in SOICT 2024 (Springer CCIS).

Advisor: Assoc. Prof. Minh-Tien Nguyen

Work Experience

February 2025 – July 2025

NLP Fresher @ Artificial Intelligence Institute JSC, Hung Yen, Vietnam

SmartChat - Smart AI Assistant for Vietnamese

Led comprehensive evaluation of large language models for production chatbot system, focusing on optimizing performance across multiple NLP tasks including document reranking, question rewriting, and content generation. Conducted systematic benchmarking of state-of-the-art models including GPT-4o-mini, Amazon Nova Lite, Amazon Nova Micro, and Amazon Nova Pro.

AI Assistant for Vietnam Ministry of Agriculture

Developed a comprehensive domain-specific chatbot system enabling intelligent question-answering capabilities over legal document collections and regulatory text corpora. Built innovative text-to-analytics functionality and designed a novel document chunking mechanism combining Depth-First Search (DFS) algorithms with advanced Regular Expression patterns.

October 2024 – January 2025

AI Intern @ Viettel Telecom Corporation, Hanoi, Vietnam

Per-Title Encoding

Developed and optimized per-title encoding algorithms for video compression on the TV360 streaming platform. Implemented advanced analysis techniques to assess video complexity and dynamically adjust encoding parameters, achieving optimal balance between visual quality and file size.

Video Frame Interpolation

Designed and implemented advanced video frame interpolation systems to enhance motion smoothness for TV360 platform content delivery. Developed sophisticated interpolation algorithms using deep learning techniques to generate high-quality intermediate frames.

Video Quality Assessment for User Generated Content

Built comprehensive video quality assessment frameworks for evaluating and enhancing user-generated content on the TV360 platform using computer vision and machine learning techniques.

Teaching & Mentoring

April 2024 – July 2025

Natural Language Processing - Teaching Assistant

CS19TN, Hung Yen University of Technology and Education

Assisted in teaching Natural Language Processing course, guiding students through fundamental and advanced NLP concepts, helping with assignments and projects.

April 2024 – July 2025

Natural Language Processing - Key Mentor

UTEHY-NLU Lab

Mentored students in NLP research projects, guiding them through literature review, experimental design, and paper writing.

December 2023 – July 2025

Deep Learning - Key Mentor

UTEHY-NLU Lab

Guided students in deep learning fundamentals and applications, covering neural networks, CNNs, RNNs, and Transformers.

June 2023 – July 2025

Machine Learning - Key Mentor

UTEHY-NLU Lab

Mentored students in machine learning concepts and practical applications, covering supervised and unsupervised learning algorithms.

Achievements

Academic Competitions

ALQAC 2025 Competition

July 2025

Top #2 in Legal Question Answering (0.9756 Accuracy) and Top #3 in Legal Information Retrieval (0.8482 F2-Score)

ALQAC 2024 Competition

July 2024

Top #5 in Legal Question Answering task

Certifications

November 2025

KSE 2025 Certification of Presentation

July 2025

TOEIC Reading & Listening

February 2025

Hugging Face Agents Course

December 2024

SOICT 2024 Presentation - Conference Presentation Certification

February 2024

Google Data Analytics Professional Certificate

February 2024

Stanford Machine Learning Specialization

Scholarships

TOYOTA Excellence Academic Scholarship

November 2024

Academic Excellence Scholarships

2021-2025

4 Academic Excellence & 8 Talented Program Scholarships - Consistently ranked #1

Full Tuition Waiver

2021-2023

Two consecutive academic years

Awards

Sencond Prize - MOE-level Student Scientific Research

June 2025

Excellence Graduate Thesis Presentation

June 2025

First Prize - University-level Student Scientific Research

May 2025

First Prize - Faculty-level Student Scientific Research

March 2025

Second Prize - Faculty-level Student Scientific Research

June 2024

Outstanding Student (5 Goods)

April 2023

First Prize - University-level English Olympic

February 2022

Skills

Programming Languages

  • Python
  • T-SQL
  • C#

Libraries & Frameworks

  • Transformers
  • Sentence-Transformers
  • HuggingFace
  • NLTK, Spacy
  • Scikit Learn
  • PyTorch
  • Pandas, Numpy
  • Streamlit

Techniques

  • Continual Pretraining PLMs
  • Continual Pretraining LLMs
  • Fine-tuning PLMs
  • Instruction-tuning LLMs
  • Parameter-Efficient Fine-Tuning
  • Hard-negative Mining

Soft Skills

  • Academic Presentation
  • Technical Report Writing
  • Problem Solving
  • Creative Thinking

Database

  • SQL
  • MySQL
  • Apache

Get In Touch

nguyentruongphuc_12421tn@utehy.edu.vn

Feel free to reach out for collaborations, research opportunities, or just to connect!