curriculum vitae
General Information
Full Name | Thang Minh Pham |
Contact | pmthangk09 [at] gmail [dot] com |
Languages | Vietnamese (Native), English (Fluent) |
Research Areas
- I'm particularly interested in making AI systems more interpretable and efficient, recently developing SlimLM models for on-device document assistance and creating methods to enhance the explainability and robustness of language models in NLP and CV tasks. My current research focuses on developing efficient small language models and model optimization techniques for mobile devices, building robust retrieval-augmented generation (RAG) systems and multi-agent architectures.
Education
-
2019 - 2024 Ph.D. in Computer Science (ML/AI)
Auburn University, AL, USA - Advised by Prof. Anh Nguyen
- GPA: 3.91/4.0
- Thesis: Improving Interpretability and Accuracy of Artificial Intelligence in Natural Language and Image Understanding
-
2009 - 2013 B.S. (Honours) in Computer Science and Engineering
University of Science (HCMUS), Ho Chi Minh City, Vietnam - Advised by Dr. Son Tran
- GPA: 3.39/4.0
- Thesis: Scene Text Detection and Recognition (Distinction)
Work Experience
-
06/2024 - now Research Scientist Intern at Adobe Reseach
- Quantizes and deploys LLMs (∼0.1–7 billion parameters) locally on mobile devices for benchmarking.
- Proposes SlimLM models (∼0.05–3 billion parameters), pre-trained from scratch on ∼1–5 trillion tokens using datasets such as SlimPajama, FineWeb, DCLM or SmolLM.
- Finetunes and evaluates SlimLM and baseline models for Q/A, question suggestion and summarization for documents.
- Develops a mobile application for PDF reader with AI assistant to demonstrate the efficiency of SlimLM on documents.
-
09/2023 - 01/2024 Research Scientist Intern at Adobe Reseach
- Crawled and analyzed trends in Reddit's image editing requests.
- Constructed an instruction-following dataset and a human preference data to teach large language models (LLMs) to use visual foundation models for handling image editing requests.
- Fine-tuned open-source LLMs (e.g., LLaMA-2, Mistral, Zephyr, Falcon, MPT) across different versions (7B, 13B, 70B) to correctly generate and execute a plan including a sequence of image editing tools to handle simple, complex and implicit user requests for Adobe products (e.g., Photoshop).
-
05/2021 - 11/2021 Research Scientist Intern at Adobe Reseach
- Developed a deep neural model for phrase representation.
- Designed a human annotation guideline and led the team (∼15 people) to construct a dataset and benchmark to enhance and evaluate in-context phrase representation for semantic search.
- Fine-tuned and evaluate small and large language models in understanding phrases in context.
- Developed a smart phrase search system that is potentially utilized in the Acrobat product.
-
05/2017 - 08/2019 ML/AI Research Engineer at AIST Japan
- Investigated state-of-the-art deep learning models for named entity recognition (NER), relation and event extraction in biomedical domain.
- Implemented novel deep neural networks for NER task with Dr. Sohrab (EMNLP 2018) and Dr. Ju (NAACL 2018).
- Developed the DeepEventMine system with Dr. Trieu in the first phase of the project.
- Fine-tuned hyper-parameters with greedy search and Bayesian optimization methods.
- Pre-processed biomedical corpora (JNLPBA2004, GENIA, ACE2005, CG2013, MLEE, PHAEDRA).
- Optimized parallel computing with multiple GPUs for speeding up a model training process.
- Collaborated with Dr. Nagano to create the EzCat database.
- Developed pipelines to evaluate EventMine systems for NaCTeM (University of Manchester).
-
01/2014 - 05/2017 Senior Software Engineer at OPSWAT Vietnam (now Beowulf)
- Developed over-the-top mobile applications (VoxyPAD, Tutorica, Victoria and Hana).
- Worked directly with CEO Dr. William Nguyen and other team leaders to design APIs for the whole systems.
- Supported clients in the United States and in Japan (Toshiba corporation) to deploy our products.
-
08/2013 - 12/2013 Software Engineer at VoxyPAD Vietnam (now Beowulf)
- Developed a back-end system using Spring framework to provide user account management for virtual RADIUS servers.
- Developed a socket server for an over-the-top application to send and receive messages via socket.
Honors and Awards
-
2023 - Diversity and Inclusion Award, EACL 2023.
- Graduate Student Council Travel Fellowship.
-
2019 - 2024 - Auburn University Graduate Assistantship.
-
2009 - 2013 - University of Science Faculty of IT Excellence Fellowship.
-
2009 - Top-3 university entrance award from high school.
Professional and Volunteering Services
-
2024 - I am a reviewer for the following conferences: NeurIPS, ACL, EMNLP, NAACL.
-
2023 - I was a reviewer for the following conferences: NeurIPS, ICLR, ACL, EMNLP, EACL.
-
08/2022 - 05/2023 - I was a youth program personnel for K-6 AI Club.
-
2022 - I was a reviewer for the following conferences: NeurIPS, EMNLP.
Technical Skills
- Programming Languages: Python, Java, C++, C#, PHP, Javascript.
- Deep Learning Frameworks: PyTorch, TensorFlow, Chainer.
- NLP Tools: NLTK, SciPy, spaCy, Pandas, Brat Annotation, Argo, EventMine.
- Databases: MySQL, PostgreSQL, MongoDB.
Other Interests
- Sports: Soccer, Tennis, Badminton, Swimming.
- Hobbies: Traveling and Gaming.