curriculum vitae

General Information

Full Name Thang Minh Pham
Contact pmthangk09 [at] gmail [dot] com
Languages Vietnamese (Native), English (Fluent)

Research Areas

  • I'm particularly interested in making AI systems more interpretable and efficient, recently developing SlimLM models for on-device document assistance and creating methods to enhance the explainability and robustness of language models in NLP and CV tasks. My current research focuses on developing efficient small language models and model optimization techniques for mobile devices, building robust retrieval-augmented generation (RAG) systems and multi-agent architectures.

Education

  • 2019 - 2024
    Ph.D. in Computer Science (ML/AI)
    Auburn University, AL, USA
    • Advised by Prof. Anh Nguyen
    • GPA: 3.91/4.0
    • Thesis: Improving Interpretability and Accuracy of Artificial Intelligence in Natural Language and Image Understanding
  • 2009 - 2013
    B.S. (Honours) in Computer Science and Engineering
    University of Science (HCMUS), Ho Chi Minh City, Vietnam
    • Advised by Dr. Son Tran
    • GPA: 3.39/4.0
    • Thesis: Scene Text Detection and Recognition (Distinction)

Work Experience

  • 06/2024 - now
    Research Scientist Intern at Adobe Reseach
    • Quantizes and deploys LLMs (∼0.1–7 billion parameters) locally on mobile devices for benchmarking.
    • Proposes SlimLM models (∼0.05–3 billion parameters), pre-trained from scratch on ∼1–5 trillion tokens using datasets such as SlimPajama, FineWeb, DCLM or SmolLM.
    • Finetunes and evaluates SlimLM and baseline models for Q/A, question suggestion and summarization for documents.
    • Develops a mobile application for PDF reader with AI assistant to demonstrate the efficiency of SlimLM on documents.
  • 09/2023 - 01/2024
    Research Scientist Intern at Adobe Reseach
    • Crawled and analyzed trends in Reddit's image editing requests.
    • Constructed an instruction-following dataset and a human preference data to teach large language models (LLMs) to use visual foundation models for handling image editing requests.
    • Fine-tuned open-source LLMs (e.g., LLaMA-2, Mistral, Zephyr, Falcon, MPT) across different versions (7B, 13B, 70B) to correctly generate and execute a plan including a sequence of image editing tools to handle simple, complex and implicit user requests for Adobe products (e.g., Photoshop).
  • 05/2021 - 11/2021
    Research Scientist Intern at Adobe Reseach
    • Developed a deep neural model for phrase representation.
    • Designed a human annotation guideline and led the team (∼15 people) to construct a dataset and benchmark to enhance and evaluate in-context phrase representation for semantic search.
    • Fine-tuned and evaluate small and large language models in understanding phrases in context.
    • Developed a smart phrase search system that is potentially utilized in the Acrobat product.
  • 05/2017 - 08/2019
    ML/AI Research Engineer at AIST Japan
    • Investigated state-of-the-art deep learning models for named entity recognition (NER), relation and event extraction in biomedical domain.
    • Implemented novel deep neural networks for NER task with Dr. Sohrab (EMNLP 2018) and Dr. Ju (NAACL 2018).
    • Developed the DeepEventMine system with Dr. Trieu in the first phase of the project.
    • Fine-tuned hyper-parameters with greedy search and Bayesian optimization methods.
    • Pre-processed biomedical corpora (JNLPBA2004, GENIA, ACE2005, CG2013, MLEE, PHAEDRA).
    • Optimized parallel computing with multiple GPUs for speeding up a model training process.
    • Collaborated with Dr. Nagano to create the EzCat database.
    • Developed pipelines to evaluate EventMine systems for NaCTeM (University of Manchester).
  • 01/2014 - 05/2017
    Senior Software Engineer at OPSWAT Vietnam (now Beowulf)
  • 08/2013 - 12/2013
    Software Engineer at VoxyPAD Vietnam (now Beowulf)
    • Developed a back-end system using Spring framework to provide user account management for virtual RADIUS servers.
    • Developed a socket server for an over-the-top application to send and receive messages via socket.

Honors and Awards

  • 2023
    • Diversity and Inclusion Award, EACL 2023.
    • Graduate Student Council Travel Fellowship.
  • 2019 - 2024
    • Auburn University Graduate Assistantship.
  • 2009 - 2013
    • University of Science Faculty of IT Excellence Fellowship.
  • 2009
    • Top-3 university entrance award from high school.

Professional and Volunteering Services

  • 2024
    • I am a reviewer for the following conferences: NeurIPS, ACL, EMNLP, NAACL.
  • 2023
    • I was a reviewer for the following conferences: NeurIPS, ICLR, ACL, EMNLP, EACL.
  • 08/2022 - 05/2023
  • 2022
    • I was a reviewer for the following conferences: NeurIPS, EMNLP.

Technical Skills

  • Programming Languages: Python, Java, C++, C#, PHP, Javascript.
  • Deep Learning Frameworks: PyTorch, TensorFlow, Chainer.
  • NLP Tools: NLTK, SciPy, spaCy, Pandas, Brat Annotation, Argo, EventMine.
  • Databases: MySQL, PostgreSQL, MongoDB.

Other Interests

  • Sports: Soccer, Tennis, Badminton, Swimming.
  • Hobbies: Traveling and Gaming.