Thanh Nguyen

Thanh Hai Nguyen

AI Researcher and Engineer - Software Architect - Senior Software Engineer

Research topics: Artificial Intelligence, Machine Learning, Data Mining and Big-Data

My Resume My CV

About Me

I completed my PhD degree in the Computer Science Department at New Mexico State University. During my degree, I had been working as a software engineer for LandPKS project and Phylotastic since 2014, in which I applied Machine Learning and Artificial Intelligence methodologies. I graduated with my Masters of Computer Science at James Cook University in Australia. My thesis, entitled Data Mining in Internet Banking, is currently being used in a number of Asian banks.

My research interests include Data Mining and Knowledge Discovery(classification, clustering, association rules and prediction models), Artificial Intelligence (Knowledge Representation and Reasoning (Argumentation, Logic Programming, Answer Set Programming, Nonmonotonic Reasoning), Reasoning about Actions and Changes, Planning, Scheduling, Autonomous Agents, Robotics, Control Theory, Inductive Logic Programming, Web Services Composition & Semantics Web), Machine Learning and Collective Intelligence (recommendation system, discovering groups, searching and ranking, collaborative filtering, document filtering, generative modelling, advanced classification, regression etc.) and Big-Data processing (Map-Reduce framework).

The primary research in my PhD thesis focused on Automation Web Services Composition in Semantics Web. I developed a completed end-to-end AI system to collect requirements from users in Natural Language and explore workflows that can satisfy the users requirements automatically. After the workflow is achieved, our system is able to execute each Web Service component in workflow sequence in order to achieve the goal.

I am extremely knowledgeable with eleven years of industry experience in software development. I possess a wide skill-set including both back-end and front-end development. I also have strong abilities in problem solving and critical thinking.

Scientific papers

Journal researches

1. Thanh Hai Nguyen, Tran Cao Son, Enrico Pontelli. Phylotastic: An Experiment in Creating, Manipulating, and Evolving Phylogenetic Biology Workflows Using Logic Programming. TPLP Theory and Practice of Logic Programming Journal 2018 [PDF [arXiv]] [Paper]

2. Van D. Nguyen, Thanh H Nguyen, Abu Saleh Md. Tayeen, H. Dail Laughinghouse IV, Luna L. Sánchez-Reyes, Enrico Pontelli, Dmitry Mozzherin, Brian O'Meara, Arlin Stoltzfus. Phylotastic: Improving Access to Tree-of-Life Knowledge With Flexible, on-the-Fly Delivery of Trees. Evolutionary Bioinformatics [Paper]

3. Abu Saleh Md Tayeen, Thanh Hai Nguyen, Van Nguyen, Enrico Pontelli. Design and Implementation of Phylotastic, a Service Architecture for Evolutionary Biology. International Journal of Software Engineering and Knowledge Engineering 2020 | Vol.30, No.10, pp. 1525-1550 (2020) [PDF] [Paper]

4. Thanh Hai Nguyen, Tran Cao Son, Matthew Bundas, Marcello Balduccinni, Kathleen Campbell Garwood, Edward R. Griffo. Specifying, and Reasoning about CPS through the Lens of the NIST CPS Framework. TPLP Theory and Practice of Logic Programming Journal 2022[PDF] [Paper]


Computer Science Conference Papers (Full Papers)

1. Thanh Hai Nguyen, Tran Cao Son, Enrico Pontelli. Automatic Web Services Composition for Phylotastic. PADL 2018 20th International Symposium on Practical Aspects of Declarative Languages [PDF]

2. Thanh Hai Nguyen, Tran Cao Son, Enrico Pontelli. Phylotastic: An Experiment in Creating, Manipulating, and Evolving Phylogenetic Biology Workflows Using Logic Programming. ICLP 2018 34th International Conference on Logic Programming.[PDF [arXiv]] [Paper]

3. Thanh Hai Nguyen, Tran Cao Son, Enrico Pontelli. On Repairing Web Services Workflows. The 22nd International Symposium on Practical Aspects of Declarative Languages.[PDF] [Paper]

4. Thanh Hai Nguyen, Tran Cao Son, Matthew Bundas, Marcello Balduccinni, Kathleen Campbell Garwood, Edward R. Griffo. Specifying and Reasoning about Concerns in Cyber-Physical System Using Answer Set Programming. 17th International Conference on Principles of Knowledge Representation and Reasoning KR-2020. [PDF] [Paper]

5. Thanh Hai Nguyen, Tran Cao Son, Matthew Bundas, Marcello Balduccinni, Kathleen Campbell Garwood, Edward R. Griffo. Reasoning about Trustworthiness in Cyber-Physical Systems Using Ontology-Based Representation and ASP. The 23rd International Conference on Principles and Practice of Multi-Agent Systems. PRIMA-2020[PDF] [Paper]

6. Matthew Bundas, Chasity Nadeau, Thanh Hai Nguyen, Jeannine Shantz, Marcello Balduccini, and Tran Cao Son.Towards a Framework for Characterizing the Behavior of AI-Enabled Cyber-Physical and IoT Systems. The 7th IEEE World Forum on Internet of Things 2021. WF-IOT-2021[PDF] [Paper]


Applied Sciences, Workshop, DC

1. Joshua Beniston, Adam Beh, Thanh Nguyen, Lilian Ndungu, Jason Karl, Jeffrey Herrick. The Land Potential Knowledge System: Generating site-specific estimates of land potential productivity and degradation risk using a mobile application and cloud computing. AgMIP 5th Global Workshop (2015) [PDF]

2. Thanh H Nguyen. An Automatic Web Services Composition Framework over Biological Domain and Specifications. LPNMR Doctoral Consortium 2019 - The 15th International Conference on Logic Programming and Non-monotonic Reasoning [PDF]

3. Herrick, Jeffrey E.; Quandt, Amy; Kenny, Ciaran; Contreras, Maximilian; Neff, Jason; Jang, Won Seok; Maynard, Jonathan; Peacock, George; Salley, Shawn; Daniel, Elifadhili; Buni, Adane; Ndungu, Lilian; Herrera, Jolene M.; Nguyen, Thanh; Terrones, Luis; Karl, Jason; Kimiti, David; Nauman, Travis; Uruma, Kevin; Schrader, Scott; Courtright, Ericha; Van Zee, Justin. LandPKS (Land Potential Knowledge System): Mobile App for Extension, Land-Use and Project Planning, M&E and On-Farm Research. LandPotential Project. USDA Research DataSet [PDF]

Latest Projects


TVoD TVoD

TVOD - Multiple-screens Video Streaming Platform

This platform was built based on two core systems : CMS (Content Management System) and CDN (Content Delivery Network). In addition, there are some other involved components : TVOD API System, TVOD Transcoding Software, TVOD Portal, TVOD Mobile Applications and Admin System. This platform provide completed system that serves Video content (Video on Demand and Live Streaming content) to end-users in multiple-screens, multiple-platforms.

Find out more

View on GitHub

VNPT CDN

VNPT Technology CDN platform

CDN (Content Delivery Network) is VNPT Technology’s solution for delivering content to end users reliably and efficiently. CDN is consisted on a large distributed network of server nodes deployed on VNPT’s backbone network with servers located in many data centers across Vietnam. This network of server helps optimize network traffic and deliver content to end-users with best download speed and best quality. With CDN, content provider can have their content delivered to users easily with best quality without investing in expensive hosting facility.

Find out more

View on GitHub

Other Projects

Rosetta Open Source

Rosetta-NRCS is a program developed by the USDA-ARS Salinity Lab in Riverside, CA. The only difference in the ARS Rosetta version and the Rosetta-NRCS version is that Rosetta identifies only the depth to the bottom of a soil layer. Rosetta-NRCS identifies both the top and bottom of a soil layer.
Rosetta can be used to estimate the following properties:

  • Water retention parameters according to van Genuchten (1980)
  • Saturated hydraulic conductivity
  • Unsaturated hydraulic conductivity parameters according to van Genuchten (1980) and Mualem (1976)

View on GitHub

View more

Work Experience

Software Architect/Developer - LandPKS - USDA - NMSU (2014 - Present)

I designed and initiated software architechture for whole LandPKS project and the connection between all components in project. Starting from system designation to end-users experiences.

In this project, I developed Data Analytic System which is based on Data Mining to get as much knowledge as possible from soil, weather and water data to build models for analyzing soil potential; a Data Portal that analyses and display data; Mobile applications (both Android and iOS) that allow collecting and interacting data and a Prediction Model that run data mining algorithm to get as much knowledge as possible

Software Developer - Phylotastic (2014 - Present)

I worked on mobile app, web services, semantic integration

Deputy Director - Technical Leader - Lead Developer - Software Architect/Engineer - R&D Center - VNPT Technology (2011 - 2013)

Design and lead everything in software architecture and development of all products in company

My GitHub

GitHub Calendar

Loading the data just for you.

Github Activities