Module 3

Data Analytics Essentials Certification Preparatory Course

Duration: 12 hours


This qualification is intended for individual who aspires to become a Citizen Data Scientist in the organization using open-source technology to perform moderate to sophisticated diagnostics analytics and simple predictive analytics. The Data Analytics Essentials qualification is also highly relevant to other key staff involved in the requirements input, design, development, delivery and ultimate use of the digital initiatives including Data consumer, digital initiatives decision maker, business analyst, and operational line managers/staff.

For private classes, please contact us at (852) 2116 3328 for more details.

View Schedule

Certified Skills

  • Create data models for analytics function on multiple data sources
  • Prepping the data with Jupyter Notebook, Python & SQL Lab and perform diagnostics analytics with Apache Superset
  • Select an appropriate machine learning algorithm at hand with Python & Scikit-learn

Intended Audience

Anyone who want to:

  • Prove their ability to perform self-service diagnostic analytics for insights
  • Display their value to use low-cost and high-return open-source technology to improve daily performance
  • Show their inclination to work productively with your colleague with data and analytics


  • Completion of CDPOS Module 2
  • Basic computer software skill
  • Basic internet skill


50 Multiple Choices | 75 minutes (Module 3)


Syllabus Highlights

Analytics Process

  • The analytics process of diagnostic and predictive analytics
  • Data prep tasks- data collection, data cleansing, data munging and data visualisation
  • Build analytics model - convert unstructured data into quantified metrics

Diagnostic Analytics Essentials

  • Diagnostic analytics objectives, processes, data prepping and
  • Data modelling for diagnostic analytics with Jupyter Notebook, Python and SQL Lab
  • Data visualisation for diagnostic analytics - Apache Superset

Predictive Analytics Essentials

  • Predictive analytics objectives, best practices, processes, data prepping and model building using Python, SQL Lab and Apache Superset
  • Recognise and select an appropriate machine learning algorithm at hand
  • Predictive modelling - decision-tree, clustering with Jupyter Notebook, Python and Scikit-learn


Mr. Patrick Tsoi

  • Doctor of Education (in progress), Hong Kong Baptist University
  • Master in IT in Education, University of Hong Kong
  • Bachelor of Engineering in System Engineering and Engineering Management, Chinese University of Hong Kong
  • An Experienced Trainer, Educator and Chief Data Scientist with a demonstrated history of working in the talent training and staff recruiting industry
  • He is a strong IT training professional who is proficient in Hands-on Data Science Training, Learning Management, Instructional Design, Professional Services and Programming


Mr. Simon Mok

  • M.Phil. Computer Science, University of Hong Kong
  • MSc, Computer Science, Chinese University of Hong Kong
  • He works as a Chief Technology Officer as well as a professional IT trainer
  • Simon’s work includes software development projects, specializing in IT service management, full-stack web-based and mobile apps development, software testing, Internet security, AWS web services, big data, databases and NoSQL. He is a holder of many professional qualifications from Microsoft (MCSE, MCSD), Oracle (MySQL and Oracle databases), Exin (ITIL, ISO 20000), and more. On top of his identity of CTO, he is also a frequent trainer at the HKPC and various private enterprises, both domestically and internationally.