PROFESSIONAL DEVELOPMENT TRAINING ON DATA MINING AND BIG DATA ANALYSIS BY USING R SOFTWARE
Date: 09 – 13 March 2026
Venue: Masterclass board room; Tetex house, Dar es Salaam
Fee: 700,000
Gain a strong foundation in data in the most commonly used programs R. In this professional training, you have the opportunity to build and leverage your data skills for upward mobility at any stage in your career. You’ll learn the six steps of the data lifecycle, using different case studies and contexts. You will analyze, manage, and communicate data, working in R to achieve basic R programming competencies. You will learn how to install and configure software necessary for a statistical programming environment and describe generic programming language concepts as they are implemented in a high-level statistical language.
Trainer will use visualization techniques to explore new data sets and determine the most appropriate approach. We will describe robust statistical techniques as alternatives when data do not fit assumptions required by the standard approaches. By using R scripts to analyze data, you will learn the basics of conducting reproducible research.
The course covers practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code. R software runs on a wide variety of UNIX platforms, Linux, Windows and Mac OS.
TRAINING OUTCOME
At the end of training participants will;
- Understand data analysis via EDA as a journey and a way to explore data
- Explore data at multiple levels using appropriate visualizations
- Identify and analyze different types of data visualizations and when to use them effectively.
- Demonstrate curiosity and skepticism when performing data analysis
- Develop intuition around a data set and understand how the data was generated.
- Acquire statistical knowledge for summarizing data
- Master the foundations of data management, including dataset identification, preparation, and lifecycle.
COURSE OUTLINE
- Getting Started with R Studio
- Working with R and R Studio
- Using Scripts
- Objects and Work Spaces
- Data types
- R Objects
- R Variables
- R operators
- Working with R
- Data Management and Manipulation
- Descriptive Statistics
- Inferential Statistics
- Regression analysis
WHO WILL ATTEND THE TRAINING
R programming skills will benefits Managers and Officers in the fields of statistics, data analysis, data scientists, big data engineers, IT specialists, database developers.
DATE
The training will be held from 08th to 12th, September 2025. This training workshop shall be held in Dar es Salaam
FACILITATOR
Mr. John Majaliwa: Poses a Bsc in Information Technology (IT) IFM, Master’s degree of official Statistics (MOS) Eastern Africa Statistical Training Centre, He is the lecturer and consultant at EASTC
He is an Expert in teaching Computer Application and Data management and analysis software in many organizations within the country and outside. Some are Advanced excel, SPSS, STATA & R, Big data programing using R & SQL, Experience in conducting research (including writing research Proposals and reports), CAPI Technologies: Survey solutions (Mobile app for survey data collection), Experience in conducting information system Analysis, Relational databases development
PARTICIPATION FEE
TZS 700,000/= paid through Equity Bank Account No. 3003211802968. Name: Para Africa Ltd. Fee shall cover facilitation cost, Installation of the R program, certificate, course material, refreshment during the training.

