Stay Safe, Stay Home in this COVID Pandemic. Let's Join Online From Home and Focus on New Skills!! Get trained from Our Experienced Working Professional Trainers. You can Call us at +91 7500000856, Email: info@convergentlearningsolutions.com for any query or support!!

R Programming

R is a programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. Polls, data mining surveys, and studies of scholarly literature databases show substantial increases in popularity as of January 2021, R ranks 9th in the TIOBE index, a measure of popularity of programming languages.

R language for statistical programming, various features of R, introduction to RStudio, statistical packages, familiarity with different data types and functions, learning to deploy them in various scenarios, use SQL to apply ‘join’ function, components of RStudio like code editor, visualization and debugging tools and learn about R-bind

R functions, code compilation and data in well-defined format called R Packages, R Package structure, package metadata and testing, CRAN (Comprehensive R Archive Network), vector creation and variables values assignment

R functionality, Rep function, generating repeats, sorting and generating factor levels, transpose and stack function

Introduction to matrix and vector in R, understanding various functions like Merge, Strsplit, Matrix manipulation, rowSums, rowMeans, colMeans, colSums, sequencing, repetition, indexing and other functions

Understanding subscripts in plots in R, how to obtain parts of vectors, using subscripts with arrays, as logical variables, with lists and understanding how to read data from external files

Generate plots in R, graphs, bar plots, line plots, histograms and components of a pie chart

Understanding analysis of variance (ANOVA) statistical technique, working with pie charts and histograms and deploying ANOVA with R, one-way ANOVA and two-way ANOVA

K-Means clustering for cluster and affinity analysis, cluster algorithm, cohesive subset of items, solving clustering issues, working with large datasets, association rule mining affinity analysis for data mining and analysis and learning co-occurrence relationships

Introduction to Association Rule Mining, various concepts of Association Rule Mining, various methods to predict relations between variables in large datasets, algorithm and rules of Association Rule Mining and understanding single cardinality

Understanding what is simple linear regression, various equations of line, slope, Y-intercept regression line, deploying analysis using regression, the least square criterion, interpreting the results and standard error to estimate and measure of variation

Scatter plots, two-variable relationship, simple regression analysis and line of best fit

Deep understanding of the measure of variation, the concept of co-efficient of determination, F-test, the test statistic with an F-distribution, advanced regression in R and prediction linear regression

Logistic regression mean and logistic regression in R

Advanced logistic regression, understanding how to do prediction using logistic regression, ensuring if the model is accurate, understanding sensitivity and specificity, confusion matrix, what is ROC, a graphical plot illustrating binary classifier system and ROC curve in R for determining sensitivity/specificity trade-offs for a binary classifier

Detailed understanding of ROC, area under ROC curve, converting the variable, data set partitioning, understanding how to check for multicollinearity, how two or more variables are highly correlated, building of model, advanced data set partitioning, interpreting of the output, predicting the output, detailed confusion matrix and deploying the Hosmer-Lemeshow test for checking whether the observed event rates match the expected event rates

Data analysis with R, understanding the Wald test, MC Fadden’s pseudo R-squared, the significance of the area under ROC curve, Kolmogorov–Smirnov chart which is a non-parametric test of one-dimensional probability distribution

Connecting to various databases from the R environment, deploying the ODBC tables for reading the data and visualization of the performance of the algorithm using confusion matrix