MOST IMPORTANT R PROGRAMMING LANGUAGE INTERVIEW QUESTIONS

Question 1. Explain The Data Import In R Language.?
Answer :
R provides to import data in R language. To begin with the R commander GUI, user should type the commands in the command Rcmdr into the console.
Data can be imported in R language in 3 ways such as:

Select the data set in the dialog box or enter the name of the data set as required.

Data is entered directly using the editor of R Commander via Data->New Data Set. This works good only when the data set is not too large.

Data can also be imported from a URL or from plain text file (ASCII), or from any statistical package or from the clipboard.

Question 2. Explain How To Communicate The Outputs Of Data Analysis Using R Language.?
Answer :
Combine the data, code and analysis results in a single document using knitr for Reproducible research done. Helps to verify the findings, add to them and engage in conversations. Reproducible research makes it easy to redo the experiments by inserting new data values and applying it to different various problems.


Question 3. What Is R?
Answer :
R is a programming language which is used for developing statistical software and data analysis.

Question 4. How R Commands Are Written?
Answer :
By using # at the starting of the line of code like #division commands are written.


Question 5. What Is T-tests() In R?
Answer :
It is used to determine that the means of two groups are equal or not by using t.test() function.


Question 6. What Are The Disadvantages Of R Programming?
Answer :
The disadvantages are:-

Lack of standard GUI

Not good for big data.

Does not provide spreadsheet view of data.

Question 7. What Is The Use Of With () And By () Function In R?
Answer :
with() function applies an expression to a dataset.
#with(data,expression)
By() function applies a function t each level of a factors.
#by(data,factorlist,function)


Question 8. In R Programming, How Missing Values Are Represented?
Answer :
In R missing values are represented by NA which should be in capital letters.

Question 9. What Is The Use Of Subset() And Sample() Function In R?
Answer :
Subset() is used to select the variables and observations and sample() function is used to  generate  a random sample of the size n from a dataset.


Question 10. Explain What Is Transpose.?
Answer :
Transpose is used for reshaping of the data which is used for analysis. Transpose is performed by t() function.


Question 11. What Are The Advantages Of R?
Answer :
The advantages are:-

It is used for managing and manipulating of data.

No license restrictions

Free and open source software.

Graphical capabilities of R are good.

Runs on many Operating system and different hardware and also run on 32 & 64 bit processors etc.


Question 12. What Is The Function Used For Adding Datasets In R?
Answer :
For adding two datasets rbind() function is used but the column of two datasets must be same.
Syntax: rbind(x1,x2……) where x1,x2: vector, matrix, data frames.


Question 13. How You Can Produce Co-relations And Covariances?
Answer :
Cor-relations is produced by cor() and covariances is produced by cov() function.


Question 14. What Is Difference Between Matrix And Dataframes?
Answer :
Dataframe can contain different type of data but matrix can contain only similar type of data.

Question 15. What Is Difference Between Lapply And Sapply?
Answer :
lapply is used to show the output in the form of list whereas sapply is used to show the output in the form of vector or data frame.


Question 16. What Is The Difference Between Seq(4) And Seq_along(4)?
Answer :
Seq(4) means vector from 1 to 4 (c(1,2,3,4)) whereas seq_along(4) means a vector of the  length(4) or 1(c(1)).


Question 17. Explain How You Can Start The R Commander Gui.?
Answer :
rcmdr command is used to start the R commander GUI.


Question 18. What Is The Memory Limit Of R?
Answer :
In 32 bit system memory limit is 3Gb but most versions limited to 2Gb and in 64 bit system memory limit is 8Tb.


Question 19. How Many Data Structures R Has?
Answer :
There are 5 data structure in R i.e. vector, matrix, array which are of homogenous type and other two are list and data frame which are heterogeneous.


Question 20. Explain How Data Is Aggregated In R.?
Answer :
There are two methods that is collapsing data by using one or more BY variable and other is aggregate() function in which BY variable should be in list.


Question 21. How Many Sorting Algorithms Are Available?
Answer :
There are 5 types of sorting algorithms are used which are:-

Bubble Sort

Selection Sort

Merge Sort

Quick Sort

Bucket Sort

Question 22. How To Create New Variable In R Programming?
Answer :
For creating new variable assignment operator ‘<-’ is used For e.g. mydata$sum <- mydata$x1 + mydata$x2.

Question 23. What Are R Packages?
Answer :
Packages are the collections of data, R functions and compiled code in a well-defined format and these packages are stored in library.


Question 24. What Is The Workspace In R?
Answer :
Workspace is the current R working environment which includes any user defined objects like vector, lists etc.


Question 25. What Is The Function Which Is Used For Merging Of Data Frames Horizontally In R?
Answer :
Merge()function is used to merge two data frames
Eg: Sum<-merge(data frame1,data frame 2,by=’ID’)
 

Question 26. What Is The Function Which Is Used For Merging Of Data Frames Vertically In R?
Answer :
rbind() function is used to merge two data frames vertically.
Eg.
Sum<- rbind(data frame1,data frame 2)

Question 27. What Is The Power Analysis?
Answer :
It is used for experimental design .It is used to determine the effect of given sample size.


Question 28. Which Package Is Used For Power Analysis In R?
Answer :
Pwr package is used for power analysis in R.

Question 29. Which Method Is Used For Exporting The Data In R?
Answer :
There are many ways to export the data into another formats like SPSS, SAS , Stata , Excel Spreadsheet.

Question 30. Which Packages Are Used For Exporting Of Data?
Answer :
For excel xlsReadWrite package is used and for sas,spss ,stata foreign package is implemented.

Question 31. How Impossible Values Are Represented In R?
Answer :
In R NaN is used to represent impossible values.

Question 32. Which Command Is Used For Storing R Object Into A File?
Answer :
Save command is used for storing R objects into a file.
Syntax: >save(z,file=”z.Rdata”)

Question 33. Which Command Is Used For Restoring R Object From A File?
Answer :
load command is used for storing R objects from a file.
Syntax: >load(”z.Rdata”)


Question 34. What Is The Use Of Coin Package In R?
Answer :
Coin package is used to achieve the re randomization or permutation based statistical tests.

Question 35. Which Function Is Used For Sorting In R?
Answer :
order() function is used to perform the sorting.

Question 36. What Happens When The Application Object Does Not Handle An Event?
Answer :
The event will be dispatched to your delegate for processing.


Question 37. Explain App Specific Objects Which Store The App Contents.?
Answer :
The app specific objects are Data model objects that store app’s contents.

Question 38. Explain The Purpose Of Using Uiwindow Object?
Answer :
UIWindow object coordinates the one or more views presenting on the screen.

Question 39. How To Create Axes In The Graph?
Answer :
Using axes() function custom axes are created.

Question 40. What Is The Use Of Abline() Function?
Answer :
abline() function is add the reference line to a graph.
Syntax:-
abline(h=yvalues, v=xvalues)


Question 41. Why Vcd Package Is Used?
Answer :
vcd package provides different methods for visualizing multivariate categorical data.

Question 42. What Is Ggobi?
Answer :
GGobi is an open source program for visualization for exploring high dimensional typed data.

For other languages interview questions 👉👉👉 Click Here
Question 43. What Is Iplots?
Answer :
It is a package which provide bar plots, mosaic plots, box plots, parallel plots, scatter plots and histograms.

Question 44. What Is The Use Of Lattice Package?
Answer :
lattice package is to improve on base R graphics by giving better defaults and it have the ability to easily display multivariate relationships.

Question 45. What Is Fitdistr() Function?
Answer :
It is used to provide the maximum likelihood fitting of univariate distributions. It is defined under the MASS package.

Question 46. Which Data Structures Are Used To Perform Statistical Analysis And Create Graphs.?
Answer :
Data structures are vectors, arrays, data frames and matrices.

Question 47. What Is The Use Of Sink() Function?
Answer :
It defines the direction of output.

Question 48. Why Library() Function Is Used?
Answer :
This function is used to show the packages which are installed.

Question 49. Why Search() Function Is Used?
Answer :
By this function we see that which packages are currently loaded.

Question 50. On Which Type Of Data Binary Operators Are Worked?
Answer :
Binary operators are worked on matrices, vectors and scalars.

Question 51. What Is The Use Of Doby Package?
Answer :
It is used to define the desired table using function and model formula.

Question 52. Which Function Is Used To Create Frequency Table?
Answer :
Frequency table is created by table() function.

Question 53. What Is The Of Use Matrix Package?
Answer :
Matrix package includes those function which support sparse and dense matrices like Lapack, BLAS etc.

Question 54. Define Matlab Package.?
Answer :
Matlab package includes those wrapper functions and variable which are used to replicate matlab function calls.

Question 55. What We Use Party Package?
Answer :
It is used to provide a non-parametric regression for ordinal, nominal, censored and multivariate responses.

Question 56. Define Cluster.stats() ?
Answer :
It is define in fpc package which provide a method for comparing the similarity of two clusters solution using different validation criteria.

Question 57. What Is Pvclust() Function?
Answer :
It comes under the pvclust package which provides p-values for hierarchical clustering.

For other languages interview questions 👉👉👉 Click Here

Question 58. Which Function Performs Classical Multidimensional Scaling?
Answer :
cmdscale() function is used to perform classical multidimensional scaling.

Question 59. What Is The Use Of Boot.sem() Function?
Answer :
It is used to bootstrap the structural equation model.

Question 60. What Is Factominer?
Answer :
It is a package which includes quantitative and qualitative variables. It also includes supplementary variables and observations.

Question 61. What Is Principal() Function?
Answer :
It is define in psych package which is used to rotate and extract the principal components.

Question 62. Define Auto.arima().?
Answer :
It is used to handle the seasonal as well as non-seasonal ARIMA models.

Question 63. What Is The Use Of Forecast Package?
Answer :
It provides the functions which are used for automatic selection of ARIMA and exponential models.

Question 64. Define Lda().?
Answer :
lda() is used to print the discriminant functions which is based on centered variable.

Question 65. What Is The Use Of Mass Package?
Answer :
MASS functions include those functions which performs linear and quadratic discriminant function analysis.

Question 66. What Is Coxph()?
Answer :
It is a function which is used to model the hazard function on the set of predictor variable.

Question 67. Define Survival Analysis.?
Answer :
It includes number of techniques which is used for modeling the time to an event.

Question 68. Define Poison Regression.?
Answer :
It is used to predict the outcome variable which represents counts from the given set of continuous predictor variable.

Question 69. What Is Logistic Regression?
Answer :
Logistic regression is used to predict the binary outcome from the given set of continuous predictor variables.

Question 70. Which Variables Are Represented By Upper Case Letters?
Answer :
Categorical factors are represented by upper case letters.

Question 71. Which Variables Are Represented By Lower Case Letters?
Answer :
Numerical variables are represented by lower case letters.

Question 72. Define Barlett.test().?
Answer :
Barlett.test() is used to provide a parametric k-sample test of the equality of variances.

Question 73. What Is Fligner.test()?
Answer :
It is a function which provides a non-parametric k sample test of the equality of variances.

Question 74. What Is The Use Of Manova?
Answer :
By using MANOVA we can test more than one dependent variable simultaneously.

Question 75. Define Plotmeans().?
Answer :
It is define under gplots package which includes confidence intervals and it produces mean plot for single factors.

Question 76. What Is Robustbase?
Answer :
It is a package which provides basic robust statistics including model selection methods.

Question 77. Why Car Package Is Used?
Answer :
It  provide a variety of regression including scatter plots, variable plots and it also enhanced diagnostic.

Question 78. Define Relaimpo Package.?
Answer :
It is used to measure the relative importance of each of the predictor in the model.

Question 79. Define Stepaic() Function.?
Answer :
It is define under the MASS package which performs stepwise model selection under exact AIC.

Question 80. What Is The Use Of Diagnostic Plots?
Answer :
It is used to check the normality, heteroscedasticity and influential observations.

Question 81. What Is Npmc?
Answer :
It is a package which gives nonparametric multiple comparisons.

For other languages interview questions 👉👉👉 Click Here

Leave a Comment

Your email address will not be published. Required fields are marked *

error: Content is protected !!