Examlex
A bank is interested in identifying different attributes of its customers and below is the sample data of 150 customers. In the data table for the dummy variable Gender, 0 represents Male and 1 represents Female. And for the dummy variable Personal loan, 0 represents a customer who has not taken personal loan and 1 represents a customer who has taken personal loan.
Partition the data into training (50 percent), validation (30 percent), and test (20 percent) sets. Use logistic regression to classify observations as Personal loan taken (or not taken) using Age, Gender, Work experience, Income (in 1000 $), and Family size as input variables and Personal loan as the output variable. Perform an exhaustive-search best subset selection with the number of best subsets equal to 2.
a. From the generated set of logistic regression models, select one that you believe is a good fit. Express the model as a mathematical equation relating the output variable to the input variables.
b. Increases in which variables increase the chance of a customer who has taken the personal loan? Increases in which variables decrease the chance of a customer who has not taken the personal loan?
c. Using the default cutoff value of 0.5 for your logistic regression model, what is the overall error rate on the test data?
Measurement
The action of measuring something, or the size, length, or amount of something, usually established by using standard units.
Metal Rusts
The process of corrosion where metals, typically iron, react with oxygen and moisture to form rust.
Density
The mass per unit volume of a substance, often measured in grams per cubic centimeter (g/cm³).
Mass
An estimation of the quantity of matter present in an object, commonly measured in grams or kilograms.
Q3: For the following sample data, compute the
Q3: Which of the following is the dominant
Q7: Consider the following time series:<br> <img src="https://d2lvgg3v3hfg70.cloudfront.net/TB1880/.jpg"
Q11: The _ is a measure of the
Q12: The inhalation of which substance is the
Q25: The _ of a set of points
Q32: A research center is interested in investigating
Q36: Consider the following time series data:<br> <img
Q42: Any data value with a z-score less
Q51: The _ is a point estimate of