Cluster analysis {Ans: Groups of closely related observations are more similar than data in other clusters}Volume {Ans: Size}Variety {Ans: Different types of data}Variability {Ans: "CP" could mean chest pain or cerebral palsy}High Dimensionality {Ans: Too many attributes}Predictive Tasks {Ans: Predicting the value of a particular attribute based on the values of other attributes}Descriptive Tasks {Ans: Deriving patterns (correlations, abnormalities) that summarize the underlying relationship of data}Predictive Modeling {Ans: Building a model for the target variable as a function of the explanatory variables (regression)}What is Data Mining {Ans: Data mining is the process of automatically discovering useful information in the large data repositories.}What is Predictive Analytics {Ans: The use of data, statistical algorithms, and machine learning techniques to identify the likelihood of future outcomes based on historical data. The goal is to go beyond knowing what has happened to providing a best assessment of what will happen in the future.}Postprocessing {Ans: Ensures that only valued and useful results are incorporated in to the decision support system.}Anomaly Detection {Ans: Identifying observations whose characteristics are significantly different from the rest of the data}Scalability {Ans: Changing the size, sample data/reduce dimension}"Curse