Chapter 1 Quiz from Introduction to Statistics by Ronald E. Walpole. Data Analysis with Statistics and Machine Learning; Data Communication with Information Visualization; Data at Scale -- Working with Big Data; The class will focus on breadth and present the topics briefly instead of focusing on a single topic in depth. Machine learning is the science where in order to predict a value, algorithms are applied for a system to learn patterns within data. If the likelihood of getting the results is so small, then the results are, Categorical (or qualitative or attribute) data, consists of names or labels (representing categories), result when the number of possible values is either a finite number or a 'countable' number, result from infinitely many possible values that correspond to some continuous scale that covers a range of values without gaps, interruptions, or jumps, characterized by data that consist of names, labels, or categories only, and the data cannot be arranged in an ordering scheme (such as low to high), involves data that can be arranged in some order, but differences between data values either cannot be determined or are meaningless, like the ordinal level, with the additional property that the difference between any two data values is meaningful, however, there is no natural zero starting point (where none of the quantity is present), the interval level with the additional property that there is also a natural zero starting point (where zero indicates that none of the quantity is present); for values at this level, differences and ratios are meaningful. The first phase in the Data Science life cycle is data discovery for any Data Science problem. Statistical speciﬁcation of the problem is the science of planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on the data Population the complete collection of all individuals (scores, people, measurements, and so on) to be studied; the collection is complete in the sense that it includes all of the individuals to be studied What are the ways to address data quality issues? Data Scienceis an umbrella term which encompasses multiple skills and scientific techniques. Data Visualization 2. Data Manipulation 3. 