


Data Visualization may also be used to examine the data in graphical format, to obtain additional insight regarding the messages within the data. Various data analysis techniques are available to understand, interpret, and derive conclusions based on the requirements.

Data Analysisĭata that is processed, organized and cleaned would be ready for the analysis. Likewise, quantitative data methods can be used for outlier detection that would be subsequently excluded in analysis. For example, while cleaning the financial data, certain totals might be compared against reliable published numbers or defined thresholds. There are several types of Data Cleaning that depend on the type of data. Data Cleaning is the process of preventing and correcting these errors. The processed and organized data may be incomplete, contain duplicates, or contain errors. For example, the data might have to be placed into rows and columns in a table within a Spreadsheet or Statistical Application. This includes structuring the data as required for the relevant Analysis Tools. The data that is collected must be processed or organized for analysis. Hence, the collected data is required to be subjected to Data Processing and Data Cleaning. The data thus obtained, may not be structured and may contain irrelevant information. Data Collection provides both a baseline to measure and a target to improve.ĭata is collected from various sources ranging from organizational databases to the information in web pages. Data Collection ensures that data gathered is accurate such that the related decisions are valid. The emphasis is on ensuring accurate and honest collection of data. Data Collectionĭata Collection is the process of gathering information on targeted variables identified as data requirements. Specific variables regarding a population (e.g., Age and Income) may be specified and obtained. Based on the requirements of those directing the analysis, the data necessary as inputs to the analysis is identified (e.g., Population of people). The data required for analysis is based on a question or an experiment. The terms Data Modeling and Data Analysis mean the same.ĭata Analysis Process consists of the following phases that are iterative in nature − Data visualization is at times used to portray the data for the ease of discovering the useful patterns in the data. The results so obtained are communicated, suggesting conclusions, and supporting decision-making.

Data Analysis is a process of collecting, transforming, cleaning, and modeling data with the goal of discovering the required information.
