StudySmarter: Study help & AI tools
4.5 • +22k Ratings
More than 22 Million Downloads
Free
Every time you bake a cake, you probably break down the process into different steps. You may pick a recipe, go to the shops, arrange the ingredients, and follow the recipe. Data handling is somewhat similar. Researchers follow a logical order to get the best quality end product possible.
Explore our app and discover over 50 million learning materials for free.
Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken
Jetzt kostenlos anmeldenNie wieder prokastinieren mit unseren Lernerinnerungen.
Jetzt kostenlos anmeldenEvery time you bake a cake, you probably break down the process into different steps. You may pick a recipe, go to the shops, arrange the ingredients, and follow the recipe. Data handling is somewhat similar. Researchers follow a logical order to get the best quality end product possible.
Data handling is vital in any psychological study and directly impacts the quality of the results obtained. Let's take a closer look at data handling and analysis.
Once data has been collected, there are several things that the researchers need to do, and one of these is data handling.
Data handling is the process of organising and analysing raw data using a logically valid and reliable process to establish if the findings from the study support or reject the hypothesis stated at the start of the experiment.
The steps that researchers take during data analysis are important because they can affect the validity and reliability of the findings. For instance, if the researcher found something unexpected and chose to ignore the variables they were initially interested in, the study will no longer be investigating what it intends to. Thus, lowering the validity of the study.
Moreover, if researchers don't use standardised procedures to analyse findings, this can lower the reliability of the study.
Now that we have learned the importance of data handling, let's uncover what steps data handling includes. There are six steps in data handling, which are:
Steps | Description |
Collection | The first step of data handling is to collect raw data from participants or whichever source the information is being collected from. |
Preparation | The second step involves the preparation of the raw data for further analysis; this may involve coding data. E.g. male participants may be ascribed 0, and female participants may be attributed 1. |
Input | The third involves inputting and storing the data. The findings should be stored securely to maintain participant confidentiality. |
Processing | The fourth step involves conducting computational or statistical tests relevant to the research. E.g. if trying to understand the relationship between two variables, a correlational analysis should be run. |
Output | The next step is verifying if the findings support or disprove the hypothesis. |
Storage | The last step is to save the data securely. |
Before a researcher analyses their research or even handles it, the researcher should have a clear and carefully plan on the direction of their research. For instance, before data is collected, the researcher should know the source of where they will collect the information.
Researchers can use primary or secondary data in their research.
Primary research refers to data the researcher collects themself, and secondary data uses data collected from others, e.g. previously published findings or statistics from government sites and databases.
Data analysis is a step that follows after a researcher has handled data.
Data analysis is the process in which graphical and quantitative or statistical techniques are applied to raw data to identify general patterns.
In research methods, two types of data are collected: qualitative and quantitative. Qualitative data is processed slightly differently from quantitative data. Let's take a look at data handling in quantitative analysis.
Quantitative data is data that is expressed in numerical form. Quantitative analysis usually involves using a mathematical approach and statistics to identify whether the findings support or disprove hypotheses.
Statistical analysis includes various mathematical calculations using probability models to make inferences from a given data set and draw conclusions about broader populations.
When carrying out statistical analyses, there are essential data handling steps that the researcher needs to consider. The reason is that statistical tests have certain conditions.
One of these conditions is identifying the level of measurement. There are four levels of measurement which essentially distinguish the different characteristics of variables. It is essential to understand the characteristics of variables because these will hint at which statistical analyses could be done and which could not.
Level of Measurement | Definition | Variable Example |
Nominal | Distinguishes differences, but there is no order to them, and we can't measure how much each quantitatively differs. | Participants eye colour |
Ordinal | Distinguishes differences and identifies that the values have a rank order, but the difference can't be quantitively measured. | Response on a Likert scale |
Interval | There is an order, and the differences between figures are measurable. Interval and ratio data can be of infinite value, but unlike ratio, interval data can go below 0. | Temperature |
Ratio | The ratio is the same as the interval with the difference that there is an absolute 0, meaning the values of the variable cannot go below 0. | Height |
As you learned, data analysis is the process in which statistical techniques are applied to find patterns within a sample. The data analysis in a study usually follows two steps. First, descriptive statistics are inspected, and then statistical analyses are performed.
As the name suggests, descriptive statistics describe the data's characteristics, and the two main types of descriptive statistical tests used are the measures of central tendency and measures of dispersion.
The measures of central tendency tests are used to calculate averages, and the three main types of tests are:
The measures of dispersion tests are used to measure the spread/ variance of the data. The common computations calculated are:
Generally, descriptive statistics involve presenting the data. This presentation of data is usually done using graphs. The following are types of graphs:
Tables are used to show contrasts between a few sets of data. For instance, the table above shows the difference between control and drug conditions according to mean and standard deviation measurements.
Control | Drug condition | |
Mean | 119 | 86 |
Standard deviation | 23 | 98 |
Bar charts show the results of different conditions (or variables) using bars of different heights. Bar charts illustrate the differences between groups and make identifying trends and patterns easier.
Scattergrams are also incorporated in correlational research. From scatterplots, researchers can identify the direction and strength of the relationship between two variables.
Before inferential tests are conducted, researchers usually run descriptive analyses. Following this, the distribution of the data is analysed.
Distribution measures the spread of data from the average; it is a form of probability statistic that makes estimations concerning a sample.
The type of distribution found will affect what statistical analyses can do later.
When data is normally distributed, parametric tests are employed, and when data is not normally distributed, non-parametric tests are employed.
There are two types of distribution:
The statistical analyses employed in psychology research use inferential statistics to identify if the data supports or negates their hypothesis.
Sometimes we may find differences, but these may not be significant. The accepted level of probability in psychology is 0.05 (5%). Now, what exactly does this mean in research? Significance levels below this threshold suggest the observed change in the dependent variable is likely due to the manipulation of the independent variable. However, if it is above the threshold, the observed changes are likely due to chance.
The 5% considers extraneous variables that may have influenced the dependent variable.
The findings should not be accepted when research establishes differences, but the results are non-significant.
Let's look into a real example of how data is handled and analysed in research.
In a study looking into the relationship between revision time and exam performance, researchers would first consider they will gather their data.
Revision time can be collected by asking participants to record how long they spent revising, and exam performance can be measured by looking at pupils' test scores on their report cards.
Initially, the researchers would carry out analyses concerning descriptions, e.g. the mean scores for revision time and exam performance. The findings may be reported as students in the current sample reported a mean revision time of around 6h (M = 5.78) and an average score of 78 points out of 100 in the exam (M = 78).
The two variables' distribution will be explored through a histogram. Depending on the visual inspection of the distribution, parametric or non-parametric tests would be performed. Let's imagine that the distribution of the two variables was normally distributed.
Researchers would then move on to data analysis, i.e. inferential statistics. To test the hypothesis, the researchers may run a correlational analysis. The Pearson correlation may be interpreted as the analysis shows a positive correlation between revision time and exam performance, r (20) = .78, p = .05.
Further to the written interpretation, researchers would include a scatterplot visually resembling the same interpretation.
Psychologists use data handling and analysis to interpret the data they collect from their studies.
There are different data types: qualitative, quantitative, primary, and secondary.
Descriptive statistics are graphs, tables, and summaries used to identify trends and analyse research data.
Levels of measurement include ordinal, ratio, interval, and nominal.
Thematic analysis is used to analyse qualitative data, and inferential testing is used to analyse quantitative data.
Psychologists use data handling and analysis to interpret data collected from research.
Data analysis is concluding data that has been gathered and presented using graphs and charts. Before data can be analysed, it needs to be handled, e.g., it needs to be stored and organised in a way that makes analysis easier.
To identify if the findings from the study support or reject the hypothesis proposed.
There are many types of data analysis, including measures of central tendency, graphs, inferential testing, (non-) parametric tests, probability and significance, thematic analysis, and more.
An example of data analysis is thematic analysis; this involves analysing qualitative data by identifying common themes throughout the text.
What are the steps in data handling?
What is quantitative data?
Numerical data.
What is qualitative data?
Data that uses words rather than numbers.
Give an example of qualitative data.
A diary entry.
Which of these is not a measure of central tendency?
Medium.
What do measures of central tendency aim to find?
Averages.
Already have an account? Log in
Open in AppThe first learning app that truly has everything you need to ace your exams in one place
Sign up to highlight and take notes. It’s 100% free.
Save explanations to your personalised space and access them anytime, anywhere!
Sign up with Email Sign up with AppleBy signing up, you agree to the Terms and Conditions and the Privacy Policy of StudySmarter.
Already have an account? Log in