Research data are factual records collected, observed or generated for analysis as well as producing and validating original research findings.
Primary data are data that are collected first-hand by a researcher himself/herself, aiming to solve the research problem in focus.
Secondary data are created for a purpose by others and made available for reuse, and now collected for another purpose.
Data could be categorized according to the methods of creation:
1. Observational
2. Experimental
3. Simulation
4. Derived or compiled
Other than differentiating research data into either quantitative or qualitative data, research data could also be categorized according to its form:
1. Text
Text data are usually documents which consist of words, sentences, or paragraphs of free-flowing text. They are usually less structured. Textual data include:
Data in textual formats could be in the following file formats: plain text, pdf, word, html, xml
2. Numeric
Numerical data, often referred to as quantitative data, are in the form of numbers, and not in any language or descriptive form. Numerical data include:
Numeric data could be in the following file formats: Stata, SPSS, Excel, GIS
3. Audiovisual
Audiovisual data include:
File formats for audiovisual data include: jpeg, png, tiff, mp3, wav, mpeg, quicktime
4. Code
Code includes the software-specific code files to carry out data processing steps, as well as the development code. It includes
Code could be in the following file formats: Python, Java, Stata, SPSS, R, MATLAB
5. Discipline Specific
Discipline specific data are broad and vary according to subjects. Examples of discipline specific data are: