--%>

Principles of data analysis

For the data analysis project, you will address some questions that interest you with the statistical methodology we are learning in class. You choose the questions; you decide how to collect data; you do the analyses. The questions can address almost any topic, including topics in economics, psychology, sociology, natural science, education, medicine, public policy, sports, law, etc.

The project requires you to synthesize all the materials from the course. Hence, it's one of the best ways to solidify your understanding of statistical methods. Plus, you get answers to issues that pique your intellectual curiosity.

In twenty (20) PowerPoint slides or more, please create a presentation that adequately addresses and answers your statistical question(s). Include your random sampling, calculations, graphs, charts, hypothesis, conclusion, and anything pertinent to your

“statistical question(s).”

The most important aspects of any statistical analysis are stating questions and collecting data. To get the full experience of running your own study, the project requires you to analyze data that you collect. It is not permissible to use data sets that have been put together by others. You are permitted to collect data off of the web; however, you must be the one who decides on the analyses and puts the data set together.

Good projects begin with very clear and well-defined hypotheses. You should think of questions that interest you first, and then worry about how to collect and analyze data to address those questions. Generally, vague topics lead to uninteresting projects. For example, surveying Harvard Undergraduates to see which sex studies more does not yield a whole lot of interesting conclusions. On the other hand, it would be interesting to hypothesize why men or women study more, and then figure out how to collect and analyze data to test your hypotheses.

Practical Advice: It is often easier to collect accurate experimental data than accurate survey data. Non-responses tend to be less of an issue with projects based on experiments than with those based on surveys. I strongly encourage you to consider experiments as opposed to surveys. For those who want to do surveys, consider using students in dorms or certain courses as target populations. Make every effort to get a random sample, and try to keep track of the characteristics of non-respondents. You will have non-responses; however, your project will not be penalized for a non-response as long as you document it and hypothesize how it might affect your results.

   Related Questions in Basic Statistics

  • Q : Creating Grouped Frequency Distribution

    Creating Grouped Frequency Distribution: A) At first we have to determine the biggest and smallest values. B) Then we have to Calculate the Range = Maximum - Minimum C) Choose the number of classes wished for. This is generally between 5 to 20. D) Find out the class width by dividing the range b

  • Q : Sample Questions in Graphical Solution

    Solved problems in Graphical Solution Procedure, sample assignments and homework Questions: Minimize Z = 10x1 + 4x2 Subject to

  • Q : Cumulative Frequency and Relative

    Explain differences between Cumulative Frequency and Relative Frequency?

  • Q : Hw An experiment is conducted in which

    An experiment is conducted in which 60 participants each fill out a personality test, but not according to the way they see themselves. Instead, 20 are randomly assigned to fill it out according to the way they think a parent sees them (i.e. how a parent would fill it out to describe the participant

  • Q : Define SPIN simulation modes SPIN: •

    SPIN: • SPIN generates C program that is the model checker – The pan verifier • Process Analyzer – Run the pan executable to do the model check

  • Q : State Kendalls notation

    Kendall’s notation:  A/B/C/K/m/Z A, Inter-arrival distribution M exponential D constant or determ

  • Q : Building Models Building Models • What

    Building Models • What do we need to know to build a model?– For model checking we need to specify behavior • Consider a simple vending machine – A custome rinserts coins, selects a beverage and receives a can of soda &bul

  • Q : Probability how can i calculate

    how can i calculate cumulative probabilities of survival

  • Q : Networks of queues Networks of queues •

    Networks of queues • Typically, the flow of customers/request through a system may involve a number of different processing nodes.– IP packets through a computer network– Orders through a manufactur

  • Q : Program Evaluation and Review

    Program Evaluation and Review Technique (PERT) A) Developed by US Navy and a consulting firm in 1958 for the Polaris submarine project. B) Technique as for CPM method, but acti