Case study of gene expression analysis

The paper by Golub et al. that was the focus of the second part of the BioConductor practical was the first analysis of its kind, demonstrating that gene expression analysis could potentially be used to classify leukaemia sub-types. Since its publication in 1999 there has been considerable interest in developing this approach for diagnostic use and a paper about this, published in Blood in 2002, and provided, is the focus for this assignment.

In this paper, Mary Ross and her colleagues at St Jude’s Children’s Research Hospital in Memphis, performed gene expression analysis on a larger cohort of 155 leukaemia cases, from five different sub-types. While the paper by Golub showed that it was possible to broadly classify leukaemia with this kind of analysis, this later paper demonstrated that even closely related sub-types have very distinct gene expression signatures.
The aim of this assignment is to reproduce the analysis of the data set described in the paper by Ross et al. You are provided with all of the raw Affymetrix data files that were used in the original analysis, and the experiment description files that define which samples belong to the training and test data sets. You will be expected to use a machine learning approach to classification of the data, perform hierarchal clustering on the data, and identify a sub set of genes whose expression defines the data classes.

   Related Questions in Financial Accounting

  • Q : Accountant & Financial In Business


    1. Identify the services or programs to be included in the cost and profitability analysis.

    2. Examine the costs listed in Table 2.

    a. Identify the direct costs associated with each service or program.

    b. Which costs would be organization

  • Q : Objective Questions on Sociology 1)

    1) Which large European city declined significantly in population over the past century?

    A) Paris

    B) London

    C) Rome

    D) Madrid

    2) The industrial city was characterized b

    A) decentralization

    B) corporate growt

  • Q : Calculate the bad debt expense for the

    The Webster Company uses the aging method to estimate the allowance for doubtful accounts. The following schedule of accounts receivable was prepared as at December

    31, 20x6:

    Age Balance %

  • Q : Small talk Define small talk and

    Define small talk and discuss its role in developing the relationship.

  • Q : Interest rate parity for determination

    Describe the allegations of interest rate parity for the determination of the exchange rate.

  • Q : Creation of North American Trade

    Mr. Ross Perot, former Presidential candidate of the Reform Party, that is the third political party in the United States, had strongly protested in the creation of North American Trade Agreement (NAFTA), however, which was inaugurated in the year 1994, due to fear of

  • Q : European Monetary System Discuss the

    Discuss the workings and arrangements of European Monetary System (EMS).

  • Q : Meso and Macro level theories of

    Identify and elucidate three meso- and/or macro-level theories about deviance.

  • Q : Cross-border acquisitions and green

    Why host country resist cross-border acquisitions, instead of the green field investments? Explain your point of view?

  • Q : What is Subsidiary bank State what is

    State what is meant by Subsidiary bank.