Case study of gene expression analysis

The paper by Golub et al. that was the focus of the second part of the BioConductor practical was the first analysis of its kind, demonstrating that gene expression analysis could potentially be used to classify leukaemia sub-types. Since its publication in 1999 there has been considerable interest in developing this approach for diagnostic use and a paper about this, published in Blood in 2002, and provided, is the focus for this assignment.

In this paper, Mary Ross and her colleagues at St Jude’s Children’s Research Hospital in Memphis, performed gene expression analysis on a larger cohort of 155 leukaemia cases, from five different sub-types. While the paper by Golub showed that it was possible to broadly classify leukaemia with this kind of analysis, this later paper demonstrated that even closely related sub-types have very distinct gene expression signatures.
 
The aim of this assignment is to reproduce the analysis of the data set described in the paper by Ross et al. You are provided with all of the raw Affymetrix data files that were used in the original analysis, and the experiment description files that define which samples belong to the training and test data sets. You will be expected to use a machine learning approach to classification of the data, perform hierarchal clustering on the data, and identify a sub set of genes whose expression defines the data classes.

   Related Questions in Financial Accounting

  • Q : Analyse the ramifications for

    HOMEWORK ASSIGNMENT FOR ADMINISTRATIVE LAW"The problem in today's complex legal environment is that the law is not able to be divided conveniently into segments. Any apparently discrete sect

  • Q : Stages in the life cycle of a family

    There are seven typical stages in the life cycle of a family with children. Fully explain and give an example to describe each of those seven stages.

  • Q : Write a Matlab function Fourbar Write a

    Write a Matlab function Fourbar (r1,r2,r3,r4,theta,speed) that animates three cycles of a fourbar linkage having link lengths r1, r2, r3, r4. The function first checks to ensure the mechanism isGrashof (including the change-point c

  • Q : Function of budgetary control play in

    Describe the function of budgetary control play in cost control? And also write down the requirements for its triumphant execution?

  • Q : What is Triangular arbitrage What is

    What is meant by the Triangular arbitrage?  Explain about the condition which provides rise to opportunity of the triangular arbitrage?

  • Q : Define Income Statement How to do

    How to do income statement = from the revenues we will deduct all the expenses related to that period to get the income or loss. When the revenues are more than the expenses then it is income and when the expenses are more than the revenues then it is

  • Q : Case study of a wind turbine for rural

    The goal of this long problem is to validate the turbine performance estimates in specific (XYZ) wind regimes, and estimate its cost.  Below is a list of tasks you will need to accomplish, but you are not limited to these if you want to do more:  

  • Q : Calculate the PV You expect the price

    You expect the price of the stock 3 years from now to be $119.04 (i.e., you expect P ˆ   3  ?? = $119.04). Discounted at a 10% rate, what is the present value of this expected future stock price? In other words, calculate the PV of $119.04.&nb

  • Q : Calculate depreciation expense for the

    On December 31, 20x3, the PPE Company purchased an asset costing $1,000,000. The asset’s useful life is expected to be 10 years with a residual value of $300,000. a. Calculate the depreciation expense for 20x4 using:

  • Q : What is Arbitrage Describe the term

    Describe the term Arbitrage.

©TutorsGlobe All rights reserved 2022-2023.