--%>

understanding of genes and their products

For this assignment you will have to make use of information and skills gained throughout the 4 weeks of the introductory programme. This may be understanding of genes & their products, how to find and reference information from different sources, and specifically how to execute code in Perl, Java and R - it will also test your logical thinking and a little bit of basic maths!

Part A. 

You are provided with an HGNC gene symbol - HTRA1. This protein is involved with Human disease. Prepare a one-page summary of the gene, its product, the disease and the protein/gene's role in the disease.

Part B. 

Obtain the sequence for the gene and the sequence for the RNA transcript in FASTA format. If there is more than one transcript choose and appropriate one and explain your choice. Using Perl, convert the FASTA format files into a simple strings containing only nucleotides - save those for later -  and determine the amino acid sequence of the protein.

Part C. 

Using the sequences prepared with Perl, calculate the molecular weight of the Gene vs mRNA vs protein using Java.

Part D. You now have sequences for gene, RNA and protein. Write an R script to calculate answers to the following:

By taking the current estimate of global population and multiplying that by the estimate for the average number of cells in the human body, determine the total number of nucleotides representing the coding part of the gene in living humans.

If all of these nucleotides were printed out using 12pt Arial font on A4 with 3cm margins and the sheets laid end-to-end, how long would it take to drive along the paper at 30km/h ? For comparison, how long would it take to drive the length of this sequence of nucleotides at the same speed, if it was in the form of a molecule of double-stranded DNA helix laid end-to-end ?

Presentation. 

There are also 10 marks available for good presentation of part A, correct referencing and well formatted/commented code.

Submission

You should submit your one page summary, Perl, Java and R scripts along with the solution to the two final questions to the Digital Drop Box on Blackboard .

Throughout the assignment, reference your sources of information appropriately.

   Related Questions in Biology

  • Q : Chief cellular attributes of the plant

    Explain the chief cellular attributes of beings of the plant kingdom?

  • Q : Where are chemoreceptors which detect

    Where are the chemoreceptors which detect the acidity of blood and trigger the respiratory compensation positioned?

  • Q : What is Reorganization Reorganization :

    Reorganization:A) Reformation of a firm's operations, in order to focus on core actions and outsource peripheral ones, frequently needing reduction in workforce. B) Combination of two or more organizational u

  • Q : Characterstics of Good Communication -

    A good system of communication should have the essential characteristics which are explained below: I. Two-way channel: communication involves two parties, the sender or transmitter and the receiver of the message. Mere transmission of facts, ideas, information, etc. does

  • Q : What are support tickets in SAP fico

    What are support tickets in SAP fico module? Support tickets are fundamentally the problems which occur in day to day usage of SAP. Therefore any sap fico consultant who is resolving the day to day production issue

  • Q : What is dicoumarol State what is

    State what is dicoumarol and how does this substance act within the clotting process and also give some examples stating its toxicity?

  • Q : Is cell division taking place during

    Is cell division taking place during the whole cell cycle? And also write down a brief explanation of interphase?

  • Q : Write down the cell types which form

    Write down the cell types which form xylem? And also write down the main aspects of those cells?

  • Q : Define Production Output Production

    Production Output: Production is an activity taken out under the control and accountability of an institutional unit which uses inputs of labor, goods and capital and services to generate goods and services. Production doesn’t cover purely natur

  • Q : How can nutrients are classified How

    How can nutrients are classified?