Tuesday 22 September 2015

'Junk' DNA vs The World 2

Looking over the skyline of Camden as the sun sets on my first day of university life, I begin to to contemplate my place in our world. How much impact could I ever make? I am one average person in virtual sea of more prolific people, all trying to find meaning in an existence that intrinsically has none (I blame the Satorl Marsh night theme I'm listening to for that last bit). But my existential crisis aside, I did begin to think about the human genome, and how the neglected 'junk' elements of it must feel, always being in the shadow of the protein coding genes, which have a reputation as the be all and end all of molecular biology. Although personifying genetic material is not really a valid way to think about the issue... Therefore I will once again champion the Junk DNA in its quest for genomic appreciation. As I stated last time, human proteins are often the same size as equivalent proteins in simpler organisms; it is the sections of intervening junk that increase in size as the complexity of an organism increases. This creates what Nessa Carey describes as a large 'signal-to-noise ratio' during analysis of genes within our genome, as there's only a relatively small region that codes for protein, embedded in a large section of junk. But this DNA must have a role to play, otherwise we would be no more complex than a fly or a worm... (only a slight exaggeration...) Some complexity is added by post transcriptional modification of mRNA, in a process known as splicing. In splicing the exons, sections of DNA which code for proteins, are cut away from the intron sequences (the junk), and can be arranged in various fashions to generate different mRNA molecules from the same gene. Over 60% of human genes produce multiple splicing variants, however this only goes so far in explaining our relative complexity. Back in 2001, in the wake of the Human Genome Project, researchers thought the junk may not even have a functional role, as the pufferfish Fugu rubripes has a genome 13% the length of our own (devoid of much of the junk we see in our genomes), and is still a successful organism. It was hypothesised that the junk was parasitic, selfish DNA using our DNA as host for future proliferation (I just can't escape you Richard...). Although based on what we know about evolution, it is irrational to assume that just because the DNA has no obvious function in one organism, it does not mean it is useless in all organisms (especially in more complex species). One theory supporting the importance of junk DNA states that it could act as mutation insulation for the protein coding genes, as it makes it decreases the likelihood that a point mutation will affect a protein coding sequence, a benefit of the large 'signal-to-noise ratio'. The reason humans would need more mutation insulation than simpler organisms is because we have longer life spans, and therefore we accumulate more mutations, and we also produce fewer offspring, so it is beneficial that they are better protected from mutations in order to ensure species survival. In terms of cost-benefit, it isn't worth expending resources in simpler organisms to protect protein coding genes, as even if they accumulate a mutation they are still viable (fewer systems can be affected by the lack of protein production), and the offspring will likely still survive and breed (ah the virtues of simplicity). So who's irrelevant now 2001 geneticists? But seriously, if the junk DNA is used as a buffer for mutations, which have the potential to disrupt vital metabolic processes for example, is the nomenclature 'junk' still applicable? Not in my book, and this is my blog. Deal with it. How empirical of me.

Thursday 3 September 2015

'Junk' DNA vs The World

Proteins, proteins, proteins; Biologists are obsessed with them. Sure they are the reason us complex organisms can exist, but as a result of this infatuation with our polypeptide friends, we often overlook a rather important aspect of the human genome. Only 2% of our genome codes for proteins, and so until recently a large proportion of the other 98% was dismissed as 'junk' DNA. However, we are now realising that proteins may be the final end points required for life, but they could never be properly synthesised and organised without the junk genetic material. In fact, research into the genomic differences between more complex and simple organisms has concluded that the only genetic change that occurs as biological complexity increases is a greater percentage of DNA that does not code for proteins. Junk DNA can code for RNA, not the mRNA that is used for translation of proteins, but RNA that is functional in its own right! (more on this later) I personally feel betrayed by the A-Level Biology syllabus at this point... Anyway, the fact that Biologists were so focussed on the effect mutations had on protein coding DNA, meant that they were looking in the wrong neighbourhood of the human genome for the cause of inherited disorders like myotonic dystrophy. Within the myotonic dystrophy gene, a small base sequence (CTG) is repeated multiple times; 5-30 repeats in a healthy person. If the number of repeats exceeds 35, the sequence becomes unstable and can change in number erratically from one generation to the next, and as the the length increases generationally, the symptoms of the condition worsen, and develop earlier. At this point you're probably thinking; this is just a standard change in the base sequence of the gene, inhibiting the production of the correct protein. However, you would be wrong, as the amino acid sequence produced by this mutated myotonic dystrophy gene is unchanged, and the protein is still synthesised (did I just hear the jaws of the uninitiated drop?). Is this just an anomaly perhaps? Certainly not. Fragile X syndrome, the most common inherited learning disability, and Friedrich's ataxia, a muscle wasting disease, follow a similar mechanism. This was an unnerving revelation for researchers; there are mutations that can cause disease without changing the amino acid sequence of proteins. But in that case how do sufferers develop such debilitating symptoms? In Friedrich's ataxia, much like in myotonic dystrophy, the abnormally large section of repeats (GAA in the case of FA) is found between 2 sections of the protein coding genetic material(so in the junk DNA). Research has shown that when cells contained the expanded repeat, the production of mRNA encoded for by the gene was inhibited. The enlarged section of GAA repeats prevents accurate copying of the DNA, and thus there is no translation. In the case of Fragile X syndrome this means RNA around the cell is not properly regulated by Fragile X protein, causing cellular chaos and a life of disability for the sufferer. But that begs the question; why have the repeat sequences within the genes for myotonic dystrophy for example, been conserved over our genomic evolution? I mean, the repeats are located at the very end of the gene, so could easily be left out of any RNA produced. This means they must have a purpose in a healthy cell, within the mRNA of the gene. This is in fact the case, as the myotonic dystrophy mRNA is used to bind proteins in the cell (a weird role for mRNA), and the larger the repeat expansion, the more molecules are bound. The proteins are normally involved in the regulation of other mRNAs; controlling their lifespan and translation efficiency, and so without these regulatory polypeptides, the cell cannot function correctly. This discovery made sense to clinicians, as diseases in which a small change in gene expression creates such a huge impact on a patient, with fine tuning of severity between patients (based on length of repeats), are simply not observed. Therefore the role of this mutated junk DNA is more complex than we first thought, as it is not simply the number of abnormal repeats within the DNA that affects symptoms, but also the role of the expanded section of repeats on the mRNA synthesised from the mutated section of DNA in gene expression. From this paradigm it is clear that the fact that we have sequenced the human genome does not mean we understand all of its intricacies. Junk DNA has a powerful role, not just in inherited diseases, but in almost every aspect of our genome, and even our epigenome. That is why I'm so excited to continue this series, shedding light on the enigmatic sections of the genome previously condemned as insignificant junk.