In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-02-24 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Development >
Share
Shulou(Shulou.com)06/01 Report--
In this article Xiaobian for you to introduce in detail the "perl corresponding gff file format is what", detailed content, clear steps, details handled properly, I hope that this "perl corresponding gff file format is what" article can help you solve doubts, the following follow the editor's ideas slowly in-depth, together to learn new knowledge bar.
The corresponding gff file is in the following format:
1 araport11 gene 3631 5899. +. ID=gene:AT1G01010;Name=NAC001;biotype=protein_coding1 araport11 mRNA 3631 5899. +. ID=transcript:AT1G01010.1;Parent=gene:AT1G010101 araport11 five_prime_UTR 3631 3759. +. Parent=transcript:AT1G01010.11 araport11 exon 3631 3913. +. Parent=transcript:AT1G01010.1;Name=AT1G01010.1.exon1;constitutive=11 araport11 CDS 3760 3913. + 0 ID=CDS:AT1G01010.1;Parent=transcript:AT1G01010.1;protein_id=AT1G01010.11 araport11 exon 3996 4276. +. Parent=transcript:AT1G01010.1;Name=AT1G01010.1.exon2;constitutive=11 araport11 CDS 3996 4276. + 2 ID=CDS:AT1G01010.1;Parent=transcript:AT1G01010.1;protein_id=AT1G01010.11 araport11 exon 4486 4605. +. Parent=transcript:AT1G01010.1;Name=AT1G01010.1.exon3;constitutive=11 araport11 gene 6788 9130. -. ID=gene:AT1G01020;Name=ARV1;biotype=protein_coding1 araport11 mRNA 6788 8737. -. ID=transcript:AT1G01020.6;Parent=gene:AT1G01020;biotype=protein_coding1 araport11 exon 6788 7069. -. Parent=transcript:AT1G01020.6;Name=AT1G01020.2.exon8;constitutive=01 araport11 three_prime_UTR 6788 7069. -. Parent=transcript:AT1G01020.61 araport11 three_prime_UTR 7157 7314. -. Parent=transcript:AT1G01020.61 araport11 exon 7157 7450. -. Parent=transcript:AT1G01020.6;Name=AT1G01020.2.exon7;constitutive=01 araport11 CDS 7315 7450. -1 ID=CDS:AT1G01020.6;Parent=transcript:AT1G01020.6;protein_id=AT1G01020.61 araport11 exon 7564 7649. -. Parent=transcript:AT1G01020.6;Name=AT1G01020.1.exon6;constitutive=1
When the script encounters the third column gene of the gff file, it matches gene_id, but different platforms and software gene_id have different prefixes. As shown in the figure above, the prefix is ID=gene:, so when the script matches:
$a [8] = ~ m/ID=gene: ([^;] *); /
If the gene_id in the eighth column of your gff file also has a different prefix, such as the following:
1 araport11 gene 11649 13714. -. ID=AT1G01030;Name=NGA3;biotype=protein_coding
You just need to modify the script to change the match to something like this:
$a [8] = ~ mram ID = ([^;] *); /
So that the script can match correctly and extract the information.
After reading this, the article "what is the gff file format corresponding to perl" has been introduced. If you want to master the knowledge points of this article, you still need to practice and use it yourself to understand it. If you want to know more about related articles, welcome to follow the industry information channel.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
"Every 5-10 years, there's a rare product, a really special, very unusual product that's the most un
© 2024 shulou.com SLNews company. All rights reserved.