Test Data Sets and Evaluation of Gene Prediction Programs on the Rice Genome Test Data Sets and Evaluation of Gene Prediction Programs on the Rice Genome

Test Data Sets and Evaluation of Gene Prediction Programs on the Rice Genome

  • 期刊名字:计算机科学技术学报(英文版)
  • 文件大小:
  • 论文作者:Heng Li,Jin-Song Liu,Zhao Xu,J
  • 作者单位:Beijing Genomics Institute ( BGI),Institute of Theoretical Physics,Department of Mathematics,Institute of Systems Scienc
  • 更新时间:2023-04-17
  • 下载次数:
论文简介

With several rice genome projects approaching completion gene prediction/finding by computer algorithms has become an urgent task. Two test sets were constructed by mapping the newly published 28,469 full-length KOME rice cDNA to the RGP BAC clone sequences of Oryza sativa ssp. japonica: a single-gene set of 550 sequences and a multi-gene set of 62 sequences with 271 genes. These data sets were used to evaluate five ab initio gene prediction programs: RiceHMM,GlimmerR, GeneMark, FGENSH and BGF. The predictions were compared on nucleotide, exon and whole gene structure levels using commonly accepted measures and several new measures. The test results show a progress in performance in chronological order. At the same time complementarity of the programs hints on the possibility of further improvement and on the feasibility of reaching better performance by combining several gene-finders.

论文截图
版权:如无特殊注明,文章转载自网络,侵权请联系cnmhg168#163.com删除!文件均为网友上传,仅供研究和学习使用,务必24小时内删除。