[問題] N50

看板BioMedInfo (生醫資訊)作者 (風大雨大)時間15年前 (2009/09/09 14:09), 編輯推噓2(204)
留言6則, 2人參與, 最新討論串1/1
關於genome在做assembly時,paper都會提到N50 size為多少。 這是網路上我所查到的定義: http://www.cbcb.umd.edu/research/castats.shtml The N50 size of a set of entities (e.g., contigs or scaffolds) represents the largest entity E such that at least half of the total size of the entities is contained in entities larger than E. For example if we have a collection of contigs with sizes 7, 4, 3, 2, 2, 1, and 1 kb (total size = 20kbp), the N50 length is 4 because we can cover 10 kb with contigs bigger than 4kb. 我的解讀是佔50%的contig, 所以20kbp的N50應該是10kbp 不過看了下面的例子又明顯不是這樣... 請問N50的定義到底該怎麼下呢? 謝謝不吝解惑. -- ※ 發信站: 批踢踢實業坊(ptt.cc) ◆ From: 140.114.88.228

09/09 16:54, , 1F
因為大於4 kbp 的 contigs 有 7 跟 4 加起來超過20kbp的
09/09 16:54, 1F

09/09 16:55, , 2F
一半,因此這個例子內N50是4 kbp。並非每個加起來20kbp的
09/09 16:55, 2F

09/09 16:55, , 3F
例子都會是4 kbps
09/09 16:55, 3F

09/09 17:17, , 4F
為何不是7kbp呢@@?
09/09 17:17, 4F

09/11 14:14, , 5F
7 kbps < 10 kbps 所以不是 7
09/11 14:14, 5F

09/14 10:02, , 6F
原來如此!謝謝h大!
09/14 10:02, 6F
文章代碼(AID): #1AfqPytV (BioMedInfo)
文章代碼(AID): #1AfqPytV (BioMedInfo)