fastq,sam文件一些小结(持续补充。。。)

ST-E00211:326:H5L3NCCXY:2:2219:23328:63173    83    chr14    21853647    24    141M    =    21853647    -141    ACTTCACCTCCTGGAGTCCTGGACTTCCCCACATCTCCCCTGCCCCTCCCACGTTTCCATAGTCCAAGGGCCAGAGTAAATGAAAATACAGCAGCCGCCCAAGCAATGGGGCCCATGCTGGGGCTTCAGTCATCCCCATGT    JFJFJFJJJJJJJJJJJJJJJFAJJJJJJJJJFFJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJFFFAA    AS:i:-28    XN:i:0    XM:i:5    XO:i:0    XG:i:0    NM:i:5    MD:Z:0T0T132A0G3C1    YS:i:-27    YT:Z:CP
ST-E00211:326:H5L3NCCXY:2:2219:23328:63173    163    chr14    21853647    24    141M    =    21853647    -141    ACTTCACCTCCTGGAGTCCTGGACTTCCCCACATCTCCCCTGCCCCTCCCACGTTTCCATAGTCCAAGGGCCAGAGTAAATGAAAATACAGCAGCCGCCCAAGCAATGGGGCCCATGCTGGGGCTTCAGTCATCCCCATGT    A<FFFFFFFJJFJFJJ<<JJJJJJJJJJJJFJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJFJJJJJJFJJJ<JFJJJJJJJJAJJFJJJJJJJJJJJJJJJJJFJJJJFJJJJJJJJJJAJJJJJJJJJJJJJJJJ    AS:i:-27    XN:i:0    XM:i:5    XO:i:0    XG:i:0    NM:i:5    MD:Z:0T0T132A0G3C1    YS:i:-28    YT:Z:CP

这是双端测序的sam文件随意挑出的一条测序片段。(sam文件会把同一片段的两条read都保留,从第二列可以知道是read1还是read2),可以发现这条片段已经被测通(因为染色体坐标一致),基因序列整体是相同的,说明read1和read2都是按5‘-->3’的顺序排列的

原文地址:https://www.cnblogs.com/zhengzh/p/7553605.html