; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G13080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G13080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr10:26643453..26646391
RNA-Seq ExpressionClc10G13080
SyntenyClc10G13080
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581055.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.98Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK
        MK R+QLV LFEKCKCSK LA+LQC TLK+GLAHD+FFATKL+ALH  YTSLV+AHKVFEETPS+TVHLWNA+LRSYC   QW QTL LF+DMVS GI+ 
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK

Query:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM
        GKPDNFTIPIVLKACAGLQALKFGK VHGF+K+N+K D+DMFV AALIELYSKCG MHEALQVFL F QPDVV+WTSMITGYEQNGNP+K+VDFFSQMVM
Subjt:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM

Query:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM
        I+H+NPDP+TLVSLASACTQL NSKLGSSIHGFI RRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMP KDVISWSSLIACY+HNGAAAEALNLFN+M
Subjt:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM

Query:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT
        ID+KIKFNSVTVIGALQACAV  NLEEG++IHEL TRKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVSWA LLSGYS NGMSSKSM TFNT
Subjt:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT

Query:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN
        MLLNNIKPDAVA+VKIL SCSDLGILQQALCLHDYVVKSGFT+NIFVGASLIELYSKCGS VNAMKVFEEMKVKDIV WSAIIAGYGIQGQGREALKLF+
Subjt:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN

Query:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE
        KM ETSE+M NEVTFLSLLSACSHAGLIEEGIKIFNMML+EYRI+PNMEHY IIVDLLGR GELNRALNFVQKMPIPAGPHVWGALLGAACIHHKS+LGE
Subjt:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE

Query:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL
        IAA NLFKLDPNHAGYYILLSNIYAVE NW+N GKLRNMIKEKGLKK++GQSVIEAG EVHSFVADDRLHP+SD IYRLLR LNVNM+DE YSSNSK+ L
Subjt:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL

Query:  QGTDEIV
        QG +EIV
Subjt:  QGTDEIV

KAG7017786.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.98Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK
        MK R+QLV LFEKCKCSK LA+LQC TLK+GLAHD+FFATKL+ALH  YTSLVQAHKVFEETPS+TVHLWNA+LRSYC   QW QTL LF+DMVS GI+ 
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK

Query:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM
        GKPDNFTIPIVLKACAGLQALKFGK VHGF+K+N+K D+DMFV AALIELYSKCG MHEALQVFL F QPDVV+WTSMITGYEQNGNP+K+VDFFSQMVM
Subjt:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM

Query:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM
        I+H+NPDP+TLVSLASACTQL NSKLGSSIHGFI RRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMP KDVISWSSLIACY+HNGAAAEALNLFN+M
Subjt:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM

Query:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT
        ID+KIKFNSVTVIGALQACAV  NLEEG++IHEL TRKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVSWA LLSGYS NGMSSKSM TFNT
Subjt:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT

Query:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN
        MLLNNIKPDAVA+VKIL SCSDLGILQQALCLHDYVVKSGFT+N+FVGASLIELYSKCGS VNAMKVFEEMKVKDIV WSAIIAGYGIQGQGREALKLF+
Subjt:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN

Query:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE
        KM ETSE+M NEVTFLSLLSACSHAGLIEEGIKIFNMML+EYRI+PNMEHY IIVDLLGR GELNRALNFVQKMPIPAGPHVWGALLGAACIHHKS+LGE
Subjt:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE

Query:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL
        IAA NLFKLDPNHAGYYILLSNIYAVE NW+N GKLRNMIKEKGLKK++GQSVIEAG+EVHSFVADDRLHP+SD IYRLLR LNVNM+DE YSSNSK+ L
Subjt:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL

Query:  QGTDEIV
        QG +EIV
Subjt:  QGTDEIV

XP_022935091.1 putative pentatricopeptide repeat-containing protein At3g01580 isoform X1 [Cucurbita moschata]0.0e+0087.55Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK
        MK R+QLV LFEKCKCSK LA+LQC TLK+GLAHD+FFATKL+ALH  YTSLVQAHKVFEETPS+TVHLWNA+LRSYC   QW QTL LF+DMVS GI+ 
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK

Query:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM
        GKPDNFTIPIVLKACAGLQALKFGK VHGF+K+N+K D+DMFV AALIELYSKCG MHEALQVFL F QPDVV+WTSMITGYEQNGNP+K+VDFFSQMVM
Subjt:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM

Query:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM
        I+H+NPDP+TLVSLASACTQL NSKLGSSIHGFI RRNLDY LSLANSLLNLY KTGSVKAAANLFEKMP KDVISWSSLIACY+HNGAAAEALNLFN+M
Subjt:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM

Query:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT
        ID+KIKFNSVTVIGALQACAV  NLEEG++IHEL  RKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVSWA LLSGYS NGMSSKSM TFNT
Subjt:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT

Query:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN
        MLLNNIKPDAVA+VKIL SCSDLGILQQALCLHDYVVKSGFT+N+FVGASLIELYSKCGS VNAMKVFEEMKVKDIV WSAIIAGYGIQGQGREALKLF+
Subjt:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN

Query:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE
        KM ETSE+M NEVTFLSLLSACSHAGLIEEGIKIFNMML+EYRI+PNMEHY IIVDLLGR GELNRALNFVQKMPIPAGPHVWGALLGAACIHHKS+LGE
Subjt:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE

Query:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL
        IAA NLFKLDPNHAGYYILLSNIYAVE NW+N GKLRNMIKEKGLKK++GQSVIEAG+EVHSFVADDRLHP+SD IYRLLR LNVNM+DE YSSNSK+ L
Subjt:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL

Query:  QGTDEIV
        QG +EIV
Subjt:  QGTDEIV

XP_022982551.1 putative pentatricopeptide repeat-containing protein At3g01580 isoform X1 [Cucurbita maxima]0.0e+0087.98Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK
        MK R+QLV LFEKCKCSKILA+LQC TLK+GLAHD+FFATKL+ALH  YTSLVQAHKVFEETPS+TVHLWNALLRSYC   QW QTL LF+DMVS GI+ 
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK

Query:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM
        GKPDNFTIPIVLKACAGLQALKFGK VHGF+K+N+K D+DMFV AALIELYSKCG MHEALQVFL F QPDVV+WTSMITGYEQNGNP+K+VDFFSQMVM
Subjt:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM

Query:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM
        I+H+NPDP+TLVSLASACTQL NSKLGSSIHGFI RRNLDYDLSLANSLLNLY KTGSVKAAANLFEKMP KDVISWSSLIACY+HNGAAAEALNLFN+M
Subjt:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM

Query:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT
        ID+KIKFNSVTVIGALQACAVA NLEEG++IHEL TRKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVSWAALLSGYS NGMSSKSM TFNT
Subjt:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT

Query:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN
        MLLNNIKPDAVA+VKIL SCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGS VNAMKVFEEMKVKDIV WSAIIAGYGIQGQGREALKLF+
Subjt:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN

Query:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE
        KM ETSE+M NEVTFLSLLSACSHAGLIEEGIKIFNMML+EYRI+PNMEHY I+VDLLGR GEL++ALNFVQKMPIPAGPHVWGALLG+ACIHHKS+LGE
Subjt:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE

Query:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL
        IAA NLFKLDPNHAGYYILLSNIYAVE NW+N GKLRNMIKEKGLKK++GQSVIEAG+EVHSFVADDRLHP+SD IYRLLR LNVNM+DE YSSNSK+ L
Subjt:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL

Query:  QGTDEIV
        Q  +EIV
Subjt:  QGTDEIV

XP_038905362.1 putative pentatricopeptide repeat-containing protein At3g01580 [Benincasa hispida]0.0e+0090.03Show/hide
Query:  QLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDN
        QLV LFEKC CSKILAQLQCLT+KVGLAHDSFFATKL+ALHS YTSLVQAHKVFEETPS+TVHLWNALLRSYC  NQWEQTL LFHDMVS GI KGKPDN
Subjt:  QLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDN

Query:  FTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLN
        FTIPIVLKACAGLQALKFG+MVHGF+K+N+K DIDMFV AALIELYSKCG MH+AL+VF+ FSQPDVVMWTSMITGYEQ+GNPEKAVDFFS+MVMIKHLN
Subjt:  FTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLN

Query:  PDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKI
        PDPITLVSLASACTQL NSKLGSSIHGFI RRNLDYDL LANSLLNLYAKTGSV AAANLFEKMPTKDVI+WSSLIACYSHNGAAAEALNLFNEMIDRKI
Subjt:  PDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKI

Query:  KFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNN
        KFNSVTV+GALQACAVA NLEEG+RIHEL TR GLELDISVSTALIDMYMKC SPEEAIDVFEKMPKKDVVSWAALLSGYS NGMSSKSM TFNTMLLNN
Subjt:  KFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNN

Query:  IKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIET
        IKPDAVA+VKILVSCSDLGILQQALCLHDY+VKSG+T N+FVGASLIELYSKCGSIVNA+KVFEE+KVKDIV WSAIIAGYGIQGQGREALKLFNKMIET
Subjt:  IKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIET

Query:  SEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKN
        S++MPNEVTFLSLLSACSHAGLIEEGIKIFNMML EYRIKPNMEHYSIIVDLLGR GEL+RAL+FVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAK 
Subjt:  SEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKN

Query:  LFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDE
        LFKLDPNH GYY LLSN+YAVEKNWD+ GKLRNMIKEKGLKKM+GQSVIEAG+EVHSFVADDRLHPESDHIYRLLR LN+NMK+ESY SNS S  +GT++
Subjt:  LFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDE

Query:  IV
        I+
Subjt:  IV

TrEMBL top hitse value%identityAlignment
A0A1S3CMP5 putative pentatricopeptide repeat-containing protein At3g015800.0e+0088.07Show/hide
Query:  REQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP
        R+QLV LFEKC  SKILAQLQCLTLK+GL HDSFFATKL+ALHS YTSLVQAHKVF+ETP++TVHLWNALLRSYC  NQW+QTL LFHDMVS GITKGKP
Subjt:  REQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP

Query:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH
        DN+TIPIVLKACAGLQAL+FGKMVHGFV++ +K D+DMFV A+LIELYSKCG M EA QVFL FSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMI+H
Subjt:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH

Query:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR
        LNPDP+TLVSLASACTQL +SKLGSSIHGF+IRRNLDYDLSLANSLLNLY KTGSV AAA LFE+MPTKDVISWSSLIACYS NG  AEALNLFNEMIDR
Subjt:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR

Query:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL
        KIKFNSVT +G LQACAVA NLEEG+RIHEL TRKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVS AALLSGYS NGMS KSM TFNTM L
Subjt:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL

Query:  NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMI
        NNIKPDAVAMVKILVSCSDLGILQQALCLHDYV+K+GFTNNIFVGASLIELYSKCG+IVNAMKVFEE+KVKDIV WSAIIAGYGIQGQGREALKLFNKMI
Subjt:  NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMI

Query:  ETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAA
        ETSE+MPNEVTFLSLLSACSHAGLIEEGI IFNMML +YRIKP MEHYSIIVDLLGR GEL+RALNFV+KMPIPAGPHVWGALLGAACIHHKSELGEIAA
Subjt:  ETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAA

Query:  KNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGT
        KNLFKLDPN+AGY+ILLS IYAVEKNWDN GKLR+M+KEK LKKM G+SVIEAGNEVHSFVA+DRLH E D IYRLLR LNVNM+DESYSSNSKSCLQGT
Subjt:  KNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGT

Query:  DEIV
         EIV
Subjt:  DEIV

A0A5A7UUL5 Putative pentatricopeptide repeat-containing protein0.0e+0088.07Show/hide
Query:  REQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP
        R+QLV LFEKC  SKILAQLQCLTLK+GL HDSFFATKL+ALHS YTSLVQAHKVF+ETP++TVHLWNALLRSYC  NQW+QTL LFHDMVS GITKGKP
Subjt:  REQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP

Query:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH
        DN+TIPIVLKACAGLQAL+FGKMVHGFV++ +K D+DMFV A+LIELYSKCG M EA QVFL FSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMI+H
Subjt:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH

Query:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR
        LNPDP+TLVSLASACTQL +SKLGSSIHGF+IRRNLDYDLSLANSLLNLY KTGSV AAA LFE+MPTKDVISWSSLIACYS NG  AEALNLFNEMIDR
Subjt:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR

Query:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL
        KIKFNSVT +G LQACAVA NLEEG+RIHEL TRKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVS AALLSGYS NGMS KSM TFNTM L
Subjt:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL

Query:  NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMI
        NNIKPDAVAMVKILVSCSDLGILQQALCLHDYV+K+GFTNNIFVGASLIELYSKCG+IVNAMKVFEE+KVKDIV WSAIIAGYGIQGQGREALKLFNKMI
Subjt:  NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMI

Query:  ETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAA
        ETSE+MPNEVTFLSLLSACSHAGLIEEGI IFNMML +YRIKP MEHYSIIVDLLGR GEL+RALNFV+KMPIPAGPHVWGALLGAACIHHKSELGEIAA
Subjt:  ETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAA

Query:  KNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGT
        KNLFKLDPN+AGY+ILLS IYAVEKNWDN GKLR+M+KEK LKKM G+SVIEAGNEVHSFVA+DRLH E D IYRLLR LNVNM+DESYSSNSKSCLQGT
Subjt:  KNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGT

Query:  DEIV
         EIV
Subjt:  DEIV

A0A6J1DE08 putative pentatricopeptide repeat-containing protein At3g015800.0e+0085.75Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMV---SRG
        M+ RE+LV LFEKC   KILAQLQC TLKVGLAHDSFFATKL+AL++ Y SLV A KVF+ETP +TVHLWNALLRSYC  NQWE++L LFHDMV   S G
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMV---SRG

Query:  ITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQ
        +T+GKPDNFT+P+VLKACAGLQALK GK +HGFVKRN+K D+DMFV AALIELYSKCG M EALQVFL FSQPDVV+WTSMITGYEQNGNPEKAV+FFSQ
Subjt:  ITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQ

Query:  MVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLF
        MV+IKH+NPD +TLVSL SACTQL NSKLGSSIHGFI RRNLDYDLSLANSLLNLYAKTGSVK+AANLFEKMP KDVISWSSLIACYS NGAAAEALNLF
Subjt:  MVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLF

Query:  NEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMET
        NEMIDRKI+FNSVTVIGALQAC VA NLEEG+ IH+L T KGLE+D+SVSTALIDMYMKCFSPE+A+DVFEKMPKKDVVSWAALLSGYS NGMSSKSM  
Subjt:  NEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMET

Query:  FNTMLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALK
        FNTMLLN IKPDAVAMVKIL SCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIV WSAIIAGYGIQGQGR+ALK
Subjt:  FNTMLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALK

Query:  LFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSE
        LF+KM ETS IMPNEVTF+SLLSACSHAGLIEEGIKIFN ML EYRIKPN EHY IIVDLLGR GEL+RAL+FVQKMPIPAGPHVWGALLGAACIHHKSE
Subjt:  LFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSE

Query:  LGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKD--ESYSSN
        LGE+AAKNLF+LDPNHAGYYILLSNIYAVEKNWDN GKLR++IKEKGLKKM+G+SVIE G+EVHSFVADDRLHPESD IYRLL  LNVNM+D  E Y+SN
Subjt:  LGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKD--ESYSSN

Query:  SK
        SK
Subjt:  SK

A0A6J1F3K2 putative pentatricopeptide repeat-containing protein At3g01580 isoform X10.0e+0087.55Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK
        MK R+QLV LFEKCKCSK LA+LQC TLK+GLAHD+FFATKL+ALH  YTSLVQAHKVFEETPS+TVHLWNA+LRSYC   QW QTL LF+DMVS GI+ 
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK

Query:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM
        GKPDNFTIPIVLKACAGLQALKFGK VHGF+K+N+K D+DMFV AALIELYSKCG MHEALQVFL F QPDVV+WTSMITGYEQNGNP+K+VDFFSQMVM
Subjt:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM

Query:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM
        I+H+NPDP+TLVSLASACTQL NSKLGSSIHGFI RRNLDY LSLANSLLNLY KTGSVKAAANLFEKMP KDVISWSSLIACY+HNGAAAEALNLFN+M
Subjt:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM

Query:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT
        ID+KIKFNSVTVIGALQACAV  NLEEG++IHEL  RKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVSWA LLSGYS NGMSSKSM TFNT
Subjt:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT

Query:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN
        MLLNNIKPDAVA+VKIL SCSDLGILQQALCLHDYVVKSGFT+N+FVGASLIELYSKCGS VNAMKVFEEMKVKDIV WSAIIAGYGIQGQGREALKLF+
Subjt:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN

Query:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE
        KM ETSE+M NEVTFLSLLSACSHAGLIEEGIKIFNMML+EYRI+PNMEHY IIVDLLGR GELNRALNFVQKMPIPAGPHVWGALLGAACIHHKS+LGE
Subjt:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE

Query:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL
        IAA NLFKLDPNHAGYYILLSNIYAVE NW+N GKLRNMIKEKGLKK++GQSVIEAG+EVHSFVADDRLHP+SD IYRLLR LNVNM+DE YSSNSK+ L
Subjt:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL

Query:  QGTDEIV
        QG +EIV
Subjt:  QGTDEIV

A0A6J1IZM6 putative pentatricopeptide repeat-containing protein At3g01580 isoform X10.0e+0087.98Show/hide
Query:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK
        MK R+QLV LFEKCKCSKILA+LQC TLK+GLAHD+FFATKL+ALH  YTSLVQAHKVFEETPS+TVHLWNALLRSYC   QW QTL LF+DMVS GI+ 
Subjt:  MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITK

Query:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM
        GKPDNFTIPIVLKACAGLQALKFGK VHGF+K+N+K D+DMFV AALIELYSKCG MHEALQVFL F QPDVV+WTSMITGYEQNGNP+K+VDFFSQMVM
Subjt:  GKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVM

Query:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM
        I+H+NPDP+TLVSLASACTQL NSKLGSSIHGFI RRNLDYDLSLANSLLNLY KTGSVKAAANLFEKMP KDVISWSSLIACY+HNGAAAEALNLFN+M
Subjt:  IKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEM

Query:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT
        ID+KIKFNSVTVIGALQACAVA NLEEG++IHEL TRKGLELDISVSTALIDMYMKCFSPEEA+ VFEKMPKKDVVSWAALLSGYS NGMSSKSM TFNT
Subjt:  IDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNT

Query:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN
        MLLNNIKPDAVA+VKIL SCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGS VNAMKVFEEMKVKDIV WSAIIAGYGIQGQGREALKLF+
Subjt:  MLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFN

Query:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE
        KM ETSE+M NEVTFLSLLSACSHAGLIEEGIKIFNMML+EYRI+PNMEHY I+VDLLGR GEL++ALNFVQKMPIPAGPHVWGALLG+ACIHHKS+LGE
Subjt:  KMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGE

Query:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL
        IAA NLFKLDPNHAGYYILLSNIYAVE NW+N GKLRNMIKEKGLKK++GQSVIEAG+EVHSFVADDRLHP+SD IYRLLR LNVNM+DE YSSNSK+ L
Subjt:  IAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCL

Query:  QGTDEIV
        Q  +EIV
Subjt:  QGTDEIV

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic5.1e-13636.9Show/hide
Query:  LFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIP
        L E+C   K L Q+  L  K GL  + FF TKL +L   Y S+ +A +VFE   S+   L++ +L+ +  V+  ++ L  F  M    +   +P  +   
Subjt:  LFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIP

Query:  IVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPI
         +LK C     L+ GK +HG + ++    +D+F    L  +Y+KC  ++EA +VF    + D+V W +++ GY QNG    A++    M   ++L P  I
Subjt:  IVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPI

Query:  TLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNS
        T+VS+  A + L    +G  IHG+ +R   D  ++++ +L+++YAK GS++ A  LF+ M  ++V+SW+S+I  Y  N    EA+ +F +M+D  +K   
Subjt:  TLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNS

Query:  VTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPD
        V+V+GAL ACA   +LE GR IH+L+   GL+ ++SV  +LI MY KC   + A  +F K+  + +VSW A++ G++ NG    ++  F+ M    +KPD
Subjt:  VTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPD

Query:  AVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIM
            V ++ + ++L I   A  +H  V++S    N+FV  +L+++Y+KCG+I+ A  +F+ M  + +  W+A+I GYG  G G+ AL+LF +M +   I 
Subjt:  AVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIM

Query:  PNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKL
        PN VTFLS++SACSH+GL+E G+K F MM + Y I+ +M+HY  +VDLLGR G LN A +F+ +MP+    +V+GA+LGA  IH      E AA+ LF+L
Subjt:  PNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKL

Query:  DPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY
        +P+  GY++LL+NIY     W+  G++R  +  +GL+K  G S++E  NEVHSF +    HP+S  IY  L  L  ++K+  Y
Subjt:  DPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic5.3e-12535.24Show/hide
Query:  ALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKR-NKKIDIDMF
        A+   + +LV A  VF +   R +  WN L+  Y     +++ +CL+H M+  G    KPD +T P VL+ C G+  L  GK VH  V R   ++DID  
Subjt:  ALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKR-NKKIDIDMF

Query:  VAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYD
        V  ALI +Y KCG +  A  +F    + D++ W +MI+GY +NG   + ++ F  M  +  ++PD +TL S+ SAC  L + +LG  IH ++I      D
Subjt:  VAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYD

Query:  LSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLEL
        +S+ NSL  +Y   GS + A  LF +M  KD++SW+++I+ Y +N    +A++ +  M    +K + +TV   L ACA   +L+ G  +H+L  +  L  
Subjt:  LSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLEL

Query:  DISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFT
         + V+  LI+MY KC   ++A+D+F  +P+K+V+SW ++++G   N    +++  F   +   ++P+A+ +   L +C+ +G L     +H +V+++G  
Subjt:  DISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFT

Query:  NNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEY
         + F+  +L+++Y +CG +  A   F   K KD+  W+ ++ GY  +GQG   ++LF++M++ S + P+E+TF+SLL  CS + ++ +G+  F+ M ++Y
Subjt:  NNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEY

Query:  RIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKE
         + PN++HY+ +VDLLGR GEL  A  F+QKMP+   P VWGALL A  IHHK +LGE++A+++F+LD    GYYILL N+YA    W    K+R M+KE
Subjt:  RIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKE

Query:  KGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDEI
         GL    G S +E   +VH+F++DD+ HP++  I  +L      M +   +  S+S      EI
Subjt:  KGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDEI

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic9.7e-12735.09Show/hide
Query:  LVSLFEKCKCSKIL---AQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP
        L S+ + C  SK L    ++       G   DS   +KL+ +++N   L +A +VF+E        WN L+        +  ++ LF  M+S G+   + 
Subjt:  LVSLFEKCKCSKIL---AQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP

Query:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH
        D++T   V K+ + L+++  G+ +HGF+ ++   + +  V  +L+  Y K   +  A +VF   ++ DV+ W S+I GY  NG  EK +  F QM ++  
Subjt:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH

Query:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR
        +  D  T+VS+ + C       LG ++H   ++     +    N+LL++Y+K G + +A  +F +M  + V+S++S+IA Y+  G A EA+ LF EM + 
Subjt:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR

Query:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTML-
         I  +  TV   L  CA    L+EG+R+HE      L  DI VS AL+DMY KC S +EA  VF +M  KD++SW  ++ GYS N  +++++  FN +L 
Subjt:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTML-

Query:  LNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKM
             PD   +  +L +C+ L    +   +H Y++++G+ ++  V  SL+++Y+KCG+++ A  +F+++  KD+V W+ +IAGYG+ G G+EA+ LFN+M
Subjt:  LNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKM

Query:  IETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIA
         +   I  +E++F+SLL ACSH+GL++EG + FN+M  E +I+P +EHY+ IVD+L R G+L +A  F++ MPIP    +WGALL    IHH  +L E  
Subjt:  IETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIA

Query:  AKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQG
        A+ +F+L+P + GYY+L++NIYA  + W+   +LR  I ++GL+K  G S IE    V+ FVA D  +PE+++I   LR +   M +E YS  +K  L  
Subjt:  AKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQG

Query:  TDEI
         +E+
Subjt:  TDEI

Q9SS97 Putative pentatricopeptide repeat-containing protein At3g015801.5e-21253.5Show/hide
Query:  YTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALI
        ++S V A ++F E   R+++ WN LL+S     QWE+ L  F  M      + KPDNFT+P+ LKAC  L+ + +G+M+HGFVK++  +  D++V ++LI
Subjt:  YTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALI

Query:  ELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANS
         +Y KCG M EAL++F    +PD+V W+SM++G+E+NG+P +AV+FF +MVM   + PD +TL++L SACT+L NS+LG  +HGF+IRR    DLSL NS
Subjt:  ELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANS

Query:  LLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVST
        LLN YAK+ + K A NLF+ +  KDVISWS++IACY  NGAAAEAL +FN+M+D   + N  TV+  LQACA A +LE+GR+ HEL  RKGLE ++ VST
Subjt:  LLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVST

Query:  ALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL-NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFV
        AL+DMYMKCFSPEEA  VF ++P+KDVVSW AL+SG++ NGM+ +S+E F+ MLL NN +PDA+ MVK+L SCS+LG L+QA C H YV+K GF +N F+
Subjt:  ALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL-NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFV

Query:  GASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPN
        GASL+ELYS+CGS+ NA KVF  + +KD V+W+++I GYGI G+G +AL+ FN M+++SE+ PNEVTFLS+LSACSHAGLI EG++IF +M+++YR+ PN
Subjt:  GASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPN

Query:  MEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKK
        +EHY+++VDLLGR+G+L+ A+   ++MP    P + G LLGA  IH   E+ E  AK LF+L+ NHAGYY+L+SN+Y V+  W+N  KLRN +K++G+KK
Subjt:  MEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKK

Query:  MVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDE
         + +S+IE   +VH FVADD LHPE + +Y LL+ L+++MK++
Subjt:  MVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDE

Q9STE1 Pentatricopeptide repeat-containing protein At4g213003.9e-12033.33Show/hide
Query:  VGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHG
        +G+  + F A+ L   +  Y  +    K+F+    +   +WN +L  Y      +  +  F  M    I+   P+  T   VL  CA    +  G  +HG
Subjt:  VGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHG

Query:  FVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSS
         V  +  +D +  +  +L+ +YSKCG   +A ++F   S+ D V W  MI+GY Q+G  E+++ FF +M+    L PD IT  SL  + ++  N +    
Subjt:  FVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSS

Query:  IHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGR
        IH +I+R ++  D+ L ++L++ Y K   V  A N+F +  + DV+ ++++I+ Y HNG   ++L +F  ++  KI  N +T++  L    +   L+ GR
Subjt:  IHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGR

Query:  RIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQA
         +H    +KG +   ++  A+IDMY KC     A ++FE++ K+D+VSW ++++  + +   S +++ F  M ++ I  D V++   L +C++L      
Subjt:  RIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQA

Query:  LCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIE
          +H +++K    ++++  ++LI++Y+KCG++  AM VF+ MK K+IV W++IIA  G  G+ +++L LF++M+E S I P+++TFL ++S+C H G ++
Subjt:  LCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIE

Query:  EGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKN
        EG++ F  M ++Y I+P  EHY+ +VDL GR G L  A   V+ MP P    VWG LLGA  +H   EL E+A+  L  LDP+++GYY+L+SN +A  + 
Subjt:  EGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKN

Query:  WDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY
        W++  K+R+++KE+ ++K+ G S IE     H FV+ D  HPES HIY LL  L   ++ E Y
Subjt:  WDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-13736.9Show/hide
Query:  LFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIP
        L E+C   K L Q+  L  K GL  + FF TKL +L   Y S+ +A +VFE   S+   L++ +L+ +  V+  ++ L  F  M    +   +P  +   
Subjt:  LFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIP

Query:  IVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPI
         +LK C     L+ GK +HG + ++    +D+F    L  +Y+KC  ++EA +VF    + D+V W +++ GY QNG    A++    M   ++L P  I
Subjt:  IVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPI

Query:  TLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNS
        T+VS+  A + L    +G  IHG+ +R   D  ++++ +L+++YAK GS++ A  LF+ M  ++V+SW+S+I  Y  N    EA+ +F +M+D  +K   
Subjt:  TLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNS

Query:  VTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPD
        V+V+GAL ACA   +LE GR IH+L+   GL+ ++SV  +LI MY KC   + A  +F K+  + +VSW A++ G++ NG    ++  F+ M    +KPD
Subjt:  VTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPD

Query:  AVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIM
            V ++ + ++L I   A  +H  V++S    N+FV  +L+++Y+KCG+I+ A  +F+ M  + +  W+A+I GYG  G G+ AL+LF +M +   I 
Subjt:  AVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIM

Query:  PNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKL
        PN VTFLS++SACSH+GL+E G+K F MM + Y I+ +M+HY  +VDLLGR G LN A +F+ +MP+    +V+GA+LGA  IH      E AA+ LF+L
Subjt:  PNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKL

Query:  DPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY
        +P+  GY++LL+NIY     W+  G++R  +  +GL+K  G S++E  NEVHSF +    HP+S  IY  L  L  ++K+  Y
Subjt:  DPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY

AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-12635.24Show/hide
Query:  ALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKR-NKKIDIDMF
        A+   + +LV A  VF +   R +  WN L+  Y     +++ +CL+H M+  G    KPD +T P VL+ C G+  L  GK VH  V R   ++DID  
Subjt:  ALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKR-NKKIDIDMF

Query:  VAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYD
        V  ALI +Y KCG +  A  +F    + D++ W +MI+GY +NG   + ++ F  M  +  ++PD +TL S+ SAC  L + +LG  IH ++I      D
Subjt:  VAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYD

Query:  LSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLEL
        +S+ NSL  +Y   GS + A  LF +M  KD++SW+++I+ Y +N    +A++ +  M    +K + +TV   L ACA   +L+ G  +H+L  +  L  
Subjt:  LSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLEL

Query:  DISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFT
         + V+  LI+MY KC   ++A+D+F  +P+K+V+SW ++++G   N    +++  F   +   ++P+A+ +   L +C+ +G L     +H +V+++G  
Subjt:  DISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFT

Query:  NNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEY
         + F+  +L+++Y +CG +  A   F   K KD+  W+ ++ GY  +GQG   ++LF++M++ S + P+E+TF+SLL  CS + ++ +G+  F+ M ++Y
Subjt:  NNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEY

Query:  RIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKE
         + PN++HY+ +VDLLGR GEL  A  F+QKMP+   P VWGALL A  IHHK +LGE++A+++F+LD    GYYILL N+YA    W    K+R M+KE
Subjt:  RIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKE

Query:  KGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDEI
         GL    G S +E   +VH+F++DD+ HP++  I  +L      M +   +  S+S      EI
Subjt:  KGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDEI

AT3G01580.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-21353.5Show/hide
Query:  YTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALI
        ++S V A ++F E   R+++ WN LL+S     QWE+ L  F  M      + KPDNFT+P+ LKAC  L+ + +G+M+HGFVK++  +  D++V ++LI
Subjt:  YTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALI

Query:  ELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANS
         +Y KCG M EAL++F    +PD+V W+SM++G+E+NG+P +AV+FF +MVM   + PD +TL++L SACT+L NS+LG  +HGF+IRR    DLSL NS
Subjt:  ELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANS

Query:  LLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVST
        LLN YAK+ + K A NLF+ +  KDVISWS++IACY  NGAAAEAL +FN+M+D   + N  TV+  LQACA A +LE+GR+ HEL  RKGLE ++ VST
Subjt:  LLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVST

Query:  ALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL-NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFV
        AL+DMYMKCFSPEEA  VF ++P+KDVVSW AL+SG++ NGM+ +S+E F+ MLL NN +PDA+ MVK+L SCS+LG L+QA C H YV+K GF +N F+
Subjt:  ALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLL-NNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFV

Query:  GASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPN
        GASL+ELYS+CGS+ NA KVF  + +KD V+W+++I GYGI G+G +AL+ FN M+++SE+ PNEVTFLS+LSACSHAGLI EG++IF +M+++YR+ PN
Subjt:  GASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPN

Query:  MEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKK
        +EHY+++VDLLGR+G+L+ A+   ++MP    P + G LLGA  IH   E+ E  AK LF+L+ NHAGYY+L+SN+Y V+  W+N  KLRN +K++G+KK
Subjt:  MEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKK

Query:  MVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDE
         + +S+IE   +VH FVADD LHPE + +Y LL+ L+++MK++
Subjt:  MVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein6.9e-12835.09Show/hide
Query:  LVSLFEKCKCSKIL---AQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP
        L S+ + C  SK L    ++       G   DS   +KL+ +++N   L +A +VF+E        WN L+        +  ++ LF  M+S G+   + 
Subjt:  LVSLFEKCKCSKIL---AQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKP

Query:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH
        D++T   V K+ + L+++  G+ +HGF+ ++   + +  V  +L+  Y K   +  A +VF   ++ DV+ W S+I GY  NG  EK +  F QM ++  
Subjt:  DNFTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKH

Query:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR
        +  D  T+VS+ + C       LG ++H   ++     +    N+LL++Y+K G + +A  +F +M  + V+S++S+IA Y+  G A EA+ LF EM + 
Subjt:  LNPDPITLVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDR

Query:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTML-
         I  +  TV   L  CA    L+EG+R+HE      L  DI VS AL+DMY KC S +EA  VF +M  KD++SW  ++ GYS N  +++++  FN +L 
Subjt:  KIKFNSVTVIGALQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTML-

Query:  LNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKM
             PD   +  +L +C+ L    +   +H Y++++G+ ++  V  SL+++Y+KCG+++ A  +F+++  KD+V W+ +IAGYG+ G G+EA+ LFN+M
Subjt:  LNNIKPDAVAMVKILVSCSDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKM

Query:  IETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIA
         +   I  +E++F+SLL ACSH+GL++EG + FN+M  E +I+P +EHY+ IVD+L R G+L +A  F++ MPIP    +WGALL    IHH  +L E  
Subjt:  IETSEIMPNEVTFLSLLSACSHAGLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIA

Query:  AKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQG
        A+ +F+L+P + GYY+L++NIYA  + W+   +LR  I ++GL+K  G S IE    V+ FVA D  +PE+++I   LR +   M +E YS  +K  L  
Subjt:  AKNLFKLDPNHAGYYILLSNIYAVEKNWDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQG

Query:  TDEI
         +E+
Subjt:  TDEI

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-12133.33Show/hide
Query:  VGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHG
        +G+  + F A+ L   +  Y  +    K+F+    +   +WN +L  Y      +  +  F  M    I+   P+  T   VL  CA    +  G  +HG
Subjt:  VGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDNFTIPIVLKACAGLQALKFGKMVHG

Query:  FVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSS
         V  +  +D +  +  +L+ +YSKCG   +A ++F   S+ D V W  MI+GY Q+G  E+++ FF +M+    L PD IT  SL  + ++  N +    
Subjt:  FVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPITLVSLASACTQLLNSKLGSS

Query:  IHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGR
        IH +I+R ++  D+ L ++L++ Y K   V  A N+F +  + DV+ ++++I+ Y HNG   ++L +F  ++  KI  N +T++  L    +   L+ GR
Subjt:  IHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGALQACAVASNLEEGR

Query:  RIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQA
         +H    +KG +   ++  A+IDMY KC     A ++FE++ K+D+VSW ++++  + +   S +++ F  M ++ I  D V++   L +C++L      
Subjt:  RIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSCSDLGILQQA

Query:  LCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIE
          +H +++K    ++++  ++LI++Y+KCG++  AM VF+ MK K+IV W++IIA  G  G+ +++L LF++M+E S I P+++TFL ++S+C H G ++
Subjt:  LCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHAGLIE

Query:  EGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKN
        EG++ F  M ++Y I+P  EHY+ +VDL GR G L  A   V+ MP P    VWG LLGA  +H   EL E+A+  L  LDP+++GYY+L+SN +A  + 
Subjt:  EGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKN

Query:  WDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY
        W++  K+R+++KE+ ++K+ G S IE     H FV+ D  HPES HIY LL  L   ++ E Y
Subjt:  WDNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGAAGGGAGCAACTAGTGAGTTTGTTTGAAAAATGTAAGTGTAGCAAAATTCTAGCTCAATTGCAATGTCTTACGCTCAAAGTTGGTCTTGCCCATGAT
AGTTTCTTCGCCACCAAACTTACTGCTCTGCATTCAAATTACACTTCTCTTGTGCAAGCACACAAGGTGTTTGAAGAAACGCCGAGCAGAACAGTACACCTGTGG
AATGCTCTTCTAAGAAGCTACTGCAACGTGAATCAATGGGAACAAACTTTATGTCTATTTCATGACATGGTATCTCGTGGAATTACAAAAGGGAAACCTGATAAT
TTCACAATTCCTATAGTTTTGAAGGCCTGTGCAGGGTTGCAGGCTCTTAAATTTGGAAAGATGGTTCATGGGTTCGTCAAGAGAAATAAGAAAATTGACATTGAC
ATGTTTGTTGCGGCTGCACTGATCGAACTGTACTCAAAATGTGGAACAATGCACGAGGCACTTCAAGTTTTCTTGGCATTTTCTCAACCAGATGTAGTTATGTGG
ACTTCTATGATAACCGGGTATGAGCAGAATGGAAATCCTGAAAAGGCAGTTGATTTTTTTTCTCAAATGGTGATGATCAAGCATTTAAATCCCGATCCTATCACG
CTTGTTAGCTTAGCATCGGCCTGTACTCAATTGTTGAATTCAAAGCTTGGAAGTAGCATACATGGTTTTATAATTAGACGAAATCTAGATTATGACTTATCTTTG
GCTAACTCGTTGTTGAATCTATATGCGAAGACGGGCTCTGTAAAGGCTGCAGCCAACTTGTTTGAGAAAATGCCGACAAAAGATGTAATTTCATGGAGCTCATTA
ATTGCTTGTTATTCTCACAATGGCGCAGCAGCGGAAGCACTAAATCTTTTCAATGAAATGATTGATAGGAAAATCAAATTCAATTCAGTTACAGTGATTGGTGCA
TTACAAGCATGTGCAGTTGCAAGTAACTTGGAAGAGGGAAGGAGGATCCATGAACTTACTACCAGGAAAGGTTTGGAGTTGGATATTTCAGTTTCCACAGCTCTA
ATTGATATGTACATGAAGTGCTTTTCACCTGAAGAAGCCATTGATGTATTTGAAAAAATGCCCAAAAAAGATGTGGTTTCTTGGGCTGCCTTGCTAAGTGGATAT
TCCCCAAATGGAATGTCATCCAAGTCGATGGAAACCTTCAATACCATGCTGTTGAACAACATAAAACCTGATGCTGTTGCCATGGTGAAAATTCTTGTTTCTTGT
TCAGATTTGGGCATTCTCCAACAAGCACTCTGCCTCCATGACTATGTTGTTAAAAGTGGCTTCACCAATAACATTTTTGTAGGAGCTTCACTCATAGAGCTGTAT
TCAAAATGTGGCAGCATAGTTAATGCCATGAAAGTATTTGAAGAAATGAAAGTAAAAGACATTGTTATTTGGAGTGCAATCATTGCAGGCTATGGAATCCAGGGA
CAAGGAAGAGAAGCTTTGAAATTGTTTAACAAAATGATAGAGACATCAGAAATTATGCCCAATGAAGTAACATTTCTCTCATTGTTATCTGCTTGTAGCCATGCT
GGTTTGATTGAAGAAGGGATCAAGATATTCAACATGATGTTGGATGAATACAGAATAAAACCCAACATGGAGCATTACAGCATCATTGTCGATCTTCTCGGTCGA
ATCGGAGAACTCAATAGGGCTCTAAACTTTGTTCAGAAAATGCCAATTCCAGCAGGACCTCATGTTTGGGGTGCCCTGTTAGGTGCTGCTTGTATTCATCACAAA
TCTGAATTGGGAGAGATTGCAGCAAAGAATCTCTTCAAGTTAGATCCTAATCATGCTGGGTACTATATACTGTTATCTAACATTTATGCTGTGGAGAAGAATTGG
GACAATGCAGGAAAACTTAGGAATATGATAAAGGAGAAGGGTTTGAAGAAGATGGTTGGGCAAAGTGTAATTGAAGCAGGAAATGAGGTCCATAGTTTTGTAGCG
GATGATAGATTACACCCAGAATCTGACCATATTTATAGGTTGCTAAGAATTTTGAATGTTAATATGAAAGATGAAAGTTATAGCTCTAATTCAAAATCTTGTTTA
CAGGGCACTGACGAAATTGTATAG
mRNA sequenceShow/hide mRNA sequence
TATGCTATTATGAATTGTGTTCTTGTATGTTTAAGCATATGTTGATGTTGAGATTGGCTGAAAAAGCATGCTATAGGATTGTGCATTCTTTTGAGATTGTGAATC
TGGTTTAGTGAGAAAAGTATCAGAGGACTGTTCTTGATAAAGGAAAATAAGGTGAATATCACATACATGAAAAGAAGGGAGCAACTAGTGAGTTTGTTTGAAAAA
TGTAAGTGTAGCAAAATTCTAGCTCAATTGCAATGTCTTACGCTCAAAGTTGGTCTTGCCCATGATAGTTTCTTCGCCACCAAACTTACTGCTCTGCATTCAAAT
TACACTTCTCTTGTGCAAGCACACAAGGTGTTTGAAGAAACGCCGAGCAGAACAGTACACCTGTGGAATGCTCTTCTAAGAAGCTACTGCAACGTGAATCAATGG
GAACAAACTTTATGTCTATTTCATGACATGGTATCTCGTGGAATTACAAAAGGGAAACCTGATAATTTCACAATTCCTATAGTTTTGAAGGCCTGTGCAGGGTTG
CAGGCTCTTAAATTTGGAAAGATGGTTCATGGGTTCGTCAAGAGAAATAAGAAAATTGACATTGACATGTTTGTTGCGGCTGCACTGATCGAACTGTACTCAAAA
TGTGGAACAATGCACGAGGCACTTCAAGTTTTCTTGGCATTTTCTCAACCAGATGTAGTTATGTGGACTTCTATGATAACCGGGTATGAGCAGAATGGAAATCCT
GAAAAGGCAGTTGATTTTTTTTCTCAAATGGTGATGATCAAGCATTTAAATCCCGATCCTATCACGCTTGTTAGCTTAGCATCGGCCTGTACTCAATTGTTGAAT
TCAAAGCTTGGAAGTAGCATACATGGTTTTATAATTAGACGAAATCTAGATTATGACTTATCTTTGGCTAACTCGTTGTTGAATCTATATGCGAAGACGGGCTCT
GTAAAGGCTGCAGCCAACTTGTTTGAGAAAATGCCGACAAAAGATGTAATTTCATGGAGCTCATTAATTGCTTGTTATTCTCACAATGGCGCAGCAGCGGAAGCA
CTAAATCTTTTCAATGAAATGATTGATAGGAAAATCAAATTCAATTCAGTTACAGTGATTGGTGCATTACAAGCATGTGCAGTTGCAAGTAACTTGGAAGAGGGA
AGGAGGATCCATGAACTTACTACCAGGAAAGGTTTGGAGTTGGATATTTCAGTTTCCACAGCTCTAATTGATATGTACATGAAGTGCTTTTCACCTGAAGAAGCC
ATTGATGTATTTGAAAAAATGCCCAAAAAAGATGTGGTTTCTTGGGCTGCCTTGCTAAGTGGATATTCCCCAAATGGAATGTCATCCAAGTCGATGGAAACCTTC
AATACCATGCTGTTGAACAACATAAAACCTGATGCTGTTGCCATGGTGAAAATTCTTGTTTCTTGTTCAGATTTGGGCATTCTCCAACAAGCACTCTGCCTCCAT
GACTATGTTGTTAAAAGTGGCTTCACCAATAACATTTTTGTAGGAGCTTCACTCATAGAGCTGTATTCAAAATGTGGCAGCATAGTTAATGCCATGAAAGTATTT
GAAGAAATGAAAGTAAAAGACATTGTTATTTGGAGTGCAATCATTGCAGGCTATGGAATCCAGGGACAAGGAAGAGAAGCTTTGAAATTGTTTAACAAAATGATA
GAGACATCAGAAATTATGCCCAATGAAGTAACATTTCTCTCATTGTTATCTGCTTGTAGCCATGCTGGTTTGATTGAAGAAGGGATCAAGATATTCAACATGATG
TTGGATGAATACAGAATAAAACCCAACATGGAGCATTACAGCATCATTGTCGATCTTCTCGGTCGAATCGGAGAACTCAATAGGGCTCTAAACTTTGTTCAGAAA
ATGCCAATTCCAGCAGGACCTCATGTTTGGGGTGCCCTGTTAGGTGCTGCTTGTATTCATCACAAATCTGAATTGGGAGAGATTGCAGCAAAGAATCTCTTCAAG
TTAGATCCTAATCATGCTGGGTACTATATACTGTTATCTAACATTTATGCTGTGGAGAAGAATTGGGACAATGCAGGAAAACTTAGGAATATGATAAAGGAGAAG
GGTTTGAAGAAGATGGTTGGGCAAAGTGTAATTGAAGCAGGAAATGAGGTCCATAGTTTTGTAGCGGATGATAGATTACACCCAGAATCTGACCATATTTATAGG
TTGCTAAGAATTTTGAATGTTAATATGAAAGATGAAAGTTATAGCTCTAATTCAAAATCTTGTTTACAGGGCACTGACGAAATTGTATAGTAAACATTCTCTTGT
AGGGGGTGTCTCGTAAATTGATGAAACTATTCAAATTGAGTGGTAAACTCACCTTTGGCCAGTGATTTGATTAAAATAGACTATTTACCATAATATATAATCCAT
CAGTTCGCTCAAGATTTGTGCTTAGCTTTGAGAATGTTAAAACACAACTAACAATTAGCAGATTTTTGCTTGGACCCCATTGTTGTGTTGTTGTATTCTTTACAC
AGGATGTGGAGGTGACTTTGCTCAGGTGGTTTGGGAGGGGTTTGGCTTTAGTTTTGCCAGCTCTCGAGGTTGCAAGGAGTCGATCAAGGACTCGAGGTGGAGCAG
AAAACATAATTATTTGCTATGTTTTGAGAATTCATAGTCTCTTGG
Protein sequenceShow/hide protein sequence
MKRREQLVSLFEKCKCSKILAQLQCLTLKVGLAHDSFFATKLTALHSNYTSLVQAHKVFEETPSRTVHLWNALLRSYCNVNQWEQTLCLFHDMVSRGITKGKPDN
FTIPIVLKACAGLQALKFGKMVHGFVKRNKKIDIDMFVAAALIELYSKCGTMHEALQVFLAFSQPDVVMWTSMITGYEQNGNPEKAVDFFSQMVMIKHLNPDPIT
LVSLASACTQLLNSKLGSSIHGFIIRRNLDYDLSLANSLLNLYAKTGSVKAAANLFEKMPTKDVISWSSLIACYSHNGAAAEALNLFNEMIDRKIKFNSVTVIGA
LQACAVASNLEEGRRIHELTTRKGLELDISVSTALIDMYMKCFSPEEAIDVFEKMPKKDVVSWAALLSGYSPNGMSSKSMETFNTMLLNNIKPDAVAMVKILVSC
SDLGILQQALCLHDYVVKSGFTNNIFVGASLIELYSKCGSIVNAMKVFEEMKVKDIVIWSAIIAGYGIQGQGREALKLFNKMIETSEIMPNEVTFLSLLSACSHA
GLIEEGIKIFNMMLDEYRIKPNMEHYSIIVDLLGRIGELNRALNFVQKMPIPAGPHVWGALLGAACIHHKSELGEIAAKNLFKLDPNHAGYYILLSNIYAVEKNW
DNAGKLRNMIKEKGLKKMVGQSVIEAGNEVHSFVADDRLHPESDHIYRLLRILNVNMKDESYSSNSKSCLQGTDEIV