; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G024500 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G024500
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr04:18145477..18148715
RNA-Seq ExpressionCmoCh04G024500
SyntenyCmoCh04G024500
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602075.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.6Show/hide
Query:  MINVWTGELDLKPKLEKQVQKAIRNS----QNSVSSSCRFQTFLTLATELEMLCPIKTHLPPLVHATPPSYSYVRTCLQKDSIIPTFLLLMFTNCI----
        MINVWTGEL+LKPKL+KQVQKAIRNS     NSVSSS      L +        P         H  PPS      C    ++I  +    F N      
Subjt:  MINVWTGELDLKPKLEKQVQKAIRNS----QNSVSSSCRFQTFLTLATELEMLCPIKTHLPPLVHATPPSYSYVRTCLQKDSIIPTFLLLMFTNCI----

Query:  --------AKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS
                 KELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS
Subjt:  --------AKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS

Query:  ETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQP
        ETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQP
Subjt:  ETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQP

Query:  NVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSF
        NVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSF
Subjt:  NVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSF

Query:  NPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALAL
        NPSISENLALWNSMLSGYVINNC+QAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALAL
Subjt:  NPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALAL

Query:  FHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALT
        FHRLPRKDIIAWSGLILGCAQMGLNWLAFS+FKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALT
Subjt:  FHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALT

Query:  LFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEA
        LFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRF+HEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEA
Subjt:  LFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEA

Query:  EKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        EKLIANMPFEPDQTTWRTLLGACGTRND KLINSVANGLLEA PDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  EKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

KAG7032773.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0093.26Show/hide
Query:  MINVWTGELDLKPKLEKQVQKAIRNS----QNSVSSSCRFQTFLTLATELEMLCPIKTHLPPLVHATPPSYSYV-----------RTCLQKDSIIPTFLL
        MINV TGEL+LKPKL+KQVQKAIRNS     NSVSSS      L +        P +   PP   A  P Y+ V           RT   KDSIIPTFLL
Subjt:  MINVWTGELDLKPKLEKQVQKAIRNS----QNSVSSSCRFQTFLTLATELEMLCPIKTHLPPLVHATPPSYSYV-----------RTCLQKDSIIPTFLL

Query:  LMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSE
        LMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSE
Subjt:  LMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSE

Query:  TANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPN
        TANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPN
Subjt:  TANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPN

Query:  VVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFN
        VVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFN
Subjt:  VVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFN

Query:  PSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALF
        PSISENLALWNSMLSGYVINNC+QAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALF
Subjt:  PSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALF

Query:  HRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTL
        HRLPRKDIIAWSGLILGCAQMGLNWLAFS+FKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTL
Subjt:  HRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTL

Query:  FDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAE
        FDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAE
Subjt:  FDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAE

Query:  KLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        KLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  KLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

XP_022964409.1 pentatricopeptide repeat-containing protein At4g08210 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
        ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
        DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

XP_022990073.1 pentatricopeptide repeat-containing protein At4g08210 [Cucurbita maxima]0.0e+0098.54Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        MFTNCIAKELRHCAQVRAFRRGNSLHAYLRK GCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTD GRPYEALRVYDDMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
        ANGYMYSAVLKACGLVGDLDRGKLIQE IYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLG+GSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFN 
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTI+DSYTFGGALKVCINLL+PRVGFQVHGLIVTCGYELDYVVGSI+VDLYAKLGRIDDALALFH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLILGCAQMGLNWLAFS+FKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
        DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEIT LGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

XP_023527633.1 pentatricopeptide repeat-containing protein At4g08210 [Cucurbita pepo subsp. pepo]0.0e+0097.96Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEM+DRNIVTWTTLVSA+TDSGRPYEALRV++DMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
        ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDM+VKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLG+GSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGT++DSYTFGGALKVCINLL+PRVGFQVHGLIVTCGYELDYVVGSI+VDLYAKLGRIDDALALFH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLILGCAQMGLNWLAFS+FKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
        DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEAR+IFNSMKS+YGLEPHLEHYCCMVDLLALAGLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVA+GLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

TrEMBL top hitse value%identityAlignment
A0A1S4E639 pentatricopeptide repeat-containing protein At4g08210 isoform X10.0e+0088.06Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        M+ N IAKELRHCA VRAF+RGN++HAYLRKFG LNDVF+ANNLISMYAEF N+RDAEKVFDEM+DRNIVTWT++VSAFTD GRPYEA+R+Y+DMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
         NGYMYSAVLKACG VGDL  GKLIQERIY  KLQ DTILMNSLMDM+VKCGSL+DAV+VFHNISRATTTTWNII+SGYSKAGLMVEAEKLFHCMP PNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVS+MH+K IKLDDFTFPCALKISALHGLLVIGKQ+H+YVTKLG+ SSCFTLSALIDMYSNCN L EAVKLFDQ SSFN 
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SIS+NLALWNSMLSGYVINNCDQAALNL+S IHCSG +LDSYTFGGALKVCINLL+ RVG Q+HGLIVTCGYELDYVVGSI+VDLYAKL  IDDALA+FH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLI+GCAQ+GLNWLAFS+FKDMLEL +EIDHFVIST LKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
         C+QEKDIV+WTGIIVGCGQNG+AAEA+RFFHEM+QSG+ PNEITFLGVLSACRYAGL+EEAR+IFNSMKSVYGLEPHLEHYCCMVDLLA  GLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIESS
        LIANMPFEPDQTTWRTLLGACGTRNDTKLIN VA+GLLEATP+DPSTYV+LSNAYASLGMWH LSKAREA+K  GVK+AGLSWIE S
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIESS

A0A5A7TZR3 Pentatricopeptide repeat-containing protein0.0e+0088.06Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        M+ N IAKELRHCA VRAF+RGN++HAYLRKFG LNDVF+ANNLISMYAEF N+RDAEKVFDEM+DRNIVTWT++VSAFTD GRPYEA+R+Y+DMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
         NGYMYSAVLKACG VGDL  GKLIQERIY  KLQ DTILMNSLMDM+VKCGSL+DAV+VFHNISRATTTTWNII+SGYSKAGLMVEAEKLFHCMP PNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVS+MH+K IKLDDFTFPCALKISALHGLLVIGKQ+H+YVTKLG+ SSCFTLSALIDMYSNCN L EAVKLFDQ SSFN 
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SIS+NLALWNSMLSGYVINNCDQAALNL+S IHCSG +LDSYTFGGALKVCINLL+ RVG Q+HGLIVTCGYELDYVVGSI+VDLYAKL  IDDALA+FH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLI+GCAQ+GLNWLAFS+FKDMLEL +EIDHFVIST LKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
         C+QEKDIV+WTGIIVGCGQNG+AAEA+RFFHEM+QSG+ PNEITFLGVLSACRYAGL+EEAR+IFNSMKSVYGLEPHLEHYCCMVDLLA  GLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIESS
        LIANMPFEPDQTTWRTLLGACGTRNDTKLIN VA+GLLEATP+DPSTYV+LSNAYASLGMWH LSKAREA+K  GVK+AGLSWIE S
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIESS

A0A6J1CK13 pentatricopeptide repeat-containing protein At4g082100.0e+0088.21Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        M+ N IAKELRHCAQV+AF++GN++HAYLRK GCLNDVF+ANNL+SMYAEF NLRDAEKVFDEM DRNIVTWTT+VSAFTDSGRPYEALRVY+DMPKSE 
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
         NGYMYSAVLKACGLVGDLD GKLIQERIYG KLQ D ILMNSLMDM VKCGSL+DAVKVFHNIS ATTT+WNIIISGYSKAGLM+EAEKLFHCMPQPNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVS+M+ +G+KLD FTFPCALKISALHGLL++GKQIH+YVTKLG+ SSCFTLSALIDMYSNCNGL EAVKLFDQHS+FN 
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SIS+NLALWNSMLSGYVINNCD+AALNLIS IHCSG +LDSYTFGGALKVCIN+L+ RV  QVHGLIVTCGYELDYV+GSI+VDLYAKLG IDDAL LF 
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLI+GCAQMGLNWLAFS+F+DM+  AHEID FVISTTLKVCSNLASLRSGKQVHAFCVK GYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
         CIQEKDIV+WTGIIVGCGQNGRAAEAVRFFHEMIQ GLNPNEITFLGVLSACRYAGL+EEAR+IFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIESS
        ++ANMPF+PDQTTWRTLLGA GTRND KLINSVA+GLLEATP+DPSTYV+LSNAYASLGMW  LSKAREAAK VG KRAGLSWIE S
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIESS

A0A6J1HN25 pentatricopeptide repeat-containing protein At4g082100.0e+00100Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
        ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
        DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

A0A6J1JS73 pentatricopeptide repeat-containing protein At4g082100.0e+0098.54Show/hide
Query:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET
        MFTNCIAKELRHCAQVRAFRRGNSLHAYLRK GCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTD GRPYEALRVYDDMPKSET
Subjt:  MFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSET

Query:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
        ANGYMYSAVLKACGLVGDLDRGKLIQE IYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV
Subjt:  ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNV

Query:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP
        VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLG+GSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFN 
Subjt:  VSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNP

Query:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH
        SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTI+DSYTFGGALKVCINLL+PRVGFQVHGLIVTCGYELDYVVGSI+VDLYAKLGRIDDALALFH
Subjt:  SISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFH

Query:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
        RLPRKDIIAWSGLILGCAQMGLNWLAFS+FKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF
Subjt:  RLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLF

Query:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
        DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEIT LGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK
Subjt:  DCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEK

Query:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
        LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE
Subjt:  LIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWIE

SwissProt top hitse value%identityAlignment
Q9LU94 Putative pentatricopeptide repeat-containing protein At3g259705.3e-10531.75Show/hide
Query:  AQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS-ETANGYMYSAVLKA
        + + +F++ +  H Y  K G ++D++++N ++  Y +F  L  A  +FDEM  R+ V+W T++S +T  G+  +A  ++  M +S    +GY +S +LK 
Subjt:  AQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS-ETANGYMYSAVLKA

Query:  CGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFAD
           V   D G+ +   +  G  + +  + +SL+DMY KC  + DA + F  IS                               +PN VSWN++IAGF  
Subjt:  CGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFAD

Query:  -NGSQRALEFVSLMHRK-GIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWN
            + A   + LM  K  + +D  TF   L +        + KQ+H+ V KLG        +A+I  Y++C  +++A ++FD         S++L  WN
Subjt:  -NGSQRALEFVSLMHRK-GIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWN

Query:  SMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKL--GRIDDALALFHRLPRKDII
        SM++G+  +   ++A  L   +       D YT+ G L  C    +   G  +HG+++  G E      + ++ +Y +   G ++DAL+LF  L  KD+I
Subjt:  SMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKL--GRIDDALALFHRLPRKDII

Query:  AWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEK-D
        +W+ +I G AQ GL+  A   F  +     ++D +  S  L+ CS+LA+L+ G+Q+HA   KSG+    F I+SL+ MYSKCG IE A   F  I  K  
Subjt:  AWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEK-D

Query:  IVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPF
         V W  +I+G  Q+G    ++  F +M    +  + +TF  +L+AC + GLI+E   + N M+ VY ++P +EHY   VDLL  AGL  +A++LI +MP 
Subjt:  IVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPF

Query:  EPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRA-GLSWIESSHAIGKVNTHEKEWPL
         PD    +T LG C    + ++   VAN LLE  P+D  TYVSLS+ Y+ L  W   +  ++  K+ GVK+  G SWIE  + +   N  ++  PL
Subjt:  EPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRA-GLSWIESSHAIGKVNTHEKEWPL

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.4e-10032.98Show/hide
Query:  MSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHN
        M+ ++ +     +S+FTDS  P+   ++ D   KS+ +  Y+                 + +   +       +  + N L+D Y KCGSL D  +VF  
Subjt:  MSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHN

Query:  ISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSLMHRKGIKLDDFTFPCALKISALHGL--LVIGKQIHSYV
        + +    TWN +++G +K G + EA+ LF  MP+ +  +WNSM++GFA +   + AL + ++MH++G  L++++F  A  +SA  GL  +  G Q+HS +
Subjt:  ISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSLMHRKGIKLDDFTFPCALKISALHGL--LVIGKQIHSYV

Query:  TKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVG
         K    S  +  SAL+DMYS C  + +A ++FD+          N+  WNS+++ +  N     AL++   +  S    D  T    +  C +L   +VG
Subjt:  TKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVG

Query:  FQVHGLIV-TCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIA-------------------------------WSGLILGCAQMGLNWLAFS
         +VHG +V       D ++ +  VD+YAK  RI +A  +F  +P +++IA                               W+ LI G  Q G N  A S
Subjt:  FQVHGLIV-TCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIA-------------------------------WSGLILGCAQMGLNWLAFS

Query:  VFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEG------FTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNG
        +F  +   +    H+  +  LK C++LA L  G Q H   +K G++ +       F   SL+DMY KCG +E+   +F  + E+D V+W  +I+G  QNG
Subjt:  VFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEG------FTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNG

Query:  RAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACG
           EA+  F EM++SG  P+ IT +GVLSAC +AG +EE R+ F+SM   +G+ P  +HY CMVDLL  AG  EEA+ +I  MP +PD   W +LL AC 
Subjt:  RAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACG

Query:  TRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV-KRAGLSWIE
           +  L   VA  LLE  P +   YV LSN YA LG W ++   R++ +K GV K+ G SWI+
Subjt:  TRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV-KRAGLSWIE

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331701.9e-10233.15Show/hide
Query:  SLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVY-----------------------DDMPKSET
        S H Y  K G   D F+A  L+++Y +F  +++ + +F+EM  R++V W  ++ A+ + G   EA+ +                        DD    + 
Subjt:  SLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVY-----------------------DDMPKSET

Query:  ---ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGS------LSDAVKV--------FHNIS-----RATTTTWNIIISG
           ANG   S+V +       + R K + E ++ G+          +++  V+C        L+ AVKV         H ++         T  N +I+ 
Subjt:  ---ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGS------LSDAVKV--------FHNIS-----RATTTTWNIIISG

Query:  YSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSLMHRKGIKLDDFTFPCALK-ISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALI
        Y K      A  +F  M + +++SWNS+IAG A NG +  A+     + R G+K D +T    LK  S+L   L + KQ+H +  K+ + S  F  +ALI
Subjt:  YSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSLMHRKGIKLDDFTFPCALK-ISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALI

Query:  DMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDY
        D YS    + EA  LF++H+        +L  WN+M++GY  ++     L L + +H  G   D +T     K C  L     G QVH   +  GY+LD 
Subjt:  DMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDY

Query:  VVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEME
         V S ++D+Y K G +  A   F  +P  D +AW+ +I GC + G    AF VF  M  +    D F I+T  K  S L +L  G+Q+HA  +K     +
Subjt:  VVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEME

Query:  GFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLE
         F  TSL+DMY+KCG I+DA  LF  I+  +I  W  ++VG  Q+G   E ++ F +M   G+ P+++TF+GVLSAC ++GL+ EA     SM   YG++
Subjt:  GFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLE

Query:  PHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV
        P +EHY C+ D L  AGL ++AE LI +M  E   + +RTLL AC  + DT+    VA  LLE  P D S YV LSN YA+   W  +  AR   K   V
Subjt:  PHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV

Query:  KR-AGLSWIESSHAI
        K+  G SWIE  + I
Subjt:  KR-AGLSWIESSHAI

Q9SUF9 Pentatricopeptide repeat-containing protein At4g082103.1e-23057.27Show/hide
Query:  LLMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDM--P
        ++M    IA  LRHC +V+AF+RG S+ A++ K G   +VFIANN+ISMY +F  L DA KVFDEMS+RNIVTWTT+VS +T  G+P +A+ +Y  M   
Subjt:  LLMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDM--P

Query:  KSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMP
        + E AN +MYSAVLKACGLVGD+  G L+ ERI    L+GD +LMNS++DMYVK G L +A   F  I R ++T+WN +ISGY KAGLM EA  LFH MP
Subjt:  KSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMP

Query:  QPNVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHS
        QPNVVSWN +I+GF D GS RALEF+  M R+G+ LD F  PC LK  +  GLL +GKQ+H  V K G  SS F +SALIDMYSNC  L  A  +F Q  
Subjt:  QPNVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHS

Query:  SFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDAL
            +++ ++A+WNSMLSG++IN  ++AAL L+  I+ S    DSYT  GALK+CIN +N R+G QVH L+V  GYELDY+VGSI+VDL+A +G I DA 
Subjt:  SFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDAL

Query:  ALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDA
         LFHRLP KDIIA+SGLI GC + G N LAF +F+++++L  + D F++S  LKVCS+LASL  GKQ+H  C+K GYE E  T T+L+DMY KCGEI++ 
Subjt:  ALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDA

Query:  LTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPE
        + LFD + E+D+V+WTGIIVG GQNGR  EA R+FH+MI  G+ PN++TFLG+LSACR++GL+EEAR+   +MKS YGLEP+LEHY C+VDLL  AGL +
Subjt:  LTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPE

Query:  EAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWI
        EA +LI  MP EPD+T W +LL ACGT  +  L+  +A  LL+  PDDPS Y SLSNAYA+LGMW  LSK REAAKK+G K +G+SWI
Subjt:  EAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWI

Q9SVA5 Pentatricopeptide repeat-containing protein At4g395308.5e-10332.73Show/hide
Query:  LHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRV-YDDMPKSETANGYMYSAVLKACGLVGDLDRGK
        L ++L K G   DV++   LI  Y +  N+  A  VFD + +++ VTWTT++S     GR Y +L++ Y  M  +   +GY+ S VL AC ++  L+ GK
Subjt:  LHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRV-YDDMPKSETANGYMYSAVLKACGLVGDLDRGK

Query:  LIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFV
         I   I    L+ D  LMN L+D YVKCG                                ++ A KLF+ MP  N++SW ++++G+  N   + A+E  
Subjt:  LIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFV

Query:  SLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCD
        + M + G+K D +     L   A    L  G Q+H+Y  K   G+  +  ++LIDMY+ C+ LT+A K+FD  +      + ++ L+N+M+ GY      
Subjt:  SLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCD

Query:  ---QAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQ
             ALN+   +          TF   L+   +L +  +  Q+HGL+   G  LD   GS ++D+Y+    + D+  +F  +  KD++ W+ +  G  Q
Subjt:  ---QAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQ

Query:  MGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCG
           N  A ++F ++       D F  +  +    NLAS++ G++ H   +K G E   +   +LLDMY+KCG  EDA   FD    +D+V W  +I    
Subjt:  MGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCG

Query:  QNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLG
         +G   +A++   +M+  G+ PN ITF+GVLSAC +AGL+E+    F  M   +G+EP  EHY CMV LL  AG   +A +LI  MP +P    WR+LL 
Subjt:  QNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLG

Query:  ACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVG-VKRAGLSWI
         C    + +L    A   + + P D  ++  LSN YAS GMW    K RE  K  G VK  G SWI
Subjt:  ACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVG-VKRAGLSWI

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein9.6e-10232.98Show/hide
Query:  MSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHN
        M+ ++ +     +S+FTDS  P+   ++ D   KS+ +  Y+                 + +   +       +  + N L+D Y KCGSL D  +VF  
Subjt:  MSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHN

Query:  ISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSLMHRKGIKLDDFTFPCALKISALHGL--LVIGKQIHSYV
        + +    TWN +++G +K G + EA+ LF  MP+ +  +WNSM++GFA +   + AL + ++MH++G  L++++F  A  +SA  GL  +  G Q+HS +
Subjt:  ISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFVSLMHRKGIKLDDFTFPCALKISALHGL--LVIGKQIHSYV

Query:  TKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVG
         K    S  +  SAL+DMYS C  + +A ++FD+          N+  WNS+++ +  N     AL++   +  S    D  T    +  C +L   +VG
Subjt:  TKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVG

Query:  FQVHGLIV-TCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIA-------------------------------WSGLILGCAQMGLNWLAFS
         +VHG +V       D ++ +  VD+YAK  RI +A  +F  +P +++IA                               W+ LI G  Q G N  A S
Subjt:  FQVHGLIV-TCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIA-------------------------------WSGLILGCAQMGLNWLAFS

Query:  VFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEG------FTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNG
        +F  +   +    H+  +  LK C++LA L  G Q H   +K G++ +       F   SL+DMY KCG +E+   +F  + E+D V+W  +I+G  QNG
Subjt:  VFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEG------FTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNG

Query:  RAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACG
           EA+  F EM++SG  P+ IT +GVLSAC +AG +EE R+ F+SM   +G+ P  +HY CMVDLL  AG  EEA+ +I  MP +PD   W +LL AC 
Subjt:  RAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACG

Query:  TRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV-KRAGLSWIE
           +  L   VA  LLE  P +   YV LSN YA LG W ++   R++ +K GV K+ G SWI+
Subjt:  TRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV-KRAGLSWIE

AT3G25970.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-10631.75Show/hide
Query:  AQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS-ETANGYMYSAVLKA
        + + +F++ +  H Y  K G ++D++++N ++  Y +F  L  A  +FDEM  R+ V+W T++S +T  G+  +A  ++  M +S    +GY +S +LK 
Subjt:  AQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKS-ETANGYMYSAVLKA

Query:  CGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFAD
           V   D G+ +   +  G  + +  + +SL+DMY KC  + DA + F  IS                               +PN VSWN++IAGF  
Subjt:  CGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFAD

Query:  -NGSQRALEFVSLMHRK-GIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWN
            + A   + LM  K  + +D  TF   L +        + KQ+H+ V KLG        +A+I  Y++C  +++A ++FD         S++L  WN
Subjt:  -NGSQRALEFVSLMHRK-GIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWN

Query:  SMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKL--GRIDDALALFHRLPRKDII
        SM++G+  +   ++A  L   +       D YT+ G L  C    +   G  +HG+++  G E      + ++ +Y +   G ++DAL+LF  L  KD+I
Subjt:  SMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKL--GRIDDALALFHRLPRKDII

Query:  AWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEK-D
        +W+ +I G AQ GL+  A   F  +     ++D +  S  L+ CS+LA+L+ G+Q+HA   KSG+    F I+SL+ MYSKCG IE A   F  I  K  
Subjt:  AWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEK-D

Query:  IVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPF
         V W  +I+G  Q+G    ++  F +M    +  + +TF  +L+AC + GLI+E   + N M+ VY ++P +EHY   VDLL  AGL  +A++LI +MP 
Subjt:  IVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPF

Query:  EPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRA-GLSWIESSHAIGKVNTHEKEWPL
         PD    +T LG C    + ++   VAN LLE  P+D  TYVSLS+ Y+ L  W   +  ++  K+ GVK+  G SWIE  + +   N  ++  PL
Subjt:  EPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRA-GLSWIESSHAIGKVNTHEKEWPL

AT4G08210.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.2e-23157.27Show/hide
Query:  LLMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDM--P
        ++M    IA  LRHC +V+AF+RG S+ A++ K G   +VFIANN+ISMY +F  L DA KVFDEMS+RNIVTWTT+VS +T  G+P +A+ +Y  M   
Subjt:  LLMFTNCIAKELRHCAQVRAFRRGNSLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDM--P

Query:  KSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMP
        + E AN +MYSAVLKACGLVGD+  G L+ ERI    L+GD +LMNS++DMYVK G L +A   F  I R ++T+WN +ISGY KAGLM EA  LFH MP
Subjt:  KSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMP

Query:  QPNVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHS
        QPNVVSWN +I+GF D GS RALEF+  M R+G+ LD F  PC LK  +  GLL +GKQ+H  V K G  SS F +SALIDMYSNC  L  A  +F Q  
Subjt:  QPNVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHS

Query:  SFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDAL
            +++ ++A+WNSMLSG++IN  ++AAL L+  I+ S    DSYT  GALK+CIN +N R+G QVH L+V  GYELDY+VGSI+VDL+A +G I DA 
Subjt:  SFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDAL

Query:  ALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDA
         LFHRLP KDIIA+SGLI GC + G N LAF +F+++++L  + D F++S  LKVCS+LASL  GKQ+H  C+K GYE E  T T+L+DMY KCGEI++ 
Subjt:  ALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDA

Query:  LTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPE
        + LFD + E+D+V+WTGIIVG GQNGR  EA R+FH+MI  G+ PN++TFLG+LSACR++GL+EEAR+   +MKS YGLEP+LEHY C+VDLL  AGL +
Subjt:  LTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPE

Query:  EAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWI
        EA +LI  MP EPD+T W +LL ACGT  +  L+  +A  LL+  PDDPS Y SLSNAYA+LGMW  LSK REAAKK+G K +G+SWI
Subjt:  EAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWI

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-10333.15Show/hide
Query:  SLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVY-----------------------DDMPKSET
        S H Y  K G   D F+A  L+++Y +F  +++ + +F+EM  R++V W  ++ A+ + G   EA+ +                        DD    + 
Subjt:  SLHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVY-----------------------DDMPKSET

Query:  ---ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGS------LSDAVKV--------FHNIS-----RATTTTWNIIISG
           ANG   S+V +       + R K + E ++ G+          +++  V+C        L+ AVKV         H ++         T  N +I+ 
Subjt:  ---ANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKLQGDTILMNSLMDMYVKCGS------LSDAVKV--------FHNIS-----RATTTTWNIIISG

Query:  YSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSLMHRKGIKLDDFTFPCALK-ISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALI
        Y K      A  +F  M + +++SWNS+IAG A NG +  A+     + R G+K D +T    LK  S+L   L + KQ+H +  K+ + S  F  +ALI
Subjt:  YSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNGSQ-RALEFVSLMHRKGIKLDDFTFPCALK-ISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALI

Query:  DMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDY
        D YS    + EA  LF++H+        +L  WN+M++GY  ++     L L + +H  G   D +T     K C  L     G QVH   +  GY+LD 
Subjt:  DMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDY

Query:  VVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEME
         V S ++D+Y K G +  A   F  +P  D +AW+ +I GC + G    AF VF  M  +    D F I+T  K  S L +L  G+Q+HA  +K     +
Subjt:  VVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEME

Query:  GFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLE
         F  TSL+DMY+KCG I+DA  LF  I+  +I  W  ++VG  Q+G   E ++ F +M   G+ P+++TF+GVLSAC ++GL+ EA     SM   YG++
Subjt:  GFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLE

Query:  PHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV
        P +EHY C+ D L  AGL ++AE LI +M  E   + +RTLL AC  + DT+    VA  LLE  P D S YV LSN YA+   W  +  AR   K   V
Subjt:  PHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGV

Query:  KR-AGLSWIESSHAI
        K+  G SWIE  + I
Subjt:  KR-AGLSWIESSHAI

AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-10432.73Show/hide
Query:  LHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRV-YDDMPKSETANGYMYSAVLKACGLVGDLDRGK
        L ++L K G   DV++   LI  Y +  N+  A  VFD + +++ VTWTT++S     GR Y +L++ Y  M  +   +GY+ S VL AC ++  L+ GK
Subjt:  LHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRV-YDDMPKSETANGYMYSAVLKACGLVGDLDRGK

Query:  LIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFV
         I   I    L+ D  LMN L+D YVKCG                                ++ A KLF+ MP  N++SW ++++G+  N   + A+E  
Subjt:  LIQERIYGGKLQGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNG-SQRALEFV

Query:  SLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCD
        + M + G+K D +     L   A    L  G Q+H+Y  K   G+  +  ++LIDMY+ C+ LT+A K+FD  +      + ++ L+N+M+ GY      
Subjt:  SLMHRKGIKLDDFTFPCALKISALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCD

Query:  ---QAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQ
             ALN+   +          TF   L+   +L +  +  Q+HGL+   G  LD   GS ++D+Y+    + D+  +F  +  KD++ W+ +  G  Q
Subjt:  ---QAALNLISYIHCSGTILDSYTFGGALKVCINLLNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQ

Query:  MGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCG
           N  A ++F ++       D F  +  +    NLAS++ G++ H   +K G E   +   +LLDMY+KCG  EDA   FD    +D+V W  +I    
Subjt:  MGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQVHAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCG

Query:  QNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLG
         +G   +A++   +M+  G+ PN ITF+GVLSAC +AGL+E+    F  M   +G+EP  EHY CMV LL  AG   +A +LI  MP +P    WR+LL 
Subjt:  QNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYGLEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLG

Query:  ACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVG-VKRAGLSWI
         C    + +L    A   + + P D  ++  LSN YAS GMW    K RE  K  G VK  G SWI
Subjt:  ACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVG-VKRAGLSWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAATGTCTGGACTGGAGAGCTTGATTTGAAGCCGAAGTTGGAGAAGCAAGTGCAGAAAGCCATAAGAAACTCGCAAAACTCTGTTTCCTCTTCCTGCCGATTTCA
GACATTCTTAACATTGGCAACAGAGCTCGAAATGCTCTGCCCCATCAAAACCCACCTTCCGCCGCTTGTCCATGCTACACCACCGTCATATTCTTACGTACGAACATGCC
TGCAGAAGGATAGTATAATTCCCACGTTTTTGTTACTCATGTTTACGAACTGCATAGCGAAGGAATTGCGCCATTGCGCGCAAGTTAGAGCTTTCAGGCGAGGCAATTCC
CTTCATGCTTACTTGAGGAAATTTGGGTGTTTGAACGATGTGTTTATTGCGAACAATTTGATTTCGATGTACGCGGAGTTTTCGAATTTACGAGATGCAGAGAAGGTGTT
TGATGAAATGTCTGACAGAAATATTGTTACTTGGACCACCCTGGTTTCTGCATTTACTGATAGTGGAAGACCTTATGAGGCACTCCGAGTGTATGATGATATGCCGAAAT
CTGAGACGGCCAATGGGTACATGTATTCCGCGGTTTTGAAGGCATGTGGGCTTGTGGGTGATTTGGATCGGGGTAAGTTAATTCAAGAAAGAATATATGGGGGTAAGTTG
CAGGGTGATACTATTTTGATGAATTCTCTTATGGATATGTATGTGAAATGTGGAAGCTTGAGTGATGCGGTGAAGGTTTTTCACAATATTTCACGTGCGACCACAACTAC
TTGGAATATCATTATTTCTGGGTATAGTAAGGCCGGTTTGATGGTGGAGGCTGAAAAACTTTTTCATTGTATGCCACAGCCAAATGTTGTGTCTTGGAACAGCATGATTG
CTGGCTTTGCAGACAATGGGAGTCAGCGGGCGTTGGAATTTGTGTCCTTGATGCACAGAAAGGGAATTAAGCTTGATGATTTCACATTTCCATGTGCTTTAAAGATCAGT
GCGCTTCATGGGTTATTAGTCATCGGGAAACAAATTCATTCCTATGTCACCAAATTGGGGCATGGATCTAGTTGTTTTACACTGTCTGCTCTGATTGATATGTATTCAAA
TTGCAATGGCCTGACCGAAGCAGTCAAGTTGTTTGACCAACACTCTTCCTTCAACCCTTCCATTTCTGAGAACCTGGCACTGTGGAACTCGATGCTCTCAGGATATGTTA
TCAACAACTGTGACCAAGCTGCTTTGAATTTGATTTCATACATCCATTGCTCGGGTACGATATTGGACTCTTACACCTTTGGTGGTGCTTTAAAGGTTTGCATCAACTTA
TTAAATCCGAGGGTTGGTTTTCAAGTACATGGTCTTATTGTCACTTGTGGTTATGAATTGGATTATGTTGTTGGAAGCATTGTTGTGGATCTCTATGCAAAACTAGGACG
CATCGACGATGCATTAGCACTGTTCCATAGGCTTCCAAGGAAAGATATCATAGCCTGGTCCGGTTTGATCCTGGGGTGTGCTCAAATGGGATTGAACTGGTTAGCTTTCT
CAGTGTTCAAAGATATGCTCGAGTTGGCTCATGAAATAGATCATTTTGTCATTTCAACCACTCTGAAAGTCTGCTCCAATTTAGCATCTCTTAGAAGTGGAAAGCAGGTC
CATGCATTCTGTGTCAAAAGTGGGTATGAAATGGAGGGGTTCACAATAACATCCCTTCTTGATATGTATTCAAAATGCGGTGAAATTGAGGATGCATTAACATTGTTTGA
TTGTATACAAGAAAAAGACATAGTAACTTGGACTGGGATCATTGTAGGATGTGGACAAAATGGAAGGGCGGCAGAAGCTGTCAGGTTTTTTCATGAGATGATTCAATCAG
GGCTAAATCCAAATGAAATCACCTTTCTAGGGGTGCTTTCTGCATGTCGATATGCTGGTTTGATCGAAGAGGCACGAAACATATTTAATTCCATGAAATCTGTATATGGA
CTAGAGCCTCATTTAGAGCATTATTGCTGCATGGTTGATCTTCTTGCTCTAGCTGGGCTACCTGAAGAAGCTGAAAAATTGATAGCAAATATGCCGTTTGAGCCAGATCA
GACCACATGGCGCACTTTGCTAGGGGCGTGTGGAACTCGTAACGATACCAAGCTTATTAACAGTGTTGCTAATGGCCTTCTTGAAGCTACACCAGATGACCCTTCAACGT
ACGTGTCACTTTCAAATGCTTATGCGTCGCTGGGGATGTGGCATAACCTAAGCAAAGCGAGGGAGGCTGCCAAAAAGGTGGGAGTCAAAAGAGCTGGGTTAAGCTGGATT
GAGTCATCCCATGCAATTGGTAAAGTGAACACACATGAAAAGGAGTGGCCGCTCTTGCTACCAAAGTTCAAAGAGACTCCACCTTTATGTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAATGTCTGGACTGGAGAGCTTGATTTGAAGCCGAAGTTGGAGAAGCAAGTGCAGAAAGCCATAAGAAACTCGCAAAACTCTGTTTCCTCTTCCTGCCGATTTCA
GACATTCTTAACATTGGCAACAGAGCTCGAAATGCTCTGCCCCATCAAAACCCACCTTCCGCCGCTTGTCCATGCTACACCACCGTCATATTCTTACGTACGAACATGCC
TGCAGAAGGATAGTATAATTCCCACGTTTTTGTTACTCATGTTTACGAACTGCATAGCGAAGGAATTGCGCCATTGCGCGCAAGTTAGAGCTTTCAGGCGAGGCAATTCC
CTTCATGCTTACTTGAGGAAATTTGGGTGTTTGAACGATGTGTTTATTGCGAACAATTTGATTTCGATGTACGCGGAGTTTTCGAATTTACGAGATGCAGAGAAGGTGTT
TGATGAAATGTCTGACAGAAATATTGTTACTTGGACCACCCTGGTTTCTGCATTTACTGATAGTGGAAGACCTTATGAGGCACTCCGAGTGTATGATGATATGCCGAAAT
CTGAGACGGCCAATGGGTACATGTATTCCGCGGTTTTGAAGGCATGTGGGCTTGTGGGTGATTTGGATCGGGGTAAGTTAATTCAAGAAAGAATATATGGGGGTAAGTTG
CAGGGTGATACTATTTTGATGAATTCTCTTATGGATATGTATGTGAAATGTGGAAGCTTGAGTGATGCGGTGAAGGTTTTTCACAATATTTCACGTGCGACCACAACTAC
TTGGAATATCATTATTTCTGGGTATAGTAAGGCCGGTTTGATGGTGGAGGCTGAAAAACTTTTTCATTGTATGCCACAGCCAAATGTTGTGTCTTGGAACAGCATGATTG
CTGGCTTTGCAGACAATGGGAGTCAGCGGGCGTTGGAATTTGTGTCCTTGATGCACAGAAAGGGAATTAAGCTTGATGATTTCACATTTCCATGTGCTTTAAAGATCAGT
GCGCTTCATGGGTTATTAGTCATCGGGAAACAAATTCATTCCTATGTCACCAAATTGGGGCATGGATCTAGTTGTTTTACACTGTCTGCTCTGATTGATATGTATTCAAA
TTGCAATGGCCTGACCGAAGCAGTCAAGTTGTTTGACCAACACTCTTCCTTCAACCCTTCCATTTCTGAGAACCTGGCACTGTGGAACTCGATGCTCTCAGGATATGTTA
TCAACAACTGTGACCAAGCTGCTTTGAATTTGATTTCATACATCCATTGCTCGGGTACGATATTGGACTCTTACACCTTTGGTGGTGCTTTAAAGGTTTGCATCAACTTA
TTAAATCCGAGGGTTGGTTTTCAAGTACATGGTCTTATTGTCACTTGTGGTTATGAATTGGATTATGTTGTTGGAAGCATTGTTGTGGATCTCTATGCAAAACTAGGACG
CATCGACGATGCATTAGCACTGTTCCATAGGCTTCCAAGGAAAGATATCATAGCCTGGTCCGGTTTGATCCTGGGGTGTGCTCAAATGGGATTGAACTGGTTAGCTTTCT
CAGTGTTCAAAGATATGCTCGAGTTGGCTCATGAAATAGATCATTTTGTCATTTCAACCACTCTGAAAGTCTGCTCCAATTTAGCATCTCTTAGAAGTGGAAAGCAGGTC
CATGCATTCTGTGTCAAAAGTGGGTATGAAATGGAGGGGTTCACAATAACATCCCTTCTTGATATGTATTCAAAATGCGGTGAAATTGAGGATGCATTAACATTGTTTGA
TTGTATACAAGAAAAAGACATAGTAACTTGGACTGGGATCATTGTAGGATGTGGACAAAATGGAAGGGCGGCAGAAGCTGTCAGGTTTTTTCATGAGATGATTCAATCAG
GGCTAAATCCAAATGAAATCACCTTTCTAGGGGTGCTTTCTGCATGTCGATATGCTGGTTTGATCGAAGAGGCACGAAACATATTTAATTCCATGAAATCTGTATATGGA
CTAGAGCCTCATTTAGAGCATTATTGCTGCATGGTTGATCTTCTTGCTCTAGCTGGGCTACCTGAAGAAGCTGAAAAATTGATAGCAAATATGCCGTTTGAGCCAGATCA
GACCACATGGCGCACTTTGCTAGGGGCGTGTGGAACTCGTAACGATACCAAGCTTATTAACAGTGTTGCTAATGGCCTTCTTGAAGCTACACCAGATGACCCTTCAACGT
ACGTGTCACTTTCAAATGCTTATGCGTCGCTGGGGATGTGGCATAACCTAAGCAAAGCGAGGGAGGCTGCCAAAAAGGTGGGAGTCAAAAGAGCTGGGTTAAGCTGGATT
GAGTCATCCCATGCAATTGGTAAAGTGAACACACATGAAAAGGAGTGGCCGCTCTTGCTACCAAAGTTCAAAGAGACTCCACCTTTATGTCTATGA
Protein sequenceShow/hide protein sequence
MINVWTGELDLKPKLEKQVQKAIRNSQNSVSSSCRFQTFLTLATELEMLCPIKTHLPPLVHATPPSYSYVRTCLQKDSIIPTFLLLMFTNCIAKELRHCAQVRAFRRGNS
LHAYLRKFGCLNDVFIANNLISMYAEFSNLRDAEKVFDEMSDRNIVTWTTLVSAFTDSGRPYEALRVYDDMPKSETANGYMYSAVLKACGLVGDLDRGKLIQERIYGGKL
QGDTILMNSLMDMYVKCGSLSDAVKVFHNISRATTTTWNIIISGYSKAGLMVEAEKLFHCMPQPNVVSWNSMIAGFADNGSQRALEFVSLMHRKGIKLDDFTFPCALKIS
ALHGLLVIGKQIHSYVTKLGHGSSCFTLSALIDMYSNCNGLTEAVKLFDQHSSFNPSISENLALWNSMLSGYVINNCDQAALNLISYIHCSGTILDSYTFGGALKVCINL
LNPRVGFQVHGLIVTCGYELDYVVGSIVVDLYAKLGRIDDALALFHRLPRKDIIAWSGLILGCAQMGLNWLAFSVFKDMLELAHEIDHFVISTTLKVCSNLASLRSGKQV
HAFCVKSGYEMEGFTITSLLDMYSKCGEIEDALTLFDCIQEKDIVTWTGIIVGCGQNGRAAEAVRFFHEMIQSGLNPNEITFLGVLSACRYAGLIEEARNIFNSMKSVYG
LEPHLEHYCCMVDLLALAGLPEEAEKLIANMPFEPDQTTWRTLLGACGTRNDTKLINSVANGLLEATPDDPSTYVSLSNAYASLGMWHNLSKAREAAKKVGVKRAGLSWI
ESSHAIGKVNTHEKEWPLLLPKFKETPPLCL