; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006452 (gene) of Snake gourd v1 genome

Gene IDTan0006452
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionpentatricopeptide repeat-containing protein isoform X1
Genome locationLG01:102135487..102143049
RNA-Seq ExpressionTan0006452
SyntenyTan0006452
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000644 - CBS domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024513.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]4.3e-22090.78Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYGFLL EGGNLSISVYNLLMKGYISSGVPQAALA+YNEMLNLELKPD+LTYNTLISACVKINKLDAAM+FFEEMK+RA KY+QEDIFPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLKGFGILKDV +VHKIVLEMK+CHDLLIDRTAYTAMIDALV+CGSINGALSLFGELLKLSG  LDLRPKPHLYLT MR FSSRGDY MVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSG+ISPGFQEEADHLLMEAAL+DNQIDVA EKLST+IKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNP APIESVMMPFKAV+P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNG+LQLKEVVMRFF+KSVVPIIDDWGRCIGLLHREDC+EL+APLWKMMRSPPPGVTTTTSIGHV NLIL+KRYKM+IIVRHSKFSTYD SS RAVGVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISRKT
         EQLYGFISPIPM+ QPNI  KT
Subjt:  AEQLYGFISPIPMRPQPNISRKT

XP_008466304.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X4 [Cucumis melo]2.1e-21990Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYG+LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMK+RADKYDQED+FPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSIN-------GALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYR
        VTYTTLLK FGILKDVH+VHKIVLEMKSCH L IDRTAYTAMIDALV+CGSIN       GALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSIN-------GALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYR

Query:  MVKCLHRRMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMM
        MVKCLHRRMWLDSSGTIS G+QEEADHLLMEAALNDNQIDVAIEKLST+IK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN  APIESVMM
Subjt:  MVKCLHRRMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMM

Query:  PFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSL
        PFKAV+PLNGSL LKEVVMRFF+KSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV++VRHSKFS Y GSSL
Subjt:  PFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSL

Query:  RAVGVFTAEQLYGFISPIPMRPQPNISRKT
        RA+GVFT EQLYGF+SPIPM  +PN+ RKT
Subjt:  RAVGVFTAEQLYGFISPIPMRPQPNISRKT

XP_008466305.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X5 [Cucumis melo]1.7e-22191.49Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYG+LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMK+RADKYDQED+FPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLK FGILKDVH+VHKIVLEMKSCH L IDRTAYTAMIDALV+CGSINGALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSGTIS G+QEEADHLLMEAALNDNQIDVAIEKLST+IK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN  APIESVMMPFKAV+P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNGSL LKEVVMRFF+KSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV++VRHSKFS Y GSSLRA+GVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISRKT
         EQLYGF+SPIPM  +PN+ RKT
Subjt:  AEQLYGFISPIPMRPQPNISRKT

XP_022975759.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita maxima]1.5e-22091.02Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYGFLL EGGNLSISVYNLLMKGYISSGVPQAALA+YNEMLNLELKPD+LTYNTLISAC KINKLDAAM+FFEEMK+RA KY+QEDIFPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLKGFGILKDV +VHKIVLEMKSCHDLLIDRTAYTAMIDALV+CGSINGALSLFGELLKLSG NLDLRPKPHLYLT MR FSSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSG+ISPGFQEEADHLLMEAAL+DNQIDVA EKLST+IKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNP APIESVMMPFKAV+P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNGSLQLKEVVMRFF+KSVVPIIDDWGRCIGLLHREDC+EL+APLW MMRSPPPGVTTTTSIGHV NLIL+KRYKM+IIVRHSKFSTYD SS RAVGVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISRKT
         EQLYGFISPIPM+ QPNI  +T
Subjt:  AEQLYGFISPIPMRPQPNISRKT

XP_023535821.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-22091.02Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYGFLL EGGNLSISVYNLLMKGYISSGVPQAALA+YNEMLNLELKPD+LTYNTLISACVKINKLDAAM+FFEEMK+RA KY+QEDIFPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLKGFGILKDV +VHKIVLEMKS HDLLIDRTAYTAMIDALV+CGSINGALSLFGELLKLSG NLDLRPKPHLYLT MR FSSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSG+ISPGFQEEADHLLMEAAL+DNQIDVA EKLST+IKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNP APIESVMMPFKAV+P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNG+LQLKEVVMRFF+KSVVPIIDDWGRCIGLLHREDC+EL++PLWKMMRSPPPGVTTTTSIGHV NLIL+KRYKM+IIVRHSKFSTYD SS RAVGVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISRKT
         EQLYGFISPIPM+ QPNI  KT
Subjt:  AEQLYGFISPIPMRPQPNISRKT

TrEMBL top hitse value%identityAlignment
A0A1S3CQX2 pentatricopeptide repeat-containing protein At5g10690 isoform X41.0e-21990Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYG+LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMK+RADKYDQED+FPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSIN-------GALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYR
        VTYTTLLK FGILKDVH+VHKIVLEMKSCH L IDRTAYTAMIDALV+CGSIN       GALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSIN-------GALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYR

Query:  MVKCLHRRMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMM
        MVKCLHRRMWLDSSGTIS G+QEEADHLLMEAALNDNQIDVAIEKLST+IK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN  APIESVMM
Subjt:  MVKCLHRRMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMM

Query:  PFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSL
        PFKAV+PLNGSL LKEVVMRFF+KSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV++VRHSKFS Y GSSL
Subjt:  PFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSL

Query:  RAVGVFTAEQLYGFISPIPMRPQPNISRKT
        RA+GVFT EQLYGF+SPIPM  +PN+ RKT
Subjt:  RAVGVFTAEQLYGFISPIPMRPQPNISRKT

A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X58.5e-22291.49Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYG+LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMK+RADKYDQED+FPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLK FGILKDVH+VHKIVLEMKSCH L IDRTAYTAMIDALV+CGSINGALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSGTIS G+QEEADHLLMEAALNDNQIDVAIEKLST+IK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN  APIESVMMPFKAV+P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNGSL LKEVVMRFF+KSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV++VRHSKFS Y GSSLRA+GVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISRKT
         EQLYGF+SPIPM  +PN+ RKT
Subjt:  AEQLYGFISPIPMRPQPNISRKT

A0A1S3CS90 pentatricopeptide repeat-containing protein At5g10690 isoform X31.3e-21989.79Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYG+LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMK+RADKYDQED+FPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLK FGILKDVH+VHKIVLEMKSCH L IDRTAYTAMIDALV+CGSINGALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQ--------IDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVM
        RMWLDSSGTIS G+QEEADHLLMEAALNDNQ        IDVAIEKLST+IK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN  APIESVM
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQ--------IDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVM

Query:  MPFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSS
        MPFKAV+PLNGSL LKEVVMRFF+KSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV++VRHSKFS Y GSS
Subjt:  MPFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSS

Query:  LRAVGVFTAEQLYGFISPIPMRPQPNISRKT
        LRA+GVFT EQLYGF+SPIPM  +PN+ RKT
Subjt:  LRAVGVFTAEQLYGFISPIPMRPQPNISRKT

A0A6J1CDP2 pentatricopeptide repeat-containing protein At5g10690 isoform X16.7e-21989.79Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYGF+LREGGNLS SVYNLLMKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMK+RADKYDQEDIFPD 
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLKGFGIL+D  VVH IVLEMKSCHD  IDRTAYTAMID+LV+CGSINGALSLFGELLKLSG N D RPKPHLYL+LMRV SSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSGTI PGFQEEADHLLMEAALNDNQIDVAIEKLST+IK WKGISW SRGGSVALRIEALLGFTKSFFS PCIFPRVNP+APIESVMMPFKAVEP
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNGS+QLKEVVM FF+KSVVPI+D+WGRC GLLHREDCTELDAPLWK+MRSPPP VTTTTSIGHVANLILQKRYKMVI+VRHSKFSTYDGSSLRAVGVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISR
        AE+LYGF+SP+P+ PQPN  R
Subjt:  AEQLYGFISPIPMRPQPNISR

A0A6J1ILI3 pentatricopeptide repeat-containing protein At5g10690 isoform X17.2e-22191.02Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        MRRANGLIARYGFLL EGGNLSISVYNLLMKGYISSGVPQAALA+YNEMLNLELKPD+LTYNTLISAC KINKLDAAM+FFEEMK+RA KY+QEDIFPDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTLLKGFGILKDV +VHKIVLEMKSCHDLLIDRTAYTAMIDALV+CGSINGALSLFGELLKLSG NLDLRPKPHLYLT MR FSSRGDYRMVKCLHR
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        RMWLDSSG+ISPGFQEEADHLLMEAAL+DNQIDVA EKLST+IKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNP APIESVMMPFKAV+P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        LNGSLQLKEVVMRFF+KSVVPIIDDWGRCIGLLHREDC+EL+APLW MMRSPPPGVTTTTSIGHV NLIL+KRYKM+IIVRHSKFSTYD SS RAVGVFT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLYGFISPIPMRPQPNISRKT
         EQLYGFISPIPM+ QPNI  +T
Subjt:  AEQLYGFISPIPMRPQPNISRKT

SwissProt top hitse value%identityAlignment
Q8VYD6 Pentatricopeptide repeat-containing protein At5g106905.1e-13158.02Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        +RRANGL+ARY  LL + G  S+ +YNLLMKGY++S  PQAA+ + +EML L L+PDRLTYNTLI AC+K   LDAAM FF +MK++A++Y  + + PDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTL+KGFG   D+  + +I LEMK C ++ IDRTA+TA++DA++ CGS +GAL +FGE+LK SG N  LRPKPHLYL++MR F+ +GDY MV+ L+ 
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        R+W DSSG+IS   Q+EAD+LLMEAALND Q+D A+  L ++++RWK I W + GG  A+R+E LLGF+KS   P  +  +V P+ PIES+M+ F+A  P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        L G+LQLK V MRFF + VVPI+DD G CIGLLHREDC  LDAPL  MMRSPP  V+TTTSIG V +L+L+K+ KMVI+V    FS   G S +AVG FT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLY
          QLY
Subjt:  AEQLY

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011108.9e-1130.67Show/hide
Query:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL
        +L++G  + +  YN ++ G     +   A  ++NEM    L PD  T   LI    K+  L  AM  F++MK++        I  DVVTY TLL GFG +
Subjt:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL

Query:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL
         D+    +I  +M S  ++L    +Y+ +++AL S G +  A  ++ E++
Subjt:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic1.4e-1130.77Show/hide
Query:  SISVYNLLMKGYISSGVP-QAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVVH
        SI  YN ++      G+  +  L ++ EM +  ++PD +TYNTL+SAC      D A   F  M D         I PD+ TY+ L++ FG L+ +  V 
Subjt:  SISVYNLLMKGYISSGVP-QAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVVH

Query:  KIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRM
         ++ EM S    L D T+Y  +++A    GSI  A+ +F + ++ +G      P  + Y  L+ +F   G Y  V+ L   M
Subjt:  KIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRM

Q9SXD8 Pentatricopeptide repeat-containing protein At1g625903.1e-1131.94Show/hide
Query:  NLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVV
        N ++  +N L+  ++  G    A  +Y++M+   + PD  TYN+L++     ++LD A   FE M  +       D FPDVVTY TL+KGF   K V   
Subjt:  NLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVV

Query:  HKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL
         ++  EM S   L+ D   YT +I  L   G  + A  +F +++
Subjt:  HKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic4.0e-1124.86Show/hide
Query:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL
        +L  G   + ++YN+L+ G+  +G   AA A++  M+   ++PD  TY+ L+     + ++D  +H+F+E+K       +  + PDVV Y  ++ G G  
Subjt:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL

Query:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRG
          +     +  EMK+   +  D   Y ++I  L   G +  A  ++ E+ +       L P    +  L+R +S  G
Subjt:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRG

Arabidopsis top hitse value%identityAlignment
AT1G62590.1 pentatricopeptide (PPR) repeat-containing protein2.2e-1231.94Show/hide
Query:  NLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVV
        N ++  +N L+  ++  G    A  +Y++M+   + PD  TYN+L++     ++LD A   FE M  +       D FPDVVTY TL+KGF   K V   
Subjt:  NLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVV

Query:  HKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL
         ++  EM S   L+ D   YT +I  L   G  + A  +F +++
Subjt:  HKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL

AT1G74850.1 plastid transcriptionally active 29.8e-1330.77Show/hide
Query:  SISVYNLLMKGYISSGVP-QAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVVH
        SI  YN ++      G+  +  L ++ EM +  ++PD +TYNTL+SAC      D A   F  M D         I PD+ TY+ L++ FG L+ +  V 
Subjt:  SISVYNLLMKGYISSGVP-QAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGILKDVHVVH

Query:  KIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRM
         ++ EM S    L D T+Y  +++A    GSI  A+ +F + ++ +G      P  + Y  L+ +F   G Y  V+ L   M
Subjt:  KIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRM

AT4G31850.1 proton gradient regulation 32.9e-1224.86Show/hide
Query:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL
        +L  G   + ++YN+L+ G+  +G   AA A++  M+   ++PD  TY+ L+     + ++D  +H+F+E+K       +  + PDVV Y  ++ G G  
Subjt:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL

Query:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRG
          +     +  EMK+   +  D   Y ++I  L   G +  A  ++ E+ +       L P    +  L+R +S  G
Subjt:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRG

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-1230.67Show/hide
Query:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL
        +L++G  + +  YN ++ G     +   A  ++NEM    L PD  T   LI    K+  L  AM  F++MK++        I  DVVTY TLL GFG +
Subjt:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGFGIL

Query:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL
         D+    +I  +M S  ++L    +Y+ +++AL S G +  A  ++ E++
Subjt:  KDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELL

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein3.6e-13258.02Show/hide
Query:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV
        +RRANGL+ARY  LL + G  S+ +YNLLMKGY++S  PQAA+ + +EML L L+PDRLTYNTLI AC+K   LDAAM FF +MK++A++Y  + + PDV
Subjt:  MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDV

Query:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR
        VTYTTL+KGFG   D+  + +I LEMK C ++ IDRTA+TA++DA++ CGS +GAL +FGE+LK SG N  LRPKPHLYL++MR F+ +GDY MV+ L+ 
Subjt:  VTYTTLLKGFGILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHR

Query:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP
        R+W DSSG+IS   Q+EAD+LLMEAALND Q+D A+  L ++++RWK I W + GG  A+R+E LLGF+KS   P  +  +V P+ PIES+M+ F+A  P
Subjt:  RMWLDSSGTISPGFQEEADHLLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEP

Query:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT
        L G+LQLK V MRFF + VVPI+DD G CIGLLHREDC  LDAPL  MMRSPP  V+TTTSIG V +L+L+K+ KMVI+V    FS   G S +AVG FT
Subjt:  LNGSLQLKEVVMRFFNKSVVPIIDDWGRCIGLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFT

Query:  AEQLY
          QLY
Subjt:  AEQLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCGTGCCAACGGTCTAATTGCACGATATGGGTTCTTACTTCGGGAAGGAGGCAATCTCTCTATATCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGG
TGTTCCTCAAGCTGCTTTAGCTATGTACAATGAGATGCTAAATCTAGAGTTGAAACCTGATAGGCTCACATATAATACATTAATCTCTGCTTGTGTGAAGATTAACAAAC
TGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGATCGAGCTGATAAGTATGATCAGGAAGATATTTTTCCTGATGTTGTGACATACACTACTTTACTTAAGGGTTTT
GGGATTCTGAAAGATGTCCATGTAGTTCACAAGATTGTGCTGGAAATGAAATCTTGTCATGATTTATTGATTGATCGAACAGCATACACTGCAATGATTGATGCTTTGGT
TAGCTGTGGCTCTATAAACGGTGCTCTTTCTTTATTTGGGGAGTTATTGAAGCTTTCCGGATTGAATTTGGACTTACGGCCAAAGCCACATCTCTATCTCACTCTTATGA
GAGTTTTTTCTAGTAGAGGAGATTATAGGATGGTCAAATGTTTGCATAGACGCATGTGGCTGGACTCCTCTGGAACTATTTCTCCTGGATTTCAAGAAGAAGCAGATCAT
CTTCTCATGGAGGCAGCTTTGAATGACAATCAGATTGACGTGGCAATAGAGAAACTATCAACAGTTATTAAGAGATGGAAGGGAATCTCATGGGCTAGTCGAGGAGGCAG
TGTTGCCCTGCGTATAGAAGCATTGCTGGGATTCACCAAATCTTTCTTCAGTCCTCCTTGCATCTTTCCTCGGGTAAATCCGGCTGCACCTATTGAGAGTGTCATGATGC
CATTTAAAGCAGTTGAGCCCTTAAATGGAAGCTTACAGTTGAAGGAAGTGGTTATGCGTTTCTTTAACAAATCAGTTGTGCCTATCATAGACGACTGGGGTAGATGCATT
GGACTACTGCACCGTGAAGATTGTACTGAGTTGGATGCTCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTGGTGTAACAACTACAACATCCATTGGCCATGTAGCGAA
TCTGATTCTACAAAAGAGGTACAAAATGGTTATTATTGTAAGACATAGCAAGTTTAGTACTTATGATGGCTCGAGTTTGAGGGCTGTCGGCGTTTTTACTGCTGAGCAAT
TGTATGGCTTTATTTCTCCCATTCCCATGCGGCCTCAGCCCAACATCTCACGTAAGACGTAA
mRNA sequenceShow/hide mRNA sequence
GTCTCTCCATTGCTCCCATGGCTGATGGTTAGCTTCTCTCTGTATCTTATCGGCACTCGCAACCACCGCCGGAGAAGATGCTACGGATCGCTTCATTTTTCTCTCCTCCA
TTGGCTTTATATCCATATGACTCCACGAATGTATCTTCTTCTTGTCCACTTCCGTCTCGCACGGCTACCGGACTTCGTCGGAAAGACTCTCCTCGGAGTCCCAATCTCAA
GCGGCTAACCTCTCGTGTCGTGAGACTCACTCGTCGGAAGCAGCTCCACCAGGTATTTGAGGAAATTGAAATTGCCAAGAGACGTTATGGAAAGCTGAATACAATTGTTA
TGAATGCGGTCCTGGAAGCTTGTGTTCACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAACCAGATAATTGTGGTGTAGACAATGTCAGTTAT
GGTACACTATTAAAGGTACTTTATTATATTTGAACTACTATGAGTTTTATTTTTCTTGGTGCATTGTTGTTAACTTGTTGGGTTATGTGGGAAATCCATAATTGTGACAG
TTATATGGGAGAGTCGTGATTAGTTCTTGTTCCAGTTAAGTCTTGTTCTTTTCTCTGCCAGGGTTTGGGAGAAGCTAGAAAAGTTGACGAAGCATTTCAATTACTTGAAT
CTGTGGAAGAAGGTACTGCTATTGGAGGTCCATCATTGTCAGCACCACTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGTGCAGGAGACATGCGCCGTGCCAACG
GTCTAATTGCACGATATGGGTTCTTACTTCGGGAAGGAGGCAATCTCTCTATATCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAAGCTGCT
TTAGCTATGTACAATGAGATGCTAAATCTAGAGTTGAAACCTGATAGGCTCACATATAATACATTAATCTCTGCTTGTGTGAAGATTAACAAACTGGACGCAGCAATGCA
TTTCTTTGAGGAAATGAAGGATCGAGCTGATAAGTATGATCAGGAAGATATTTTTCCTGATGTTGTGACATACACTACTTTACTTAAGGGTTTTGGGATTCTGAAAGATG
TCCATGTAGTTCACAAGATTGTGCTGGAAATGAAATCTTGTCATGATTTATTGATTGATCGAACAGCATACACTGCAATGATTGATGCTTTGGTTAGCTGTGGCTCTATA
AACGGTGCTCTTTCTTTATTTGGGGAGTTATTGAAGCTTTCCGGATTGAATTTGGACTTACGGCCAAAGCCACATCTCTATCTCACTCTTATGAGAGTTTTTTCTAGTAG
AGGAGATTATAGGATGGTCAAATGTTTGCATAGACGCATGTGGCTGGACTCCTCTGGAACTATTTCTCCTGGATTTCAAGAAGAAGCAGATCATCTTCTCATGGAGGCAG
CTTTGAATGACAATCAGATTGACGTGGCAATAGAGAAACTATCAACAGTTATTAAGAGATGGAAGGGAATCTCATGGGCTAGTCGAGGAGGCAGTGTTGCCCTGCGTATA
GAAGCATTGCTGGGATTCACCAAATCTTTCTTCAGTCCTCCTTGCATCTTTCCTCGGGTAAATCCGGCTGCACCTATTGAGAGTGTCATGATGCCATTTAAAGCAGTTGA
GCCCTTAAATGGAAGCTTACAGTTGAAGGAAGTGGTTATGCGTTTCTTTAACAAATCAGTTGTGCCTATCATAGACGACTGGGGTAGATGCATTGGACTACTGCACCGTG
AAGATTGTACTGAGTTGGATGCTCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTGGTGTAACAACTACAACATCCATTGGCCATGTAGCGAATCTGATTCTACAAAAG
AGGTACAAAATGGTTATTATTGTAAGACATAGCAAGTTTAGTACTTATGATGGCTCGAGTTTGAGGGCTGTCGGCGTTTTTACTGCTGAGCAATTGTATGGCTTTATTTC
TCCCATTCCCATGCGGCCTCAGCCCAACATCTCACGTAAGACGTAACAAATAACACATCCTTTTATTTTATTATTATTTTATTTAGTTTTTCAGCAAA
Protein sequenceShow/hide protein sequence
MRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKDRADKYDQEDIFPDVVTYTTLLKGF
GILKDVHVVHKIVLEMKSCHDLLIDRTAYTAMIDALVSCGSINGALSLFGELLKLSGLNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEEADH
LLMEAALNDNQIDVAIEKLSTVIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPAAPIESVMMPFKAVEPLNGSLQLKEVVMRFFNKSVVPIIDDWGRCI
GLLHREDCTELDAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIIVRHSKFSTYDGSSLRAVGVFTAEQLYGFISPIPMRPQPNISRKT