; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037321 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037321
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr2:5130186..5138270
RNA-Seq ExpressionLag0037321
SyntenyLag0037321
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000644 - CBS domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466303.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X3 [Cucumis melo]4.6e-30689.17Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LALY S+S NV SC +PSRTAT RR +DSPRSPNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADK+DQED+FPDVVTYTTLLK FGILK+VH+VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        H LSIDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW+L+LRPKPHLYLTLMRVFSSRGDYRMV+CLHRRMWLDSSGTIS  +QEEADHLLMEAALND
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQ--------IDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVP
        NQ        IDVA EKLSTIIK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN GAPIE+VMMPFKAVQPLNGSL LKEVVMRFFD+SVVP
Subjt:  NQ--------IDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVP

Query:  IIDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR
        IIDDWGRCIGLLHREDCTEL+APLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV+VVRHSKFS Y  SSLRA+GVFT EQLYGF+SPIP+   PN+PR
Subjt:  IIDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR

XP_008466304.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X4 [Cucumis melo]3.5e-30689.32Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LALY S+S NV SC +PSRTAT RR +DSPRSPNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADK+DQED+FPDVVTYTTLLK FGILK+VH+VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIK-------GALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLL
        H LSIDRTAYTAMIDAL+NCGSI        GALSLFGELLKLSGW+L+LRPKPHLYLTLMRVFSSRGDYRMV+CLHRRMWLDSSGTIS  +QEEADHLL
Subjt:  HDLSIDRTAYTAMIDALINCGSIK-------GALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLL

Query:  MEAALNDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPI
        MEAALNDNQIDVA EKLSTIIK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN GAPIE+VMMPFKAVQPLNGSL LKEVVMRFFD+SVVPI
Subjt:  MEAALNDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPI

Query:  IDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR
        IDDWGRCIGLLHREDCTEL+APLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV+VVRHSKFS Y  SSLRA+GVFT EQLYGF+SPIP+   PN+PR
Subjt:  IDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR

XP_008466305.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X5 [Cucumis melo]2.9e-30890.37Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LALY S+S NV SC +PSRTAT RR +DSPRSPNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADK+DQED+FPDVVTYTTLLK FGILK+VH+VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        H LSIDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW+L+LRPKPHLYLTLMRVFSSRGDYRMV+CLHRRMWLDSSGTIS  +QEEADHLLMEAALND
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC
        NQIDVA EKLSTIIK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN GAPIE+VMMPFKAVQPLNGSL LKEVVMRFFD+SVVPIIDDWGRC
Subjt:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC

Query:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR
        IGLLHREDCTEL+APLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV+VVRHSKFS Y  SSLRA+GVFT EQLYGF+SPIP+   PN+PR
Subjt:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR

XP_022936928.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita moschata]3.5e-30689.68Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML I SFSSR LA  PSDSMNV SC +PSRTA  RRR DSPRSPNLKRLTSRVVRLTRRKQLHQ+FEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALA+YNEMLNLELKPD+LTYNTLISACVKINKLDAAM+FFEEMKERA K++QEDIFPDVVTYTTLLKGFGILK+V +VH IVLEMK+C
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        HDL IDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW LDLRPKPHLYLT MR FSSRGDY MV+CLHRRMWLDSSG+ISP FQEEADHLLMEAAL+D
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC
        NQIDVATEKLSTIIKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNPGAPIE+VMMPFKAVQPLNG+LQLKEVVMRFFD+SVVPIIDDWGRC
Subjt:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC

Query:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP
        IGLLHREDC+ELEAPLWKMMRSPPPGVTTTTSIGHV NLIL+KRYKM+I+VR+SKFSTYDSSS RAVGVFT EQLYGFISP P++  PNIP
Subjt:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP

XP_023535821.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita pepo subsp. pepo]2.4e-30790.02Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LA  PSDSMNV SC +PSRTA  RRR DSPRSPNLKRLTSRVVRLTRRKQLHQ+FEEIEIAK+RYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALA+YNEMLNLELKPD+LTYNTLISACVKINKLDAAM+FFEEMKERA K++QEDIFPDVVTYTTLLKGFGILK+V +VH IVLEMKS 
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        HDL IDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW+LDLRPKPHLYLT MR FSSRGDYRMV+CLHRRMWLDSSG+ISP FQEEADHLLMEAAL+D
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC
        NQIDVATEKLSTIIKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNPGAPIE+VMMPFKAVQPLNG+LQLKEVVMRFFD+SVVPIIDDWGRC
Subjt:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC

Query:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP
        IGLLHREDC+ELE+PLWKMMRSPPPGVTTTTSIGHV NLIL+KRYKM+I+VRHSKFSTYDSSS RAVGVFT EQLYGFISPIP++  PNIP
Subjt:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP

TrEMBL top hitse value%identityAlignment
A0A1S3CQX2 pentatricopeptide repeat-containing protein At5g10690 isoform X41.7e-30689.32Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LALY S+S NV SC +PSRTAT RR +DSPRSPNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADK+DQED+FPDVVTYTTLLK FGILK+VH+VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIK-------GALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLL
        H LSIDRTAYTAMIDAL+NCGSI        GALSLFGELLKLSGW+L+LRPKPHLYLTLMRVFSSRGDYRMV+CLHRRMWLDSSGTIS  +QEEADHLL
Subjt:  HDLSIDRTAYTAMIDALINCGSIK-------GALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLL

Query:  MEAALNDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPI
        MEAALNDNQIDVA EKLSTIIK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN GAPIE+VMMPFKAVQPLNGSL LKEVVMRFFD+SVVPI
Subjt:  MEAALNDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPI

Query:  IDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR
        IDDWGRCIGLLHREDCTEL+APLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV+VVRHSKFS Y  SSLRA+GVFT EQLYGF+SPIP+   PN+PR
Subjt:  IDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR

A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X51.4e-30890.37Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LALY S+S NV SC +PSRTAT RR +DSPRSPNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADK+DQED+FPDVVTYTTLLK FGILK+VH+VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        H LSIDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW+L+LRPKPHLYLTLMRVFSSRGDYRMV+CLHRRMWLDSSGTIS  +QEEADHLLMEAALND
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC
        NQIDVA EKLSTIIK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN GAPIE+VMMPFKAVQPLNGSL LKEVVMRFFD+SVVPIIDDWGRC
Subjt:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC

Query:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR
        IGLLHREDCTEL+APLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV+VVRHSKFS Y  SSLRA+GVFT EQLYGF+SPIP+   PN+PR
Subjt:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR

A0A1S3CS90 pentatricopeptide repeat-containing protein At5g10690 isoform X32.2e-30689.17Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFSSR LALY S+S NV SC +PSRTAT RR +DSPRSPNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADK+DQED+FPDVVTYTTLLK FGILK+VH+VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        H LSIDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW+L+LRPKPHLYLTLMRVFSSRGDYRMV+CLHRRMWLDSSGTIS  +QEEADHLLMEAALND
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQ--------IDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVP
        NQ        IDVA EKLSTIIK+WKGISW SRGGSVALRIEALLG TKSFFS PCIFPRVN GAPIE+VMMPFKAVQPLNGSL LKEVVMRFFD+SVVP
Subjt:  NQ--------IDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVP

Query:  IIDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR
        IIDDWGRCIGLLHREDCTEL+APLWKMMRSPPPGVTTT  IGHVANLILQKRYKMV+VVRHSKFS Y  SSLRA+GVFT EQLYGF+SPIP+   PN+PR
Subjt:  IIDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPR

A0A6J1FEL4 pentatricopeptide repeat-containing protein At5g10690 isoform X11.7e-30689.68Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML I SFSSR LA  PSDSMNV SC +PSRTA  RRR DSPRSPNLKRLTSRVVRLTRRKQLHQ+FEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCG+DNVSYGTLLKGLG ARK+DEAFQLLESVEEGTAIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALA+YNEMLNLELKPD+LTYNTLISACVKINKLDAAM+FFEEMKERA K++QEDIFPDVVTYTTLLKGFGILK+V +VH IVLEMK+C
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        HDL IDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW LDLRPKPHLYLT MR FSSRGDY MV+CLHRRMWLDSSG+ISP FQEEADHLLMEAAL+D
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC
        NQIDVATEKLSTIIKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNPGAPIE+VMMPFKAVQPLNG+LQLKEVVMRFFD+SVVPIIDDWGRC
Subjt:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC

Query:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP
        IGLLHREDC+ELEAPLWKMMRSPPPGVTTTTSIGHV NLIL+KRYKM+I+VR+SKFSTYDSSS RAVGVFT EQLYGFISP P++  PNIP
Subjt:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP

A0A6J1ILI3 pentatricopeptide repeat-containing protein At5g10690 isoform X14.2e-30589.51Show/hide
Query:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML I SFSSR L   PSDSMNV SC +PSRTA  RRR DSPRSPNLKRLTSRVVRLTRRKQLHQ+FEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        L TFNEMSKP NCG+DNVSYGTLLKGLG ARK+DEAF LLESVEEGTAIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC
        MKGYIS+GVPQAALA+YNEMLNLELKPD+LTYNTLISAC KINKLDAAM+FFEEMKERA K++QEDIFPDVVTYTTLLKGFGILK+V +VH IVLEMKSC
Subjt:  MKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSC

Query:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND
        HDL IDRTAYTAMIDAL+NCGSI GALSLFGELLKLSGW+LDLRPKPHLYLT MR FSSRGDYRMV+CLHRRMWLDSSG+ISP FQEEADHLLMEAAL+D
Subjt:  HDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALND

Query:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC
        NQIDVATEKLSTIIKRWKGISW+SRGGSVALRIEALLG TKSFFS PCIFPRVNPGAPIE+VMMPFKAVQPLNGSLQLKEVVMRFFD+SVVPIIDDWGRC
Subjt:  NQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRC

Query:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP
        IGLLHREDC+ELEAPLW MMRSPPPGVTTTTSIGHV NLIL+KRYKM+I+VRHSKFSTYDSSS RAVGVFT EQLYGFISPIP++  PNIP
Subjt:  IGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIP

SwissProt top hitse value%identityAlignment
Q8VYD6 Pentatricopeptide repeat-containing protein At5g106901.4e-18358.13Show/hide
Query:  MLRIASFSS--RPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDID
        M RI++ S+   PL L PS S       +P+R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+E AK+RYG+LNTIVMN+VLEACVHCG+ID
Subjt:  MLRIASFSS--RPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDID

Query:  LALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYN
        LALR F+EM++P   GVD++SY T+LKGLG AR++DEAFQ+LE++E GTA G P LS+ LIYGLL+ALI AGD+RRANGL+ARY  LL + G  S+ +YN
Subjt:  LALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYN

Query:  LLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMK
        LLMKGY+++  PQAA+ + +EML L L+PDRLTYNTLI AC+K   LDAAM FF +MKE+A+++  + + PDVVTYTTL+KGFG   ++  +  I LEMK
Subjt:  LLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMK

Query:  SCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAAL
         C ++ IDRTA+TA++DA++ CGS  GAL +FGE+LK SG +  LRPKPHLYL++MR F+ +GDY MV  L+ R+W DSSG+IS   Q+EAD+LLMEAAL
Subjt:  SCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAAL

Query:  NDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWG
        ND Q+D A   L +I++RWK I W + GG  A+R+E LLGF+KS   P  +  +V P  PIE++M+ F+A +PL G+LQLK V MRFF   VVPI+DD G
Subjt:  NDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWG

Query:  RCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLY
         CIGLLHREDC  L+APL  MMRSPP  V+TTTSIG V +L+L+K+ KMVIVV    FS    SS +AVG FT  QLY
Subjt:  RCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397105.6e-2029.17Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI
        N    N ++      G+ID+AL  F++M +   C  + V+Y TL+ G    RK+D+ F+LL S+    A+ G   +      ++N L   G M+  + ++
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI

Query:  ARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLK
             + R G +L    YN L+KGY   G    AL M+ EML   L P  +TY +LI +  K   ++ AM F ++M+ R        + P+  TYTTL+ 
Subjt:  ARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLK

Query:  GFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDL
        GF     ++  + ++ EM   +  S     Y A+I+     G ++ A+++  E +K  G S D+
Subjt:  GFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDL

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic5.8e-1727.95Show/hide
Query:  VLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF
        V++  +  GD+D ALR   +M +   C   NVS   ++ G     +V++A   ++  S ++G      T +      L+N L +AG ++ A   I     
Subjt:  VLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF

Query:  LLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGIL
        +L+EG +  +  YN ++ G    G  + A+ + ++M+  +  P+ +TYNTLIS   K N++       EE  E A     + I PDV T+ +L++G  + 
Subjt:  LLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGIL

Query:  KNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSG
        +N  V   +  EM+S      D   Y  +ID+L + G +  AL++  + ++LSG
Subjt:  KNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSG

Q9SXD8 Pentatricopeptide repeat-containing protein At1g625901.5e-1728.91Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVE-EGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGL
        N +    V+      GD DLAL   N+M +      D V + T++  L   R VD+A  L + +E +G      T S+     L++ L   G    A+ L
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVE-EGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGL

Query:  IARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLL
        ++    ++ +  N ++  +N L+  ++  G    A  +Y++M+   + PD  TYN+L++     ++LD A   FE M  +       D FPDVVTY TL+
Subjt:  IARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLL

Query:  KGFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELL
        KGF   K V     +  EM S   L  D   YT +I  L + G    A  +F +++
Subjt:  KGFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELL

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic8.9e-1825.54Show/hide
Query:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL  A  VD+A  L   +        PT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+   G   AA A++  M+   ++PD  TY+ L+     + ++D  +H+F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGI

Query:  LKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRG
           +     +  EMK+   ++ D   Y ++I  L   G ++ A  ++ E+ +       L P    +  L+R +S  G
Subjt:  LKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRG

Arabidopsis top hitse value%identityAlignment
AT1G62590.1 pentatricopeptide (PPR) repeat-containing protein1.1e-1828.91Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVE-EGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGL
        N +    V+      GD DLAL   N+M +      D V + T++  L   R VD+A  L + +E +G      T S+     L++ L   G    A+ L
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVE-EGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGL

Query:  IARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLL
        ++    ++ +  N ++  +N L+  ++  G    A  +Y++M+   + PD  TYN+L++     ++LD A   FE M  +       D FPDVVTY TL+
Subjt:  IARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLL

Query:  KGFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELL
        KGF   K V     +  EM S   L  D   YT +I  L + G    A  +F +++
Subjt:  KGFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELL

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-1827.95Show/hide
Query:  VLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF
        V++  +  GD+D ALR   +M +   C   NVS   ++ G     +V++A   ++  S ++G      T +      L+N L +AG ++ A   I     
Subjt:  VLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF

Query:  LLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGIL
        +L+EG +  +  YN ++ G    G  + A+ + ++M+  +  P+ +TYNTLIS   K N++       EE  E A     + I PDV T+ +L++G  + 
Subjt:  LLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGIL

Query:  KNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSG
        +N  V   +  EM+S      D   Y  +ID+L + G +  AL++  + ++LSG
Subjt:  KNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSG

AT4G31850.1 proton gradient regulation 36.3e-1925.54Show/hide
Query:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL  A  VD+A  L   +        PT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+   G   AA A++  M+   ++PD  TY+ L+     + ++D  +H+F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGI

Query:  LKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRG
           +     +  EMK+   ++ D   Y ++I  L   G ++ A  ++ E+ +       L P    +  L+R +S  G
Subjt:  LKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRG

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein9.6e-18558.13Show/hide
Query:  MLRIASFSS--RPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDID
        M RI++ S+   PL L PS S       +P+R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+E AK+RYG+LNTIVMN+VLEACVHCG+ID
Subjt:  MLRIASFSS--RPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDID

Query:  LALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYN
        LALR F+EM++P   GVD++SY T+LKGLG AR++DEAFQ+LE++E GTA G P LS+ LIYGLL+ALI AGD+RRANGL+ARY  LL + G  S+ +YN
Subjt:  LALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYN

Query:  LLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMK
        LLMKGY+++  PQAA+ + +EML L L+PDRLTYNTLI AC+K   LDAAM FF +MKE+A+++  + + PDVVTYTTL+KGFG   ++  +  I LEMK
Subjt:  LLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMK

Query:  SCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAAL
         C ++ IDRTA+TA++DA++ CGS  GAL +FGE+LK SG +  LRPKPHLYL++MR F+ +GDY MV  L+ R+W DSSG+IS   Q+EAD+LLMEAAL
Subjt:  SCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAAL

Query:  NDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWG
        ND Q+D A   L +I++RWK I W + GG  A+R+E LLGF+KS   P  +  +V P  PIE++M+ F+A +PL G+LQLK V MRFF   VVPI+DD G
Subjt:  NDNQIDVATEKLSTIIKRWKGISWASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWG

Query:  RCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLY
         CIGLLHREDC  L+APL  MMRSPP  V+TTTSIG V +L+L+K+ KMVIVV    FS    SS +AVG FT  QLY
Subjt:  RCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLY

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-2129.17Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI
        N    N ++      G+ID+AL  F++M +   C  + V+Y TL+ G    RK+D+ F+LL S+    A+ G   +      ++N L   G M+  + ++
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI

Query:  ARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLK
             + R G +L    YN L+KGY   G    AL M+ EML   L P  +TY +LI +  K   ++ AM F ++M+ R        + P+  TYTTL+ 
Subjt:  ARYGFLLREGGNLSISVYNLLMKGYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLK

Query:  GFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDL
        GF     ++  + ++ EM   +  S     Y A+I+     G ++ A+++  E +K  G S D+
Subjt:  GFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTAMIDALINCGSIKGALSLFGELLKLSGWSLDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAGCTTTCTCTCTGTATCTTATCGGCTCTCACAACCACCGCCGGAGAAGATGCTGCGGATCGCCTCATTCTCCTCTCGTCCACTGGCATTGTATCCATCCGACTC
CATGAACGTACTTTCTTGTCCGATTCCTTCTCGCACGGCCACCGGCCGGCGACGGAAAGACTCTCCTCGGAGTCCCAATCTCAAGCGGCTGACCTCTCGTGTTGTGAGAC
TCACTCGCCGGAAGCAGCTCCACCAGGTCTTTGAGGAAATTGAAATTGCCAAGAGACGTTATGGAAAGCTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGTGTT
CACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAGCCAGATAATTGTGGCGTAGACAATGTTAGTTATGGCACGTTATTAAAGGGTTTAGGTGT
AGCTCGAAAAGTAGATGAAGCATTTCAATTACTTGAATCTGTGGAAGAAGGCACTGCTATTGGAGGTCCAACATTGTCAGCACCACTTATTTATGGTCTTCTAAATGCTT
TAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATTGCACGATATGGGTTCTTACTTCGGGAAGGAGGCAATCTCTCTATATCAGTTTACAATTTATTGATGAAG
GGGTACATAAGCACGGGTGTTCCTCAAGCTGCCTTAGCTATGTACAATGAAATGCTAAATCTGGAGTTAAAACCTGATAGGCTCACTTATAATACGTTAATCTCTGCTTG
TGTGAAGATTAACAAACTGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGAACGAGCTGATAAGTTTGATCAGGAAGATATTTTTCCCGATGTTGTGACGTACACTA
CTTTACTTAAGGGTTTTGGGATTCTGAAAAATGTCCACGTAGTTCACAACATAGTGCTGGAAATGAAGTCTTGTCACGATTTGTCGATTGATCGAACAGCATACACTGCA
ATGATTGATGCTTTGATTAACTGTGGCTCTATAAAAGGTGCTCTTTCTTTATTTGGGGAATTATTGAAGCTTTCTGGATGGAGTTTGGACTTGCGGCCAAAGCCACATCT
CTATCTCACTCTTATGAGGGTTTTTTCTAGTAGAGGAGATTATAGGATGGTCGAATGTTTGCATAGACGAATGTGGCTGGACTCCTCTGGAACCATTTCTCCTGAATTTC
AAGAAGAAGCAGATCATCTTCTCATGGAGGCAGCTTTAAATGACAATCAGATTGACGTGGCAACAGAGAAACTATCAACAATTATTAAGAGATGGAAGGGAATCTCATGG
GCTAGTCGAGGAGGCAGTGTTGCTCTGCGTATAGAAGCTTTGCTGGGATTCACCAAATCTTTCTTTAGTCCTCCTTGCATATTTCCTCGGGTAAATCCGGGTGCACCTAT
TGAGAATGTCATGATGCCATTTAAAGCAGTTCAACCCTTAAATGGCAGCTTACAGTTAAAGGAGGTGGTTATGCGTTTCTTTGACAGATCAGTTGTGCCTATCATAGACG
ACTGGGGTAGATGCATTGGACTTCTGCACCGTGAAGACTGTACTGAGTTGGAAGCTCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTGGCGTAACAACTACAACATCC
ATTGGGCATGTTGCGAATCTAATTCTACAAAAGAGGTACAAAATGGTTATTGTTGTAAGACATAGCAAGTTTAGCACATATGATAGCTCTAGTTTGAGGGCTGTTGGCGT
TTTTACTGCTGAGCAATTGTATGGCTTTATTTCTCCCATTCCCTTGCGGCCTCACCCCAACATCCCACGTTTAGGTCCACTTGCAGGCCACATTATTCTCTACCAAGAAA
TAATGTTGGTTGACCCACCAACTGAAACCCTCAAATTTAATGAGGCATTGAAACCAATCCAAGGGATCAAATACACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAGCTTTCTCTCTGTATCTTATCGGCTCTCACAACCACCGCCGGAGAAGATGCTGCGGATCGCCTCATTCTCCTCTCGTCCACTGGCATTGTATCCATCCGACTC
CATGAACGTACTTTCTTGTCCGATTCCTTCTCGCACGGCCACCGGCCGGCGACGGAAAGACTCTCCTCGGAGTCCCAATCTCAAGCGGCTGACCTCTCGTGTTGTGAGAC
TCACTCGCCGGAAGCAGCTCCACCAGGTCTTTGAGGAAATTGAAATTGCCAAGAGACGTTATGGAAAGCTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGTGTT
CACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAGCCAGATAATTGTGGCGTAGACAATGTTAGTTATGGCACGTTATTAAAGGGTTTAGGTGT
AGCTCGAAAAGTAGATGAAGCATTTCAATTACTTGAATCTGTGGAAGAAGGCACTGCTATTGGAGGTCCAACATTGTCAGCACCACTTATTTATGGTCTTCTAAATGCTT
TAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATTGCACGATATGGGTTCTTACTTCGGGAAGGAGGCAATCTCTCTATATCAGTTTACAATTTATTGATGAAG
GGGTACATAAGCACGGGTGTTCCTCAAGCTGCCTTAGCTATGTACAATGAAATGCTAAATCTGGAGTTAAAACCTGATAGGCTCACTTATAATACGTTAATCTCTGCTTG
TGTGAAGATTAACAAACTGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGAACGAGCTGATAAGTTTGATCAGGAAGATATTTTTCCCGATGTTGTGACGTACACTA
CTTTACTTAAGGGTTTTGGGATTCTGAAAAATGTCCACGTAGTTCACAACATAGTGCTGGAAATGAAGTCTTGTCACGATTTGTCGATTGATCGAACAGCATACACTGCA
ATGATTGATGCTTTGATTAACTGTGGCTCTATAAAAGGTGCTCTTTCTTTATTTGGGGAATTATTGAAGCTTTCTGGATGGAGTTTGGACTTGCGGCCAAAGCCACATCT
CTATCTCACTCTTATGAGGGTTTTTTCTAGTAGAGGAGATTATAGGATGGTCGAATGTTTGCATAGACGAATGTGGCTGGACTCCTCTGGAACCATTTCTCCTGAATTTC
AAGAAGAAGCAGATCATCTTCTCATGGAGGCAGCTTTAAATGACAATCAGATTGACGTGGCAACAGAGAAACTATCAACAATTATTAAGAGATGGAAGGGAATCTCATGG
GCTAGTCGAGGAGGCAGTGTTGCTCTGCGTATAGAAGCTTTGCTGGGATTCACCAAATCTTTCTTTAGTCCTCCTTGCATATTTCCTCGGGTAAATCCGGGTGCACCTAT
TGAGAATGTCATGATGCCATTTAAAGCAGTTCAACCCTTAAATGGCAGCTTACAGTTAAAGGAGGTGGTTATGCGTTTCTTTGACAGATCAGTTGTGCCTATCATAGACG
ACTGGGGTAGATGCATTGGACTTCTGCACCGTGAAGACTGTACTGAGTTGGAAGCTCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTGGCGTAACAACTACAACATCC
ATTGGGCATGTTGCGAATCTAATTCTACAAAAGAGGTACAAAATGGTTATTGTTGTAAGACATAGCAAGTTTAGCACATATGATAGCTCTAGTTTGAGGGCTGTTGGCGT
TTTTACTGCTGAGCAATTGTATGGCTTTATTTCTCCCATTCCCTTGCGGCCTCACCCCAACATCCCACGTTTAGGTCCACTTGCAGGCCACATTATTCTCTACCAAGAAA
TAATGTTGGTTGACCCACCAACTGAAACCCTCAAATTTAATGAGGCATTGAAACCAATCCAAGGGATCAAATACACATGA
Protein sequenceShow/hide protein sequence
MVSFLSVSYRLSQPPPEKMLRIASFSSRPLALYPSDSMNVLSCPIPSRTATGRRRKDSPRSPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACV
HCGDIDLALRTFNEMSKPDNCGVDNVSYGTLLKGLGVARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMK
GYISTGVPQAALAMYNEMLNLELKPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKFDQEDIFPDVVTYTTLLKGFGILKNVHVVHNIVLEMKSCHDLSIDRTAYTA
MIDALINCGSIKGALSLFGELLKLSGWSLDLRPKPHLYLTLMRVFSSRGDYRMVECLHRRMWLDSSGTISPEFQEEADHLLMEAALNDNQIDVATEKLSTIIKRWKGISW
ASRGGSVALRIEALLGFTKSFFSPPCIFPRVNPGAPIENVMMPFKAVQPLNGSLQLKEVVMRFFDRSVVPIIDDWGRCIGLLHREDCTELEAPLWKMMRSPPPGVTTTTS
IGHVANLILQKRYKMVIVVRHSKFSTYDSSSLRAVGVFTAEQLYGFISPIPLRPHPNIPRLGPLAGHIILYQEIMLVDPPTETLKFNEALKPIQGIKYT