; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0699 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0699
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionpentatricopeptide repeat-containing protein At2g30100, chloroplastic
Genome locationMC02:5647105..5651723
RNA-Seq ExpressionMC02g0699
SyntenyMC02g0699
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010495.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]0.089.82Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGF+P+TQFGFSFSLSS LK++R  F S PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+ YQ+ELLADGVRLSNWVL+EG SS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

XP_022148369.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Momordica charantia]0.0100Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QKHKLWVIKML
Subjt:  QKHKLWVIKML

XP_022944005.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata]0.089.82Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS LK++R  F S PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+ YQ+ELLADGVRLSNWVL+EG SS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

XP_023512972.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo]0.089.82Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS LK++R  F S PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+ YQ+ELLADGVRLSNWVL+EG SS H V HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

XP_038901728.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa hispida]0.090.22Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        M+CAQGFTP+TQFGFSFSLSSALKT R  F STPQLY   PV FCFM+S I+CN+++STFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKE+AISFVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDGIVAELDK+NV LVE YQTELLADGVRLSNWVLEEGS SIHGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        T+AM RLLTRIEI SP+ KKKSL+WLLRGYIKGGHF DAAETLVKM+DLGFLPEYLDRVAVLQGLRK+IREP +V+TY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QKHKLWV+KML
Subjt:  QKHKLWVIKML

TrEMBL top hitse value%identityAlignment
A0A0A0KC35 Uncharacterized protein0.087.08Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS L++QR  F STP+LY         M+S I+CN+++STFSV +A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKE+A+ FVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLK Y RDG VAELDK+NV LV  YQTELLADGV+LSNWVLEEGSSSI GV HERLLAMYICAG+G+EAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        T+AM RLLTRIEI SP++KKKSL+WLLRGYIKGGHF DAA TLVKM++LGFLPEYLDRVAVLQGLRK IREP SV TY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QKHKLW+IKML
Subjt:  QKHKLWVIKML

A0A1S3CNE0 pentatricopeptide repeat-containing protein At2g30100, chloroplastic0.087.08Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS L+TQR  F STP+LY         M+S I+CN+++STFSV +A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKE+AI FVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG VAELDK+NV LV  YQTELLADGVRLSNWVLEEGSSSIHGV HERLLAMYICAG+G+EAERQLWEMKL+GKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
         +AM RLLTRIEI SP++KKKSL+WLLRGYIKGGHF DAA T+VKM++LGFLPEYLDRVAVLQGLRK IREP  V TY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QKHKLW+IKML
Subjt:  QKHKLWVIKML

A0A6J1D3T2 pentatricopeptide repeat-containing protein At2g30100, chloroplastic0.0100Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QKHKLWVIKML
Subjt:  QKHKLWVIKML

A0A6J1FYE9 pentatricopeptide repeat-containing protein At2g30100, chloroplastic0.089.82Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS LK++R  F S PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+ YQ+ELLADGVRLSNWVL+EG SS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

A0A6J1JH85 pentatricopeptide repeat-containing protein At2g30100, chloroplastic0.089.24Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICA GFTP+T+FGFSFSLSS LK++R  F S PQL   SPVNFCF++S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY +LIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+ YQ+ELLADGVRLSNWVL+EGSSS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL
        TRAMNRLL+RIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

SwissProt top hitse value%identityAlignment
Q0WNN7 Pentatricopeptide repeat-containing protein At2g30100, chloroplastic3.5e-17164.85Show/hide
Query:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM
        AGKFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E M
Subjt:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM

Query:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV
        ELMVSIMC W+KKL+E + N   V DLL++MDCVGLKP FSM++KVI+LY EMG+KE A+ FVKEVL R+  F          EG KGGP GYLAWK MV
Subjt:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV

Query:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEG--SSSIHGVAH
        DGDYR AV +V+ LR SGL PE YSYLIAMTA+VKELN   K LR+LK + R G VAE+D  +  L+E YQ+E L+ G++L+ W +EEG  + SI GV H
Subjt:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEG--SSSIHGVAH

Query:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP
        ERLLAMYICAGRG EAE+QLW+MKL G+E ++DL+DIV+AICASQKE  A++RLLTR+E +    KKK+LSWLLRGY+KGGHF +AAETLV M+D G  P
Subjt:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP

Query:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHLQKHKLWVIKML
        EY+DRVAV+QG+ ++I+ P  VE Y  LCK L DA L+GP LVY+++ K+KLW++KM+
Subjt:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHLQKHKLWVIKML

Q0WVV0 Pentatricopeptide repeat-containing protein At1g10910, chloroplastic1.2e-0620.93Show/hide
Query:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVA
        ++ +G     +KL   ++  GL P+V +Y   +   +K  N + KA+  +     +GI                                     +  V 
Subjt:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVA

Query:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL
        +  +LA+    GR  EAE  + +MK+ G   +   Y  +L   + + + +  + L+T ++ I  +  K  ++ LL+ YIKGG F  + E L ++   G+ 
Subjt:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

Arabidopsis top hitse value%identityAlignment
AT1G10910.1 Pentatricopeptide repeat (PPR) superfamily protein8.8e-0820.93Show/hide
Query:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVA
        ++ +G     +KL   ++  GL P+V +Y   +   +K  N + KA+  +     +GI                                     +  V 
Subjt:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGVA

Query:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL
        +  +LA+    GR  EAE  + +MK+ G   +   Y  +L   + + + +  + L+T ++ I  +  K  ++ LL+ YIKGG F  + E L ++   G+ 
Subjt:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein5.9e-0424.14Show/hide
Query:  MMVD-GDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGV
        ++VD G +  A K+ + +R+ G+ P+VYS+ I M +  K     A ALR L + +  G    +      +   Y+    A+G  L   +L  G S     
Subjt:  MMVD-GDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEGSSSIHGV

Query:  AHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGF
           +LL +    G   E E+ L ++   G   +   Y++ +     + E     R++  +    P     + + L+ G  K   F +A   L KMV+ G 
Subjt:  AHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGF

Query:  LPE
         P+
Subjt:  LPE

AT2G30100.1 pentatricopeptide (PPR) repeat-containing protein2.5e-17264.85Show/hide
Query:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM
        AGKFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E M
Subjt:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM

Query:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV
        ELMVSIMC W+KKL+E + N   V DLL++MDCVGLKP FSM++KVI+LY EMG+KE A+ FVKEVL R+  F          EG KGGP GYLAWK MV
Subjt:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV

Query:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEG--SSSIHGVAH
        DGDYR AV +V+ LR SGL PE YSYLIAMTA+VKELN   K LR+LK + R G VAE+D  +  L+E YQ+E L+ G++L+ W +EEG  + SI GV H
Subjt:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELLADGVRLSNWVLEEG--SSSIHGVAH

Query:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP
        ERLLAMYICAGRG EAE+QLW+MKL G+E ++DL+DIV+AICASQKE  A++RLLTR+E +    KKK+LSWLLRGY+KGGHF +AAETLV M+D G  P
Subjt:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP

Query:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHLQKHKLWVIKML
        EY+DRVAV+QG+ ++I+ P  VE Y  LCK L DA L+GP LVY+++ K+KLW++KM+
Subjt:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHLQKHKLWVIKML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTGTGCCCAGGGCTTTACTCCGGTGACTCAATTTGGTTTCTCGTTTTCTTTATCTTCTGCACTGAAAACTCAGAGGCAAAGATTCTTCTCCACTCCCCAATTGTA
TTGCCACTCGCCGGTAAATTTTTGCTTTATGATTTCTTGCATTACCTGCAACCACCGGAATTCTACCTTCTCTGTTCTGAAAGCCGGTAAGTTTCGGGACCTGAGGTTGT
TCAAATCGGTTGAGTTGGACCAGTTCATCACGAGTGACGACGAAGACGAAATGGGAGATGGATTTTTTGAGGCAATCGAGGAGTTGGAGCGCATGACGAGGGAACCATCG
GATGTTCTCGAAGAAATGAATGACCGCCTTTCAGCGAGGGAATTTCAGCTTGTGCTTGTGTACTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGA
GTGGCTCCAGAAAGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGGTGTCTATTATGTGCAGTTGGATTAAAAAATTGGTTGAGGGAGACCATAACGTCGGAGATG
TGGTTGACCTTCTCGTAGATATGGATTGTGTAGGTTTGAAACCTCATTTTAGCATGATAGAAAAAGTCATTTCCTTGTATTGGGAAATGGGCGAGAAGGAACAAGCTATT
TCGTTCGTGAAAGAGGTCTTGGGACGCAAGATTGCTTTTATGAAGGACGATCGGGAGGGGCATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGG
TGACTATAGGGGTGCAGTGAAATTGGTGCTGCATCTTAGAGAATCTGGATTAAATCCAGAGGTTTATAGCTACCTCATCGCCATGACTGCTGTGGTTAAAGAGCTGAATG
AATTTGCAAAAGCTCTACGCAAACTCAAAAGTTACACAAGAGATGGAATAGTAGCCGAACTTGATAAAGACAATGTTGGACTTGTGGAGATATATCAGACAGAGCTTCTA
GCCGATGGAGTACGATTATCCAACTGGGTGCTTGAAGAGGGAAGCTCTTCAATTCATGGGGTGGCTCATGAGAGACTTCTTGCTATGTATATTTGCGCCGGGCGAGGACT
TGAGGCAGAGAGACAACTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATAGTGATCTCTATGATATCGTGCTAGCCATCTGTGCTTCACAGAAGGAGACTAGAGCAA
TGAACCGGTTGCTTACCAGGATTGAGATTATAAGTCCCCTGCTGAAGAAGAAGAGCCTATCATGGCTGCTAAGGGGTTACATAAAAGGAGGGCATTTCAGTGATGCTGCA
GAAACATTAGTAAAAATGGTTGATTTGGGTTTTCTCCCAGAATACTTGGACAGAGTAGCTGTGCTGCAAGGCCTAAGAAAACGGATTCGGGAACCTGGAAGTGTGGAAAC
TTACTTCAAGCTCTGCAAGTGTCTCTCAGATGCTAATCTGATTGGACCTGGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTGA
mRNA sequenceShow/hide mRNA sequence
GAAGGGTCCGAAAATAATTGACGAATATATTATTTTAATCAAATTCTTTCAACATCATAATTTAATTATAGTATACATTAATCAAATAATTGGCGTGATTCTACAGACAA
GAAAGCCACAATGAAGGATTTAATTGAGGGGCAATTATGTAACTTCATAACCGTTAATTCGGTTTAGTAGTCTTCACCATCGGAAGAATTATGATATAAACTCCAAAGCT
TTGTCTTGCAGTGGCGGCAAAACGCTGCCATTTGCGACCACTCACTCCGACCTCCACCATCGCCATCGCCATCGTTCAATCCTCACCCAGAACCAGAAGATTACGGTTTC
TTATTTCCGATCATCAATTCGATTCACCGTTTCTTGAAGAAGGGGTTTCTCGTTCGTTTGCTCTCTCTCTCTCTGCTATCCCTCTTTTCGCTTTGTTTTTCTTTCACTAT
CTGTTGATAATCTACTGTCATCTGAGTTCGGATTGCGCTCGTAATTGATTAATATTCTGGTTTGCTTTCGAAAAGTCGGAGGCGAGGAACGAATGATTCTTCTATTTCTT
GCTTTACTCCATTCGGTTTTGGACTTTTCTCAACTCGGTCATTTTCGTTTGAATTTGGTACTTTGAACTAGAAGTACAAAATGATTTGTGCCCAGGGCTTTACTCCGGTG
ACTCAATTTGGTTTCTCGTTTTCTTTATCTTCTGCACTGAAAACTCAGAGGCAAAGATTCTTCTCCACTCCCCAATTGTATTGCCACTCGCCGGTAAATTTTTGCTTTAT
GATTTCTTGCATTACCTGCAACCACCGGAATTCTACCTTCTCTGTTCTGAAAGCCGGTAAGTTTCGGGACCTGAGGTTGTTCAAATCGGTTGAGTTGGACCAGTTCATCA
CGAGTGACGACGAAGACGAAATGGGAGATGGATTTTTTGAGGCAATCGAGGAGTTGGAGCGCATGACGAGGGAACCATCGGATGTTCTCGAAGAAATGAATGACCGCCTT
TCAGCGAGGGAATTTCAGCTTGTGCTTGTGTACTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAGAAAGAAAATCGGGTCGACAA
GGAGACCATGGAGCTGATGGTGTCTATTATGTGCAGTTGGATTAAAAAATTGGTTGAGGGAGACCATAACGTCGGAGATGTGGTTGACCTTCTCGTAGATATGGATTGTG
TAGGTTTGAAACCTCATTTTAGCATGATAGAAAAAGTCATTTCCTTGTATTGGGAAATGGGCGAGAAGGAACAAGCTATTTCGTTCGTGAAAGAGGTCTTGGGACGCAAG
ATTGCTTTTATGAAGGACGATCGGGAGGGGCATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGGTGACTATAGGGGTGCAGTGAAATTGGTGCT
GCATCTTAGAGAATCTGGATTAAATCCAGAGGTTTATAGCTACCTCATCGCCATGACTGCTGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTACGCAAACTCAAAA
GTTACACAAGAGATGGAATAGTAGCCGAACTTGATAAAGACAATGTTGGACTTGTGGAGATATATCAGACAGAGCTTCTAGCCGATGGAGTACGATTATCCAACTGGGTG
CTTGAAGAGGGAAGCTCTTCAATTCATGGGGTGGCTCATGAGAGACTTCTTGCTATGTATATTTGCGCCGGGCGAGGACTTGAGGCAGAGAGACAACTTTGGGAAATGAA
GCTTGTAGGTAAGGAGGCTGATAGTGATCTCTATGATATCGTGCTAGCCATCTGTGCTTCACAGAAGGAGACTAGAGCAATGAACCGGTTGCTTACCAGGATTGAGATTA
TAAGTCCCCTGCTGAAGAAGAAGAGCCTATCATGGCTGCTAAGGGGTTACATAAAAGGAGGGCATTTCAGTGATGCTGCAGAAACATTAGTAAAAATGGTTGATTTGGGT
TTTCTCCCAGAATACTTGGACAGAGTAGCTGTGCTGCAAGGCCTAAGAAAACGGATTCGGGAACCTGGAAGTGTGGAAACTTACTTCAAGCTCTGCAAGTGTCTCTCAGA
TGCTAATCTGATTGGACCTGGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTTTGAAGAAGCTCCTCAATAACTCTCTGTTCAGGTAGCT
AATAAAGTGGAGTAAAAGATCATATTATACAGCACCAGTACTTTTGTGGGTGCTTTTACATGTTGATTTTGTATAGTTTGAGGGACCTGTTTTTTGAGGCAGGTGACTCT
GACATCTCTGTAAGTTGACCCTTCAGAGGAATACCTGTACATATATGTCAGCTATTGTGCACAGAGAACCTATGTTTCAACTCAATGTATAGACTGTATAAGGAAATTAC
ATAGTTTGATATTTTTAGTGCAGGCTTCTGTTTCTTGAGTGTTTAAAGTCTGAAAAATCTGGTTTCTGGAGAAAAAGAAAAGAAAAGGGAAACTACATATTGTGAAGTGT
TGCTCGTGAACTGCAAAGTGTTTATGTTTGGCAGCATCTAAGAGCTTGGCCATATTGCTTCTTCTCACCCAGAGAACAACACGTGGGGTCTCATCAAAGAAGGAGAAACA
ACACGTGGTCTGTGCGCAGTGCAGTGCAAGTGTACATGACAACAAACTGCGAACAAACCAACCTTGTGGAACCACTCGACTACCAAACTGATGAAAAATAGAACCTAATT
GAGGGATTCTGACGAAAGCTATCCAAATCAGATCAACTCAAACTTGTAAACTTTATCCAAATCGACCCAAATGGATTGTTTTCGGTTGTCTTGAAATCAATCCAATGCAT
GAGTAAATTTTTTGGGTTCTTTTTTAATTGTTTTAGTCGTAAAATCACCAAGGTTGTTTGTTTCTTTAGTACAACAAATGCAGGGTGCAGGGTTGAATATTCAACTTTGA
GAAAGGTATTAGGTGTCTTATACTTATGCTTAGATTGACCAGGTACATTTCATTCTGGGTTCGATTGAGCTAAAAAGACTTGTGTTTTGCCCGTTGCTTATACAACCATT
CTCCATTACTGAAGAATCACATGGTTGGCTATTATTGTTTTAATTATAGTGAATTGGAATTAGAAATCATTTGGATATGTGGACTCTCAAATTAATTTGATTTAATTTGA
TCTCAACTCTAAGGTTCAGGTCAGATATGGATTGCCATAAGATCAAAGCTCTTGAGACAACGAATAAGTTGCTGACTCTACTATCTGCAGAACTTTTTCAATAGTATGAA
TTGCTATGTTTCGCTTGTTTCTATATAATAATACGTTGATTTGGCCTATGTGCATTATGTCTAGCTACTTCTTTTTAAC
Protein sequenceShow/hide protein sequence
MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPS
DVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAI
SFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVEIYQTELL
ADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAA
ETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPGLVYLHLQKHKLWVIKML