; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014185 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014185
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionpentatricopeptide repeat-containing protein At2g30100, chloroplastic
Genome locationscaffold5:1272153..1274780
RNA-Seq ExpressionMS014185
SyntenyMS014185
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010495.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.0e-26490.22Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGF+P+TQFGFSFSLSS LK++R   FS PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+RYQ+ELLADGVRLSNWVL+EG SS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

XP_022148369.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Momordica charantia]9.3e-29499.61Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSYTRDGIVAELDKDNVGLVE YQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QKHKLWVIKML
Subjt:  QKHKLWVIKML

XP_022944005.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata]5.9e-26490.22Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS LK++R   FS PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+RYQ+ELLADGVRLSNWVL+EG SS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

XP_023512972.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo]5.9e-26490.22Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS LK++R   FS PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+RYQ+ELLADGVRLSNWVL+EG SS H V HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

XP_038901728.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa hispida]2.2e-26690.41Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        M+CAQGFTP+TQFGFSFSLSSALKT R   FSTPQLY   PV FCFM+S I+CN+++STFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKE+AISFVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDGIVAELDK+NV LVE+YQTELLADGVRLSNWVLEEGS SIHGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        T+AM RLLTRIEI SP+ KKKSL+WLLRGYIKGGHF DAAETLVKM+DLGFLPEYLDRVAVLQGLRK+IREP +V+TY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QKHKLWV+KML
Subjt:  QKHKLWVIKML

TrEMBL top hitse value%identityAlignment
A0A0A0KC35 Uncharacterized protein3.5e-25487.28Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS L++QR   FSTP+LY         M+S I+CN+++STFSV +A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKE+A+ FVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLK Y RDG VAELDK+NV LV +YQTELLADGV+LSNWVLEEGSSSI GV HERLLAMYICAG+G+EAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        T+AM RLLTRIEI SP++KKKSL+WLLRGYIKGGHF DAA TLVKM++LGFLPEYLDRVAVLQGLRK IREP SV TY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QKHKLW+IKML
Subjt:  QKHKLWVIKML

A0A1S3CNE0 pentatricopeptide repeat-containing protein At2g30100, chloroplastic7.7e-25487.28Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS L+TQR   FSTP+LY         M+S I+CN+++STFSV +A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKE+AI FVKEVLGR +AFMKDD EGHKGGPSGYLAWKMMVDGDYRGAVK+VLHLRESGL PEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG VAELDK+NV LV +YQTELLADGVRLSNWVLEEGSSSIHGV HERLLAMYICAG+G+EAERQLWEMKL+GKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
         +AM RLLTRIEI SP++KKKSL+WLLRGYIKGGHF DAA T+VKM++LGFLPEYLDRVAVLQGLRK IREP  V TY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QKHKLW+IKML
Subjt:  QKHKLWVIKML

A0A6J1D3T2 pentatricopeptide repeat-containing protein At2g30100, chloroplastic4.5e-29499.61Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSYTRDGIVAELDKDNVGLVE YQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGP LVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QKHKLWVIKML
Subjt:  QKHKLWVIKML

A0A6J1FYE9 pentatricopeptide repeat-containing protein At2g30100, chloroplastic2.8e-26490.22Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICAQGFTP+TQFGFSFSLSS LK++R   FS PQL   SPVNFCFM+S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY YLIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+RYQ+ELLADGVRLSNWVL+EG SS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLLTRIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

A0A6J1JH85 pentatricopeptide repeat-containing protein At2g30100, chloroplastic5.4e-26389.63Show/hide
Query:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        MICA GFTP+T+FGFSFSLSS LK++R   FS PQL   SPVNFCF++S ITCNH+NSTFSV +AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTR+PSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR
        SMIEKVISLYW+MGEKE+AISFVKEVLGRK+ FMKD+ EGHKGGPSGYLAWKMMVDGDYRGAVK+VL+LRESGL PEVY +LIAMTAVVKELNEFAKALR
Subjt:  SMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALR

Query:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE
        KLKSY RDG+VAELDKDNV LV+RYQ+ELLADGVRLSNWVL+EGSSS HGV HERLLAMYICAG+GLEAERQLWEMKLVGKEAD+DLYDIVLAICASQKE
Subjt:  KLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKE

Query:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL
        TRAMNRLL+RIEI SP LKKKSL+WLLRGYIKGGHF DAAETLVKMV+LGFLPEYLDRVAVLQGLRKRIREP +VETY  LCKCLSDANLIGPSLVYLHL
Subjt:  TRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHL

Query:  QKHKLWVIKML
        QK+KLWVIKML
Subjt:  QKHKLWVIKML

SwissProt top hitse value%identityAlignment
Q0WNN7 Pentatricopeptide repeat-containing protein At2g30100, chloroplastic9.3e-17264.85Show/hide
Query:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM
        AGKFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E M
Subjt:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM

Query:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV
        ELMVSIMC W+KKL+E + N   V DLL++MDCVGLKP FSM++KVI+LY EMG+KE A+ FVKEVL R+  F          EG KGGP GYLAWK MV
Subjt:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV

Query:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEG--SSSIHGVAH
        DGDYR AV +V+ LR SGL PE YSYLIAMTA+VKELN   K LR+LK + R G VAE+D  +  L+E+YQ+E L+ G++L+ W +EEG  + SI GV H
Subjt:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEG--SSSIHGVAH

Query:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP
        ERLLAMYICAGRG EAE+QLW+MKL G+E ++DL+DIV+AICASQKE  A++RLLTR+E +    KKK+LSWLLRGY+KGGHF +AAETLV M+D G  P
Subjt:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP

Query:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHLQKHKLWVIKML
        EY+DRVAV+QG+ ++I+ P  VE Y  LCK L DA L+GP LVY+++ K+KLW++KM+
Subjt:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHLQKHKLWVIKML

Q0WVV0 Pentatricopeptide repeat-containing protein At1g10910, chloroplastic1.2e-0620.93Show/hide
Query:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVA
        ++ +G     +KL   ++  GL P+V +Y   +   +K  N + KA+  +     +GI                                     +  V 
Subjt:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVA

Query:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL
        +  +LA+    GR  EAE  + +MK+ G   +   Y  +L   + + + +  + L+T ++ I  +  K  ++ LL+ YIKGG F  + E L ++   G+ 
Subjt:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

Arabidopsis top hitse value%identityAlignment
AT1G10910.1 Pentatricopeptide repeat (PPR) superfamily protein8.8e-0820.93Show/hide
Query:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVA
        ++ +G     +KL   ++  GL P+V +Y   +   +K  N + KA+  +     +GI                                     +  V 
Subjt:  MMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEGSSSIHGVA

Query:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL
        +  +LA+    GR  EAE  + +MK+ G   +   Y  +L   + + + +  + L+T ++ I  +  K  ++ LL+ YIKGG F  + E L ++   G+ 
Subjt:  HERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

AT2G30100.1 pentatricopeptide (PPR) repeat-containing protein6.6e-17364.85Show/hide
Query:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM
        AGKFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E M
Subjt:  AGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETM

Query:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV
        ELMVSIMC W+KKL+E + N   V DLL++MDCVGLKP FSM++KVI+LY EMG+KE A+ FVKEVL R+  F          EG KGGP GYLAWK MV
Subjt:  ELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAISFVKEVLGRKIAFMKD-----DREGHKGGPSGYLAWKMMV

Query:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEG--SSSIHGVAH
        DGDYR AV +V+ LR SGL PE YSYLIAMTA+VKELN   K LR+LK + R G VAE+D  +  L+E+YQ+E L+ G++L+ W +EEG  + SI GV H
Subjt:  DGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELLADGVRLSNWVLEEG--SSSIHGVAH

Query:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP
        ERLLAMYICAGRG EAE+QLW+MKL G+E ++DL+DIV+AICASQKE  A++RLLTR+E +    KKK+LSWLLRGY+KGGHF +AAETLV M+D G  P
Subjt:  ERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAAETLVKMVDLGFLP

Query:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHLQKHKLWVIKML
        EY+DRVAV+QG+ ++I+ P  VE Y  LCK L DA L+GP LVY+++ K+KLW++KM+
Subjt:  EYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHLQKHKLWVIKML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTGTGCCCAGGGCTTTACTCCGGTGACTCAATTTGGTTTCTCGTTTTCTTTATCTTCTGCACTGAAAACTCAGAGGCAAAGATTCTTCTCCACTCCCCAATTGTA
TTGCCACTCGCCGGTAAATTTTTGCTTTATGATTTCTTGCATTACCTGCAACCACCGGAATTCTACCTTCTCTGTTCTGAAAGCCGGTAAGTTTCGGGACCTGAGGTTGT
TCAAATCGGTTGAGTTGGACCAGTTCATCACGAGTGACGACGAAGACGAAATGGGAGATGGATTTTTTGAGGCAATCGAGGAGTTGGAGCGCATGACGAGGGAACCATCG
GATGTTCTCGAAGAAATGAATGACCGCCTTTCAGCGAGGGAATTCCAGCTTGTGCTTGTGTACTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGA
GTGGCTCCAGAAAGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGGTGTCTATTATGTGCAGTTGGATTAAAAAATTGGTTGAGGGAGACCATAACGTCGGAGATG
TGGTTGACCTTCTCGTAGATATGGATTGTGTAGGTTTGAAACCTCATTTTAGCATGATAGAAAAAGTCATTTCCTTGTATTGGGAAATGGGCGAGAAGGAACAAGCTATT
TCGTTCGTGAAAGAGGTCTTGGGACGCAAGATTGCTTTTATGAAGGACGATCGGGAGGGGCATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGG
TGACTATAGGGGTGCAGTGAAATTGGTGCTGCATCTTAGAGAATCTGGATTAAATCCAGAGGTTTATAGCTACCTCATCGCCATGACTGCTGTGGTTAAAGAGCTGAATG
AATTTGCAAAAGCTCTACGCAAACTCAAAAGTTACACAAGAGATGGAATAGTAGCCGAACTTGATAAAGACAATGTTGGACTTGTGGAGAGATATCAGACAGAGCTTCTA
GCCGATGGAGTACGATTATCCAACTGGGTGCTTGAAGAGGGAAGCTCTTCAATTCATGGGGTGGCTCATGAGAGACTTCTTGCTATGTATATTTGCGCCGGGCGAGGACT
TGAGGCAGAGAGACAACTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATAGTGATCTCTATGATATCGTGCTAGCCATCTGTGCTTCACAGAAGGAGACTAGAGCAA
TGAACCGGTTGCTTACCAGGATTGAGATTATAAGTCCCCTGCTGAAGAAGAAGAGCCTATCATGGCTGCTAAGGGGTTACATAAAAGGAGGGCATTTCAGTGATGCTGCA
GAAACATTAGTAAAAATGGTTGATTTGGGTTTTCTCCCAGAATACTTGGACAGAGTAGCTGTGCTGCAAGGCCTAAGAAAACGGATTCGGGAACCTGGAAGTGTGGAAAC
TTACTTCAAGCTCTGCAAGTGTCTCTCAGATGCTAATCTGATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTT
mRNA sequenceShow/hide mRNA sequence
ATGATTTGTGCCCAGGGCTTTACTCCGGTGACTCAATTTGGTTTCTCGTTTTCTTTATCTTCTGCACTGAAAACTCAGAGGCAAAGATTCTTCTCCACTCCCCAATTGTA
TTGCCACTCGCCGGTAAATTTTTGCTTTATGATTTCTTGCATTACCTGCAACCACCGGAATTCTACCTTCTCTGTTCTGAAAGCCGGTAAGTTTCGGGACCTGAGGTTGT
TCAAATCGGTTGAGTTGGACCAGTTCATCACGAGTGACGACGAAGACGAAATGGGAGATGGATTTTTTGAGGCAATCGAGGAGTTGGAGCGCATGACGAGGGAACCATCG
GATGTTCTCGAAGAAATGAATGACCGCCTTTCAGCGAGGGAATTCCAGCTTGTGCTTGTGTACTTCTCTCAAGAAGGGAGAGATTCATGGTGTGCTCTTGAGGTTTTTGA
GTGGCTCCAGAAAGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGGTGTCTATTATGTGCAGTTGGATTAAAAAATTGGTTGAGGGAGACCATAACGTCGGAGATG
TGGTTGACCTTCTCGTAGATATGGATTGTGTAGGTTTGAAACCTCATTTTAGCATGATAGAAAAAGTCATTTCCTTGTATTGGGAAATGGGCGAGAAGGAACAAGCTATT
TCGTTCGTGAAAGAGGTCTTGGGACGCAAGATTGCTTTTATGAAGGACGATCGGGAGGGGCATAAAGGGGGACCAAGCGGTTATCTCGCATGGAAGATGATGGTTGATGG
TGACTATAGGGGTGCAGTGAAATTGGTGCTGCATCTTAGAGAATCTGGATTAAATCCAGAGGTTTATAGCTACCTCATCGCCATGACTGCTGTGGTTAAAGAGCTGAATG
AATTTGCAAAAGCTCTACGCAAACTCAAAAGTTACACAAGAGATGGAATAGTAGCCGAACTTGATAAAGACAATGTTGGACTTGTGGAGAGATATCAGACAGAGCTTCTA
GCCGATGGAGTACGATTATCCAACTGGGTGCTTGAAGAGGGAAGCTCTTCAATTCATGGGGTGGCTCATGAGAGACTTCTTGCTATGTATATTTGCGCCGGGCGAGGACT
TGAGGCAGAGAGACAACTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATAGTGATCTCTATGATATCGTGCTAGCCATCTGTGCTTCACAGAAGGAGACTAGAGCAA
TGAACCGGTTGCTTACCAGGATTGAGATTATAAGTCCCCTGCTGAAGAAGAAGAGCCTATCATGGCTGCTAAGGGGTTACATAAAAGGAGGGCATTTCAGTGATGCTGCA
GAAACATTAGTAAAAATGGTTGATTTGGGTTTTCTCCCAGAATACTTGGACAGAGTAGCTGTGCTGCAAGGCCTAAGAAAACGGATTCGGGAACCTGGAAGTGTGGAAAC
TTACTTCAAGCTCTGCAAGTGTCTCTCAGATGCTAATCTGATTGGACCTAGTCTTGTATATTTGCACTTACAGAAACACAAGCTTTGGGTCATTAAAATGCTT
Protein sequenceShow/hide protein sequence
MICAQGFTPVTQFGFSFSLSSALKTQRQRFFSTPQLYCHSPVNFCFMISCITCNHRNSTFSVLKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPS
DVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLYWEMGEKEQAI
SFVKEVLGRKIAFMKDDREGHKGGPSGYLAWKMMVDGDYRGAVKLVLHLRESGLNPEVYSYLIAMTAVVKELNEFAKALRKLKSYTRDGIVAELDKDNVGLVERYQTELL
ADGVRLSNWVLEEGSSSIHGVAHERLLAMYICAGRGLEAERQLWEMKLVGKEADSDLYDIVLAICASQKETRAMNRLLTRIEIISPLLKKKSLSWLLRGYIKGGHFSDAA
ETLVKMVDLGFLPEYLDRVAVLQGLRKRIREPGSVETYFKLCKCLSDANLIGPSLVYLHLQKHKLWVIKML