; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019546 (gene) of Chayote v1 genome

Gene IDSed0019546
OrganismSechium edule (Chayote v1)
Descriptionpentatricopeptide repeat-containing protein At2g30100, chloroplastic
Genome locationLG05:45571110..45574251
RNA-Seq ExpressionSed0019546
SyntenySed0019546
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570645.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.5e-25988.43Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG + LTQFGFSFSLSSGLK+ER GF  P+L  RSPV F FMV  ITCNHQNSTFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERM R+PSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYW+M EKEKAISFVKEVLGRKL FMKD+WEGHKGGPSGYLAWKMMV GDY+GAVKMVL+LRESGLKPEVY YL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDKD+V+LV+RYQ+EL+ADGV+LS WVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        RAM+RLLTRIEITSP LKKKS+TWLLRGYIKG HF DAAETLVKMV+LGFLPEYLDRVAVLQGLRK+I EPENVETY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWVIKML
Subjt:  KSKLWVIKML

KAG7010495.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]6.7e-26088.63Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG + LTQFGFSFSLSSGLK+ER GF  P+L  RSPV F FMV  ITCNHQNSTFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYW+M EKEKAISFVKEVLGRKL FMKD+WEGHKGGPSGYLAWKMMV GDY+GAVKMVL+LRESGLKPEVY YL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDKD+V+LV+RYQ+EL+ADGV+LS WVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        RAM+RLLTRIEITSP LKKKS+TWLLRGYIKG HF DAAETLVKMV+LGFLPEYLDRVAVLQGLRK+I EPENVETY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWVIKML
Subjt:  KSKLWVIKML

XP_022944005.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata]1.9e-25988.63Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG T LTQFGFSFSLSSGLK+ER GF  P+L  RSPV F FMV  ITCNHQNSTFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYW+M EKEKAISFVKEVLGRKL FMKD+WEGHKGGPSGYLAWKMMV GDY+GAVKMVL+LRESGLKPEVY YL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDKD+V+LV+RYQ+EL+ADGV+LS WVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        RAM+RLLTRIEITSP LKKKS+TWLLRGYIKG HF DAAETLVKMV+LGFLPEYLDRVAVLQGLRK+I EPENVETY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWVIKML
Subjt:  KSKLWVIKML

XP_023512972.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo]1.9e-25988.63Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG T LTQFGFSFSLSSGLK+ER GF  P+L  RSPV F FMV  ITCNHQNSTFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYW+M EKEKAISFVKEVLGRKL FMKD+WEGHKGGPSGYLAWKMMV GDY+GAVKMVL+LRESGLKPEVY YL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDKD+V+LV+RYQ+EL+ADGV+LS WVL+EG SS H VVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        RAM+RLLTRIEITSP LKKKS+TWLLRGYIKG HF DAAETLVKMV+LGFLPEYLDRVAVLQGLRK+I EPENVETY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWVIKML
Subjt:  KSKLWVIKML

XP_038901728.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Benincasa hispida]2.9e-26389.02Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        MVCAQG T LTQFGFSFSLSS LKT+RHGF TP+LY   PVKF FMV  I+CN+Q+STFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTREPSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYWEM EKEKAISFVKEVLGR LAFMKDDWEGHKGGPSGYLAWKMMV GDY+GAVKMVLHLRESGLKPEVY YL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDK++V+LVE+YQTEL+ADGV+LS WVLEEG+ SIHGVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        +AM RLLTRIEITSP+ KKKS+TWLLRGYIKG HFHDAAETLVKM+ LGFLPEYLDRVAVLQGLRKQI EPENV+TY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWV+KML
Subjt:  KSKLWVIKML

TrEMBL top hitse value%identityAlignment
A0A0A0KC35 Uncharacterized protein6.1e-25185.88Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG T LTQFGFSFSLSS L+++R GF TPRLY  SP         I+CN+Q+STFS S+A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTREPSDVL EMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYWEM EKEKA+ FVKEVLGR LAFMKDDWEGHKGGPSGYLAWKMMV GDY+GAVKMVLHLRESGL+PEVYSYL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LK YA++G VAELDK++V+LV +YQTEL+ADGVQLS WVLEEG+SSI GVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        +AM RLLTRIEITSP++KKKS+TWLLRGYIKG HF DAA TLVKM++LGFLPEYLDRVAVLQGLRK+I EPE+V TY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLW+IKML
Subjt:  KSKLWVIKML

A0A1S3CNE0 pentatricopeptide repeat-containing protein At2g30100, chloroplastic4.0e-25085.69Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG T LTQFGFSFSLSS L+T+R+GF TPRLY  SP         I+CN+Q+STFS S+A KFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTREPSDVL EMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYWEM EKEKAI FVKEVLGR LAFMKDDWEGHKGGPSGYLAWKMMV GDY+GAVKMVLHLRESGL+PEVYSYL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G VAELDK++V+LV +YQTEL+ADGV+LS WVLEEG+SSIHGVVHERLLAMYICAGQGVEAERQLWEMKL+GKEADADLYDIVLAICASQK+ 
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        +AM RLLTRIEITSP++KKKS+TWLLRGYIKG HF DAA T+VKM++LGFLPEYLDRVAVLQGLRK I EPE V TY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLW+IKML
Subjt:  KSKLWVIKML

A0A6J1D3T2 pentatricopeptide repeat-containing protein At2g30100, chloroplastic8.8e-25887.87Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFF-TPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
        M+CAQG T +TQFGFSFSLSS LKT+R  FF TP+LY  SPV F FM+  ITCNH+NSTFS  KAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFF-TPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIE

Query:  ELERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
        ELERMTREPSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF
Subjt:  ELERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHF

Query:  SMIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALR
        SMIEK+ISLYWEM EKE+AISFVKEVLGRK+AFMKDD EGHKGGPSGYLAWKMMV GDY+GAVK+VLHLRESGL PEVYSYL+AMTAVVKELNE AKALR
Subjt:  SMIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALR

Query:  KLKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQ
        KLKSY ++G+VAELDKD+V LVE YQTEL+ADGV+LS WVLEEG+SSIHGV HERLLAMYICAG+G+EAERQLWEMKLVGKEAD+DLYDIVLAICASQK+
Subjt:  KLKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQ

Query:  TRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHI
        TRAM+RLLTRIEI SPLLKKKS++WLLRGYIKG HF DAAETLVKMV LGFLPEYLDRVAVLQGLRK+I EP +VETYFKLCKCLSDANLIGP LVYLH+
Subjt:  TRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHI

Query:  QKSKLWVIKML
        QK KLWVIKML
Subjt:  QKSKLWVIKML

A0A6J1FYE9 pentatricopeptide repeat-containing protein At2g30100, chloroplastic9.4e-26088.63Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CAQG T LTQFGFSFSLSSGLK+ER GF  P+L  RSPV F FMV  ITCNHQNSTFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNV DVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYW+M EKEKAISFVKEVLGRKL FMKD+WEGHKGGPSGYLAWKMMV GDY+GAVKMVL+LRESGLKPEVY YL+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDKD+V+LV+RYQ+EL+ADGV+LS WVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        RAM+RLLTRIEITSP LKKKS+TWLLRGYIKG HF DAAETLVKMV+LGFLPEYLDRVAVLQGLRK+I EPENVETY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWVIKML
Subjt:  KSKLWVIKML

A0A6J1JH85 pentatricopeptide repeat-containing protein At2g30100, chloroplastic8.8e-25887.65Show/hide
Query:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
        M+CA G T LT+FGFSFSLSSGLK++R GF  P+L  RSPV F F+V  ITCNHQNSTFS S+AGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE
Subjt:  MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEE

Query:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS
        LERMTR+PSDVL EMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG HNVGDVVDLLVDMDCVGLKPHFS
Subjt:  LERMTREPSDVLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFS

Query:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK
        MIEK+ISLYW+M EKEKAISFVKEVLGRKL FMKD+WEGHKGGPSGYLAWKMMV GDY+GAVKMVL+LRESGLKPEVY +L+AMTAVVKELNE AKALRK
Subjt:  MIEKIISLYWEMDEKEKAISFVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRK

Query:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT
        LKSYA++G+VAELDKD+V+LV+RYQ+EL+ADGV+LS WVL+EG+SS HGVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDIVLAICASQK+T
Subjt:  LKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQT

Query:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ
        RAM+RLL+RIEITSP LKKKS+TWLLRGYIKG HF DAAETLVKMV+LGFLPEYLDRVAVLQGLRK+I EPENVETY  LCKCLSDANLIGPSLVYLH+Q
Subjt:  RAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQ

Query:  KSKLWVIKML
        K KLWVIKML
Subjt:  KSKLWVIKML

SwissProt top hitse value%identityAlignment
Q0WNN7 Pentatricopeptide repeat-containing protein At2g30100, chloroplastic5.7e-16961.71Show/hide
Query:  FTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLGEMNDRLSAREFQL
        F PRL+R   VK +     I CN +        AGKFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+L EMN RLS+RE QL
Subjt:  FTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLGEMNDRLSAREFQL

Query:  VLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKIISLYWEMDEKEKAISFVKEVL
        +LVYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E + N   V DLL++MDCVGLKP FSM++K+I+LY EM +KE A+ FVKEVL
Subjt:  VLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKIISLYWEMDEKEKAISFVKEVL

Query:  GRKLAFMKD-----DWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLV
         R+  F          EG KGGP GYLAWK MV GDY+ AV MV+ LR SGLKPE YSYL+AMTA+VKELN L K LR+LK +A+ G VAE+D     L+
Subjt:  GRKLAFMKD-----DWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLV

Query:  ERYQTELVADGVQLSRWVLEEG--NSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKK
        E+YQ+E ++ G+QL+ W +EEG  N SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQK+  A+SRLLTR+E      KK
Subjt:  ERYQTELVADGVQLSRWVLEEG--NSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKK

Query:  KSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQKSKLWVIKML
        K+++WLLRGY+KG HF +AAETLV M+  G  PEY+DRVAV+QG+ ++I  P +VE Y  LCK L DA L+GP LVY++I K KLW++KM+
Subjt:  KSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQKSKLWVIKML

Q0WVV0 Pentatricopeptide repeat-containing protein At1g10910, chloroplastic1.8e-0520.93Show/hide
Query:  MMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVV
        ++  G     +K+   ++  GLKP+V +Y   +   +K  N   KA+  +     NG+                                     +  V+
Subjt:  MMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVV

Query:  HERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFL
        +  +LA+    G+  EAE  + +MK+ G   +   Y  +L   + +   +    L+T ++    +  K  MT LL+ YIKG  F  + E L ++ S G+ 
Subjt:  HERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

Arabidopsis top hitse value%identityAlignment
AT1G10910.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-0620.93Show/hide
Query:  MMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVV
        ++  G     +K+   ++  GLKP+V +Y   +   +K  N   KA+  +     NG+                                     +  V+
Subjt:  MMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLVERYQTELVADGVQLSRWVLEEGNSSIHGVV

Query:  HERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFL
        +  +LA+    G+  EAE  + +MK+ G   +   Y  +L   + +   +    L+T ++    +  K  MT LL+ YIKG  F  + E L ++ S G+ 
Subjt:  HERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAETLVKMVSLGFL

Query:  PEYLDRVAVLQGLRK
           +    ++ GL K
Subjt:  PEYLDRVAVLQGLRK

AT2G30100.1 pentatricopeptide (PPR) repeat-containing protein4.0e-17061.71Show/hide
Query:  FTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLGEMNDRLSAREFQL
        F PRL+R   VK +     I CN +        AGKFR++ L +SVELDQFITS++E    +E+G+GFFEAIEELERMTREPSD+L EMN RLS+RE QL
Subjt:  FTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDE----DEMGDGFFEAIEELERMTREPSDVLGEMNDRLSAREFQL

Query:  VLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKIISLYWEMDEKEKAISFVKEVL
        +LVYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E + N   V DLL++MDCVGLKP FSM++K+I+LY EM +KE A+ FVKEVL
Subjt:  VLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKIISLYWEMDEKEKAISFVKEVL

Query:  GRKLAFMKD-----DWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLV
         R+  F          EG KGGP GYLAWK MV GDY+ AV MV+ LR SGLKPE YSYL+AMTA+VKELN L K LR+LK +A+ G VAE+D     L+
Subjt:  GRKLAFMKD-----DWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLV

Query:  ERYQTELVADGVQLSRWVLEEG--NSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKK
        E+YQ+E ++ G+QL+ W +EEG  N SI GVVHERLLAMYICAG+G EAE+QLW+MKL G+E +ADL+DIV+AICASQK+  A+SRLLTR+E      KK
Subjt:  ERYQTELVADGVQLSRWVLEEG--NSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKK

Query:  KSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQKSKLWVIKML
        K+++WLLRGY+KG HF +AAETLV M+  G  PEY+DRVAV+QG+ ++I  P +VE Y  LCK L DA L+GP LVY++I K KLW++KM+
Subjt:  KSMTWLLRGYIKGSHFHDAAETLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQKSKLWVIKML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTGCTCAGGGATCTACTACGTTAACTCAATTTGGATTTTCGTTTTCTTTATCTTCTGGACTGAAAACTGAGAGGCATGGATTTTTTACTCCCCGATTGTATAG
ACGTTCTCCGGTTAAATTTAGCTTTATGGTTCCTCTTATTACTTGCAACCACCAGAATTCTACTTTTTCTGGTTCGAAAGCAGGTAAGTTTCGGGACCTGAGGTTGTTCA
AATCGGTTGAGTTGGATCAGTTTATTACGAGTGATGATGAAGACGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACCAGGGAACCATCGGAT
GTTCTTGGGGAAATGAATGACCGCCTTTCGGCGAGGGAGTTTCAGCTCGTGCTCGTGTACTTCTCTCAAGAAGGGAGAGATTCTTGGTGTGCTCTTGAGGTTTTCGAGTG
GCTCCAAAAAGAGAATCGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATTATGTGCAGTTGGATTAAGAAGCTAGTTGAGGGAGATCATAACGTCGGAGATGTGG
TTGACCTTCTCGTTGATATGGATTGTGTAGGTTTGAAGCCCCATTTTAGCATGATAGAAAAGATCATCTCTTTGTACTGGGAAATGGACGAGAAGGAGAAAGCAATTTCA
TTTGTAAAAGAGGTCTTGGGACGCAAGCTTGCTTTTATGAAGGACGATTGGGAGGGGCATAAAGGGGGACCTAGCGGTTATCTCGCATGGAAGATGATGGTTGGTGGTGA
CTATAAGGGTGCAGTGAAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTATAGCTACCTTGTTGCCATGACTGCTGTCGTAAAAGAGTTGAATGAAT
TAGCAAAAGCTCTTCGCAAACTCAAAAGTTATGCAAAGAATGGAGTAGTGGCTGAACTCGATAAAGACAGTGTCAAACTTGTTGAGCGGTATCAGACGGAGCTTGTAGCT
GATGGTGTACAGTTATCCAGATGGGTGCTTGAAGAGGGAAACTCTTCAATTCATGGGGTGGTGCATGAGAGACTCCTAGCTATGTACATTTGTGCCGGTCAAGGAGTTGA
GGCAGAGAGACAGCTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTTGCCATTTGTGCTTCACAGAAGCAGACTAGAGCCATGA
GCCGGTTGCTAACTAGGATCGAGATTACGAGTCCCCTGCTTAAGAAGAAGAGTATGACATGGCTACTCAGGGGTTACATAAAAGGAAGCCATTTCCATGATGCAGCAGAA
ACATTAGTAAAAATGGTCAGCTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGTCTAAGAAAACAGATTTGGGAACCCGAAAACGTCGAAACTTA
CTTCAAGCTTTGCAAGTGCCTCTCTGATGCTAATCTAATTGGACCTAGTCTTGTATATTTGCACATACAAAAAAGTAAGCTTTGGGTCATTAAAATGCTTTGA
mRNA sequenceShow/hide mRNA sequence
GTCTTGCAGTGGCGGCAAAAATCGCTGCCATTGCGACCACTCACTCCGATTTCCACCGTCGCCATCGCTCAATCCTCCGTCTCGAAATTAGGGTTTCATTTTTCCGATCA
TCGATTCAATTCTCCGATTCTTCAACAGGTTTTTCTGTTCGATGTTTGCTCAATCTCTTCCGCTTTGTATTTGGTTTTGTTTTCTTTTTGCTAATCGAGTTCTGAATTGA
TGCGGTGGAAACCCTAATGTTCTGATTTTGTTTCGAGAAGTTTCTGGAGTCGGTGTGAATGATTTTGTGTCTTGATTTCTGTATTTCATTCGGTTTTTGGACTTTTCTGA
ACTCGGTGGAGTGGAGATAGATACCTTGACGTTTAAGTTCGTTGATTATTTGAAATTTTAGTTTGGATTTGGTGATTTGAGTTACAAGTACGAAATGGTTTGTGCTCAGG
GATCTACTACGTTAACTCAATTTGGATTTTCGTTTTCTTTATCTTCTGGACTGAAAACTGAGAGGCATGGATTTTTTACTCCCCGATTGTATAGACGTTCTCCGGTTAAA
TTTAGCTTTATGGTTCCTCTTATTACTTGCAACCACCAGAATTCTACTTTTTCTGGTTCGAAAGCAGGTAAGTTTCGGGACCTGAGGTTGTTCAAATCGGTTGAGTTGGA
TCAGTTTATTACGAGTGATGATGAAGACGAAATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACCAGGGAACCATCGGATGTTCTTGGGGAAATGA
ATGACCGCCTTTCGGCGAGGGAGTTTCAGCTCGTGCTCGTGTACTTCTCTCAAGAAGGGAGAGATTCTTGGTGTGCTCTTGAGGTTTTCGAGTGGCTCCAAAAAGAGAAT
CGGGTTGACAAGGAGACCATGGAGTTGATGGTGTCTATTATGTGCAGTTGGATTAAGAAGCTAGTTGAGGGAGATCATAACGTCGGAGATGTGGTTGACCTTCTCGTTGA
TATGGATTGTGTAGGTTTGAAGCCCCATTTTAGCATGATAGAAAAGATCATCTCTTTGTACTGGGAAATGGACGAGAAGGAGAAAGCAATTTCATTTGTAAAAGAGGTCT
TGGGACGCAAGCTTGCTTTTATGAAGGACGATTGGGAGGGGCATAAAGGGGGACCTAGCGGTTATCTCGCATGGAAGATGATGGTTGGTGGTGACTATAAGGGTGCAGTG
AAAATGGTGCTGCATCTTAGAGAATCTGGATTAAAGCCAGAGGTTTATAGCTACCTTGTTGCCATGACTGCTGTCGTAAAAGAGTTGAATGAATTAGCAAAAGCTCTTCG
CAAACTCAAAAGTTATGCAAAGAATGGAGTAGTGGCTGAACTCGATAAAGACAGTGTCAAACTTGTTGAGCGGTATCAGACGGAGCTTGTAGCTGATGGTGTACAGTTAT
CCAGATGGGTGCTTGAAGAGGGAAACTCTTCAATTCATGGGGTGGTGCATGAGAGACTCCTAGCTATGTACATTTGTGCCGGTCAAGGAGTTGAGGCAGAGAGACAGCTT
TGGGAAATGAAGCTTGTAGGTAAGGAGGCCGATGCTGATCTCTACGATATCGTGCTTGCCATTTGTGCTTCACAGAAGCAGACTAGAGCCATGAGCCGGTTGCTAACTAG
GATCGAGATTACGAGTCCCCTGCTTAAGAAGAAGAGTATGACATGGCTACTCAGGGGTTACATAAAAGGAAGCCATTTCCATGATGCAGCAGAAACATTAGTAAAAATGG
TCAGCTTGGGTTTTCTCCCAGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGTCTAAGAAAACAGATTTGGGAACCCGAAAACGTCGAAACTTACTTCAAGCTTTGCAAG
TGCCTCTCTGATGCTAATCTAATTGGACCTAGTCTTGTATATTTGCACATACAAAAAAGTAAGCTTTGGGTCATTAAAATGCTTTGAGCTCATCAATATCTCTCTGCACA
GGCAGCTAATAAAGTGGAACAAAAGATCATCTCTCACAGCACCAGCACTTTTTTGGGTGCTTTTACATGATGATTTTGTATAGTTTGAAGGACCTGCTTCTTCGAGGCGG
TTGAGGTTACTCTGGTAGCTCT
Protein sequenceShow/hide protein sequence
MVCAQGSTTLTQFGFSFSLSSGLKTERHGFFTPRLYRRSPVKFSFMVPLITCNHQNSTFSGSKAGKFRDLRLFKSVELDQFITSDDEDEMGDGFFEAIEELERMTREPSD
VLGEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGDHNVGDVVDLLVDMDCVGLKPHFSMIEKIISLYWEMDEKEKAIS
FVKEVLGRKLAFMKDDWEGHKGGPSGYLAWKMMVGGDYKGAVKMVLHLRESGLKPEVYSYLVAMTAVVKELNELAKALRKLKSYAKNGVVAELDKDSVKLVERYQTELVA
DGVQLSRWVLEEGNSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDIVLAICASQKQTRAMSRLLTRIEITSPLLKKKSMTWLLRGYIKGSHFHDAAE
TLVKMVSLGFLPEYLDRVAVLQGLRKQIWEPENVETYFKLCKCLSDANLIGPSLVYLHIQKSKLWVIKML