; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g2126 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g2126
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUAA transporter
Genome locationMC06:28731392..28735025
RNA-Seq ExpressionMC06g2126
SyntenyMC06g2126
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]2.30e-19366.19Show/hide
Query:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDS+   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    + KPF      RK++LR FWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS
        IPSSS GNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT   +       TTT +RE+IVK  W +EEEKEQ SPVSVLDFPF+DEDQ   SS
Subjt:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS

Query:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA
        FNCNLHLI+GKKQK+  + +RFENG E EPLDLKKRFAD+ + R+ F  IS+ EHQRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A
Subjt:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA

Query:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
             R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +ME AGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

XP_022156544.1 uncharacterized protein LOC111023420 [Momordica charantia]0.0100Show/hide
Query:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
        MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
Subjt:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF

Query:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
        SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
Subjt:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS

Query:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
        ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
Subjt:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP

Query:  PKTRRFENGAELEPLDLKKRFADLGLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
        PKTRRFENGAELEPLDLKKRFADLGLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
Subjt:  PKTRRFENGAELEPLDLKKRFADLGLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA

Query:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
        EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
Subjt:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC

XP_022922340.1 uncharacterized protein LOC111430353 [Cucurbita moschata]1.62e-19366.19Show/hide
Query:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDS+   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    + KPF      RK++LR FWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS
        IPSSS GNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT   +       TTT +RE+IVK  W +EEEKEQ SPVSVLDFPF+DEDQ   SS
Subjt:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS

Query:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA
        FNCNLHL++GKKQK+  K +RFENG E EPLDLKKRFAD+ + R+ F  IS+ E+QRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A
Subjt:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA

Query:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
             R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +MEKAGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]3.46e-19566.6Show/hide
Query:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    +RKPF      RK++LRAFWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS
        IPSSS GNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT   +       TTT +RE+IVK QW +EEEKEQ SPVSVLDFPF+DEDQ   SS
Subjt:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS

Query:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA
        FNCNLHL++GKKQK+  + +RFENG E EPLDL KRFAD+ + R+ F  IS+ EHQRE++A E+L LVKS   S       ENLLLDFFHEKLEE ++TA
Subjt:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA

Query:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
             R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +ME AGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

XP_038906459.1 uncharacterized protein LOC120092443 [Benincasa hispida]1.60e-19666.12Show/hide
Query:  ALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAF
        A   SS+W+L++     K +S MLKDYL D++ S SSNGFRSFPRRQCCTTTVRFLLEIDLKVKD+S   RFL RT SRK+ALSTISTLQ+AS AV+RAF
Subjt:  ALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAF

Query:  KQFSFPSPTNRHRKPFLPR----KLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMI
        KQF  PS     RK FLPR    KL+ +AFWKK++ VD  TRRWKSF+EFLDEKEPPS    NRSDSA CTAIAVAGRNS SSCSNS SWTESEF SEMI
Subjt:  KQFSFPSPTNRHRKPFLPR----KLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMI

Query:  PSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETT----ADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCN
        PSSS GNSES SENDAVKD KDSP N IGKR+GV+FGKDSME+TT    A AA TT YR++IVK QW+++EEKEQ SPVSVLDFPF+DEDQ ISSSFNCN
Subjt:  PSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETT----ADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCN

Query:  LHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLG-LRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESG
        ++L+EGKKQK+  K++R E G ELEP+DLKKRF D+  + + F LI+K EHQ EE+A E L L+KS  KS      TENLLLDFFH+KLEE E+TA  S 
Subjt:  LHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLG-LRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESG

Query:  CRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
             DF+   +A +LK T++WIDG    +M  GWE  E R  Y+ +ME AGKW SFAGEKEEL +E EAEVWISL ++LLIDL
Subjt:  CRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein3.75e-18763.77Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS+W+L++     K +SL+LKDYL D+  S SSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS   RFL RT SRK+ALSTISTLQ+AS AV+RA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPFLPR----KLLLRAFWKKADTVD-SCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFA
        FKQF  PS     RK F PR    KL+ +AF KK+D VD +  +RWKSF+EFLDEKEPPS     +N SDSA CTAIAVAGRNSISSCSNS SWTESEF 
Subjt:  FKQFSFPSPTNRHRKPFLPR----KLLLRAFWKKADTVD-SCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFA

Query:  SEMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTAD----AATTTA--YREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHIS
        SE+IPSS SGNSES SENDAVKD KDSP N IGKR+GVTFGKDSMEETT      AA T+A  YRE+ VK QW++EEEKEQFSPVSVLDFPF+DEDQ IS
Subjt:  SEMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTAD----AATTTA--YREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHIS

Query:  SSFNCNLHLIEGKKQKN-PPKTRRFENGAELEPLDLKKRFADLGLRRD---FCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEE
        SSFNCN+HL+EGKKQK    KT+R E G ELEP+DLKKRF ++ +  D   F LI+K EHQ EE+ALE L L+KS  +S      TENLLLDFFH+KL+E
Subjt:  SSFNCNLHLIEGKKQKN-PPKTRRFENGAELEPLDLKKRFADLGLRRD---FCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEE

Query:  IESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
         E+T+  S      DF+   + ++LK  +DWIDG    +T    WE+ E R+ Y+ +ME   KWRSF G+KEEL +E E EVWISLL++LLIDL
Subjt:  IESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

A0A5A7TN51 Uncharacterized protein3.40e-18563.41Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS+W+L++     K +SL+LKDYL D+  S SSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS   +FL RT+SRK+ALSTISTLQ+AS AV+RA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPFLPR----KLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFAS
        FKQF  PS     RK F PR    KL+ +AF KK+D VD   RRWKSF+EFLDEKEPPS     QN SDSA CTAIAVAGRNSISSCSNS SWTESEF S
Subjt:  FKQFSFPSPTNRHRKPFLPR----KLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFAS

Query:  EMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEET---TADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFN
        E+IPSS SGNSES SEN AVKD KDSP N IGKR+GVTFGKDSMEET   +A AA    YRE+ VK+   +EEEKEQFSPVSVLDFPF+DEDQ ISSSFN
Subjt:  EMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEET---TADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFN

Query:  CNLHLIEGKKQKN-PPKTRRFENGAELEPLDLKKRFADLGLRRD----FCLISKT-EHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIE
        CN+HL+EGKKQK    KT+R E G ELEP+DLKKRF ++ +  D    F LI+K  EHQ EE+ALE L L+KS  KS      TENLLLDFFH+KL+E E
Subjt:  CNLHLIEGKKQKN-PPKTRRFENGAELEPLDLKKRFADLGLRRD----FCLISKT-EHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIE

Query:  STADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
        +T+  S      DF+   + ++L+  +DW+DG    +T    WE+ E R+ Y+ +ME A KWRSF G+KEEL +E EAEVWISLL +LLIDL
Subjt:  STADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

A0A6J1DTT2 uncharacterized protein LOC1110234200.0100Show/hide
Query:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
        MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
Subjt:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF

Query:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
        SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
Subjt:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS

Query:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
        ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
Subjt:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP

Query:  PKTRRFENGAELEPLDLKKRFADLGLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
        PKTRRFENGAELEPLDLKKRFADLGLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
Subjt:  PKTRRFENGAELEPLDLKKRFADLGLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA

Query:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
        EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
Subjt:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC

A0A6J1E8H2 uncharacterized protein LOC1114303537.85e-19466.19Show/hide
Query:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDS+   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    + KPF      RK++LR FWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS
        IPSSS GNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT   +       TTT +RE+IVK  W +EEEKEQ SPVSVLDFPF+DEDQ   SS
Subjt:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS

Query:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA
        FNCNLHL++GKKQK+  K +RFENG E EPLDLKKRFAD+ + R+ F  IS+ E+QRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A
Subjt:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA

Query:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
             R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +MEKAGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

A0A6J1HZ34 uncharacterized protein LOC1114689622.50e-19266.19Show/hide
Query:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVAA----KSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    +RKPF      RK++LRAFWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SE 
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS
        IPSSS GNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT   +       TTT +RE+IVK QW +EEEKEQ SPVSVLDFPF+DEDQ   SS
Subjt:  IPSSS-GNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAA-------TTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSS

Query:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA
        FNCNLHL++GKK  +  K++RFENG E EPLDLKKRFAD+ +  + F LIS+ EHQRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A
Subjt:  FNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTA

Query:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
             R G D +   +A+VLK TEDWI+G     M TGWE  EGR LY+ +ME AGKWRS  GEKEELA+E EAEVWISL  ELLIDL
Subjt:  DESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein8.7e-1023.03Show/hide
Query:  RSLMLKDYLRDEIGSSSSNGFRSFPRR------------------QCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK
        RS MLKD L ++  S SSNGF+S PRR                  Q     ++ L    +K    SAP+  L R+ SR+LA    +  Q AS  V+R   
Subjt:  RSLMLKDYLRDEIGSSSSNGFRSFPRR------------------QCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK

Query:  QFSFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSG
                                     D V     RW S ++  ++     P      ++   T       +S +S ++ +SW++ +F SE +PSS G
Subjt:  QFSFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSG

Query:  -NSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQ
         N E   E  +V                    K+++     D+ T     +  V  +   + EKE  SPVSV +   ++ D+   SSF+  L  +E  KQ
Subjt:  -NSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQ

Query:  KNPPKTRRFENGAELEPLDLKK------------------RFADLGLRRDFCLISKTEH--QREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLE
        K     +RFE+ A + P +L +                  ++ D          S+ E+  + EE+A ++ N VK R       +  E+L++D+F ++L 
Subjt:  KNPPKTRRFENGAELEPLDLKK------------------RFADLGLRRDFCLISKTEH--QREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLE

Query:  EIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKW--RSFAGEKEELASELEAEVWISLLHELLIDL
        +  ++  E+           F+ +++   + W+ G            + R     E+E+   W  +    E E + +++E E++  L+ E L  L
Subjt:  EIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKW--RSFAGEKEELASELEAEVWISLLHELLIDL

AT4G11780.1 unknown protein9.2e-2829.47Show/hide
Query:  ALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQ--CCTTTVRFLLEIDLKVK------DSSAPARFLSRTASRKLALSTISTLQKASG
        ++ SSSD  L  +K R   L+L+DYL D++ S SSNGF+SFPRRQ    ++TVR LL+ ++K          +   R   R++      +    + KAS 
Subjt:  ALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQ--CCTTTVRFLLEIDLKVK------DSSAPARFLSRTASRKLALSTISTLQKASG

Query:  AVVRAFKQFSFPSPTNRHRKPF---LPRKLLLRAFWKK----------ADTVDSCTRRWKS--FQEFLDEKEPPSPRHQNRSD------SAACTAIAVAG
        A +   K   FPS T + +  F     ++LL  +FW+K              D   + W+S  ++E LD++     +     D      SAA   +    
Subjt:  AVVRAFKQFSFPSPTNRHRKPF---LPRKLLLRAFWKK----------ADTVDSCTRRWKS--FQEFLDEKEPPSPRHQNRSD------SAACTAIAVAG

Query:  RNSISSCSNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVL
         +  SS   S  +T S        SSS +S SS E++ V    D+ E+  GK  G +      + ++ +   +   R+E V       EEKEQ SPVS+L
Subjt:  RNSISSCSNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVL

Query:  DFPF--DDEDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRD---FCLISKTEHQREERALEILNLVKSRIKSECFIVRT-
        + PF  DDED  I+              +K   K+RR      LEPLDL KR      R++   +  +   E + E +A  +  LVK RI     ++ + 
Subjt:  DFPF--DDEDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRD---FCLISKTEHQREERALEILNLVKSRIKSECFIVRT-

Query:  --ENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWIS
          +NLLLD+  E         D  G +         +  ++K  EDW+ G    M   WEV   R +YV EM    KW    G E+E +  EL    + S
Subjt:  --ENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWIS

Query:  LLHELLIDL
         + E + DL
Subjt:  LLHELLIDL

AT4G23020.1 unknown protein7.5e-2229.07Show/hide
Query:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK
        M   SSSD  L  +K R   L+L+D+L D++ S SSNGF+SFPR          LL  +++        R ++        L+    + KAS A++ A K
Subjt:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK

Query:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESE--FASEMIPS
           FPS      +    +K L  R+FWKK       +RR    +  +D  E          +   C + A   + S    S+   +      F+ E   S
Subjt:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESE--FASEMIPS

Query:  SSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGK
             +SSS +    +V  S    I     V    D +    +D ++     EE         EEKEQ SP+S+LD PF D+      + +   H  E  
Subjt:  SSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGK

Query:  KQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--CLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLDFFHEKLEEIESTADESGCR
        ++K   K RR E+   LEP+DL+KR      R+D+   +I   E Q E RA  +  LVKSRI  E   +      +N+LLDFF E       T DE    
Subjt:  KQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--CLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLDFFHEKLEEIESTADESGCR

Query:  RGGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHELLIDL
                   +++++ E+W+         M   W+V+E R +YV EM    KW    G EKE +  EL      SL+ EL+ D+
Subjt:  RGGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHELLIDL

AT4G23020.2 unknown protein4.0e-2328.77Show/hide
Query:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK
        M   SSSD  L  +K R   L+L+D+L D++ S SSNGF+SFPR          LL  +++        R ++        L+    + KAS A++ A K
Subjt:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK

Query:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKK----------------ADTVDSCTRRWKSFQEFLDEKEPP-SPRHQNRSDSAACTAIAVAGRNSI---SSC
           FPS      +    +K L  R+FWKK                 D  +   +R +SF EFL E +   S +    S +   +  A   ++++   SS 
Subjt:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKK----------------ADTVDSCTRRWKSFQEFLDEKEPP-SPRHQNRSDSAACTAIAVAGRNSI---SSC

Query:  SNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSE-EEKEQFSPVSVLDFPFDD
        S+ +S      +  ++   SG+   S  +D    + D+ E F+   E   +G+             +     +V  ++  E EEKEQ SP+S+LD PF D
Subjt:  SNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSE-EEKEQFSPVSVLDFPFDD

Query:  EDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--CLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLD
        +      + +   H  E  ++K   K RR E+   LEP+DL+KR      R+D+   +I   E Q E RA  +  LVKSRI  E   +      +N+LLD
Subjt:  EDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--CLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLD

Query:  FFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHEL
        FF E       T DE               +++++ E+W+         M   W+V+E R +YV EM    KW    G EKE +  EL      SL+ EL
Subjt:  FFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHEL

Query:  LIDL
        + D+
Subjt:  LIDL

AT5G03670.1 unknown protein4.2e-0424.44Show/hide
Query:  DEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNSESS-----SENDAVKDVK--DSPENFI--------GKREGVTFG
        +E+   S  H+  S++      + +G  S S  +   SW + +F + +  SS  N         +  D  +D +  +SP +F+        G R      
Subjt:  DEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNSESS-----SENDAVKDVK--DSPENFI--------GKREGVTFG

Query:  KDSMEETTADAATTTAYREEIVK--EQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFN---CNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADL
          +            +Y  E +K  E    EEEKEQ SPVSVLD PF D+D+ I    N    +   ++  K     K  RFE  A L+P++L+KR +D 
Subjt:  KDSMEETTADAATTTAYREEIVK--EQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFN---CNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADL

Query:  GLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIEST----ADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTG
                  +TE + EE   E+ +L        C I+ T+ +L  +F E +E  E      +D +      D + E +A +       +          
Subjt:  GLRRDFCLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIEST----ADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTG

Query:  WEVAEGRSL-----YVNEMEKAGKWRS-FAGEKEELASELEAEVWISLLHELLIDL
        W   E  ++     +    E+ G WRS    +  E   ++E E++  L+ EL  D+
Subjt:  WEVAEGRSL-----YVNEMEKAGKWRS-FAGEKEELASELEAEVWISLLHELLIDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGGCGTCTTCTTCTGATTGGAGCCTCGTGGCGGCGAAATCTAGGTCGCTCATGCTCAAGGATTATCTTCGAGATGAAATCGGCTCTTCTTCCTCTAATGGCTT
CCGATCCTTTCCACGCCGCCAATGCTGCACCACCACCGTCCGATTTCTTCTCGAGATCGATCTCAAAGTGAAGGATTCTTCCGCACCTGCAAGATTCCTCTCTCGAACCG
CTTCCAGAAAACTCGCTCTCTCCACCATCTCCACTCTGCAAAAGGCGTCCGGCGCCGTCGTCAGAGCATTCAAGCAATTTTCCTTTCCTTCCCCCACAAATCGGCATCGG
AAGCCGTTTTTGCCGCGGAAACTGCTTCTCAGAGCCTTCTGGAAGAAAGCAGACACGGTTGATTCCTGTACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGACGAGAA
AGAACCGCCGTCGCCTCGTCATCAGAATCGCTCCGACTCCGCCGCCTGCACCGCCATTGCCGTCGCCGGTAGAAACTCGATCAGTTCTTGCAGTAACAGCAACAGTTGGA
CGGAGAGCGAATTCGCTTCGGAGATGATACCGTCTTCCAGCGGTAATTCCGAGAGTTCCAGCGAAAACGACGCCGTCAAAGATGTTAAGGATTCGCCTGAAAATTTCATC
GGCAAAAGAGAAGGCGTAACGTTCGGAAAAGATTCCATGGAGGAAACAACCGCCGACGCCGCAACTACCACTGCCTATCGGGAGGAAATCGTTAAGGAGCAATGGCGGAG
TGAGGAAGAGAAAGAACAGTTCAGTCCAGTTTCGGTGTTGGATTTTCCTTTCGATGACGAAGATCAACACATCTCCTCATCTTTCAACTGCAATCTTCACCTCATCGAAG
GGAAGAAGCAGAAGAATCCGCCGAAGACGAGGCGATTCGAGAACGGAGCCGAATTGGAACCACTGGACCTGAAGAAGCGATTCGCAGATTTAGGGCTTCGCCGCGATTTC
TGCTTGATATCAAAAACAGAGCACCAGAGGGAGGAGAGGGCGTTGGAAATTCTGAATCTCGTCAAATCGAGGATCAAATCGGAATGCTTCATAGTCAGAACGGAGAATCT
GCTGCTTGATTTCTTCCACGAGAAGCTCGAGGAGATCGAATCGACGGCGGATGAGAGCGGATGCCGAAGAGGAGGTGATTTTGAGTTTGAGTTTAAGGCAGAGGTTTTGA
AATTGACGGAAGATTGGATCGACGGAGGAACGGCGTTGATGACGACGGGATGGGAGGTGGCGGAGGGGCGGAGTTTGTACGTTAATGAAATGGAGAAGGCCGGAAAGTGG
AGAAGTTTCGCCGGAGAAAAGGAAGAATTGGCGTCGGAGTTGGAAGCTGAGGTTTGGATTTCCTTGCTCCACGAGCTATTAATTGACCTCTGCTAG
mRNA sequenceShow/hide mRNA sequence
CCCACATTAAAAAAATTCCGATTCAAAAATTTAAAACTCTCACTCCCACTCGCACTCCCACTTACAGTAAGAACCATCCTGCAACCTTCTTCTTCGTCTTCCTCTGTTCT
ATTCTATTTTATATATAAATTTACTCTGTCTTCTGCTTTTTCTTCTACCTTGAGATCTTCACCTCATTTCCCTCAATTTCTCATTTTTCTCTAAAAGACACTGAAATTAA
TGGAAGTTTTGGTTCTTCCTTTTCCATACCCATTCTCCGCTGTGCTTTAAATGCTCCACCCTCCACTCCAGTACTACTCATCCATGTCTCCTTAATTGCTACTTTTTTCC
TCTCTCATCGTTCTTTACAAATTCAAAATCCTCTGTTTCTCTCTCTCTAGTTTTCGATTTTGTTTGATTTCGTTTTTGAATTGATTATGGCGCTGGCGTCTTCTTCTGAT
TGGAGCCTCGTGGCGGCGAAATCTAGGTCGCTCATGCTCAAGGATTATCTTCGAGATGAAATCGGCTCTTCTTCCTCTAATGGCTTCCGATCCTTTCCACGCCGCCAATG
CTGCACCACCACCGTCCGATTTCTTCTCGAGATCGATCTCAAAGTGAAGGATTCTTCCGCACCTGCAAGATTCCTCTCTCGAACCGCTTCCAGAAAACTCGCTCTCTCCA
CCATCTCCACTCTGCAAAAGGCGTCCGGCGCCGTCGTCAGAGCATTCAAGCAATTTTCCTTTCCTTCCCCCACAAATCGGCATCGGAAGCCGTTTTTGCCGCGGAAACTG
CTTCTCAGAGCCTTCTGGAAGAAAGCAGACACGGTTGATTCCTGTACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGACGAGAAAGAACCGCCGTCGCCTCGTCATCA
GAATCGCTCCGACTCCGCCGCCTGCACCGCCATTGCCGTCGCCGGTAGAAACTCGATCAGTTCTTGCAGTAACAGCAACAGTTGGACGGAGAGCGAATTCGCTTCGGAGA
TGATACCGTCTTCCAGCGGTAATTCCGAGAGTTCCAGCGAAAACGACGCCGTCAAAGATGTTAAGGATTCGCCTGAAAATTTCATCGGCAAAAGAGAAGGCGTAACGTTC
GGAAAAGATTCCATGGAGGAAACAACCGCCGACGCCGCAACTACCACTGCCTATCGGGAGGAAATCGTTAAGGAGCAATGGCGGAGTGAGGAAGAGAAAGAACAGTTCAG
TCCAGTTTCGGTGTTGGATTTTCCTTTCGATGACGAAGATCAACACATCTCCTCATCTTTCAACTGCAATCTTCACCTCATCGAAGGGAAGAAGCAGAAGAATCCGCCGA
AGACGAGGCGATTCGAGAACGGAGCCGAATTGGAACCACTGGACCTGAAGAAGCGATTCGCAGATTTAGGGCTTCGCCGCGATTTCTGCTTGATATCAAAAACAGAGCAC
CAGAGGGAGGAGAGGGCGTTGGAAATTCTGAATCTCGTCAAATCGAGGATCAAATCGGAATGCTTCATAGTCAGAACGGAGAATCTGCTGCTTGATTTCTTCCACGAGAA
GCTCGAGGAGATCGAATCGACGGCGGATGAGAGCGGATGCCGAAGAGGAGGTGATTTTGAGTTTGAGTTTAAGGCAGAGGTTTTGAAATTGACGGAAGATTGGATCGACG
GAGGAACGGCGTTGATGACGACGGGATGGGAGGTGGCGGAGGGGCGGAGTTTGTACGTTAATGAAATGGAGAAGGCCGGAAAGTGGAGAAGTTTCGCCGGAGAAAAGGAA
GAATTGGCGTCGGAGTTGGAAGCTGAGGTTTGGATTTCCTTGCTCCACGAGCTATTAATTGACCTCTGCTAGCTTCGATTTATTTATTTTCTCTCATTTAAAAAAAAAAA
AGTTCGTAGCTAATTAGCAGCCAAATCTACTTGTTAATAATGGATTAAAAAAAAAGCCTAAAATTTTAGTTGAATTGAAATTATTATCACTCATATTAGGGCGGCAAGTA
GGTTTGGCTACTGACTAGTTTTTGAGCTTTGAAGTTATATATTTCATAATTCTCTATGAAATTGGTGCCTAAACTGCGTACTGTATTTGCAAATTTTGTTGCATGAAAAG
TGAAAATGGTAATTGTTTTTCCCTCGT
Protein sequenceShow/hide protein sequence
MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQFSFPSPTNRHR
KPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFI
GKREGVTFGKDSMEETTADAATTTAYREEIVKEQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF
CLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKW
RSFAGEKEELASELEAEVWISLLHELLIDLC