; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021378 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021378
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUAA transporter
Genome locationscaffold358:1531711..1534734
RNA-Seq ExpressionMS021378
SyntenyMS021378
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]2.4e-15566.53Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDS+   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    + KPF      RK++LR FWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF
        IP SSSGNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT           TTT +RE+IVK W +EEEKEQ SPVSVLDFPF+DEDQ   SSF
Subjt:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF

Query:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD
        NCNLHLI+GKKQK+  + +RFENG E EPLDLKKRFAD+ + R+ FG IS+ EHQRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A 
Subjt:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD

Query:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
            R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +ME AGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

XP_022156544.1 uncharacterized protein LOC111023420 [Momordica charantia]5.2e-25999.58Show/hide
Query:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
        MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
Subjt:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF

Query:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
        SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
Subjt:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS

Query:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVK-QWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
        ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVK QWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
Subjt:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVK-QWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP

Query:  PKTRRFENGAELEPLDLKKRFADLGLRRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
        PKTRRFENGAELEPLDLKKRFADLGLRRDF LISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
Subjt:  PKTRRFENGAELEPLDLKKRFADLGLRRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA

Query:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
        EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
Subjt:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC

XP_022922340.1 uncharacterized protein LOC111430353 [Cucurbita moschata]1.9e-15566.53Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDS+   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    + KPF      RK++LR FWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF
        IP SSSGNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT           TTT +RE+IVK W +EEEKEQ SPVSVLDFPF+DEDQ   SSF
Subjt:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF

Query:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD
        NCNLHL++GKKQK+  K +RFENG E EPLDLKKRFAD+ + R+ FG IS+ E+QRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A 
Subjt:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD

Query:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
            R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +MEKAGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]8.3e-15666.74Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    +RKPF      RK++LRAFWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF
        IP SSSGNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT           TTT +RE+IVKQW +EEEKEQ SPVSVLDFPF+DEDQ   SSF
Subjt:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF

Query:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD
        NCNLHL++GKKQK+  + +RFENG E EPLDL KRFAD+ + R+ F  IS+ EHQRE++A E+L LVKS   S       ENLLLDFFHEKLEE ++TA 
Subjt:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD

Query:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
            R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +ME AGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

XP_038906459.1 uncharacterized protein LOC120092443 [Benincasa hispida]1.7e-15666.25Show/hide
Query:  ALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAF
        A   SS+W+L++     K +S MLKDYL D++ S SSNGFRSFPRRQCCTTTVRFLLEIDLKVKD+S   RFL RT SRK+ALSTISTLQ+AS AV+RAF
Subjt:  ALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAF

Query:  KQFSFPSPTNRHRKPFLP----RKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMI
        KQF  PS     RK FLP    RKL+ +AFWKK++ VD  TRRWKSF+EFLDEKEPPS    NRSDSA CTAIAVAGRNS SSCSNS SWTESEF SEMI
Subjt:  KQFSFPSPTNRHRKPFLP----RKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMI

Query:  P-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETT----ADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNL
        P SSSGNSES SENDAVKD KDSP N IGKR+GV+FGKDSME+TT    A AA TT YR++IVKQW+++EEKEQ SPVSVLDFPF+DEDQ ISSSFNCN+
Subjt:  P-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETT----ADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNL

Query:  HLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLG-LRRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGC
        +L+EGKKQK+  K++R E G ELEP+DLKKRF D+  + + F LI+K EHQ EE+A E L L+KS  KS      TENLLLDFFH+KLEE E+TA  S  
Subjt:  HLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLG-LRRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGC

Query:  RRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
            DF+   +A +LK T++WIDG    +M  GWE  E R  Y+ +ME AGKW SFAGEKEEL +E EAEVWISL ++LLIDL
Subjt:  RRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein1.6e-14963.89Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS+W+L++     K +SL+LKDYL D+  S SSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS   RFL RT SRK+ALSTISTLQ+AS AV+RA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPFLP----RKLLLRAFWKKADTVD-SCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFA
        FKQF  PS     RK F P    RKL+ +AF KK+D VD +  +RWKSF+EFLDEKEPPS     +N SDSA CTAIAVAGRNSISSCSNS SWTESEF 
Subjt:  FKQFSFPSPTNRHRKPFLP----RKLLLRAFWKKADTVD-SCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFA

Query:  SEMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEET----TADAATTTA--YREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISS
        SE+IPSS SGNSES SENDAVKD KDSP N IGKR+GVTFGKDSMEET    T+ AA T+A  YRE+ VKQW++EEEKEQFSPVSVLDFPF+DEDQ ISS
Subjt:  SEMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEET----TADAATTTA--YREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISS

Query:  SFNCNLHLIEGKKQK-NPPKTRRFENGAELEPLDLKKRFADLGLRRD---FGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEI
        SFNCN+HL+EGKKQK    KT+R E G ELEP+DLKKRF ++ +  D   F LI+K EHQ EE+ALE L L+KS  +S      TENLLLDFFH+KL+E 
Subjt:  SFNCNLHLIEGKKQK-NPPKTRRFENGAELEPLDLKKRFADLGLRRD---FGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEI

Query:  ESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
        E+T+  S      DF+   + ++LK  +DWIDG    +T    WE+ E R+ Y+ +ME   KWRSF G+KEEL +E E EVWISLL++LLIDL
Subjt:  ESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

A0A5A7TN51 Uncharacterized protein3.4e-14763.62Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS+W+L++     K +SL+LKDYL D+  S SSNGFRSFPRRQCC+TTVRFLLEIDLKVKDSS   +FL RT+SRK+ALSTISTLQ+AS AV+RA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPFLP----RKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFAS
        FKQF  PS     RK F P    RKL+ +AF KK+D VD   RRWKSF+EFLDEKEPPS     QN SDSA CTAIAVAGRNSISSCSNS SWTESEF S
Subjt:  FKQFSFPSPTNRHRKPFLP----RKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPR--HQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFAS

Query:  EMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEET---TADAATTTAYREEIVKQWR-SEEEKEQFSPVSVLDFPFDDEDQHISSSFN
        E+IPSS SGNSES SEN AVKD KDSP N IGKR+GVTFGKDSMEET   +A AA    YRE+ VK+W+ +EEEKEQFSPVSVLDFPF+DEDQ ISSSFN
Subjt:  EMIPSS-SGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEET---TADAATTTAYREEIVKQWR-SEEEKEQFSPVSVLDFPFDDEDQHISSSFN

Query:  CNLHLIEGKKQK-NPPKTRRFENGAELEPLDLKKRFADLGLRRD----FGLISK-TEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIE
        CN+HL+EGKKQK    KT+R E G ELEP+DLKKRF ++ +  D    F LI+K  EHQ EE+ALE L L+KS  KS      TENLLLDFFH+KL+E E
Subjt:  CNLHLIEGKKQK-NPPKTRRFENGAELEPLDLKKRFADLGLRRD----FGLISK-TEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIE

Query:  STADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
        +T+  S      DF+   + ++L+  +DW+DG    +T    WE+ E R+ Y+ +ME A KWRSF G+KEEL +E EAEVWISLL +LLIDL
Subjt:  STADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTT--GWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

A0A6J1DTT2 uncharacterized protein LOC1110234202.5e-25999.58Show/hide
Query:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
        MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF
Subjt:  MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQF

Query:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
        SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS
Subjt:  SFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNS

Query:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVK-QWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
        ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVK QWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP
Subjt:  ESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVK-QWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNP

Query:  PKTRRFENGAELEPLDLKKRFADLGLRRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
        PKTRRFENGAELEPLDLKKRFADLGLRRDF LISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA
Subjt:  PKTRRFENGAELEPLDLKKRFADLGLRRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKA

Query:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
        EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC
Subjt:  EVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDLC

A0A6J1E8H2 uncharacterized protein LOC1114303539.0e-15666.53Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDS+   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    + KPF      RK++LR FWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SEM
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF
        IP SSSGNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT           TTT +RE+IVK W +EEEKEQ SPVSVLDFPF+DEDQ   SSF
Subjt:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF

Query:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD
        NCNLHL++GKKQK+  K +RFENG E EPLDLKKRFAD+ + R+ FG IS+ E+QRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A 
Subjt:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD

Query:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
            R G DF+   +A+VLK TEDWI+G     M TGWE  EGR LY+ +MEKAGKWRS AGEKEELA+E EAEVW+SL  ELLIDL
Subjt:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

A0A6J1HZ34 uncharacterized protein LOC1114689621.1e-15366.32Show/hide
Query:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA
        MA   SS WS+++     K  S MLKDYL D+  S SSNGFRSFPRRQCC TTVRFLLEIDLKVKDSS   RFL RTASRK+ALSTISTLQ+AS AVVRA
Subjt:  MALASSSDWSLVA----AKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRA

Query:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM
        FK+F  PS    +RKPF      RK++LRAFWKK D VD  TRR KSFQEFLDEKEPP     +RSDSA CTA+ V GRNSISSCSNS SWTESEF SE 
Subjt:  FKQFSFPSPTNRHRKPF----LPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEM

Query:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF
        IP SSSGNSES SENDAVK  KDSP N IGKR+GVTFGKDSMEETT           TTT +RE+IVKQW +EEEKEQ SPVSVLDFPF+DEDQ   SSF
Subjt:  IP-SSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTA-------DAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSF

Query:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD
        NCNLHL++GKK  +  K++RFENG E EPLDLKKRFAD+ +  + F LIS+ EHQRE++A E+L LVKS   S      TENLLLDFFHEKLEE ++ A 
Subjt:  NCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGL-RRDFGLISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTAD

Query:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL
            R G D +   +A+VLK TEDWI+G     M TGWE  EGR LY+ +ME AGKWRS  GEKEELA+E EAEVWISL  ELLIDL
Subjt:  ESGCRRGGDFEFEFKAEVLKLTEDWIDGGTA-LMTTGWEVAEGRSLYVNEMEKAGKWRSFAGEKEELASELEAEVWISLLHELLIDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein3.9e-1024.09Show/hide
Query:  RSLMLKDYLRDEIGSSSSNGFRSFPRR------------------QCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK
        RS MLKD L ++  S SSNGF+S PRR                  Q     ++ L    +K    SAP+  L R+ SR+LA    +  Q AS  V+R   
Subjt:  RSLMLKDYLRDEIGSSSSNGFRSFPRR------------------QCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK

Query:  QFSFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSG
                                     D V     RW S ++  ++     P      ++   T       +S +S ++ +SW++ +F SE +PSS G
Subjt:  QFSFPSPTNRHRKPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSG

Query:  -NSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQK
         N E   E  +VK+                 G+DS   T    A T    EE       + EKE  SPVSV +   ++ D+   SSF+  L  +E  KQK
Subjt:  -NSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQK

Query:  NPPKTRRFENGAELEPLDLKK-RFADLGLRRDFGLISKTEH-------------------QREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEE
             +RFE+ A + P +L +    D     + G  + T++                   + EE+A ++ N VK R       +  E+L++D+F ++L +
Subjt:  NPPKTRRFENGAELEPLDLKK-RFADLGLRRDFGLISKTEH-------------------QREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEE

Query:  IESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKW--RSFAGEKEELASELEAEVWISLLHELLIDL
          ++  E+           F+ +++   + W+ G            + R     E+E+   W  +    E E + +++E E++  L+ E L  L
Subjt:  IESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKW--RSFAGEKEELASELEAEVWISLLHELLIDL

AT4G11780.1 unknown protein2.4e-2829.53Show/hide
Query:  ALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQ--CCTTTVRFLLEIDLKVK------DSSAPARFLSRTASRKLALSTISTLQKASG
        ++ SSSD  L  +K R   L+L+DYL D++ S SSNGF+SFPRRQ    ++TVR LL+ ++K          +   R   R++      +    + KAS 
Subjt:  ALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQ--CCTTTVRFLLEIDLKVK------DSSAPARFLSRTASRKLALSTISTLQKASG

Query:  AVVRAFKQFSFPSPTNRHRKPF---LPRKLLLRAFWKK----------ADTVDSCTRRWKS--FQEFLDEKEPPSPRHQNRSD------SAACTAIAVAG
        A +   K   FPS T + +  F     ++LL  +FW+K              D   + W+S  ++E LD++     +     D      SAA   +    
Subjt:  AVVRAFKQFSFPSPTNRHRKPF---LPRKLLLRAFWKK----------ADTVDSCTRRWKS--FQEFLDEKEPPSPRHQNRSD------SAACTAIAVAG

Query:  RNSISSCSNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLD
         +  SS   S  +T S        SSS +S SS E++ V    D+ E+  GK  G +      + ++ +   +   R+E V      EEKEQ SPVS+L+
Subjt:  RNSISSCSNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLD

Query:  FPF--DDEDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRD---FGLISKTEHQREERALEILNLVKSRIKSECFIVRT--
         PF  DDED  I+              +K   K+RR      LEPLDL KR      R++   +  +   E + E +A  +  LVK RI     ++ +  
Subjt:  FPF--DDEDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRD---FGLISKTEHQREERALEILNLVKSRIKSECFIVRT--

Query:  -ENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISL
         +NLLLD+  E         D  G +         +  ++K  EDW+ G    M   WEV   R +YV EM    KW    G E+E +  EL    + S 
Subjt:  -ENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISL

Query:  LHELLIDL
        + E + DL
Subjt:  LHELLIDL

AT4G23020.1 unknown protein2.6e-2229.13Show/hide
Query:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK
        M   SSSD  L  +K R   L+L+D+L D++ S SSNGF+SFPR          LL  +++        R ++        L+    + KAS A++ A K
Subjt:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK

Query:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESE--FASEMIPS
           FPS      +    +K L  R+FWKK       +RR    +  +D  E          +   C + A   + S    S+   +      F+ E   S
Subjt:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESE--FASEMIPS

Query:  SSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKK
             +SSS +    +V  S    I     V    D +    +D ++     EE        EEKEQ SP+S+LD PF D+      + +   H  E  +
Subjt:  SSGNSESSSENDAVKDVKDSPENFIGKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKK

Query:  QKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--GLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLDFFHEKLEEIESTADESGCRR
        +K   K RR E+   LEP+DL+KR      R+D+   +I   E Q E RA  +  LVKSRI  E   +      +N+LLDFF E       T DE     
Subjt:  QKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--GLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLDFFHEKLEEIESTADESGCRR

Query:  GGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHELLIDL
                  +++++ E+W+         M   W+V+E R +YV EM    KW    G EKE +  EL      SL+ EL+ D+
Subjt:  GGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHELLIDL

AT4G23020.2 unknown protein3.1e-2328.83Show/hide
Query:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK
        M   SSSD  L  +K R   L+L+D+L D++ S SSNGF+SFPR          LL  +++        R ++        L+    + KAS A++ A K
Subjt:  MALASSSDWSLVAAKSR--SLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFK

Query:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKK----------------ADTVDSCTRRWKSFQEFLDEKEPP-SPRHQNRSDSAACTAIAVAGRNSI---SSC
           FPS      +    +K L  R+FWKK                 D  +   +R +SF EFL E +   S +    S +   +  A   ++++   SS 
Subjt:  QFSFPSPTNRHRKPFLPRK-LLLRAFWKK----------------ADTVDSCTRRWKSFQEFLDEKEPP-SPRHQNRSDSAACTAIAVAGRNSI---SSC

Query:  SNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGK-DSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDE
        S+ +S      +  ++   SG+   S  +D    + D+ E F+   E   +G+  S+                 +K     EEKEQ SP+S+LD PF D+
Subjt:  SNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFIGKREGVTFGK-DSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDE

Query:  DQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--GLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLDF
              + +   H  E  ++K   K RR E+   LEP+DL+KR      R+D+   +I   E Q E RA  +  LVKSRI  E   +      +N+LLDF
Subjt:  DQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDF--GLISKTEHQREERALEILNLVKSRIKSECFIVR----TENLLLDF

Query:  FHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHELL
        F E       T DE               +++++ E+W+         M   W+V+E R +YV EM    KW    G EKE +  EL      SL+ EL+
Subjt:  FHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWI---DGGTALMTTGWEVAEGRSLYVNEMEKAGKWRSFAG-EKEELASELEAEVWISLLHELL

Query:  IDL
         D+
Subjt:  IDL

AT5G03670.1 unknown protein1.6e-0827.88Show/hide
Query:  EEEKEQFSPVSVLDFPFDDEDQHISSSFN---CNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDFGLISKTEHQREERALEILNLVKSR
        EEEKEQ SPVSVLD PF D+D+ I    N    +   ++  K     K  RFE  A L+P++L+KR +D           +TE + EE   E+ +L    
Subjt:  EEEKEQFSPVSVLDFPFDDEDQHISSSFN---CNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDFGLISKTEHQREERALEILNLVKSR

Query:  IKSECFIVRTENLLLDFFHEKLEEIEST----ADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSL-----YVNEMEKAGKWRS-FA
            C I+ T+ +L  +F E +E  E      +D +      D + E +A +       +          W   E  ++     +    E+ G WRS   
Subjt:  IKSECFIVRTENLLLDFFHEKLEEIEST----ADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSL-----YVNEMEKAGKWRS-FA

Query:  GEKEELASELEAEVWISLLHELLIDL
         +  E   ++E E++  L+ EL  D+
Subjt:  GEKEELASELEAEVWISLLHELLIDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGGCGTCTTCTTCTGATTGGAGCCTCGTGGCGGCGAAATCTAGGTCGCTCATGCTCAAGGATTATCTTCGAGATGAAATCGGCTCTTCTTCCTCTAATGGCTT
CCGATCCTTTCCACGCCGCCAATGCTGCACCACCACCGTCCGATTTCTTCTCGAGATCGATCTCAAAGTGAAGGATTCTTCCGCACCTGCAAGATTCCTCTCTCGAACCG
CTTCCAGAAAACTCGCTCTCTCCACCATCTCCACTCTGCAAAAGGCGTCCGGCGCCGTCGTCAGAGCATTCAAGCAATTTTCCTTTCCTTCCCCCACAAATCGGCATCGG
AAGCCGTTTTTGCCGCGGAAACTGCTTCTCAGAGCCTTCTGGAAGAAAGCAGACACGGTTGATTCCTGTACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGACGAGAA
AGAACCGCCGTCGCCTCGTCATCAGAATCGCTCCGACTCCGCCGCCTGCACCGCCATTGCCGTCGCCGGTAGAAACTCGATCAGTTCTTGCAGTAACAGCAACAGTTGGA
CGGAGAGCGAATTCGCTTCGGAGATGATACCGTCTTCCAGCGGTAATTCCGAGAGTTCCAGCGAAAACGACGCCGTCAAAGATGTTAAGGATTCGCCTGAAAATTTCATC
GGCAAAAGAGAAGGCGTAACGTTCGGAAAAGATTCCATGGAGGAAACAACCGCCGACGCCGCAACTACCACCGCCTATCGGGAGGAAATCGTTAAGCAATGGCGGAGTGA
GGAAGAGAAAGAACAGTTCAGTCCAGTTTCGGTGTTGGATTTTCCTTTCGATGACGAAGATCAACACATCTCCTCATCTTTCAACTGCAATCTTCACCTCATCGAAGGGA
AGAAGCAGAAGAATCCGCCGAAGACGAGGCGATTCGAGAACGGAGCCGAATTGGAACCACTGGACCTGAAGAAGCGATTCGCAGATTTAGGGCTTCGCCGCGATTTCGGC
TTGATATCAAAAACAGAGCACCAGAGGGAGGAGAGGGCGTTGGAAATTCTGAATCTCGTCAAATCGAGGATCAAATCGGAATGCTTCATAGTCAGAACGGAGAATCTGCT
GCTTGATTTCTTCCACGAGAAGCTCGAGGAGATCGAATCGACGGCGGATGAGAGCGGATGCCGAAGAGGAGGTGATTTTGAGTTTGAGTTTAAGGCAGAGGTTTTGAAAT
TGACGGAAGATTGGATCGACGGAGGAACAGCGTTGATGACGACGGGATGGGAGGTGGCGGAGGGGCGGAGTTTGTACGTTAATGAAATGGAGAAGGCCGGAAAGTGGAGA
AGTTTCGCCGGAGAAAAGGAAGAATTGGCGTCGGAGTTGGAAGCTGAGGTTTGGATTTCCTTGCTCCACGAGCTATTAATTGACCTCTGCTGC
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTGGCGTCTTCTTCTGATTGGAGCCTCGTGGCGGCGAAATCTAGGTCGCTCATGCTCAAGGATTATCTTCGAGATGAAATCGGCTCTTCTTCCTCTAATGGCTT
CCGATCCTTTCCACGCCGCCAATGCTGCACCACCACCGTCCGATTTCTTCTCGAGATCGATCTCAAAGTGAAGGATTCTTCCGCACCTGCAAGATTCCTCTCTCGAACCG
CTTCCAGAAAACTCGCTCTCTCCACCATCTCCACTCTGCAAAAGGCGTCCGGCGCCGTCGTCAGAGCATTCAAGCAATTTTCCTTTCCTTCCCCCACAAATCGGCATCGG
AAGCCGTTTTTGCCGCGGAAACTGCTTCTCAGAGCCTTCTGGAAGAAAGCAGACACGGTTGATTCCTGTACCAGACGGTGGAAATCGTTTCAGGAATTTCTCGACGAGAA
AGAACCGCCGTCGCCTCGTCATCAGAATCGCTCCGACTCCGCCGCCTGCACCGCCATTGCCGTCGCCGGTAGAAACTCGATCAGTTCTTGCAGTAACAGCAACAGTTGGA
CGGAGAGCGAATTCGCTTCGGAGATGATACCGTCTTCCAGCGGTAATTCCGAGAGTTCCAGCGAAAACGACGCCGTCAAAGATGTTAAGGATTCGCCTGAAAATTTCATC
GGCAAAAGAGAAGGCGTAACGTTCGGAAAAGATTCCATGGAGGAAACAACCGCCGACGCCGCAACTACCACCGCCTATCGGGAGGAAATCGTTAAGCAATGGCGGAGTGA
GGAAGAGAAAGAACAGTTCAGTCCAGTTTCGGTGTTGGATTTTCCTTTCGATGACGAAGATCAACACATCTCCTCATCTTTCAACTGCAATCTTCACCTCATCGAAGGGA
AGAAGCAGAAGAATCCGCCGAAGACGAGGCGATTCGAGAACGGAGCCGAATTGGAACCACTGGACCTGAAGAAGCGATTCGCAGATTTAGGGCTTCGCCGCGATTTCGGC
TTGATATCAAAAACAGAGCACCAGAGGGAGGAGAGGGCGTTGGAAATTCTGAATCTCGTCAAATCGAGGATCAAATCGGAATGCTTCATAGTCAGAACGGAGAATCTGCT
GCTTGATTTCTTCCACGAGAAGCTCGAGGAGATCGAATCGACGGCGGATGAGAGCGGATGCCGAAGAGGAGGTGATTTTGAGTTTGAGTTTAAGGCAGAGGTTTTGAAAT
TGACGGAAGATTGGATCGACGGAGGAACAGCGTTGATGACGACGGGATGGGAGGTGGCGGAGGGGCGGAGTTTGTACGTTAATGAAATGGAGAAGGCCGGAAAGTGGAGA
AGTTTCGCCGGAGAAAAGGAAGAATTGGCGTCGGAGTTGGAAGCTGAGGTTTGGATTTCCTTGCTCCACGAGCTATTAATTGACCTCTGCTGC
Protein sequenceShow/hide protein sequence
MALASSSDWSLVAAKSRSLMLKDYLRDEIGSSSSNGFRSFPRRQCCTTTVRFLLEIDLKVKDSSAPARFLSRTASRKLALSTISTLQKASGAVVRAFKQFSFPSPTNRHR
KPFLPRKLLLRAFWKKADTVDSCTRRWKSFQEFLDEKEPPSPRHQNRSDSAACTAIAVAGRNSISSCSNSNSWTESEFASEMIPSSSGNSESSSENDAVKDVKDSPENFI
GKREGVTFGKDSMEETTADAATTTAYREEIVKQWRSEEEKEQFSPVSVLDFPFDDEDQHISSSFNCNLHLIEGKKQKNPPKTRRFENGAELEPLDLKKRFADLGLRRDFG
LISKTEHQREERALEILNLVKSRIKSECFIVRTENLLLDFFHEKLEEIESTADESGCRRGGDFEFEFKAEVLKLTEDWIDGGTALMTTGWEVAEGRSLYVNEMEKAGKWR
SFAGEKEELASELEAEVWISLLHELLIDLCC