; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017743 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017743
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionArabidopsis protein of unknown function (DUF241)
Genome locationtig00153055:482868..483598
RNA-Seq ExpressionSgr017743
SyntenySgr017743
Gene Ontology termsNA
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591230.1 hypothetical protein SDJN03_13576, partial [Cucurbita argyrosperma subsp. sororia]2.4e-8469.55Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNKA  KP+LSHHVRSISLP R HPLIFQLKDEIANL++W  +S+ RTAAW+CDGLNRLKTVHNHLDDILNLPQTQESLRHQP W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA
        LEHFLRFVDVYGIFQTLIL++KEEHSAAQ    +++ E      K +    R+                      +L+A IEEV+GVTM+VSLALFNGIA
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA

Query:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        ESF TRK WAW G ++VSKK K+SAEEEKGIREFREIG+ENLR
Subjt:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

KAG7024116.1 hypothetical protein SDJN02_12929, partial [Cucurbita argyrosperma subsp. argyrosperma]8.2e-8569.96Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNKA  KP+LSHHVRSISLP R HPLIFQLKDEIANL++W  +S+ RTAAW+CDGLNRLKTVHNHLDDILNLPQTQESLRHQP W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA
        LEHFLRFVDVYGIFQTLIL++KEEHSAAQ    +++ E      K +    R+                      +LAA IEEV+GVTM+VSLALFNGIA
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA

Query:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        ESF TRK WAW G ++VSKK K+SAEEEKGIREFREIG+ENLR
Subjt:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

KAG7037942.1 hypothetical protein SDJN02_01575, partial [Cucurbita argyrosperma subsp. argyrosperma]4.5e-8370.2Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNK+PAKP+LSHHVRSISLPCRSHPLIFQLKDEIANLK+W  + D RTAAW+CDGLNRLKTVHNHLDD+LNLPQTQESLRHQP W++KL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE--AGFNGA-------------------KDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNG
        LEHFL FVDVYGIFQTLIL +KEEHSAAQ    +++ E  A +  A                   K  + GT    A+LA+VIEEVVGVT +VSLAL NG
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE--AGFNGA-------------------KDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNG

Query:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        IAESF TRK W W GL+R+SKK   SAEEEKGIREFREIG+E LR
Subjt:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

XP_022936876.1 uncharacterized protein LOC111443334 [Cucurbita moschata]2.6e-8369.14Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNKA  KP+LSHHVRSISLP R HPLIF LKDEIANL++W  +S+ RTAAW+CDGLNRLKTVHNHLDDILNLPQTQESLRHQP W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA
        LEHFLRFVDVYGIFQTLIL++KEEHSAAQ    +++ E      K +    R+                      +LAA I EV+GVTM+VSLALFNGIA
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA

Query:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        ESF TRK WAW G ++VSKK K+SAEEEKGIREFREIG+ENLR
Subjt:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

XP_038898709.1 uncharacterized protein LOC120086235 [Benincasa hispida]2.4e-8469.8Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNK PAKPSLSHHVRSISLPCRSHPLIFQLKDEIANL +W   SD RTAAW+CDGL+RLKTVHNHLDDILNLPQTQESLR    W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR---------------------RAELAAVIEEVVGVTMSVSLALFNG
        LEHFLRFVDVYGIFQTLIL +KEEHSAAQ    + + E      K +    R+                        +LAAVIEEV+ VTM+VSLA+FNG
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR---------------------RAELAAVIEEVVGVTMSVSLALFNG

Query:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        IAESFGTRK W W GL+RVSKK K+SAEEEKGI+EFREIG+ENLR
Subjt:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

TrEMBL top hitse value%identityAlignment
A0A1S3BUG7 uncharacterized protein LOC1034938015.0e-8065.46Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRS SFPNK   KPSLSHHVRSISLPCRSHPLIFQLKD+IANL +W   SD RTAAW+C+GL+ LKTVHNHLDDILNLPQT+ESLRH P W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE------------------------AGFNGAKDQDGGTRRRRAELAAVIEEVVGVTMSVSLAL
        LEHFLRFVDVYGIFQTLIL++KEEHSAAQ    +++ E                              K  + G     A+LAAVIEEV+GVTM+VSLAL
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE------------------------AGFNGAKDQDGGTRRRRAELAAVIEEVVGVTMSVSLAL

Query:  FNGIAESFGTRKAWAWAGLERVSKKAKRSAE-EEKGIREFREIGTENLR
        FNGI+ESFGT+  W W  L+RV+KK K+SAE +EKGI+EFREIG+ENLR
Subjt:  FNGIAESFGTRKAWAWAGLERVSKKAKRSAE-EEKGIREFREIGTENLR

A0A5A7VG10 Uncharacterized protein5.0e-8065.46Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRS SFPNK   KPSLSHHVRSISLPCRSHPLIFQLKD+IANL +W   SD RTAAW+C+GL+ LKTVHNHLDDILNLPQT+ESLRH P W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE------------------------AGFNGAKDQDGGTRRRRAELAAVIEEVVGVTMSVSLAL
        LEHFLRFVDVYGIFQTLIL++KEEHSAAQ    +++ E                              K  + G     A+LAAVIEEV+GVTM+VSLAL
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE------------------------AGFNGAKDQDGGTRRRRAELAAVIEEVVGVTMSVSLAL

Query:  FNGIAESFGTRKAWAWAGLERVSKKAKRSAE-EEKGIREFREIGTENLR
        FNGI+ESFGT+  W W  L+RV+KK K+SAE +EKGI+EFREIG+ENLR
Subjt:  FNGIAESFGTRKAWAWAGLERVSKKAKRSAE-EEKGIREFREIGTENLR

A0A6J1F8P4 uncharacterized protein LOC1114433341.3e-8369.14Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNKA  KP+LSHHVRSISLP R HPLIF LKDEIANL++W  +S+ RTAAW+CDGLNRLKTVHNHLDDILNLPQTQESLRHQP W+DKL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA
        LEHFLRFVDVYGIFQTLIL++KEEHSAAQ    +++ E      K +    R+                      +LAA I EV+GVTM+VSLALFNGIA
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRR-------------------RAELAAVIEEVVGVTMSVSLALFNGIA

Query:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        ESF TRK WAW G ++VSKK K+SAEEEKGIREFREIG+ENLR
Subjt:  ESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

A0A6J1FM80 uncharacterized protein LOC1114464941.4e-8269.39Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNK+P KP+LSHHVRSISLPCRSHPLIFQLKDEIANLK+W  + D RTAAW+CDGLNRLKTVHNHLDD+LNLPQTQESLRHQP W++KL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE--AGFNGA-------------------KDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNG
        LEHFL FVDVYGIFQTLIL +KEEHSAAQ    +++ E  A +  A                   K  + GT    A+LA+VIEEVVGVT +VSLAL NG
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE--AGFNGA-------------------KDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNG

Query:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        IAESF TRK W W GL+R+SKK   SAEEEKGIREFR+IG+E LR
Subjt:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

A0A6J1IZK2 uncharacterized protein LOC1114813175.4e-8268.98Show/hide
Query:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL
        MVGVFRRSFSFPNK+P KP+LSHHVRSISLPCRSHPLIFQLK+EIANL +W  + D RTAAW+CDGLNRLKTVHNHLDD+LNLPQTQESLRHQP W++KL
Subjt:  MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKL

Query:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE--AGFNGA-------------------KDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNG
        LEHFL FVDVYGIFQTLIL +KEEHSAAQ    +++ E  A +  A                   K  + GT    A+LA+VIEEVVGVT +VSLAL NG
Subjt:  LEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGE--AGFNGA-------------------KDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNG

Query:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR
        I ESF TRK WAW GL+R+SKK   SAEEEKGIREFREIG+E LR
Subjt:  IAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGTENLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76240.1 Arabidopsis protein of unknown function (DUF241)1.0e-4042.31Show/hide
Query:  MVGVFRRSFSFPNK------APAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLS---ASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLR
        MVGVFRRS SFPNK        +KP +SHH RSISLPCRSHPLI  +  EI+ LK+W S    +  RT +W+ DGL+ LK V   L DIL LPQ+QESLR
Subjt:  MVGVFRRSFSFPNK------APAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLS---ASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLR

Query:  HQPRWVDKLLEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKD---------------QDGGTRRRR------------AELAAVIEE
        ++P + + LLE  LRFVD YGIF+T IL ++E  SAAQ    +++ E   +  K                ++  T+ +             AELA+VI +
Subjt:  HQPRWVDKLLEHFLRFVDVYGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKD---------------QDGGTRRRR------------AELAAVIEE

Query:  VVGVTMSVSLALFNGIAESFGTRKAWAWAG-LERVSKKAKRSAEEEKGIREFREIGTENL
        V+ VT+ VS+ALFNG+  S    K   + G L+R  KK K     ++GI E +++  ++L
Subjt:  VVGVTMSVSLALFNGIAESFGTRKAWAWAG-LERVSKKAKRSAEEEKGIREFREIGTENL

AT2G17070.1 Arabidopsis protein of unknown function (DUF241)6.6e-0824.66Show/hide
Query:  SLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQ--PRWVDKLLEHFLRFVDVYGIFQTL
        ++S HVRS S P   HP    + +++A L++    S   +++ +C  L+ L+ +H  LD ++ LP TQ++L  +   + V++LL+  L+ +DV  I +  
Subjt:  SLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQ--PRWVDKLLEHFLRFVDVYGIFQTL

Query:  ILAMKEE----HSAAQKETGQENGEAG--FNGAKDQDGGTRRRRAELAAVIEE--------VVGVTMSVSLALFNGI-AESFGTRKAWAWAGLERVSKKA
        +  MKE      S  +++ G  +GE        K      ++ +  L A   E        V G   +V++A+F+ + +   G++    W+ + ++  K 
Subjt:  ILAMKEE----HSAAQKETGQENGEAG--FNGAKDQDGGTRRRRAELAAVIEE--------VVGVTMSVSLALFNGI-AESFGTRKAWAWAGLERVSKKA

Query:  KRSAEEEKGIREFREIGTE
        K + E ++   EF ++ +E
Subjt:  KRSAEEEKGIREFREIGTE

AT2G17080.1 Arabidopsis protein of unknown function (DUF241)3.2e-1024.66Show/hide
Query:  SLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESL--RHQPRWVDKLLEHFLRFVDVYGIFQTL
        ++S HVRS S P RSHP    + +++A L++   AS   +++ +C  L+ L+ +H  LD +++ P TQ++L   H  + V++LL+  LR +D+  I +  
Subjt:  SLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESL--RHQPRWVDKLLEHFLRFVDVYGIFQTL

Query:  ILAMKEEHSAAQKETGQENGEAG------FNGAKDQDGGTRRRRAELAAVIEE--------VVGVTMSVSLALFNGIAESF-GTRKAWAWAGLERVSKKA
        +  MKE     Q    ++ G+            K      ++ +  L     E        V G   +++L+LF+ +     G++    W+ + ++  K 
Subjt:  ILAMKEEHSAAQKETGQENGEAG------FNGAKDQDGGTRRRRAELAAVIEE--------VVGVTMSVSLALFNGIAESF-GTRKAWAWAGLERVSKKA

Query:  KRSAEEEKGIREFREIGTE
        K + E ++   EF ++ +E
Subjt:  KRSAEEEKGIREFREIGTE

AT2G17680.1 Arabidopsis protein of unknown function (DUF241)8.9e-0522.81Show/hide
Query:  LSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESL-----------RHQPRWVDKLLEHFLRFVD
        + +HVRSISL  RSHP    +++ +      ++ S   ++  +  GL+ L+ +++  +D+L +  TQ  L           + +  +++++L+  LR +D
Subjt:  LSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESL-----------RHQPRWVDKLLEHFLRFVD

Query:  VYGIFQTLILAMKE
        +  + + L++   E
Subjt:  VYGIFQTLILAMKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGCGTTTTCCGGCGATCCTTCTCGTTTCCGAACAAGGCTCCGGCCAAGCCCTCTCTCTCTCATCACGTCCGCTCCATCAGTCTGCCCTGCAGATCGCACCCGTT
GATCTTCCAGCTCAAGGACGAGATTGCAAATCTCAAGACGTGGTTGTCTGCTTCCGATTGCCGGACCGCCGCCTGGCTCTGCGACGGCTTGAACCGCCTGAAAACCGTCC
ACAACCATCTCGACGACATTCTCAACCTCCCTCAGACGCAAGAGTCTCTCCGCCACCAGCCGCGGTGGGTCGATAAGCTCCTCGAGCATTTCTTGCGCTTCGTCGACGTC
TACGGAATCTTCCAGACGTTGATATTGGCGATGAAAGAAGAGCACTCGGCGGCACAGAAAGAGACTGGCCAGGAAAATGGCGAAGCTGGTTTCAACGGTGCAAAAGATCA
GGACGGCGGAACAAGGCGGCGGCGGGCCGAGCTGGCCGCCGTGATCGAAGAAGTCGTCGGAGTGACCATGTCGGTGTCTCTCGCACTGTTCAACGGAATAGCAGAATCAT
TCGGGACGAGAAAGGCATGGGCGTGGGCAGGATTGGAACGAGTATCGAAGAAGGCTAAGAGATCGGCGGAGGAGGAGAAGGGGATTCGAGAGTTCAGAGAAATCGGGACG
GAGAATTTGAGGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGCGTTTTCCGGCGATCCTTCTCGTTTCCGAACAAGGCTCCGGCCAAGCCCTCTCTCTCTCATCACGTCCGCTCCATCAGTCTGCCCTGCAGATCGCACCCGTT
GATCTTCCAGCTCAAGGACGAGATTGCAAATCTCAAGACGTGGTTGTCTGCTTCCGATTGCCGGACCGCCGCCTGGCTCTGCGACGGCTTGAACCGCCTGAAAACCGTCC
ACAACCATCTCGACGACATTCTCAACCTCCCTCAGACGCAAGAGTCTCTCCGCCACCAGCCGCGGTGGGTCGATAAGCTCCTCGAGCATTTCTTGCGCTTCGTCGACGTC
TACGGAATCTTCCAGACGTTGATATTGGCGATGAAAGAAGAGCACTCGGCGGCACAGAAAGAGACTGGCCAGGAAAATGGCGAAGCTGGTTTCAACGGTGCAAAAGATCA
GGACGGCGGAACAAGGCGGCGGCGGGCCGAGCTGGCCGCCGTGATCGAAGAAGTCGTCGGAGTGACCATGTCGGTGTCTCTCGCACTGTTCAACGGAATAGCAGAATCAT
TCGGGACGAGAAAGGCATGGGCGTGGGCAGGATTGGAACGAGTATCGAAGAAGGCTAAGAGATCGGCGGAGGAGGAGAAGGGGATTCGAGAGTTCAGAGAAATCGGGACG
GAGAATTTGAGGAACTGA
Protein sequenceShow/hide protein sequence
MVGVFRRSFSFPNKAPAKPSLSHHVRSISLPCRSHPLIFQLKDEIANLKTWLSASDCRTAAWLCDGLNRLKTVHNHLDDILNLPQTQESLRHQPRWVDKLLEHFLRFVDV
YGIFQTLILAMKEEHSAAQKETGQENGEAGFNGAKDQDGGTRRRRAELAAVIEEVVGVTMSVSLALFNGIAESFGTRKAWAWAGLERVSKKAKRSAEEEKGIREFREIGT
ENLRN