; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10000800 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10000800
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF506)
Genome locationChr09:9611316..9618947
RNA-Seq ExpressionHG10000800
SyntenyHG10000800
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035272.1 hypothetical protein SDJN02_02067, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-6372.54Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS
        MGS EEE LVQM+DDFIES +   SS S ++SSN LPL S  +HYFF+LKEILG +G  AE EV E VMKH+R  KIDAPKT+ +KKWLVMKLKMDGY S
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS

Query:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL
         +LCHTSWVTS+GCP GDYEYIEMK +      KR+IIDI+FKAQFEVARATE YKQLT+ALPSVFVGSEEKV +IIS+LCSAAKQSLKE  L
Subjt:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL

XP_004147351.1 uncharacterized protein LOC101214990 [Cucumis sativus]1.0e-6875.88Show/hide
Query:  MGSLEEEELVQMVDDFIESAD---QTPSSCSFANSSNSLPLNS-TKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKID--APKTSSLKKWLVMKLKM
        M SLEEE+LVQMVDDFIES D   Q+P+S SF       PL+S +KSH+FF+LKEILG+G K E EVGE VMKHLR WK    + KT+SL+KWLVMKLKM
Subjt:  MGSLEEEELVQMVDDFIESAD---QTPSSCSFANSSNSLPLNS-TKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKID--APKTSSLKKWLVMKLKM

Query:  DGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMK-DEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL
        DGYDSS LCHTSWVTSMGCPAGDYEYIEM+ K DE GS KRLIIDIEFKAQFEVARATE+YKQLT+ALP+VFVGSEEKVKRIISVLCSAAKQSL++  L
Subjt:  DGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMK-DEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL

XP_022143594.1 uncharacterized protein LOC111013453 [Momordica charantia]1.6e-6666.52Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDSS
        MGSLEEE+LVQMVDDFIES  +TP+SCS + SSNSL   ++K+H+ FSLKEILGSG +AEGEV E V KHLR  K+++PKT+SLKKWLVMKL+MDGYDS+
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDSS

Query:  DLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL-----RKV
        DLCHTSWVTS+GCPAG+YEYIE K++DE+G  KR+IIDIEFKAQFEVAR T  YKQLTEALP+VFVG+EE V RII++LCSAAKQSL+E  L     R  
Subjt:  DLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL-----RKV

Query:  RKKHRKSKSTNDEDDSTNDGT
             K K    E++    G+
Subjt:  RKKHRKSKSTNDEDDSTNDGT

XP_023007236.1 uncharacterized protein LOC111499781 [Cucurbita maxima]3.4e-6473.58Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS
        MGSLEEE LVQM+DDF+ES +   S  S ++SSN LPL S  +HYFF+LKEILG SG  AE EV E VMKH+R  K DAPKT+ LKKWLVMKLKMDGY S
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS

Query:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL
         DLCHTSWVTSMGCP GDYEYIEMK++      KR+IIDI+FKAQFEVARATE YKQLT+ALPSVFVGSEEKV +IIS+LCSAAKQSLKE  L
Subjt:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL

XP_038900827.1 uncharacterized protein LOC120087891 [Benincasa hispida]1.1e-8678.95Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPK-TSSLKKWLVMKLKMDGYDS
        M SLEEE+LVQMVDDFIESAD TPSSCSFANSSNSLPLNS KSHYF SLKEILGSGIKAEGEVGE VMKHLRSWK D+PK TSSLKKWLVMKLKMDGYDS
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPK-TSSLKKWLVMKLKMDGYDS

Query:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL-------
        SDLCHTSWVTSMGCPAGDYEYIEMK+KDEYGS KR+IIDIEFKAQFEVARATE+YKQLTEALP+VFVGSEE+VKRIISVLCSAAKQSLKE  L       
Subjt:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL-------

Query:  ------RKVRKKHRKSKSTNDEDDSTND
              + + + H  S + N+  ++ N+
Subjt:  ------RKVRKKHRKSKSTNDEDDSTND

TrEMBL top hitse value%identityAlignment
A0A0A0LJS3 Uncharacterized protein5.0e-6975.88Show/hide
Query:  MGSLEEEELVQMVDDFIESAD---QTPSSCSFANSSNSLPLNS-TKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKID--APKTSSLKKWLVMKLKM
        M SLEEE+LVQMVDDFIES D   Q+P+S SF       PL+S +KSH+FF+LKEILG+G K E EVGE VMKHLR WK    + KT+SL+KWLVMKLKM
Subjt:  MGSLEEEELVQMVDDFIESAD---QTPSSCSFANSSNSLPLNS-TKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKID--APKTSSLKKWLVMKLKM

Query:  DGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMK-DEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL
        DGYDSS LCHTSWVTSMGCPAGDYEYIEM+ K DE GS KRLIIDIEFKAQFEVARATE+YKQLT+ALP+VFVGSEEKVKRIISVLCSAAKQSL++  L
Subjt:  DGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMK-DEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL

A0A1S3CDH7 uncharacterized protein LOC1034996454.5e-6261.81Show/hide
Query:  MGSLEEEELVQMVDDFIESAD----QTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWK----IDAPKTSSLKKWLVMKL
        MGSLEEE+L QMVDDFIES D    Q+P+S SF  SSNS       SHY F+LKEILG+G K E EVGE VMKHLR WK     ++ KT+SL+KWLVMKL
Subjt:  MGSLEEEELVQMVDDFIESAD----QTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWK----IDAPKTSSLKKWLVMKL

Query:  KMDGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEK--
        KM+GYDSS L HTSWVTSMGCPAGDYEYIEM+MK E     RLIIDIEFKAQFEVARATE+YKQLT+ALPSVFVGSEEKVKRIISVLCSAAKQSLK+   
Subjt:  KMDGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEK--

Query:  ------------------ELRKVRKKHRKSKSTNDEDDSTNDGTTSSDGGGNRN
                           L      H     TND + + N  T +S+   N N
Subjt:  ------------------ELRKVRKKHRKSKSTNDEDDSTNDGTTSSDGGGNRN

A0A6J1CPS4 uncharacterized protein LOC1110134538.0e-6766.52Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDSS
        MGSLEEE+LVQMVDDFIES  +TP+SCS + SSNSL   ++K+H+ FSLKEILGSG +AEGEV E V KHLR  K+++PKT+SLKKWLVMKL+MDGYDS+
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDSS

Query:  DLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL-----RKV
        DLCHTSWVTS+GCPAG+YEYIE K++DE+G  KR+IIDIEFKAQFEVAR T  YKQLTEALP+VFVG+EE V RII++LCSAAKQSL+E  L     R  
Subjt:  DLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL-----RKV

Query:  RKKHRKSKSTNDEDDSTNDGT
             K K    E++    G+
Subjt:  RKKHRKSKSTNDEDDSTNDGT

A0A6J1G7H8 uncharacterized protein LOC111451460 isoform X13.5e-6273.58Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS
        MGS EEE LVQM+DDFIES  +TP S S ++SS+ LPL S  +HYFF+LKEILG +G  AE EV E VMKH+R  KIDAPKT+ LKKWLVMKLKMDGY S
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS

Query:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL
         DLCH+SWVTSMGCP GDYEYIEMK +      KR+IIDI+FKAQFEVARATE YKQLT+ALPSVFVGSEEKV +IIS+LCSAAKQSLKE  L
Subjt:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL

A0A6J1KZZ7 uncharacterized protein LOC1114997811.7e-6473.58Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS
        MGSLEEE LVQM+DDF+ES +   S  S ++SSN LPL S  +HYFF+LKEILG SG  AE EV E VMKH+R  K DAPKT+ LKKWLVMKLKMDGY S
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILG-SGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDS

Query:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL
         DLCHTSWVTSMGCP GDYEYIEMK++      KR+IIDI+FKAQFEVARATE YKQLT+ALPSVFVGSEEKV +IIS+LCSAAKQSLKE  L
Subjt:  SDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77145.1 Protein of unknown function (DUF506)1.6e-2236.97Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLR----SWKIDAPKTSSLKKWLVMKLKMDG
        MGSL EE+L ++V D+IES    P + S    S++  L         +LKEIL +  + E E+ E +   +     S++ D  K   + K +V KL+ +G
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLR----SWKIDAPKTSSLKKWLVMKLKMDG

Query:  YDSSDLCHTSWVTSM----GCP----AGDYEYIEMKMK-----DEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAA
        YD+S L  TSW +S     GC     +  YEYI++ +K     D    +KR+IID++FK QFE+AR TE YK +TE LP VFV +E +++R++S++C   
Subjt:  YDSSDLCHTSWVTSM----GCP----AGDYEYIEMKMK-----DEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAA

Query:  KQSLKEKELRK
        K+S+K++ + +
Subjt:  KQSLKEKELRK

AT1G77160.1 Protein of unknown function (DUF506)9.7e-1731.15Show/hide
Query:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLR----SWKIDAPKTSSLKKWLVMKLKMDG
        MGSL EE+  ++V  +IES    P + S   +++S  L + +  +   L+EIL +    E E+ E +  ++     S++ D  K   + K +V KL+ +G
Subjt:  MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLR----SWKIDAPKTSSLKKWLVMKLKMDG

Query:  YDSSDLCHTSWVTSM----GCP----AGDYEYIEMKM-----KDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAA
        Y++S L  TSW +S     GC     +  YEYI+  +     +D    +KR+IID++FK QFE+AR TE YK +TE LP+VFV +E +++R++S++C   
Subjt:  YDSSDLCHTSWVTSM----GCP----AGDYEYIEMKM-----KDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAA

Query:  KQSLKEKEL--------RKVRKKHRKSKSTNDEDDSTNDGTTSSDGGGNRNVGTSNSYQV
        K+S+K++ +        R ++ K        D        + S    G   +G + S+ V
Subjt:  KQSLKEKEL--------RKVRKKHRKSKSTNDEDDSTNDGTTSSDGGGNRNVGTSNSYQV

AT2G38820.1 Protein of unknown function (DUF506)4.7e-1941.61Show/hide
Query:  VMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLK
        V K+    YD++ LC + W  S  CPAG+YEY+++ MK E     RL+IDI+FK++FE+ARAT+ YK + + LP +FVG  +++++II ++C AAKQSLK
Subjt:  VMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLK

Query:  EKELRKV---RKKHRKSK--STNDEDDSTNDGTTSSD
        +K L      R ++ KSK  S++   D  ++G    +
Subjt:  EKELRKV---RKKHRKSK--STNDEDDSTNDGTTSSD

AT2G38820.2 Protein of unknown function (DUF506)4.7e-1941.1Show/hide
Query:  KTSSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVL
        K  S  K +   L   GYD++ LC + W  S  CPAG+YEY+++ MK E     RL+IDI+FK++FE+ARAT+ YK + + LP +FVG  +++++II ++
Subjt:  KTSSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVL

Query:  CSAAKQSLKEKELRKV---RKKHRKSK--STNDEDDSTNDGTTSSD
        C AAKQSLK+K L      R ++ KSK  S++   D  ++G    +
Subjt:  CSAAKQSLKEKELRKV---RKKHRKSK--STNDEDDSTNDGTTSSD

AT4G14620.1 Protein of unknown function (DUF506)2.5e-2032.65Show/hide
Query:  GSLEEEELVQMVDDFIESAD--QTPSS-----CSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKM
        G+  E  L +MV +++E  +  QT +      C+  N +N +  +      + + K ++  G   E  +     K +   K    +   L+K +V +L  
Subjt:  GSLEEEELVQMVDDFIESAD--QTPSS-----CSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKM

Query:  DGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEK
         GYDSS +C + W  +   PAG+YEYI++ +  E     RLIIDI+F+++FE+AR T  YK+L ++LP +FVG  +++++I+S++  A+KQSLK+K
Subjt:  DGYDSSDLCHTSWVTSMGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTTTGGAGGAGGAAGAATTAGTTCAAATGGTTGATGATTTTATTGAATCAGCTGATCAAACACCAAGTTCTTGTTCTTTTGCTAATTCTTCAAATTCTCTTCC
TCTCAATTCCACCAAATCCCACTATTTTTTCAGCTTAAAGGAGATTCTTGGGAGTGGAATAAAAGCAGAAGGAGAAGTGGGTGAGATAGTGATGAAACACTTGAGAAGCT
GGAAAATAGATGCTCCAAAAACCAGCAGCCTCAAGAAATGGCTTGTGATGAAGCTCAAAATGGACGGCTATGATTCTTCTGATCTCTGTCACACTTCTTGGGTCACTTCC
ATGGGATGCCCAGCCGGGGATTATGAGTACATAGAGATGAAAATGAAGGATGAGTATGGGAGTATAAAGAGGTTGATAATAGACATAGAATTCAAAGCTCAATTTGAAGT
AGCAAGAGCAACAGAAAAGTACAAGCAGCTTACAGAAGCACTTCCATCAGTGTTTGTAGGAAGTGAAGAGAAAGTTAAGAGAATAATCTCAGTTTTATGTTCAGCAGCCA
AACAGTCCCTTAAGGAAAAAGAGCTTAGAAAAGTGAGAAAAAAACATCGTAAATCTAAGTCAACCAATGACGAAGATGACAGTACAAACGATGGGACTACATCTAGTGAT
GGAGGCGGTAATAGAAATGTTGGAACTTCAAACAGTTACCAAGTTAGAGAGGGACAATCTAACCAACAAGGATTAAGTCCTTTGACATGTGAGGTTGGATTTGACCATGT
CACCCAAGACGAAGACCACGGATCTCGGGTGGGTGGCGAAGGAATTGCTGCCATTGGTAAACCATTTTATCGACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGTTTGGAGGAGGAAGAATTAGTTCAAATGGTTGATGATTTTATTGAATCAGCTGATCAAACACCAAGTTCTTGTTCTTTTGCTAATTCTTCAAATTCTCTTCC
TCTCAATTCCACCAAATCCCACTATTTTTTCAGCTTAAAGGAGATTCTTGGGAGTGGAATAAAAGCAGAAGGAGAAGTGGGTGAGATAGTGATGAAACACTTGAGAAGCT
GGAAAATAGATGCTCCAAAAACCAGCAGCCTCAAGAAATGGCTTGTGATGAAGCTCAAAATGGACGGCTATGATTCTTCTGATCTCTGTCACACTTCTTGGGTCACTTCC
ATGGGATGCCCAGCCGGGGATTATGAGTACATAGAGATGAAAATGAAGGATGAGTATGGGAGTATAAAGAGGTTGATAATAGACATAGAATTCAAAGCTCAATTTGAAGT
AGCAAGAGCAACAGAAAAGTACAAGCAGCTTACAGAAGCACTTCCATCAGTGTTTGTAGGAAGTGAAGAGAAAGTTAAGAGAATAATCTCAGTTTTATGTTCAGCAGCCA
AACAGTCCCTTAAGGAAAAAGAGCTTAGAAAAGTGAGAAAAAAACATCGTAAATCTAAGTCAACCAATGACGAAGATGACAGTACAAACGATGGGACTACATCTAGTGAT
GGAGGCGGTAATAGAAATGTTGGAACTTCAAACAGTTACCAAGTTAGAGAGGGACAATCTAACCAACAAGGATTAAGTCCTTTGACATGTGAGGTTGGATTTGACCATGT
CACCCAAGACGAAGACCACGGATCTCGGGTGGGTGGCGAAGGAATTGCTGCCATTGGTAAACCATTTTATCGACAATGA
Protein sequenceShow/hide protein sequence
MGSLEEEELVQMVDDFIESADQTPSSCSFANSSNSLPLNSTKSHYFFSLKEILGSGIKAEGEVGEIVMKHLRSWKIDAPKTSSLKKWLVMKLKMDGYDSSDLCHTSWVTS
MGCPAGDYEYIEMKMKDEYGSIKRLIIDIEFKAQFEVARATEKYKQLTEALPSVFVGSEEKVKRIISVLCSAAKQSLKEKELRKVRKKHRKSKSTNDEDDSTNDGTTSSD
GGGNRNVGTSNSYQVREGQSNQQGLSPLTCEVGFDHVTQDEDHGSRVGGEGIAAIGKPFYRQ