; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037588 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037588
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein HAPLESS 2 isoform X1
Genome locationchr2:7502936..7505150
RNA-Seq ExpressionLag0037588
SyntenyLag0037588
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608041.1 Protein HAPLESS 2, partial [Cucurbita argyrosperma subsp. sororia]6.1e-8774.9Show/hide
Query:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT
        MLNL D MGNVASSLASG F A+ KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCV+NLLKMGMV+IL+YFVLLLLYL HKIGIFGCIGRGLCRMIWT
Subjt:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT

Query:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGE-KHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSG
        CL+SYC+AWEYCC+FMC KLA++KRTRRR  RRRDLEEE ES+ E KHR+ SSSDSSN  E IESR SK+ S +RRR+HRGSQ  K LR +SHGIRVRSG
Subjt:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGE-KHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSG

Query:  RVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQ
        RV VY K                  HGSSK V KERKYRRG Q
Subjt:  RVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQ

XP_004142706.1 protein HAPLESS 2 isoform X1 [Cucumis sativus]1.0e-9479.83Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGNVASSLAS  FSAIGK+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCV+NLLKMGMV ILSYFVLLLLYLLHKIGIF CIGRGLCRMIWTCL+SY Y
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH
        AWEYCC FMC KLA++KRTRRRH RRRD+EEEFE +  K RH S+SDS+NV EH+ES+SS++ S R RRNHR SQ RK+LR K HG+RVRSGRV VYGKH
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH

Query:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR
        RRKS EVG+HLNEI S   +GSSK+VHKERKYRRG  R
Subjt:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR

XP_008444189.1 PREDICTED: protein HAPLESS 2 isoform X1 [Cucumis melo]1.1e-9681.51Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGNVASSLAS  FSAIGK+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCV+NL+KMGMV ILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCL+SY Y
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH
        AWEYCCSFMC KLA++KRTRRRH RRRDLEEEFES+  K +H S+SDSSNV EH+ESRSS+ +S R RRNH+ SQ RK+LR K HG+RVRSGRV VYGKH
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH

Query:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR
        RRKS EVG+H NEI S   +GSSKFVHKERKYRRG QR
Subjt:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR

XP_022154011.1 uncharacterized protein LOC111021352 [Momordica charantia]4.7e-9577.96Show/hide
Query:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT
        M NL D+MGNVASS+ASGFFSA+GK+F SPLDFLSGKSCSSVCGSTWDFICYIENFCV+NLLK+GMVLILS FV+LLLYLLHKIGIFGCI RGLCRM WT
Subjt:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT

Query:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGR
        C++SY YAW+YCC+FMC KL ++KRTRRR  RRRDLEEEFES+G KHR+ SSSDSS+VPE IE RSS++AS R R NHRGSQMRKALR KS GIRVRSGR
Subjt:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGR

Query:  VSVYGKHRRKSTEVGSHLNEIHS--HHGSSKFVHKERKYRRGSQR
          VYGKHRRKS+EV + L EIHS   HGSSKFVH+E +Y+RG Q+
Subjt:  VSVYGKHRRKSTEVGSHLNEIHS--HHGSSKFVHKERKYRRGSQR

XP_038897053.1 uncharacterized protein LOC120085227 isoform X1 [Benincasa hispida]2.4e-9983.19Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGNVASSL SG FSAIGKVFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLL L LLHKIGIFGCIGRGLC+MIWTCL+SY Y
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH
        AWEYCC+FMC KLA++KRTRRRH RRRDLEEEFES+  K +H S+SDSSNVPEH+ESRSS++A  R RRNHR S+MRK+LR + HGIRVRSGRV VYGKH
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH

Query:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR
        RRKS EVG+HLNEIHS   +GSSKFVHKERKYRR SQR
Subjt:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR

TrEMBL top hitse value%identityAlignment
A0A0A0L1V0 Uncharacterized protein5.0e-9579.83Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGNVASSLAS  FSAIGK+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCV+NLLKMGMV ILSYFVLLLLYLLHKIGIF CIGRGLCRMIWTCL+SY Y
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH
        AWEYCC FMC KLA++KRTRRRH RRRD+EEEFE +  K RH S+SDS+NV EH+ES+SS++ S R RRNHR SQ RK+LR K HG+RVRSGRV VYGKH
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH

Query:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR
        RRKS EVG+HLNEI S   +GSSK+VHKERKYRRG  R
Subjt:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR

A0A1S4DVV8 protein HAPLESS 2 isoform X15.4e-9781.51Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGNVASSLAS  FSAIGK+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCV+NL+KMGMV ILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCL+SY Y
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH
        AWEYCCSFMC KLA++KRTRRRH RRRDLEEEFES+  K +H S+SDSSNV EH+ESRSS+ +S R RRNH+ SQ RK+LR K HG+RVRSGRV VYGKH
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH

Query:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR
        RRKS EVG+H NEI S   +GSSKFVHKERKYRRG QR
Subjt:  RRKSTEVGSHLNEIHSH--HGSSKFVHKERKYRRGSQR

A0A6J1DMF3 uncharacterized protein LOC1110213522.3e-9577.96Show/hide
Query:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT
        M NL D+MGNVASS+ASGFFSA+GK+F SPLDFLSGKSCSSVCGSTWDFICYIENFCV+NLLK+GMVLILS FV+LLLYLLHKIGIFGCI RGLCRM WT
Subjt:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT

Query:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGR
        C++SY YAW+YCC+FMC KL ++KRTRRR  RRRDLEEEFES+G KHR+ SSSDSS+VPE IE RSS++AS R R NHRGSQMRKALR KS GIRVRSGR
Subjt:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGR

Query:  VSVYGKHRRKSTEVGSHLNEIHS--HHGSSKFVHKERKYRRGSQR
          VYGKHRRKS+EV + L EIHS   HGSSKFVH+E +Y+RG Q+
Subjt:  VSVYGKHRRKSTEVGSHLNEIHS--HHGSSKFVHKERKYRRGSQR

A0A6J1FIX0 uncharacterized protein LOC1114458931.8e-8473.66Show/hide
Query:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT
        MLNL D MGNVASSLASG F A+ KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCV+NLLKMGMV+IL+YFVLLLLYL HKIGIFGCIGRGLCRMIWT
Subjt:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT

Query:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGE-KHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSG
        CL+SYC+AWEYCC+FMC KLA++KRTRRRH RRRDLEEE ES+ + K+R+ SSSDSSN  + IESR SK+ S +RRR+HRGSQ  K LR  SHGIRVRSG
Subjt:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGE-KHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSG

Query:  RVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQ
        RV VY K                  HGSSK V KER YRRG Q
Subjt:  RVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQ

A0A6J1IVE5 uncharacterized protein LOC1114803121.4e-8473.77Show/hide
Query:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT
        MLNL D MGNVASSLASG F A+ KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCV+NLLKMGMV+IL+YFVLLLLYL HKIGIFGCIGRG CRMIWT
Subjt:  MLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWT

Query:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGE-KHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSG
        CL+SYC+AWEYCC+FMC KLA++KRTRR H RRRDLEEE ES+ E KHR+ SS DSSN  E IESR S++ S +RRR+HRGSQ  K LR  SHGIRVRSG
Subjt:  CLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGE-KHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSG

Query:  RVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQR
        RV VY K                  HGSSKFV K+RKYRRG QR
Subjt:  RVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21722.1 unknown protein8.9e-3640.68Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGNV  S  +GF  +IG  FGSPLDFLSGKSCSSVC S WDFICY+ENFCV+NL K  ++LILSYF L  +Y+L+K+G + CI  G  +++W  +S + Y
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH
           YCCSF C+ L + KR RRR R  R +EE+++            D+S+  + ++   S      +R   +  ++RK+LR ++H +RV         K 
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSVYGKH

Query:  RRKSTEVGSHLNEIHSHHG-----SSKFVHKERKYR
         R  + +  H +     HG      SKF  K  K R
Subjt:  RRKSTEVGSHLNEIHSHHG-----SSKFVHKERKYR

AT1G78922.1 unknown protein1.3e-1531.07Show/hide
Query:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY
        MGN     ++     IG +F +PL    G+SC  VC   WD  C+IE+FC+ ++ K+ ++  L + +L+ + LL K+GI  C+ + +C+M     ++Y +
Subjt:  MGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYCY

Query:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEE---EFESDGEKHRHTSSSDSSNV-PEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSV
              S +C  L NI R  RR +R  D+E    ++ SD E     S S   N+ P+    R   + SH    ++R +  R+ +R  S  + VR G    
Subjt:  AWEYCCSFMCFKLANIKRTRRRHRRRRDLEE---EFESDGEKHRHTSSSDSSNV-PEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIRVRSGRVSV

Query:  YGKHRR
         GK RR
Subjt:  YGKHRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCACCCTCCATTCTTCTTCCATCTGCTGTTCTTCATTTCGCGGTGCTCTTTAAGCGTTCCTATCGATTGGATGCTGAATCTACACGACGTAATGGGTAATGTGGC
CAGTTCATTGGCTTCTGGGTTTTTTTCGGCCATTGGCAAAGTATTTGGATCCCCACTTGATTTTCTCTCTGGAAAGTCCTGCAGTTCAGTGTGTGGATCAACATGGGATT
TCATATGCTACATAGAAAATTTCTGCGTTTCCAATTTGCTAAAGATGGGCATGGTCTTGATCCTTTCATACTTTGTTCTTTTACTCCTGTATTTATTACATAAAATTGGC
ATCTTTGGATGCATCGGTCGGGGGCTCTGCAGAATGATATGGACATGTTTATCTTCCTATTGCTATGCATGGGAGTACTGCTGCTCTTTCATGTGTTTCAAGCTTGCCAA
TATCAAAAGAACAAGAAGACGACACCGGAGAAGAAGAGACCTGGAAGAAGAGTTTGAAAGTGATGGCGAAAAACATCGGCACACATCATCAAGTGATTCGAGCAATGTCC
CCGAACACATTGAGTCGAGAAGTAGCAAACAAGCATCTCACAGACGGAGGAGGAACCATAGAGGTTCTCAAATGAGAAAGGCATTGAGGACGAAGAGCCACGGAATTCGA
GTAAGGAGCGGTAGAGTATCGGTCTATGGTAAGCATAGAAGAAAATCCACTGAGGTTGGGAGTCATTTGAATGAGATCCATAGCCATCATGGATCATCCAAGTTTGTGCA
TAAAGAAAGAAAGTATAGAAGAGGAAGCCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCACCCTCCATTCTTCTTCCATCTGCTGTTCTTCATTTCGCGGTGCTCTTTAAGCGTTCCTATCGATTGGATGCTGAATCTACACGACGTAATGGGTAATGTGGC
CAGTTCATTGGCTTCTGGGTTTTTTTCGGCCATTGGCAAAGTATTTGGATCCCCACTTGATTTTCTCTCTGGAAAGTCCTGCAGTTCAGTGTGTGGATCAACATGGGATT
TCATATGCTACATAGAAAATTTCTGCGTTTCCAATTTGCTAAAGATGGGCATGGTCTTGATCCTTTCATACTTTGTTCTTTTACTCCTGTATTTATTACATAAAATTGGC
ATCTTTGGATGCATCGGTCGGGGGCTCTGCAGAATGATATGGACATGTTTATCTTCCTATTGCTATGCATGGGAGTACTGCTGCTCTTTCATGTGTTTCAAGCTTGCCAA
TATCAAAAGAACAAGAAGACGACACCGGAGAAGAAGAGACCTGGAAGAAGAGTTTGAAAGTGATGGCGAAAAACATCGGCACACATCATCAAGTGATTCGAGCAATGTCC
CCGAACACATTGAGTCGAGAAGTAGCAAACAAGCATCTCACAGACGGAGGAGGAACCATAGAGGTTCTCAAATGAGAAAGGCATTGAGGACGAAGAGCCACGGAATTCGA
GTAAGGAGCGGTAGAGTATCGGTCTATGGTAAGCATAGAAGAAAATCCACTGAGGTTGGGAGTCATTTGAATGAGATCCATAGCCATCATGGATCATCCAAGTTTGTGCA
TAAAGAAAGAAAGTATAGAAGAGGAAGCCAAAGATGA
Protein sequenceShow/hide protein sequence
MSHPPFFFHLLFFISRCSLSVPIDWMLNLHDVMGNVASSLASGFFSAIGKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVSNLLKMGMVLILSYFVLLLLYLLHKIG
IFGCIGRGLCRMIWTCLSSYCYAWEYCCSFMCFKLANIKRTRRRHRRRRDLEEEFESDGEKHRHTSSSDSSNVPEHIESRSSKQASHRRRRNHRGSQMRKALRTKSHGIR
VRSGRVSVYGKHRRKSTEVGSHLNEIHSHHGSSKFVHKERKYRRGSQR