; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021962 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021962
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationtig00153870:178475..179338
RNA-Seq ExpressionSgr021962
SyntenySgr021962
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]1.1e-6268.13Show/hide
Query:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE
        RAN+LT  RI R+ GT  SS    CNSRS    P+TT  I PPYPWST  RA+V TLN LRS+QIL ITGDVRCR+C  +Y IEYD+V+KF+EIA+F+EE
Subjt:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE

Query:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLT
        NK+   DRAP SWM P +PTC+FC  E+GA PVIP+E R+INWLFLLLG+ LG+LNL HLKYFC+YTNNHRTGAKNRLLYLT
Subjt:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLT

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]1.1e-7068.18Show/hide
Query:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE
        RAN+LT  RI R+ GT  SS    CNSRS    P+TT  I PPYPWST  RA+V TLN LRS+QIL ITGDVRCR+C  +Y IEYD+V+KF+EIA+F+EE
Subjt:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE

Query:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFDR
        NK+   DRAP SWM P +PTC+FC  E+GA PVIP+E R+INWLFLLLG+ LG+LNL HLKYFC+YTNNHRTGAKNRLLYLTY+TLC QVDP+G F+R
Subjt:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFDR

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]4.0e-6861.86Show/hide
Query:  HPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQI
        H S+     P G +   R N++T +R+ RS GT  SS    CNSRS    P+TT  I PPYPWST  RA+V TLN L+SNQIL ITGDV+CR+C  +Y I
Subjt:  HPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQI

Query:  EYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTY
        EYD+ +KF+EIA+F+EENK+S  DRAP SWM P +PTC+FC  E+GA PVIP++ R+INWLFLLLG+ LG+LNL HLKYFC+ T NHRTGAKNRLLYLTY
Subjt:  EYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTY

Query:  LTLCKQVDPNGPFDR
        +TLC QVDP+G F+R
Subjt:  LTLCKQVDPNGPFDR

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]9.8e-6753.87Show/hide
Query:  SDNLALELSLRPP-----VNNRVVLQQQ----PPVLLPMPPLL-----FRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHS--SSG
        S NL ++LSLRPP         +++QQ+    PP L P P  L        T+NQ++  H STS+    +      RA +   +R R S     +   + 
Subjt:  SDNLALELSLRPP-----VNNRVVLQQQ----PPVLLPMPPLL-----FRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHS--SSG

Query:  GICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKF
         +   R S  + K TPI+PPYPWSTE +AVVH LNYLR NQILTITGDVRC RC +QY IEYDL+TKF+EIA+FIE+NK +LHDRAP SW  P F  CK 
Subjt:  GICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKF

Query:  CNLEDGAVPVIPE-ERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN
        C  E+   P IPE + + INWLFLLLGQ +G L LEHLKYFC YTNNHRTGAKNRL+YLTYLTLCKQ+ P+
Subjt:  CNLEDGAVPVIPE-ERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]1.0e-6350.88Show/hide
Query:  DHRITPMETPKNADRSRESDNLALELSLRPPVNNRVVLQQQPPVLLPMPPLLFRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSF----GT
        ++++ P +   ++  S ES +  +  S   P+++    Q QP  L   P  L  Q   +  +PHPSTS+            + SL   R RRS      T
Subjt:  DHRITPMETPKNADRSRESDNLALELSLRPPVNNRVVLQQQPPVLLPMPPLLFRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSF----GT

Query:  YHSSSGGICN----SRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSW
        + SSS         SR    KPK T I PPYPWST  RAVVH L YL+ NQILTITGDV+C +C +QY+IEYDLVTKFDEIA+FIE+NKD+LHDRAPSSW
Subjt:  YHSSSGGICN----SRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSW

Query:  MYPKFPTCKFCNLEDGAVPVIP-----EERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN
          P  P CKFC  E    PVIP     ++ + INWLFLLLGQ +G L L+HLKYFCTYTNNHRT AK+RL+YLTYL+LCKQ+ P+
Subjt:  MYPKFPTCKFCNLEDGAVPVIP-----EERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.9e-6861.86Show/hide
Query:  HPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQI
        H S+     P G +   R N++T +R+ RS GT  SS    CNSRS    P+TT  I PPYPWST  RA+V TLN L+SNQIL ITGDV+CR+C  +Y I
Subjt:  HPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQI

Query:  EYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTY
        EYD+ +KF+EIA+F+EENK+S  DRAP SWM P +PTC+FC  E+GA PVIP++ R+INWLFLLLG+ LG+LNL HLKYFC+ T NHRTGAKNRLLYLTY
Subjt:  EYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTY

Query:  LTLCKQVDPNGPFDR
        +TLC QVDP+G F+R
Subjt:  LTLCKQVDPNGPFDR

A0A1S3BHR1 uncharacterized protein LOC1034897705.4e-7168.18Show/hide
Query:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE
        RAN+LT  RI R+ GT  SS    CNSRS    P+TT  I PPYPWST  RA+V TLN LRS+QIL ITGDVRCR+C  +Y IEYD+V+KF+EIA+F+EE
Subjt:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE

Query:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFDR
        NK+   DRAP SWM P +PTC+FC  E+GA PVIP+E R+INWLFLLLG+ LG+LNL HLKYFC+YTNNHRTGAKNRLLYLTY+TLC QVDP+G F+R
Subjt:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFDR

A0A5A7T547 Uncharacterized protein5.4e-6368.13Show/hide
Query:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE
        RAN+LT  RI R+ GT  SS    CNSRS    P+TT  I PPYPWST  RA+V TLN LRS+QIL ITGDVRCR+C  +Y IEYD+V+KF+EIA+F+EE
Subjt:  RANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTT-PIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEE

Query:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLT
        NK+   DRAP SWM P +PTC+FC  E+GA PVIP+E R+INWLFLLLG+ LG+LNL HLKYFC+YTNNHRTGAKNRLLYLT
Subjt:  NKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLT

A0A6J1C462 uncharacterized protein LOC1110077683.6e-6754.24Show/hide
Query:  SDNLALELSLRPP-----VNNRVVLQQQ----PPVLLPMPPLL-----FRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHS--SSG
        S NL +ELSLRPP         +++QQ+    PP L P P  L        T+NQ++  H STS+    +      RA +   +R R S     +   + 
Subjt:  SDNLALELSLRPP-----VNNRVVLQQQ----PPVLLPMPPLL-----FRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHS--SSG

Query:  GICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKF
         +   R S  + K TPI+PPYPWSTE +AVVH LNYLR NQILTITGDVRC RC +QY IEYDL+TKF+EIA+FIE+NK +LHDRAP SW  P F  CK 
Subjt:  GICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKF

Query:  CNLEDGAVPVIPE-ERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN
        C  E+   P IPE + + INWLFLLLGQ +G L LEHLKYFC YTNNHRTGAKNRL+YLTYLTLCKQ+ P+
Subjt:  CNLEDGAVPVIPE-ERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN

A0A6J1C690 probable serine/threonine-protein kinase samkC4.9e-6450.88Show/hide
Query:  DHRITPMETPKNADRSRESDNLALELSLRPPVNNRVVLQQQPPVLLPMPPLLFRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSF----GT
        ++++ P +   ++  S ES +  +  S   P+++    Q QP  L   P  L  Q   +  +PHPSTS+            + SL   R RRS      T
Subjt:  DHRITPMETPKNADRSRESDNLALELSLRPPVNNRVVLQQQPPVLLPMPPLLFRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSF----GT

Query:  YHSSSGGICN----SRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSW
        + SSS         SR    KPK T I PPYPWST  RAVVH L YL+ NQILTITGDV+C +C +QY+IEYDLVTKFDEIA+FIE+NKD+LHDRAPSSW
Subjt:  YHSSSGGICN----SRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSW

Query:  MYPKFPTCKFCNLEDGAVPVIP-----EERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN
          P  P CKFC  E    PVIP     ++ + INWLFLLLGQ +G L L+HLKYFCTYTNNHRT AK+RL+YLTYL+LCKQ+ P+
Subjt:  MYPKFPTCKFCNLEDGAVPVIP-----EERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein3.2e-4745Show/hide
Query:  FRQTSN-QMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDV
        F+QT N   LV H    +G  P  S       +LT   ++R      + S  I  SRS+  K K+  I PP+PW+T  R  + +L YL SNQI TITG+V
Subjt:  FRQTSN-QMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSSSGGICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDV

Query:  RCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRT
        +CR C + YQ+ Y+L  +F E+  F    K  + DRA   W YP+   C+ C  E    PVI E + +INWLFLLLGQTLG   LE LK FC ++ NHRT
Subjt:  RCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRT

Query:  GAKNRLLYLTYLTLCKQVDP
        GAK+R+LYLTY+ LCK + P
Subjt:  GAKNRLLYLTYLTLCKQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)8.9e-4247.47Show/hide
Query:  IRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERR
        I PPYPW+T+    + +   L SN I  I+G V C+ C R   +EY+L  KF E+  +I+ NK+ +  RAP SW  PK   C+ C  E    PV+ E + 
Subjt:  IRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERR

Query:  RINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFD
         INWLFLLLGQ LG   L+ L+YFC   + HRTG+K+R++Y+TYL+LCKQ+DP GPF+
Subjt:  RINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFD

AT2G16190.2 FUNCTIONS IN: molecular_function unknown9.9e-2543.8Show/hide
Query:  IRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERR
        I PPYPW+T+    + +   L SN I  I+G V C+ C R   +EY+L  KF E+  +I+ NK+ +  RAP SW  PK   C+ C  E    PV+ E + 
Subjt:  IRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAVPVIPEERR

Query:  RINWLFLLLGQTLGILNLEHL
         INWLFLLLGQ LG   L+ L
Subjt:  RINWLFLLLGQTLGILNLEHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCGGACGAACAAGAAGCTGCTCAAGATCATCGGATTACGCCAATGGAGACTCCGAAGAATGCCGACCGAAGCCGTGAAAGCGACAATCTCGCCCTGGAACTCTC
TCTCCGTCCGCCGGTGAACAACCGTGTGGTGCTGCAGCAGCAGCCGCCAGTTCTCCTTCCGATGCCGCCATTATTGTTTCGCCAAACTTCGAACCAGATGCTCGTTCCGC
ACCCTTCCACTTCCACTGGCTATCTTCCTGCAGGCAGTAGCCATATCGAGCGGGCAAATTCACTAACCGCATTGCGAATTAGGCGCAGTTTTGGAACTTATCATTCTTCA
AGTGGTGGAATCTGCAATTCGAGAAGCTCCACAGGCAAACCGAAGACCACGCCTATCAGGCCGCCCTATCCTTGGTCGACTGAACTCCGAGCGGTGGTTCACACTCTAAA
TTACCTCCGATCAAACCAGATCCTCACTATCACTGGCGATGTCCGATGCCGGCGATGCCATAGACAGTACCAGATTGAATACGACCTCGTTACGAAGTTCGATGAGATTG
CAACTTTTATAGAGGAAAACAAGGATTCTTTGCACGACAGAGCCCCGAGCTCCTGGATGTACCCTAAATTTCCGACCTGCAAGTTCTGTAACCTAGAAGACGGAGCAGTA
CCGGTGATACCAGAGGAGCGGAGGCGCATCAATTGGCTTTTCTTGCTTTTAGGACAAACGCTTGGAATTTTGAATCTCGAACATCTGAAATACTTCTGCACTTACACCAA
CAATCATCGAACAGGTGCGAAGAATCGCCTTCTTTATCTCACTTATCTTACTTTGTGCAAGCAAGTTGATCCAAACGGGCCTTTCGATCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCGGACGAACAAGAAGCTGCTCAAGATCATCGGATTACGCCAATGGAGACTCCGAAGAATGCCGACCGAAGCCGTGAAAGCGACAATCTCGCCCTGGAACTCTC
TCTCCGTCCGCCGGTGAACAACCGTGTGGTGCTGCAGCAGCAGCCGCCAGTTCTCCTTCCGATGCCGCCATTATTGTTTCGCCAAACTTCGAACCAGATGCTCGTTCCGC
ACCCTTCCACTTCCACTGGCTATCTTCCTGCAGGCAGTAGCCATATCGAGCGGGCAAATTCACTAACCGCATTGCGAATTAGGCGCAGTTTTGGAACTTATCATTCTTCA
AGTGGTGGAATCTGCAATTCGAGAAGCTCCACAGGCAAACCGAAGACCACGCCTATCAGGCCGCCCTATCCTTGGTCGACTGAACTCCGAGCGGTGGTTCACACTCTAAA
TTACCTCCGATCAAACCAGATCCTCACTATCACTGGCGATGTCCGATGCCGGCGATGCCATAGACAGTACCAGATTGAATACGACCTCGTTACGAAGTTCGATGAGATTG
CAACTTTTATAGAGGAAAACAAGGATTCTTTGCACGACAGAGCCCCGAGCTCCTGGATGTACCCTAAATTTCCGACCTGCAAGTTCTGTAACCTAGAAGACGGAGCAGTA
CCGGTGATACCAGAGGAGCGGAGGCGCATCAATTGGCTTTTCTTGCTTTTAGGACAAACGCTTGGAATTTTGAATCTCGAACATCTGAAATACTTCTGCACTTACACCAA
CAATCATCGAACAGGTGCGAAGAATCGCCTTCTTTATCTCACTTATCTTACTTTGTGCAAGCAAGTTGATCCAAACGGGCCTTTCGATCGCTGA
Protein sequenceShow/hide protein sequence
MESDEQEAAQDHRITPMETPKNADRSRESDNLALELSLRPPVNNRVVLQQQPPVLLPMPPLLFRQTSNQMLVPHPSTSTGYLPAGSSHIERANSLTALRIRRSFGTYHSS
SGGICNSRSSTGKPKTTPIRPPYPWSTELRAVVHTLNYLRSNQILTITGDVRCRRCHRQYQIEYDLVTKFDEIATFIEENKDSLHDRAPSSWMYPKFPTCKFCNLEDGAV
PVIPEERRRINWLFLLLGQTLGILNLEHLKYFCTYTNNHRTGAKNRLLYLTYLTLCKQVDPNGPFDR