; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010513 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010513
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationChr06:22861392..22862213
RNA-Seq ExpressionHG10010513
SyntenyHG10010513
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM98277.1 hypothetical protein CDL12_29246 [Handroanthus impetiginosus]3.8e-5245Show/hide
Query:  PNENGDGNGNDDDNLDLSLHL-PQPLP-----PPLPTPSPPSLTPNPTMNFF-----PL------------PSSPT-------ANPVAPTRFRRQLLRPG
        PN NG     D  NL LSL      LP     P LP PS P  T N +   +     PL            P  PT         P+   R  R++LRPG
Subjt:  PNENGDGNGNDDDNLDLSLHL-PQPLP-----PPLPTPSPPSLTPNPTMNFF-----PL------------PSSPT-------ANPVAPTRFRRQLLRPG

Query:  KTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIG
        ++ETI  PYPWA + RAII  +  +  +GI+ I GK+KC  C  + +I+Y+L  KF EV+NF++ NK  M+ RAPEVW +P +LDC+ C  ++CV  ++G
Subjt:  KTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIG

Query:  KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY
        KKR INWLFL LGQM+G C L +LKYFCKHT  H TGA++R+LY+ Y  LC QL+P G Y
Subjt:  KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY

TXG59059.1 hypothetical protein EZV62_016888 [Acer yangbiense]5.5e-5144.96Show/hide
Query:  NLDLSLHIPQPLLPPLPTP----SPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR---QLLRPGKT
        ++ +S   P  L PP P P    SPP ++   +            + + LPPP  +P PP    N T+N      +P   P   TR RR   Q  +PGKT
Subjt:  NLDLSLHIPQPLLPPLPTP----SPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR---QLLRPGKT

Query:  ETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKK
        ET+ +PYPWA  +RA + S+  L+ + I  I G+++CKRCEK  +IEY+L EKF E+  FI +NKS M++RAP VW  P    C+ CG   CV  +I KK
Subjt:  ETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKK

Query:  RDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY
        + +NWLFL LG+MLG C LEQLKYFCKHT+NH+TGA++R+LY+ Y  LC QL+P G +
Subjt:  RDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY

XP_022157692.1 uncharacterized protein LOC111024349 [Momordica charantia]2.3e-5747.67Show/hide
Query:  NLHHLSPPNDND----------DDNLDLSLHIPQPLLPPLPTPSPPNENGDG--NGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPT
        NL H S P+             D  L      P P  P   +P+ P + G         + + LSL   +  PPP P P PP           P   SP 
Subjt:  NLHHLSPPNDND----------DDNLDLSLHIPQPLLPPLPTPSPPNENGDG--NGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPT

Query:  ANPVAPTRFRRQLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKR
         NP  P    R LL  GK+ETIP PYPWA   RAIIRS+ +L  +GI KI G+MKCK+C  +S++E+NL EKF EVE+FI  NKSEM+ RAP  W  P R
Subjt:  ANPVAPTRFRRQLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKR

Query:  LDCESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI
         DC  C GE C   V GKKR++NWLFL LGQM+G+ SLE LKY CKHTRNH+TGA++RL+YIAY  LC QL+P G Y +
Subjt:  LDCESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI

XP_028120279.1 uncharacterized protein LOC114317729 [Camellia sinensis]9.4e-5145.02Show/hide
Query:  PNLHHLSPPNDNDDDNLDLSLHIPQPLLPP-LPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFR
        P++H     + +   +L LS H P P  PP LP P PP            +  L   L  P PP LP   PPS   +P   F P PS   A P  P R R
Subjt:  PNLHHLSPPNDNDDDNLDLSLHIPQPLLPP-LPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFR

Query:  R---QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCG
        R   Q  R GK++ +PAP+PWA NRRA + S+ +L+ + I  IVG ++CKRCE+  E+E++L +KF EV  FI ++K+ M++RAP +W  P    C+ C 
Subjt:  R---QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCG

Query:  GERCVGAVIG-KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY
         E  +  +I  KK+ INWLFL LGQMLG C+LEQLKYFCKHT+NH+TGA++R+LY+ Y  LC QL+P+G +
Subjt:  GERCVGAVIG-KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY

XP_031257817.1 uncharacterized protein LOC116115824 [Pistacia vera]1.4e-5146.84Show/hide
Query:  SPPND----NDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR--
        SP ND     DD  L+L+L +P P        +P  +N              L+LP PL  PL  PS     P+PT++      +  A P   +R RR  
Subjt:  SPPND----NDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR--

Query:  -QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGER
         Q L+PGKTE +PAPYPWA   RA + S+ +L+   I KI G+++CKRCE   EIEY+L EKF EV +FI ENK  M++RAPE+W  P    C+ CG   
Subjt:  -QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGER

Query:  CVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI
         V  V  KK+ INWLFLFLGQMLG   L QLKYFCKHT+NH+TGA++R+LY+AY  LC QL+P+G + I
Subjt:  CVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI

TrEMBL top hitse value%identityAlignment
A0A068TNY1 Uncharacterized protein4.5e-5145.91Show/hide
Query:  HLSPPNDNDDDNLDLSL---------HIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSP---PSLTPNPTMNFF---PLPSSPT
        HL   +DND   L LS          H P    PPLP P    ++              L++P P+P P  T SP   P  +  PT+        PSS +
Subjt:  HLSPPNDNDDDNLDLSL---------HIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSP---PSLTPNPTMNFF---PLPSSPT

Query:  ANPVAPTRFRR---QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNT
        A    P R RR   Q  R GK+ET+PAP+PWA  RRA + S+  L+   I KI G+++CKRCEK  E+EY+L EKF EV +FI ENKS M++RAP VW  
Subjt:  ANPVAPTRFRR---QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNT

Query:  PKRLDCESCGGERCVGAVI-GKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY
        P    C+ C  + CV  +I  KK+ INWLFL LGQMLG C+LEQLKYFCKHT+NH+TGA++R+LY+ Y  LC QL+P+G +
Subjt:  PKRLDCESCGGERCVGAVI-GKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY

A0A2G9FYY8 Uncharacterized protein1.8e-5245Show/hide
Query:  PNENGDGNGNDDDNLDLSLHL-PQPLP-----PPLPTPSPPSLTPNPTMNFF-----PL------------PSSPT-------ANPVAPTRFRRQLLRPG
        PN NG     D  NL LSL      LP     P LP PS P  T N +   +     PL            P  PT         P+   R  R++LRPG
Subjt:  PNENGDGNGNDDDNLDLSLHL-PQPLP-----PPLPTPSPPSLTPNPTMNFF-----PL------------PSSPT-------ANPVAPTRFRRQLLRPG

Query:  KTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIG
        ++ETI  PYPWA + RAII  +  +  +GI+ I GK+KC  C  + +I+Y+L  KF EV+NF++ NK  M+ RAPEVW +P +LDC+ C  ++CV  ++G
Subjt:  KTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIG

Query:  KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY
        KKR INWLFL LGQM+G C L +LKYFCKHT  H TGA++R+LY+ Y  LC QL+P G Y
Subjt:  KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY

A0A2H5NYI9 Uncharacterized protein4.5e-5151.21Show/hide
Query:  PPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR---QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLV
        PP  T     +T N   N   + S+   +  + +R R+   Q LRPGKTETIPAP+PWA  RRA + S+  L    + KI G+++CKRCE++ EIEY+L 
Subjt:  PPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR---QLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLV

Query:  EKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQ
         KF EV +FI ENK  M++RAP +W  P   +C+ CG   CV  ++GKK+ INWLFL LGQMLG C L +LKYFCKHTRNH+TGA++R+LY+ Y SLC Q
Subjt:  EKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQ

Query:  LNPHGQY
        L+P+G Y
Subjt:  LNPHGQY

A0A5C7HQG1 Uncharacterized protein2.7e-5144.96Show/hide
Query:  NLDLSLHIPQPLLPPLPTP----SPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR---QLLRPGKT
        ++ +S   P  L PP P P    SPP ++   +            + + LPPP  +P PP    N T+N      +P   P   TR RR   Q  +PGKT
Subjt:  NLDLSLHIPQPLLPPLPTP----SPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRR---QLLRPGKT

Query:  ETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKK
        ET+ +PYPWA  +RA + S+  L+ + I  I G+++CKRCEK  +IEY+L EKF E+  FI +NKS M++RAP VW  P    C+ CG   CV  +I KK
Subjt:  ETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKK

Query:  RDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY
        + +NWLFL LG+MLG C LEQLKYFCKHT+NH+TGA++R+LY+ Y  LC QL+P G +
Subjt:  RDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQY

A0A6J1DV57 uncharacterized protein LOC1110243491.1e-5747.67Show/hide
Query:  NLHHLSPPNDND----------DDNLDLSLHIPQPLLPPLPTPSPPNENGDG--NGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPT
        NL H S P+             D  L      P P  P   +P+ P + G         + + LSL   +  PPP P P PP           P   SP 
Subjt:  NLHHLSPPNDND----------DDNLDLSLHIPQPLLPPLPTPSPPNENGDG--NGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPT

Query:  ANPVAPTRFRRQLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKR
         NP  P    R LL  GK+ETIP PYPWA   RAIIRS+ +L  +GI KI G+MKCK+C  +S++E+NL EKF EVE+FI  NKSEM+ RAP  W  P R
Subjt:  ANPVAPTRFRRQLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKR

Query:  LDCESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI
         DC  C GE C   V GKKR++NWLFL LGQM+G+ SLE LKY CKHTRNH+TGA++RL+YIAY  LC QL+P G Y +
Subjt:  LDCESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein1.0e-4238.95Show/hide
Query:  NDNDDDNLDLSLHIP--------QPLLPPLPTPSPPNENGDGN--GNDDDNLDLSLHLPQPLPP----PL-------PTPSPPSLTPN---------PTM
        +D+DD+ L LSL +         +P+  P+P   PP   G         D L     +P P PP    PL        TP+PP L  +         P+ 
Subjt:  NDNDDDNLDLSLHIP--------QPLLPPLPTPSPPNENGDGN--GNDDDNLDLSLHLPQPLPP----PL-------PTPSPPSLTPN---------PTM

Query:  NFFPLP-SSPTANPVAPTRFRRQLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYN
        N  P P   P    V   R R  + +  K++TI  P+PWA NRR  I+S+  L  + I  I G+++C+ CEK  ++ YNL E+F EV  F +  K +M +
Subjt:  NFFPLP-SSPTANPVAPTRFRRQLLRPGKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYN

Query:  RAPEVWNTPKRLDCESCGGERCVGAVIG-KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNP
        RA + W  P++  CE CG E+ V  VI  +K  INWLFL LGQ LG+C+LEQLK FCKH++NH+TGA++R+LY+ Y  LC  L P
Subjt:  RAPEVWNTPKRLDCESCGGERCVGAVIG-KKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)7.4e-3835.74Show/hide
Query:  PNDNDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPP-----LPTPSPPS--LTPNPTMNFFPLPSSPTANPVAPT--RFR
        P   ++    + L    P     P+P  PN+        +  +  +  L Q +PPP      P P  PS  + P P +N     +  T     P   + R
Subjt:  PNDNDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPP-----LPTPSPPS--LTPNPTMNFFPLPSSPTANPVAPT--RFR

Query:  RQLLRP--------GKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLD
        R   RP        G  E +P PYPWA  +   I+S  +L  + I  I G++ CK C++   +EYNL EKF E+  +I  NK EM +RAP  W+TPK + 
Subjt:  RQLLRP--------GKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLD

Query:  CESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI
        C +C  E     +  +K +INWLFL LGQMLG C+L+QL+YFC+    H+TG+++R++YI Y SLC QL+P G +++
Subjt:  CESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.2e-2434.31Show/hide
Query:  PNDNDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPP-----LPTPSPPS--LTPNPTMNFFPLPSSPTANPVAPT--RFR
        P   ++    + L    P     P+P  PN+        +  +  +  L Q +PPP      P P  PS  + P P +N     +  T     P   + R
Subjt:  PNDNDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPP-----LPTPSPPS--LTPNPTMNFFPLPSSPTANPVAPT--RFR

Query:  RQLLRP--------GKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLD
        R   RP        G  E +P PYPWA  +   I+S  +L  + I  I G++ CK C++   +EYNL EKF E+  +I  NK EM +RAP  W+TPK + 
Subjt:  RQLLRP--------GKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLD

Query:  CESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQL
        C +C  E     +  +K +INWLFL LGQMLG C+L+QL
Subjt:  CESCGGERCVGAVIGKKRDINWLFLFLGQMLGYCSLEQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGAAACATCTCCCAATCTTCATCATCTTTCCCCTCCCAACGACAACGATGATGATAATCTCGACCTCTCCCTTCACATCCCCCAACCGCTCCTGCCACCATTGCC
AACTCCTTCCCCTCCTAATGAGAATGGTGACGGCAACGGCAACGATGATGATAATCTCGACCTCTCCCTTCACCTCCCCCAACCGCTCCCGCCACCATTGCCAACTCCAT
CACCTCCATCGCTGACACCAAACCCTACCATGAATTTCTTCCCTTTGCCATCATCTCCGACTGCAAACCCTGTTGCTCCCACGCGGTTCCGAAGGCAATTACTCCGACCA
GGAAAAACCGAGACAATACCAGCGCCATATCCATGGGCGAAGAACCGACGAGCAATCATACGCAGCATGGGAAACCTAGTCGGAGACGGGATCTTAAAAATCGTCGGGAA
AATGAAGTGCAAAAGGTGTGAGAAAGAGAGTGAAATAGAATATAACCTAGTGGAGAAGTTCAGAGAAGTAGAGAATTTCATAGTGGAAAACAAATCTGAGATGTACAACC
GTGCGCCGGAGGTTTGGAACACCCCGAAGCGGCTCGACTGCGAGAGTTGTGGCGGAGAAAGGTGTGTCGGGGCAGTGATAGGGAAGAAGAGAGACATAAATTGGTTGTTC
TTGTTTTTAGGGCAAATGCTTGGGTATTGCAGCTTAGAACAACTAAAATATTTTTGTAAGCATACAAGAAACCACCAAACAGGGGCAAGAAATAGGCTTCTCTACATTGC
CTATTTCTCTTTGTGCAATCAACTTAATCCCCATGGCCAATATCATATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGAAACATCTCCCAATCTTCATCATCTTTCCCCTCCCAACGACAACGATGATGATAATCTCGACCTCTCCCTTCACATCCCCCAACCGCTCCTGCCACCATTGCC
AACTCCTTCCCCTCCTAATGAGAATGGTGACGGCAACGGCAACGATGATGATAATCTCGACCTCTCCCTTCACCTCCCCCAACCGCTCCCGCCACCATTGCCAACTCCAT
CACCTCCATCGCTGACACCAAACCCTACCATGAATTTCTTCCCTTTGCCATCATCTCCGACTGCAAACCCTGTTGCTCCCACGCGGTTCCGAAGGCAATTACTCCGACCA
GGAAAAACCGAGACAATACCAGCGCCATATCCATGGGCGAAGAACCGACGAGCAATCATACGCAGCATGGGAAACCTAGTCGGAGACGGGATCTTAAAAATCGTCGGGAA
AATGAAGTGCAAAAGGTGTGAGAAAGAGAGTGAAATAGAATATAACCTAGTGGAGAAGTTCAGAGAAGTAGAGAATTTCATAGTGGAAAACAAATCTGAGATGTACAACC
GTGCGCCGGAGGTTTGGAACACCCCGAAGCGGCTCGACTGCGAGAGTTGTGGCGGAGAAAGGTGTGTCGGGGCAGTGATAGGGAAGAAGAGAGACATAAATTGGTTGTTC
TTGTTTTTAGGGCAAATGCTTGGGTATTGCAGCTTAGAACAACTAAAATATTTTTGTAAGCATACAAGAAACCACCAAACAGGGGCAAGAAATAGGCTTCTCTACATTGC
CTATTTCTCTTTGTGCAATCAACTTAATCCCCATGGCCAATATCATATTTGA
Protein sequenceShow/hide protein sequence
MHETSPNLHHLSPPNDNDDDNLDLSLHIPQPLLPPLPTPSPPNENGDGNGNDDDNLDLSLHLPQPLPPPLPTPSPPSLTPNPTMNFFPLPSSPTANPVAPTRFRRQLLRP
GKTETIPAPYPWAKNRRAIIRSMGNLVGDGILKIVGKMKCKRCEKESEIEYNLVEKFREVENFIVENKSEMYNRAPEVWNTPKRLDCESCGGERCVGAVIGKKRDINWLF
LFLGQMLGYCSLEQLKYFCKHTRNHQTGARNRLLYIAYFSLCNQLNPHGQYHI