; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000640 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000640
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationchr11:5521936..5522709
RNA-Seq ExpressionPI0000640
SyntenyPI0000640
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]1.5e-6157.63Show/hide
Query:  TQPPTSTLPSSQTPNEESEAIPQPNNETSSQ--TPNEEFEAIPQPNN---ETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQSNNI
        +QPPT T    +TPN +  AIPQ   ET +Q  T ++  E  P  +    + +N H   + R RR RTRAD  RIEPPYPWS ++RA +H L+YLQSNNI
Subjt:  TQPPTSTLPSSQTPNEESEAIPQPNNETSSQ--TPNEEFEAIPQPNN---ETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQSNNI

Query:  MTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLKLKQL
        +TIKG+V CKKCE  YE+EY+LMNK +EI RF E E DNMHDRAP CW  P LPNC  C EE CV P+I  E+D    ++INWLFL LGQ +G LKLKQL
Subjt:  MTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLKLKQL

Query:  KYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL
        KYFCA T  HRTGAK+RL++L+Y AL  QLQPS  L
Subjt:  KYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL

XP_008463189.1 PREDICTED: uncharacterized protein LOC103501397 [Cucumis melo]2.5e-9879.59Show/hide
Query:  DLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDR
        DLDLSLR PSL P  PP +Y+QPPTSTLP                    SQ  NEE EAI +PNNETSN+ QQQ+   RRRRTRADMTRIEPPYPWSTDR
Subjt:  DLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDR

Query:  RAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLG
        RAVVHELKYLQ NNIMTIKGEVICKKCEM+YEMEYDLMNKVNEITRFFEEEID+MHDRAPSCWT PNLPNC+LCNEEKCVMPV SQE DTKINWLFLFLG
Subjt:  RAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLG

Query:  QFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPST
        QFLGCLKL+QLKYFC QTNIHRTGAKNRLLYLSYR LF QLQP T
Subjt:  QFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPST

XP_011656206.1 uncharacterized protein LOC105435666 [Cucumis sativus]2.1e-10579.53Show/hide
Query:  ERSPDLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPW
        ERSP LDLSLRPPS  P P P EY Q  +STLPSSQ PNEES                    A PQPN ETSND QQ +RR+RRRRTRADMTRIEPPYPW
Subjt:  ERSPDLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPW

Query:  STDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLF
        +TD+RAVVHELKYLQSNNIM IKGEVICKKCEM+YE+EYDLMNKVNEITRFFEEEID+MHDRAP+CWT+PNLPNCN CNEEKCVMPVIS+EDD+KINWLF
Subjt:  STDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLF

Query:  LFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNIN
        LFLGQFLGCL+LKQLK+FCAQ+NIHRTGAKNRLLYLSYRALFHQLQPS TLNIN
Subjt:  LFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNIN

XP_022953023.1 mucin-16-like [Cucurbita moschata]2.1e-6057.2Show/hide
Query:  TQPPTSTLPSSQTPNEESEAIPQPNNETSSQ--TPNEEFEAIPQPNN---ETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQSNNI
        +QPPT T    +T N +  AIPQ   ET +Q  T ++  E  P  +    + +N H   + R RR RTRAD  RIEPPYPWS ++RA +H L+YLQSNNI
Subjt:  TQPPTSTLPSSQTPNEESEAIPQPNNETSSQ--TPNEEFEAIPQPNN---ETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQSNNI

Query:  MTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLKLKQL
        +TIKG+V CKKCE  YE+EY+LMNK +EI RF E E DNMHDRAP CW  P LPNC  C EE CV P+I  E+D    ++INWLFL LGQ +G LKLKQL
Subjt:  MTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLKLKQL

Query:  KYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL
        KYFCA T  HRTGAK+RL++L+Y AL  QLQPS  L
Subjt:  KYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]1.3e-8369.38Show/hide
Query:  DLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESE-AIPQPNNETS---SQTPNEEFEAIPQPNNETSNDHQQQ-------QRRVRRRRTRADMTR
        +L+LSLR    LPSPPP E   PP    P  ++P   S    P  +  TS    +TPNE  E   Q NNETSN HQQQ       Q R RRRRTRADMTR
Subjt:  DLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESE-AIPQPNNETS---SQTPNEEFEAIPQPNNETSNDHQQQ-------QRRVRRRRTRADMTR

Query:  IEPPYPWSTDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD
        IEPPYPWSTDRRAV+HELKYLQSNNI+TIKGEV CKKCE +YEMEYDLMNK NEI RF E E D+MHDRAP CWT+P LPNCNLCN+E+CV PVIS+ED 
Subjt:  IEPPYPWSTDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD

Query:  TKINWLFLFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL
        TKINWLFL LG+FLGCLKLKQLKYFCAQTNIHRTGAKNRLLYL Y  L +QLQPS  L
Subjt:  TKINWLFLFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL

TrEMBL top hitse value%identityAlignment
A0A0A0KMQ2 Uncharacterized protein1.0e-10579.53Show/hide
Query:  ERSPDLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPW
        ERSP LDLSLRPPS  P P P EY Q  +STLPSSQ PNEES                    A PQPN ETSND QQ +RR+RRRRTRADMTRIEPPYPW
Subjt:  ERSPDLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPW

Query:  STDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLF
        +TD+RAVVHELKYLQSNNIM IKGEVICKKCEM+YE+EYDLMNKVNEITRFFEEEID+MHDRAP+CWT+PNLPNCN CNEEKCVMPVIS+EDD+KINWLF
Subjt:  STDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLF

Query:  LFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNIN
        LFLGQFLGCL+LKQLK+FCAQ+NIHRTGAKNRLLYLSYRALFHQLQPS TLNIN
Subjt:  LFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNIN

A0A1S3AZB1 protein PAF1 homolog1.3e-6054.94Show/hide
Query:  PSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQS
        P PPP     P +  LP    P       P P  +T +Q P      IP+P  +T N    +  + +RRRT+AD +RIEPPYPWST++ AV+H+L+YL++
Subjt:  PSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQS

Query:  NNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLGQFLGCLKLKQLK
        NNI+TIKGEV CK+C+ + E+EY+L++K +EI RF E E DNMHDRAP  W  P L NCN CN+E+CV P+IS E ++ INWLFL LG FLGCLKL QLK
Subjt:  NNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLGQFLGCLKLKQLK

Query:  YFCAQTNIHRTGAKNRLLYLSYRALFHQLQPST
        YFC QTNIHRTGAK+RL+YL+Y AL  QLQP++
Subjt:  YFCAQTNIHRTGAKNRLLYLSYRALFHQLQPST

A0A1S3CK70 uncharacterized protein LOC1035013971.2e-9879.59Show/hide
Query:  DLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDR
        DLDLSLR PSL P  PP +Y+QPPTSTLP                    SQ  NEE EAI +PNNETSN+ QQQ+   RRRRTRADMTRIEPPYPWSTDR
Subjt:  DLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDR

Query:  RAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLG
        RAVVHELKYLQ NNIMTIKGEVICKKCEM+YEMEYDLMNKVNEITRFFEEEID+MHDRAPSCWT PNLPNC+LCNEEKCVMPV SQE DTKINWLFLFLG
Subjt:  RAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLG

Query:  QFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPST
        QFLGCLKL+QLKYFC QTNIHRTGAKNRLLYLSYR LF QLQP T
Subjt:  QFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPST

A0A6J1GM83 mucin-16-like1.0e-6057.2Show/hide
Query:  TQPPTSTLPSSQTPNEESEAIPQPNNETSSQ--TPNEEFEAIPQPNN---ETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQSNNI
        +QPPT T    +T N +  AIPQ   ET +Q  T ++  E  P  +    + +N H   + R RR RTRAD  RIEPPYPWS ++RA +H L+YLQSNNI
Subjt:  TQPPTSTLPSSQTPNEESEAIPQPNNETSSQ--TPNEEFEAIPQPNN---ETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQSNNI

Query:  MTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLKLKQL
        +TIKG+V CKKCE  YE+EY+LMNK +EI RF E E DNMHDRAP CW  P LPNC  C EE CV P+I  E+D    ++INWLFL LGQ +G LKLKQL
Subjt:  MTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLKLKQL

Query:  KYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL
        KYFCA T  HRTGAK+RL++L+Y AL  QLQPS  L
Subjt:  KYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL

A0A6J1I8I0 uncharacterized protein KIAA0754-like3.0e-6056.67Show/hide
Query:  LPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQ
        +P       +QP T T    +TPN+   AIPQ    T  +TPN+    IPQ     +N H   + R RR RTRAD  RIEPPYPWS ++RA +H L+YLQ
Subjt:  LPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAVVHELKYLQ

Query:  SNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLK
        SNNI+ IKG+V CKKCE  YE+EY+LMNK +EI RF E E DNMHDRAP CW  P LPNC  C EE CV P+I  E+D     +INWLFL LGQ +G LK
Subjt:  SNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD----TKINWLFLFLGQFLGCLK

Query:  LKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL
        LKQLKYFCA T  HRTGAK+RL++L+Y AL  QLQPS  L
Subjt:  LKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein4.2e-3540.08Show/hide
Query:  SLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTR----IEPPYPWSTDRRAVVH
        S++P PPP  +  P   +    QTPN        P   T    P+    A P  N       +     VR  R+R+ +++    I PP+PW+T+RR  + 
Subjt:  SLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTR----IEPPYPWSTDRRAVVH

Query:  ELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLGQFLGC
         L+YL+SN I TI GEV C+ CE  Y++ Y+L  +  E+ +F+  E   M DRA   W  P    C LC  EK V PVI+ E  ++INWLFL LGQ LG 
Subjt:  ELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLGQFLGC

Query:  LKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL
          L+QLK FC  +  HRTGAK+R+LYL+Y  L   LQP + L
Subjt:  LKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTL

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.6e-2933.85Show/hide
Query:  PPSLLPSPP-PHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAI--PQPNNETSNDHQQQQRRVR------RRRTRADMTRIE-------
        P +  PSPP P++ T     T           +A+P PN    +  P +  E +  P   N+ +       RR R      RR ++  +  +E       
Subjt:  PPSLLPSPP-PHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAI--PQPNNETSNDHQQQQRRVR------RRRTRADMTRIE-------

Query:  --PPYPWSTDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD
          PPYPW+T +   +   + L SNNI  I G+V CK C+    +EY+L  K +E+  + +   + M  RAP  W+ P L  C  C  E  + PV+S+  +
Subjt:  --PPYPWSTDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD

Query:  TKINWLFLFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNI
         +INWLFL LGQ LGC  L QL+YFC   + HRTG+K+R++Y++Y +L  QL P    N+
Subjt:  TKINWLFLFLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNI

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.2e-1832.43Show/hide
Query:  PPSLLPSPP-PHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAI--PQPNNETSNDHQQQQRRVR------RRRTRADMTRIE-------
        P +  PSPP P++ T     T           +A+P PN    +  P +  E +  P   N+ +       RR R      RR ++  +  +E       
Subjt:  PPSLLPSPP-PHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAI--PQPNNETSNDHQQQQRRVR------RRRTRADMTRIE-------

Query:  --PPYPWSTDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD
          PPYPW+T +   +   + L SNNI  I G+V CK C+    +EY+L  K +E+  + +   + M  RAP  W+ P L  C  C  E  + PV+S+  +
Subjt:  --PPYPWSTDRRAVVHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDD

Query:  TKINWLFLFLGQFLGCLKLKQL
         +INWLFL LGQ LGC  L QL
Subjt:  TKINWLFLFLGQFLGCLKLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCCGAAAGGAGCCCCGATCTTGATCTTTCTCTCCGTCCCCCGTCTCTTCTTCCTTCTCCTCCACCACACGAATATACACAGCCACCCACATCGACATTGCCCTC
GTCGCAAACTCCAAACGAAGAATCCGAGGCAATCCCTCAACCCAACAATGAAACTTCGTCGCAAACTCCAAACGAAGAATTTGAGGCAATCCCTCAACCCAACAATGAAA
CTTCAAATGACCACCAGCAGCAACAACGGAGAGTGAGACGACGTAGAACGAGAGCAGACATGACAAGGATTGAGCCACCATATCCATGGTCGACGGACAGACGAGCGGTA
GTCCACGAACTCAAGTACCTTCAATCAAACAACATAATGACAATCAAGGGGGAAGTGATATGCAAAAAATGTGAGATGAGGTATGAAATGGAATATGATCTAATGAACAA
GGTTAATGAAATAACAAGATTCTTTGAAGAAGAAATAGATAATATGCATGATAGAGCTCCAAGTTGTTGGACAAGACCTAATTTACCAAATTGCAATTTATGCAATGAAG
AAAAATGTGTAATGCCAGTGATATCTCAAGAAGATGATACAAAAATCAATTGGTTGTTCTTGTTCTTGGGGCAATTTCTTGGATGTTTGAAGCTCAAACAACTCAAATAT
TTTTGTGCTCAAACAAATATTCATAGAACTGGGGCCAAGAATCGTCTTCTTTATCTCAGTTATCGTGCTTTGTTCCATCAACTCCAACCCTCCACGACACTCAACATTAA
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCCGAAAGGAGCCCCGATCTTGATCTTTCTCTCCGTCCCCCGTCTCTTCTTCCTTCTCCTCCACCACACGAATATACACAGCCACCCACATCGACATTGCCCTC
GTCGCAAACTCCAAACGAAGAATCCGAGGCAATCCCTCAACCCAACAATGAAACTTCGTCGCAAACTCCAAACGAAGAATTTGAGGCAATCCCTCAACCCAACAATGAAA
CTTCAAATGACCACCAGCAGCAACAACGGAGAGTGAGACGACGTAGAACGAGAGCAGACATGACAAGGATTGAGCCACCATATCCATGGTCGACGGACAGACGAGCGGTA
GTCCACGAACTCAAGTACCTTCAATCAAACAACATAATGACAATCAAGGGGGAAGTGATATGCAAAAAATGTGAGATGAGGTATGAAATGGAATATGATCTAATGAACAA
GGTTAATGAAATAACAAGATTCTTTGAAGAAGAAATAGATAATATGCATGATAGAGCTCCAAGTTGTTGGACAAGACCTAATTTACCAAATTGCAATTTATGCAATGAAG
AAAAATGTGTAATGCCAGTGATATCTCAAGAAGATGATACAAAAATCAATTGGTTGTTCTTGTTCTTGGGGCAATTTCTTGGATGTTTGAAGCTCAAACAACTCAAATAT
TTTTGTGCTCAAACAAATATTCATAGAACTGGGGCCAAGAATCGTCTTCTTTATCTCAGTTATCGTGCTTTGTTCCATCAACTCCAACCCTCCACGACACTCAACATTAA
TTGA
Protein sequenceShow/hide protein sequence
MKAERSPDLDLSLRPPSLLPSPPPHEYTQPPTSTLPSSQTPNEESEAIPQPNNETSSQTPNEEFEAIPQPNNETSNDHQQQQRRVRRRRTRADMTRIEPPYPWSTDRRAV
VHELKYLQSNNIMTIKGEVICKKCEMRYEMEYDLMNKVNEITRFFEEEIDNMHDRAPSCWTRPNLPNCNLCNEEKCVMPVISQEDDTKINWLFLFLGQFLGCLKLKQLKY
FCAQTNIHRTGAKNRLLYLSYRALFHQLQPSTTLNIN