; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G010860 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G010860
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationchr11:18962766..18963635
RNA-Seq ExpressionLsi11G010860
SyntenyLsi11G010860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439384.1 PREDICTED: protein PAF1 homolog [Cucumis melo]1.3e-7159.16Show/hide
Query:  HSPPPPSESLS----SRSPSPPPECPILF---LHLPLQSPPRSLECPQPPIPTSPS----PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIE
        H PPPP   LS      SP PPP  P  F   L LPLQSPP   +  QPP PT  +    PEIP +P  +T N   ++      P+ +R+RT+AD +RIE
Subjt:  HSPPPPSESLS----SRSPSPPPECPILF---LHLPLQSPPRSLECPQPPIPTSPS----PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIE

Query:  PPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEE
        PPYPWST++ AV+H+LEYLE+NNILTIKGE+KCK+C++K EIEY+L +KFDEI  FIE++K++MHDRAP  W NPIL NCN CNKE+CVEP+I       
Subjt:  PPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEE

Query:  EEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSN
           +    INWLFLLLG FLGCLKL QLKYFC QTNIHRTGAK+RLIYL YLALC+QLQP++
Subjt:  EEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSN

XP_008463189.1 PREDICTED: uncharacterized protein LOC103501397 [Cucumis melo]7.3e-7063.79Show/hide
Query:  LFLHLPLQSPPRSLECPQPPIPTSPS------PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIK
        L L  P   PP   +  QPP  T PS       E   RPNNETSN+ QQ        + RR+RTRAD TRIEPPYPWSTDRRAVVHEL+YL+ NNI+TIK
Subjt:  LFLHLPLQSPPRSLECPQPPIPTSPS------PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIK

Query:  GELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQL
        GE+ CKKCE KYE+EYDL NK +EI  F E++ +SMHDRAP CWTNP LPNC+ CN+EKCV PV  T +E+        KINWLFL LGQFLGCLKL+QL
Subjt:  GELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQL

Query:  KYFCAQTNIHRTGAKNRLIYLVYLALCRQLQP
        KYFC QTNIHRTGAKNRL+YL Y  L RQLQP
Subjt:  KYFCAQTNIHRTGAKNRLIYLVYLALCRQLQP

XP_022953023.1 mucin-16-like [Cucurbita moschata]2.8e-6960.43Show/hide
Query:  LHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKC
        L+ P+  P    E P     T+    I Q PN  T+        +  +PR RR RTRADT RIEPPYPWS ++RA +H LEYL+SNNI+TIKG+++CKKC
Subjt:  LHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKC

Query:  EKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTN
        E+ YEIEY+L NKFDEIA FIE+++++MHDRAPICW NPILPNC  C +E CVEP+I     +EE+D++F +INWLFLLLGQ +G LKLKQLKYFCA T 
Subjt:  EKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTN

Query:  IHRTGAKNRLIYLVYLALCRQLQPSNINFN
         HRTGAK+RLI+L YLALC+QLQPSN  FN
Subjt:  IHRTGAKNRLIYLVYLALCRQLQPSNINFN

XP_022972400.1 uncharacterized protein KIAA0754-like [Cucurbita maxima]3.6e-6961.19Show/hide
Query:  LECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLR
        +E P  P+    + E  + PN  T+        +  +PR RR RTRADT RIEPPYPWS ++RA +H LEYL+SNNI+ IKG+++CKKCE+ YEIEY+L 
Subjt:  LECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLR

Query:  NKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLI
        NKFDEIA FIE+++++MHDRAPICW NPILPNC  C +E CVEP+I     +EE+D++F++INWLFLLLGQ +G LKLKQLKYFCA T  HRTGAK+RLI
Subjt:  NKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLI

Query:  YLVYLALCRQLQPSNINFN
        +L YLALC+QLQPSN  FN
Subjt:  YLVYLALCRQLQPSNINFN

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]5.0e-8766.2Show/hide
Query:  SQSNNQGYLNLNLSLHLSSHSPPPPSESLSSRSPSPPPECPILFLHLPLQS--PPRSLECPQPPIPTSPSP--------EIPQRPNNETSNHHQQLEETI
        +Q++ Q   NL LSL L S   PPP E     SP PPP  P      PL+S  PP  LE P     TS SP        EIP + NNETSNH QQ  E +
Subjt:  SQSNNQGYLNLNLSLHLSSHSPPPPSESLSSRSPSPPPECPILFLHLPLQS--PPRSLECPQPPIPTSPSP--------EIPQRPNNETSNHHQQLEETI

Query:  E-QPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCN
        E QPR RR+RTRAD TRIEPPYPWSTDRRAV+HEL+YL+SNNI+TIKGE+KCKKCE+KYE+EYDL NKF+EIA FIE +K+SMHDRAP CWT PILPNCN
Subjt:  E-QPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCN

Query:  SCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSNINFNVS
         CNKE+CVEPVI       EED  + KINWLFLLLG+FLGCLKLKQLKYFCAQTNIHRTGAKNRL+YL+YL LC QLQPSN  F +S
Subjt:  SCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSNINFNVS

TrEMBL top hitse value%identityAlignment
A0A0A0KMQ2 Uncharacterized protein2.5e-6861.9Show/hide
Query:  PSESLSSRSPS-PPPECPILFLHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHE
        P   LS R PS  PP  P        + P  S   P   IP   S   PQ PN ETSN  QQ      + R RR+RTRAD TRIEPPYPW+TD+RAVVHE
Subjt:  PSESLSSRSPS-PPPECPILFLHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHE

Query:  LEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLL
        L+YL+SNNI+ IKGE+ CKKCE KYEIEYDL NK +EI  F E++ +SMHDRAP CWT P LPNCN CN+EKCV PVI       +EDD   KINWLFL 
Subjt:  LEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLL

Query:  LGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPS---NIN
        LGQFLGCL+LKQLK+FCAQ+NIHRTGAKNRL+YL Y AL  QLQPS   NIN
Subjt:  LGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPS---NIN

A0A1S3AZB1 protein PAF1 homolog6.4e-7259.16Show/hide
Query:  HSPPPPSESLS----SRSPSPPPECPILF---LHLPLQSPPRSLECPQPPIPTSPS----PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIE
        H PPPP   LS      SP PPP  P  F   L LPLQSPP   +  QPP PT  +    PEIP +P  +T N   ++      P+ +R+RT+AD +RIE
Subjt:  HSPPPPSESLS----SRSPSPPPECPILF---LHLPLQSPPRSLECPQPPIPTSPS----PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIE

Query:  PPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEE
        PPYPWST++ AV+H+LEYLE+NNILTIKGE+KCK+C++K EIEY+L +KFDEI  FIE++K++MHDRAP  W NPIL NCN CNKE+CVEP+I       
Subjt:  PPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEE

Query:  EEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSN
           +    INWLFLLLG FLGCLKL QLKYFC QTNIHRTGAK+RLIYL YLALC+QLQP++
Subjt:  EEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSN

A0A1S3CK70 uncharacterized protein LOC1035013973.5e-7063.79Show/hide
Query:  LFLHLPLQSPPRSLECPQPPIPTSPS------PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIK
        L L  P   PP   +  QPP  T PS       E   RPNNETSN+ QQ        + RR+RTRAD TRIEPPYPWSTDRRAVVHEL+YL+ NNI+TIK
Subjt:  LFLHLPLQSPPRSLECPQPPIPTSPS------PEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIK

Query:  GELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQL
        GE+ CKKCE KYE+EYDL NK +EI  F E++ +SMHDRAP CWTNP LPNC+ CN+EKCV PV  T +E+        KINWLFL LGQFLGCLKL+QL
Subjt:  GELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQL

Query:  KYFCAQTNIHRTGAKNRLIYLVYLALCRQLQP
        KYFC QTNIHRTGAKNRL+YL Y  L RQLQP
Subjt:  KYFCAQTNIHRTGAKNRLIYLVYLALCRQLQP

A0A6J1GM83 mucin-16-like1.3e-6960.43Show/hide
Query:  LHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKC
        L+ P+  P    E P     T+    I Q PN  T+        +  +PR RR RTRADT RIEPPYPWS ++RA +H LEYL+SNNI+TIKG+++CKKC
Subjt:  LHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKC

Query:  EKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTN
        E+ YEIEY+L NKFDEIA FIE+++++MHDRAPICW NPILPNC  C +E CVEP+I     +EE+D++F +INWLFLLLGQ +G LKLKQLKYFCA T 
Subjt:  EKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTN

Query:  IHRTGAKNRLIYLVYLALCRQLQPSNINFN
         HRTGAK+RLI+L YLALC+QLQPSN  FN
Subjt:  IHRTGAKNRLIYLVYLALCRQLQPSNINFN

A0A6J1I8I0 uncharacterized protein KIAA0754-like1.8e-6961.19Show/hide
Query:  LECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLR
        +E P  P+    + E  + PN  T+        +  +PR RR RTRADT RIEPPYPWS ++RA +H LEYL+SNNI+ IKG+++CKKCE+ YEIEY+L 
Subjt:  LECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLR

Query:  NKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLI
        NKFDEIA FIE+++++MHDRAPICW NPILPNC  C +E CVEP+I     +EE+D++F++INWLFLLLGQ +G LKLKQLKYFCA T  HRTGAK+RLI
Subjt:  NKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLI

Query:  YLVYLALCRQLQPSNINFN
        +L YLALC+QLQPSN  FN
Subjt:  YLVYLALCRQLQPSNINFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein4.2e-3939.37Show/hide
Query:  PPPSESLSSRS--PSPPPECPILFLHLPLQSPPRSLECPQPPIPTS----PSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTD
        P P++ L++RS  P PPP        +PL       + P PP   +    PS   P   N       + +  ++   R+R   ++   T I PP+PW+T+
Subjt:  PPPSESLSSRS--PSPPPECPILFLHLPLQSPPRSLECPQPPIPTS----PSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADTTRIEPPYPWSTD

Query:  RRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKK
        RR  +  LEYLESN I TI GE++C+ CEK Y++ Y+LR +F E+  F   +K  M DRA   W  P    C  C +EK V+PVI  ++ +         
Subjt:  RRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKFKK

Query:  INWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSN
        INWLFLLLGQ LG   L+QLK FC  +  HRTGAK+R++YL Y+ LC+ LQP +
Subjt:  INWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSN

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.5e-3335.8Show/hide
Query:  SPPPPSESLSSRSPSPPPECPILFLHLPLQSPPRSLECPQPPIPTS---PSPEIPQRPNNETSNHHQ------QLEETIEQPRARRQRTRADTTRIEPPY
        SPP P++  S  + +     P   L   +  P  S+  P P  P+    P P++ Q      +   +      Q     ++P A  +R   D   I PPY
Subjt:  SPPPPSESLSSRSPSPPPECPILFLHLPLQSPPRSLECPQPPIPTS---PSPEIPQRPNNETSNHHQ------QLEETIEQPRARRQRTRADTTRIEPPY

Query:  PWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEED
        PW+T +   +     L SNNI  I G++ CK C++   +EY+L  KF E+  +I+  K  M  RAP  W+ P L  C +C  E  ++PV+  ++EE    
Subjt:  PWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEED

Query:  DKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQP
             INWLFLLLGQ LGC  L QL+YFC   + HRTG+K+R++Y+ YL+LC+QL P
Subjt:  DKFKKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.9e-2033.33Show/hide
Query:  SPPPPSESLSSRSPSPPPECPILFLHLPLQSPPRSLECPQPPIPTS---PSPEIPQRPNNETSNHHQ------QLEETIEQPRARRQRTRADTTRIEPPY
        SPP P++  S  + +     P   L   +  P  S+  P P  P+    P P++ Q      +   +      Q     ++P A  +R   D   I PPY
Subjt:  SPPPPSESLSSRSPSPPPECPILFLHLPLQSPPRSLECPQPPIPTS---PSPEIPQRPNNETSNHHQ------QLEETIEQPRARRQRTRADTTRIEPPY

Query:  PWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEED
        PW+T +   +     L SNNI  I G++ CK C++   +EY+L  KF E+  +I+  K  M  RAP  W+ P L  C +C  E  ++PV+  ++EE    
Subjt:  PWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEED

Query:  DKFKKINWLFLLLGQFLGCLKLKQL
             INWLFLLLGQ LGC  L QL
Subjt:  DKFKKINWLFLLLGQFLGCLKLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACAGGAGAAACTAGCCAAAGCAATAACCAAGGCTACCTCAATCTCAATCTCTCTCTCCATCTATCGTCGCACTCCCCTCCGCCGCCATCTGAAAGTCTATCATC
CCGGTCTCCTTCTCCTCCGCCTGAATGTCCAATACTCTTTCTCCATTTGCCGTTGCAGTCTCCTCCTCGGTCACTTGAATGTCCACAACCGCCGATACCTACGTCGCCCT
CACCCGAGATCCCTCAACGTCCCAACAACGAAACTTCAAACCACCACCAACAACTAGAGGAGACGATAGAACAACCGAGAGCGAGACGACAAAGAACGAGAGCAGACACT
ACGAGGATCGAGCCACCGTATCCATGGTCGACGGATCGACGAGCGGTAGTACATGAACTGGAGTATCTTGAATCAAACAACATACTCACAATCAAGGGGGAATTGAAATG
CAAAAAATGTGAGAAAAAGTATGAGATTGAGTATGATTTAAGGAATAAGTTCGACGAGATAGCAAGTTTTATTGAAAAACAAAAAAATAGTATGCATGATAGAGCTCCAA
TTTGTTGGACAAACCCTATTTTACCAAATTGCAATTCGTGCAATAAAGAAAAATGTGTAGAGCCAGTTATAATTACTCAAGAAGAAGAGGAGGAGGAGGATGATAAATTC
AAGAAAATCAATTGGTTGTTCTTGCTTTTGGGACAATTTCTTGGATGTTTGAAGCTCAAACAACTCAAATATTTTTGTGCTCAAACTAATATTCATCGAACTGGGGCCAA
GAATCGTCTTATTTATCTAGTATATCTTGCTTTGTGTCGCCAACTTCAACCCTCCAATATAAACTTCAATGTTTCTGAAGTATTCGATCCAGAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACAGGAGAAACTAGCCAAAGCAATAACCAAGGCTACCTCAATCTCAATCTCTCTCTCCATCTATCGTCGCACTCCCCTCCGCCGCCATCTGAAAGTCTATCATC
CCGGTCTCCTTCTCCTCCGCCTGAATGTCCAATACTCTTTCTCCATTTGCCGTTGCAGTCTCCTCCTCGGTCACTTGAATGTCCACAACCGCCGATACCTACGTCGCCCT
CACCCGAGATCCCTCAACGTCCCAACAACGAAACTTCAAACCACCACCAACAACTAGAGGAGACGATAGAACAACCGAGAGCGAGACGACAAAGAACGAGAGCAGACACT
ACGAGGATCGAGCCACCGTATCCATGGTCGACGGATCGACGAGCGGTAGTACATGAACTGGAGTATCTTGAATCAAACAACATACTCACAATCAAGGGGGAATTGAAATG
CAAAAAATGTGAGAAAAAGTATGAGATTGAGTATGATTTAAGGAATAAGTTCGACGAGATAGCAAGTTTTATTGAAAAACAAAAAAATAGTATGCATGATAGAGCTCCAA
TTTGTTGGACAAACCCTATTTTACCAAATTGCAATTCGTGCAATAAAGAAAAATGTGTAGAGCCAGTTATAATTACTCAAGAAGAAGAGGAGGAGGAGGATGATAAATTC
AAGAAAATCAATTGGTTGTTCTTGCTTTTGGGACAATTTCTTGGATGTTTGAAGCTCAAACAACTCAAATATTTTTGTGCTCAAACTAATATTCATCGAACTGGGGCCAA
GAATCGTCTTATTTATCTAGTATATCTTGCTTTGTGTCGCCAACTTCAACCCTCCAATATAAACTTCAATGTTTCTGAAGTATTCGATCCAGAAACTTGA
Protein sequenceShow/hide protein sequence
METGETSQSNNQGYLNLNLSLHLSSHSPPPPSESLSSRSPSPPPECPILFLHLPLQSPPRSLECPQPPIPTSPSPEIPQRPNNETSNHHQQLEETIEQPRARRQRTRADT
TRIEPPYPWSTDRRAVVHELEYLESNNILTIKGELKCKKCEKKYEIEYDLRNKFDEIASFIEKQKNSMHDRAPICWTNPILPNCNSCNKEKCVEPVIITQEEEEEEDDKF
KKINWLFLLLGQFLGCLKLKQLKYFCAQTNIHRTGAKNRLIYLVYLALCRQLQPSNINFNVSEVFDPET