; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013858 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013858
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationChr02:5446622..5451787
RNA-Seq ExpressionHG10013858
SyntenyHG10013858
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444243.1 PREDICTED: uncharacterized protein LOC103487632 isoform X1 [Cucumis melo]2.9e-12991.08Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRT TTT RPST IIQAYQYQQPNPKFNSLFGYR DLVGAC R FPACAS S GPQVPAASAPLIQ+HLGAASRTSTLEK +TIEEELE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGS+LCF+KGCVHVA+SFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFIS LG+ARS SK ++EHKSNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVI SP DLLCLAVSIFLSSGTLFLLTKLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_022937012.1 uncharacterized protein LOC111443436 [Cucurbita moschata]6.1e-12789.59Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRTLTTTARPST IIQAYQ+QQPNPKFN +FGYRADLVG CGRRFPACAS SSGPQVPAASAP +QS +GAASRTS LEK DT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+ S+ H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_022976828.1 uncharacterized protein LOC111477089 [Cucurbita maxima]1.5e-12588.85Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRTLTTTARPST IIQAYQ+QQPN KFN +FGYRADLVG CGRRFPACAS SSGPQVPAASAP +QS +GAASRTS LEK DT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+  + H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_023536456.1 uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo]2.3e-12689.22Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRTL TTARPST IIQAYQ+QQPNPKFN +FGYRADLVG CGRRFPACAS SSGPQVPAASAP +QS +GAASRTS LEK DT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+ S+ H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_038898874.1 uncharacterized protein LOC120086339 isoform X3 [Benincasa hispida]1.8e-12690.74Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAY-QYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEEL
        MQPSPSLITGP+RTLTTTARPS  IIQAY QY QPNP F S FGYR DLVGAC RRF ACAS SSGPQVPA SAPLIQSHLGA SR STL K DTIEEEL
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAY-QYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEEL

Query:  EKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLF
        EKAIYRCRFMAFLGVLGSLIGSVLCF+KGCVHVA+SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSK S+EHKSNLFGLF
Subjt:  EKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLF

Query:  TLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        TLKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSS TLFLLTKLTE
Subjt:  TLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

TrEMBL top hitse value%identityAlignment
A0A0A0KWU8 Uncharacterized protein2.1e-11784.39Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLIT PIR LTTT RPST I QAY Y QP PKFNSLFGYR  L+G+  R FPA AS +   QVPAASAPLIQ+HLGAASRTSTLEK +T+EEELE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAF GVLGSLIGS+ CF++GCVHVA+SFSEYFVNRGKVI++LVEAIDVYLLGTVMLVFGTGLYELFIS LG+AR  SK ++EHKSNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSSGTLFLLTKLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A1S3BA03 uncharacterized protein LOC103487632 isoform X11.4e-12991.08Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRT TTT RPST IIQAYQYQQPNPKFNSLFGYR DLVGAC R FPACAS S GPQVPAASAPLIQ+HLGAASRTSTLEK +TIEEELE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGS+LCF+KGCVHVA+SFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFIS LG+ARS SK ++EHKSNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVI SP DLLCLAVSIFLSSGTLFLLTKLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A6J1CHU0 uncharacterized protein LOC1110112762.0e-12084.76Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSP LITGPIRTLTTT RPST I+QAY YQQ NPKF+  FGY  DLVG C RRFPACAS SSGPQVPAASAPLIQS   AA RTS LEK +TIEE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVA+SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLG+A+S S  + EH+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKW+ ++TVNELKTKLGHVIVMLLLIGFF+K+KKVVI SPGDLLCLAVS+FLSSG+LFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A6J1F9X5 uncharacterized protein LOC1114434362.9e-12789.59Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRTLTTTARPST IIQAYQ+QQPNPKFN +FGYRADLVG CGRRFPACAS SSGPQVPAASAP +QS +GAASRTS LEK DT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+ S+ H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A6J1INA6 uncharacterized protein LOC1114770897.2e-12688.85Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE
        MQPSPSLITGPIRTLTTTARPST IIQAYQ+QQPN KFN +FGYRADLVG CGRRFPACAS SSGPQVPAASAP +QS +GAASRTS LEK DT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+  + H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVI SPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.0e-6360.98Show/hide
Query:  SSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTV
        S G    A+++    +   AA  +++  + + +EE +EK IY CRFM FLG LGSL+GSVLCF+KGC++V  SF +Y VNRGKVI LLVEAID+YLLGTV
Subjt:  SSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTV

Query:  MLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLF
        MLVFG GLYELFIS+L ++ S +   + ++S+LFG+FTLKERP+W+ V++V+ELKTKLGHVIVMLLLIG FDKSK+VVI S  DLLC++VSIF SS  LF
Subjt:  MLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLF

Query:  LLTKL
        LL++L
Subjt:  LLTKL

AT5G13720.1 Uncharacterised protein family (UPF0114)3.9e-3943.46Show/hide
Query:  ASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVN------RGKVIMLLVEA
        AS+S    +P     L  S+      +   +   + E  +E+ I+  RF+A L V GSL GS+LCF+ GCV++  ++  Y+ N       G++++ LVEA
Subjt:  ASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVN------RGKVIMLLVEA

Query:  IDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVS
        IDVYL GTVML+F  GLY LFISH               S+LFG+F +KERPKWM + +++ELKTK+GHVIVM+LL+  F++SK V I +  DLL  +V 
Subjt:  IDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVS

Query:  IFLSSGTLFLLTKL
        IFLSS +L++L  L
Subjt:  IFLSSGTLFLLTKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCGTCTCCATCGTTGATTACTGGCCCCATCAGAACTCTAACGACCACCGCCCGACCTTCCACGAACATCATCCAAGCCTATCAGTACCAGCAACCTAAT
CCAAAGTTCAATAGTCTTTTTGGGTATAGAGCTGACCTTGTCGGTGCTTGTGGCCGTAGATTTCCTGCTTGTGCAAGTGCCAGCTCAGGGCCTCAAGTTCCGGCT
GCTTCTGCTCCTTTAATCCAATCCCATCTTGGCGCTGCGTCTCGGACGTCGACACTGGAAAAGTCAGATACCATCGAGGAGGAGCTTGAAAAGGCCATTTATCGA
TGCCGATTCATGGCATTTTTGGGGGTCCTGGGGTCTTTAATTGGCTCTGTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCATCATCTTTCTCAGAGTATTTT
GTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGCACGGGTCTGTATGAGTTGTTTATC
AGCCATCTTGGAAGTGCACGGTCATCATCAAAGTGTAGCATTGAGCATAAATCAAACTTATTTGGCTTGTTTACTTTAAAGGAACGACCTAAATGGATGAACGTA
AGGACGGTTAACGAGCTGAAAACAAAGCTGGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTTGACAAGAGTAAAAAGGTGGTTATACATTCTCCAGGT
GATTTGCTTTGCTTAGCTGTTTCAATATTCCTTTCCTCTGGTACCCTGTTTTTACTGACTAAACTAACTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCGTCTCCATCGTTGATTACTGGCCCCATCAGAACTCTAACGACCACCGCCCGACCTTCCACGAACATCATCCAAGCCTATCAGTACCAGCAACCTAAT
CCAAAGTTCAATAGTCTTTTTGGGTATAGAGCTGACCTTGTCGGTGCTTGTGGCCGTAGATTTCCTGCTTGTGCAAGTGCCAGCTCAGGGCCTCAAGTTCCGGCT
GCTTCTGCTCCTTTAATCCAATCCCATCTTGGCGCTGCGTCTCGGACGTCGACACTGGAAAAGTCAGATACCATCGAGGAGGAGCTTGAAAAGGCCATTTATCGA
TGCCGATTCATGGCATTTTTGGGGGTCCTGGGGTCTTTAATTGGCTCTGTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCATCATCTTTCTCAGAGTATTTT
GTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTAGGAACTGTGATGCTAGTCTTTGGCACGGGTCTGTATGAGTTGTTTATC
AGCCATCTTGGAAGTGCACGGTCATCATCAAAGTGTAGCATTGAGCATAAATCAAACTTATTTGGCTTGTTTACTTTAAAGGAACGACCTAAATGGATGAACGTA
AGGACGGTTAACGAGCTGAAAACAAAGCTGGGGCATGTCATAGTGATGCTGCTTCTAATTGGGTTCTTTGACAAGAGTAAAAAGGTGGTTATACATTCTCCAGGT
GATTTGCTTTGCTTAGCTGTTTCAATATTCCTTTCCTCTGGTACCCTGTTTTTACTGACTAAACTAACTGAATGA
Protein sequenceShow/hide protein sequence
MQPSPSLITGPIRTLTTTARPSTNIIQAYQYQQPNPKFNSLFGYRADLVGACGRRFPACASASSGPQVPAASAPLIQSHLGAASRTSTLEKSDTIEEELEKAIYR
CRFMAFLGVLGSLIGSVLCFVKGCVHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKCSIEHKSNLFGLFTLKERPKWMNV
RTVNELKTKLGHVIVMLLLIGFFDKSKKVVIHSPGDLLCLAVSIFLSSGTLFLLTKLTE