; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G219710 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G219710
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationCmU531Chr11:28127591..28140113
RNA-Seq ExpressionCmUC11G219710
SyntenyCmUC11G219710
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444243.1 PREDICTED: uncharacterized protein LOC103487632 isoform X1 [Cucumis melo]3.2e-12890.33Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRT TTT RPSTIIIQAYQYQQ NP FNSLFGYR DLVGAC R FPACAS S GPQVPAAS PLIQ+ LGAASRTSTLEK++TIEEELE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGS+LCF+KGC+HVA+SFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFIS LG+ARS SKS++EHKSNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSP DLLCLAVSIFLSSGTLFLLTKLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_022937012.1 uncharacterized protein LOC111443436 [Cucurbita moschata]1.8e-12688.85Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRTLTTTARPSTIIIQAYQ+QQ NP FN +FGYRADLVG CGRRFPACAS S GPQVPAAS P +QS +GAASRTS LEKLDT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGC+HVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+ S+ H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_022976828.1 uncharacterized protein LOC111477089 [Cucurbita maxima]4.3e-12588.1Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRTLTTTARPSTIIIQAYQ+QQ N  FN +FGYRADLVG CGRRFPACAS S GPQVPAAS P +QS +GAASRTS LEKLDT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGC+HVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+  + H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_023536456.1 uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo]6.7e-12688.48Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRTL TTARPSTIIIQAYQ+QQ NP FN +FGYRADLVG CGRRFPACAS S GPQVPAAS P +QS +GAASRTS LEKLDT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGC+HVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+ S+ H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

XP_038898874.1 uncharacterized protein LOC120086339 isoform X3 [Benincasa hispida]1.3e-12690.74Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAY-QYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEEL
        MQPSPSLITGP+RTLTTTARPS IIIQAY QY Q NPNF S FGYR DLVGAC RRF ACAS S GPQVPA S PLIQS LGA SR STL KLDTIEEEL
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAY-QYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEEL

Query:  EKAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLF
        EKAIYRCRFMAFLGVLGSLIGSVLCF+KGC+HVA+SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSS+EHKSNLFGLF
Subjt:  EKAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLF

Query:  TLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        TLKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSS TLFLLTKLTE
Subjt:  TLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

TrEMBL top hitse value%identityAlignment
A0A0A0KWU8 Uncharacterized protein2.3e-11683.64Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLIT PIR LTTT RPSTII QAY Y Q  P FNSLFGYR  L+G+  R FPA AS +   QVPAAS PLIQ+ LGAASRTSTLEK++T+EEELE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAF GVLGSLIGS+ CF++GC+HVA+SFSEYFVNRGKVI++LVEAIDVYLLGTVMLVFGTGLYELFIS LG+AR  SKS++EHKSNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A1S3BA03 uncharacterized protein LOC103487632 isoform X11.6e-12890.33Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRT TTT RPSTIIIQAYQYQQ NP FNSLFGYR DLVGAC R FPACAS S GPQVPAAS PLIQ+ LGAASRTSTLEK++TIEEELE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGS+LCF+KGC+HVA+SFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFIS LG+ARS SKS++EHKSNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSP DLLCLAVSIFLSSGTLFLLTKLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A6J1CHU0 uncharacterized protein LOC1110112764.1e-12184.39Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSP LITGPIRTLTTT RPSTII+QAY YQQ NP F+  FGY  DLVG C RRFPACAS S GPQVPAAS PLIQS   AA RTS LEKL+TIEE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGC+HVA+SFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLG+A+S S  + EH+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKW+ ++TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSSG+LFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A6J1F9X5 uncharacterized protein LOC1114434368.5e-12788.85Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRTLTTTARPSTIIIQAYQ+QQ NP FN +FGYRADLVG CGRRFPACAS S GPQVPAAS P +QS +GAASRTS LEKLDT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGC+HVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+ S+ H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

A0A6J1INA6 uncharacterized protein LOC1114770892.1e-12588.1Show/hide
Query:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE
        MQPSPSLITGPIRTLTTTARPSTIIIQAYQ+QQ N  FN +FGYRADLVG CGRRFPACAS S GPQVPAAS P +QS +GAASRTS LEKLDT+EE LE
Subjt:  MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGC+HVA+S SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LGSARS S+  + H+SNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFT

Query:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE
        LKERPKWMN+ TVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSS TLFLL+KLTE
Subjt:  LKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)2.7e-6460.98Show/hide
Query:  SKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTV
        S G    A+++    +   AA  +++  + + +EE +EK IY CRFM FLG LGSL+GSVLCF+KGC++V  SF +Y VNRGKVI LLVEAID+YLLGTV
Subjt:  SKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTV

Query:  MLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLF
        MLVFG GLYELFIS+L ++ S +   + ++S+LFG+FTLKERP+W+ V++V+ELKTKLGHVIVMLLLIG FDKSK+VVI S  DLLC++VSIF SS  LF
Subjt:  MLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLF

Query:  LLTKL
        LL++L
Subjt:  LLTKL

AT5G13720.1 Uncharacterised protein family (UPF0114)3.0e-3942.99Show/hide
Query:  ASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVN------RGKVIMLLVEA
        AS+S    +P     L  S       +   +   + E  +E+ I+  RF+A L V GSL GS+LCF+ GC+++  ++  Y+ N       G++++ LVEA
Subjt:  ASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVN------RGKVIMLLVEA

Query:  IDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVS
        IDVYL GTVML+F  GLY LFISH               S+LFG+F +KERPKWM + +++ELKTK+GHVIVM+LL+  F++SK V I +  DLL  +V 
Subjt:  IDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFTLKERPKWMNVRTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVS

Query:  IFLSSGTLFLLTKL
        IFLSS +L++L  L
Subjt:  IFLSSGTLFLLTKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCGTCTCCATCGTTGATTACTGGCCCCATCAGAACTCTAACGACCACCGCCCGACCTTCCACGATCATCATCCAAGCCTATCAGTACCAGCAACGTAAT
CCAAATTTCAATAGTCTTTTTGGGTATAGAGCTGACCTTGTTGGTGCTTGTGGCCGTAGATTTCCTGCTTGTGCAAGTGCAAGCAAAGGGCCTCAAGTTCCGGCT
GCTTCTACTCCTCTAATCCAATCCCAGCTTGGCGCTGCGTCTCGGACGTCGACACTGGAAAAGTTGGATACCATCGAGGAGGAGCTTGAAAAGGCCATTTATCGA
TGCCGATTCATGGCATTTCTGGGCGTCTTAGGGTCTTTAATTGGCTCTGTACTATGTTTCGTCAAGGGGTGCATTCATGTAGCATCATCTTTCTCGGAGTATTTT
GTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCTTGGGAACTGTGATGCTAGTCTTCGGCACGGGTCTGTATGAGTTGTTTATC
AGCCATCTTGGAAGTGCGCGGTCATCATCAAAGAGTAGCATTGAGCATAAATCAAACTTATTTGGCTTGTTTACTTTAAAGGAACGACCTAAATGGATGAATGTA
AGGACGGTTAACGAGCTGAAAACAAAGCTGGGGCATGTGATAGTGATGCTGCTTCTAATTGGGTTCTTTGACAAGAGTAAAAAGGTGGTTATACAATCTCCAGGT
GATTTGCTATGCTTAGCTGTTTCAATATTCCTTTCATCTGGTACCCTTTTTTTACTGACTAAACTAACTGAATGA
mRNA sequenceShow/hide mRNA sequence
CGACCAACATCCATTTCTATAGATTGACAAAATTATAATTTGCCAATGATAAATTATATTTACTTATTTCTTGAAATATTTTAATTAATTTTTTTTTATATGGGT
AATGAAAGTAAGATGCTGCATAAAAAGTAAGAACTACAATATTATTTGAAAAAACCATGAAAAATAAAATACTAATAATACTCTAAACCTTTATACGTATTTATA
CCAACCCGATCTTCACGAACGGTGGAGAGATTCACCGTTTTAAAATTTTCAAATTCAAAATTTGAGAATTTTATTTTATTTTCCCTTCAAAATTACTTCTGATTT
GAAGAACAAGGCGAGGGAATGTAGCTTCCATTGTGCGGGTAAAACCGGAGATGGCAGTTCAAACGGCACCGAAACCTGAGCGGTCCTAAGACTCCAACTCGCCAT
CGAGTTCTCGAGCTTCCTCTCAGATCGAATTGCATCAAATCCCTCTTCAGCGACCAGCTTAGATGCTGCATCTGCGTTGGATTGAGCGGAATCGCCGACGATTCC
ACTAAGAAATGCAAATCCAAGTAACTCATTTTGTTCACTTTGAACGGCCGGTGAGCCACCAGTGATGCGATTTCTATGCCTTCGAAAAGCAAGACGAAATCGGTC
TGTGAAAAGCTTGCGTGTGCCCTAGCATTTTGATTCTCGGCTCGGACGACGATCTTCATCTGGACTTCAAGCAATCCAGTTCTGCTGCTTCGGATTCTGTCGAGA
TGTCCGTCGGTTACACTAATTGAAGGCACCCTAGGGCGGATCGTCACATAGCCGATGAAGATGACGATGCCTGCGATGATCACTGCAAGAGCTATGGCGGCGCAG
ATTATGCCGGCAATCCAAATTAGAGGATGAAGCTAAATTTTAGGCATGGCAGAGGAGAGACACTGAGGAAGGAATGCAAGATTATAAAAAAGTTGCAGCAATGGG
CGGCCAAGGTTTTGCTGTTTCTCTTGTAATCTTCCCGCAACCACCAATTGGAGCACAGAGAAAGACCACTGGTTTCAAATATGCATCTAAACCAATTGAGTTATG
ATCGAGATTCTTTTGGAAAAAGAAGCTTCTTTCTAAGCATGTTTCACCAGGCAAATATCAACGTGCCGTGATGAAACAGAGAACCAACTGATCATGGAGTATCCA
ACTTTAAAGGAGGCCCATAACTTTGGCCCCTATCTCTTAAGTTTTAACTATCTTCAACTGCCACTTCCTCGGAGCTCAGTGGCTGGCGCCGCTGTACGGCAATCT
TCTTTGATGTTAAAACCCCTTTCTATCCCCTTCCATCTCCGGCTGCCACCATGCAACCGTCTCCATCGTTGATTACTGGCCCCATCAGAACTCTAACGACCACCG
CCCGACCTTCCACGATCATCATCCAAGCCTATCAGTACCAGCAACGTAATCCAAATTTCAATAGTCTTTTTGGGTATAGAGCTGACCTTGTTGGTGCTTGTGGCC
GTAGATTTCCTGCTTGTGCAAGTGCAAGCAAAGGGCCTCAAGTTCCGGCTGCTTCTACTCCTCTAATCCAATCCCAGCTTGGCGCTGCGTCTCGGACGTCGACAC
TGGAAAAGTTGGATACCATCGAGGAGGAGCTTGAAAAGGCCATTTATCGATGCCGATTCATGGCATTTCTGGGCGTCTTAGGGTCTTTAATTGGCTCTGTACTAT
GTTTCGTCAAGGGGTGCATTCATGTAGCATCATCTTTCTCGGAGTATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCTCT
TGGGAACTGTGATGCTAGTCTTCGGCACGGGTCTGTATGAGTTGTTTATCAGCCATCTTGGAAGTGCGCGGTCATCATCAAAGAGTAGCATTGAGCATAAATCAA
ACTTATTTGGCTTGTTTACTTTAAAGGAACGACCTAAATGGATGAATGTAAGGACGGTTAACGAGCTGAAAACAAAGCTGGGGCATGTGATAGTGATGCTGCTTC
TAATTGGGTTCTTTGACAAGAGTAAAAAGGTGGTTATACAATCTCCAGGTGATTTGCTATGCTTAGCTGTTTCAATATTCCTTTCATCTGGTACCCTTTTTTTAC
TGACTAAACTAACTGAATGAGAGTAGTAAGATATGTACAAATTATGTAATACCCCACTCCCCCTTGGTCTTTTTTGCTTTCCTTTTTTGCCCACCTCTGAAGACA
GTTGAAACCATGAAATGTTATTGTAAATGGGTGTGTAAGATTACTGTAAGAAGATGCTGCTGGAGTGTAGATTGGACTGCAAAGGAGTTAATAAATAAAGGAATG
AAGCAATATATAAAAATTTCAGTGTC
Protein sequenceShow/hide protein sequence
MQPSPSLITGPIRTLTTTARPSTIIIQAYQYQQRNPNFNSLFGYRADLVGACGRRFPACASASKGPQVPAASTPLIQSQLGAASRTSTLEKLDTIEEELEKAIYR
CRFMAFLGVLGSLIGSVLCFVKGCIHVASSFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGSARSSSKSSIEHKSNLFGLFTLKERPKWMNV
RTVNELKTKLGHVIVMLLLIGFFDKSKKVVIQSPGDLLCLAVSIFLSSGTLFLLTKLTE