; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004593 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004593
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationscaffold995:536448..540085
RNA-Seq ExpressionMS004593
SyntenyMS004593
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591856.1 hypothetical protein SDJN03_14202, partial [Cucurbita argyrosperma subsp. sororia]2.6e-12085.87Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRTLTTT RPSTII+QAY +QQ NPKFS  FGY  DLVGGC R FPACAS SSGPQVPAASAP +QSD  AA RTSALEKL+T+EEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+A+S S ++  HRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSS +LFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

XP_022140712.1 uncharacterized protein LOC111011276 [Momordica charantia]2.4e-14299.63Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFS+FFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

XP_022937012.1 uncharacterized protein LOC111443436 [Cucurbita moschata]8.8e-12185.87Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRTLTTT RPSTII+QAY +QQ NPKF+  FGY  DLVGGC RRFPACAS SSGPQVPAASAP +QSD  AA RTSALEKL+T+EEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+A+S S ++  HRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSS +LFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

XP_022976828.1 uncharacterized protein LOC111477089 [Cucurbita maxima]9.8e-12085.5Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRTLTTT RPSTII+QAY +QQ N KF+  FGY  DLVGGC RRFPACAS SSGPQVPAASAP +QSD  AA RTSALEKL+T+EEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+A+S S +   HRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSS +LFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

XP_023536456.1 uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo]3.4e-12085.5Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRTL TT RPSTII+QAY +QQ NPKF+  FGY  DLVGGC RRFPACAS SSGPQVPAASAP +QSD  AA RTSALEKL+T+EEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+A+S S ++  HRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSS +LFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

TrEMBL top hitse value%identityAlignment
A0A1S3BA03 uncharacterized protein LOC103487632 isoform X12.0e-11883.64Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRT TTTVRPSTII+QAY YQQ NPKF+  FGY TDLVG CSR FPACAS S GPQVPAASAPLIQ+   AA RTS LEK+ TIEE LE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGS+LCF+KGCVHVAASFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFIS LG A+S S  N EH+SNLFGLF 
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ ++TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSP DLLCLAVS+FLSSG+LFLL+KLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

A0A6J1CHU0 uncharacterized protein LOC1110112761.2e-14299.63Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFS+FFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

A0A6J1F9X5 uncharacterized protein LOC1114434364.3e-12185.87Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRTLTTT RPSTII+QAY +QQ NPKF+  FGY  DLVGGC RRFPACAS SSGPQVPAASAP +QSD  AA RTSALEKL+T+EEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+A+S S ++  HRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSS +LFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

A0A6J1INA6 uncharacterized protein LOC1114770894.7e-12085.5Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE
        MQPSP LITGPIRTLTTT RPSTII+QAY +QQ N KF+  FGY  DLVGGC RRFPACAS SSGPQVPAASAP +QSD  AA RTSALEKL+T+EEGLE
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLE

Query:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT
        KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+A+S S +   HRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFT

Query:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        LKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KKVVIQSPGDLLCLAVS+FLSS +LFLLSKLTE
Subjt:  LKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

A0A6J1J2R4 uncharacterized protein LOC1114807458.6e-11482.22Show/hide
Query:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYH-YQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGL
        MQPSPPLI+GP R+LTTTVRPST+I+QAYH Y QS PKF+ F GY T L+ GC RRFPA A+ SSGP VPAASAP IQSD   A RTSALEK   IEE L
Subjt:  MQPSPPLITGPIRTLTTTVRPSTIIVQAYH-YQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGL

Query:  EKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLF
        EKAIYRCRFMAFLGV GSL+GS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGT ++ S +N EHRSNLFGLF
Subjt:  EKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLF

Query:  TLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE
        TLKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KK  IQSPGDLLCLA SVFLSSGSLFLLSKLTE
Subjt:  TLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.5e-6562.93Show/hide
Query:  SSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTV
        S G    A+++    +  SAA  +++  + E +EEG+EK IY CRFM FLG LGSL+GSVLCF+KGC++V  SF +Y VNRGKVI LLVEAID+YLLGTV
Subjt:  SSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTV

Query:  MLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFTLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLF
        MLVFG GLYELFIS+L T++S +     +RS+LFG+FTLKERP+WL +K+V+ELKTKLGHVIVMLLLIG F+K+K+VVI S  DLLC++VS+F SS  LF
Subjt:  MLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFTLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLF

Query:  LLSKL
        LLS+L
Subjt:  LLSKL

AT5G13720.1 Uncharacterised protein family (UPF0114)7.4e-4144.39Show/hide
Query:  ASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVN------RGKVIMLLVEA
        AS+S    +P     L  S  +    +   +   + E  +E+ I+  RF+A L V GSL GS+LCF+ GCV++  ++  Y+ N       G++++ LVEA
Subjt:  ASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEGLEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVN------RGKVIMLLVEA

Query:  IDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFTLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVS
        IDVYL GTVML+F  GLY LFISH      P    A   S+LFG+F +KERPKW+ I +++ELKTK+GHVIVM+LL+  FE++K V I +  DLL  +V 
Subjt:  IDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFTLKERPKWLYIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVS

Query:  VFLSSGSLFLLSKL
        +FLSS SL++L  L
Subjt:  VFLSSGSLFLLSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATCCTTTCTCTCCCCTTCCATCTCCGGCTGCCACCATGCAACCATCTCCACCGTTGATTACTGGCCCTATCAGAACTCTAACGACCACCGTCCGACCTTCCACGATCAT
CGTCCAAGCCTACCACTACCAGCAATCCAATCCAAAATTCAGTAAGTTTTTTGGGTATACGACCGATCTTGTTGGTGGTTGTAGCCGTAGATTTCCTGCTTGTGCAAGTA
CCAGCTCAGGGCCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATTTCAGCGCTGCGCCCCGGACTTCGGCACTGGAAAAGTTGGAAACCATAGAGGAGGGC
CTGGAAAAGGCCATTTATCGATGCCGATTCATGGCGTTTTTGGGCGTCTTAGGGTCTTTGATTGGGTCTGTACTCTGTTTTGTCAAGGGGTGCGTTCATGTGGCAGCATC
TTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAAGCCATAGATGTGTATCTCTTGGGGACTGTGATGCTGGTCTTCGGCACGGGTCTCTATG
AGCTGTTTATAAGCCATCTCGGAACTGCACAATCGCCATCAATGAAAAACGCTGAGCATAGATCTAACTTATTTGGCCTATTCACTTTGAAGGAACGACCGAAATGGTTG
TACATCAAAACCGTCAACGAACTGAAAACGAAACTCGGGCACGTCATAGTGATGCTGCTTCTAATCGGGTTCTTTGAGAAGACAAAGAAGGTGGTCATACAATCTCCAGG
TGATTTGCTTTGCTTGGCTGTTTCAGTATTCCTTTCCTCTGGTAGCCTCTTTTTGTTGTCTAAACTAACCGAA
mRNA sequenceShow/hide mRNA sequence
AATCCTTTCTCTCCCCTTCCATCTCCGGCTGCCACCATGCAACCATCTCCACCGTTGATTACTGGCCCTATCAGAACTCTAACGACCACCGTCCGACCTTCCACGATCAT
CGTCCAAGCCTACCACTACCAGCAATCCAATCCAAAATTCAGTAAGTTTTTTGGGTATACGACCGATCTTGTTGGTGGTTGTAGCCGTAGATTTCCTGCTTGTGCAAGTA
CCAGCTCAGGGCCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATTTCAGCGCTGCGCCCCGGACTTCGGCACTGGAAAAGTTGGAAACCATAGAGGAGGGC
CTGGAAAAGGCCATTTATCGATGCCGATTCATGGCGTTTTTGGGCGTCTTAGGGTCTTTGATTGGGTCTGTACTCTGTTTTGTCAAGGGGTGCGTTCATGTGGCAGCATC
TTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAAGCCATAGATGTGTATCTCTTGGGGACTGTGATGCTGGTCTTCGGCACGGGTCTCTATG
AGCTGTTTATAAGCCATCTCGGAACTGCACAATCGCCATCAATGAAAAACGCTGAGCATAGATCTAACTTATTTGGCCTATTCACTTTGAAGGAACGACCGAAATGGTTG
TACATCAAAACCGTCAACGAACTGAAAACGAAACTCGGGCACGTCATAGTGATGCTGCTTCTAATCGGGTTCTTTGAGAAGACAAAGAAGGTGGTCATACAATCTCCAGG
TGATTTGCTTTGCTTGGCTGTTTCAGTATTCCTTTCCTCTGGTAGCCTCTTTTTGTTGTCTAAACTAACCGAA
Protein sequenceShow/hide protein sequence
NPFSPLPSPAATMQPSPPLITGPIRTLTTTVRPSTIIVQAYHYQQSNPKFSKFFGYTTDLVGGCSRRFPACASTSSGPQVPAASAPLIQSDFSAAPRTSALEKLETIEEG
LEKAIYRCRFMAFLGVLGSLIGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTAQSPSMKNAEHRSNLFGLFTLKERPKWL
YIKTVNELKTKLGHVIVMLLLIGFFEKTKKVVIQSPGDLLCLAVSVFLSSGSLFLLSKLTE