; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg00298 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg00298
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUncharacterised protein family (UPF0114)
Genome locationCarg_Chr04:6481865..6485540
RNA-Seq ExpressionCarg00298
SyntenyCarg00298
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608018.1 hypothetical protein SDJN03_01360, partial [Cucurbita argyrosperma subsp. sororia]1.2e-130100Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
        MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE

Query:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
        KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT

Query:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSP
        LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSP
Subjt:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSP

XP_022140712.1 uncharacterized protein LOC111011276 [Momordica charantia]3.8e-11381.85Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL
        MQPSPPLI+GP RTLTTTVRPST+I+ AYH Y QS PKF+ F GY T L+ GC RRFPA A+ SSGP VPAASAP IQSD   A RTS LEK   IEE L
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL

Query:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF
        EKAIYRCRFMAFLGV GSL+GS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGT ++ S +N EHRSNLFGLF
Subjt:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF

Query:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        TLKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KK  IQSPGDLLCLA SVFLSSGSLFLLSKLTE
Subjt:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

XP_022940220.1 uncharacterized protein LOC111445906 [Cucurbita moschata]9.3e-144100Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
        MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE

Query:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
        KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT

Query:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
Subjt:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

XP_022981673.1 uncharacterized protein LOC111480745 [Cucurbita maxima]8.7e-14298.51Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
        MQPSPPLISGPSR+LTTTVRPSTMII AYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSD+GMASRTS LEKSGIIEEDLE
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE

Query:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
        KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT

Query:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
Subjt:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

XP_023523344.1 uncharacterized protein LOC111787566 [Cucurbita pepo subsp. pepo]5.1e-14299.26Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
        MQPSPPLISGPSRTLTTTV PSTMII AYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE

Query:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
        KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT

Query:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
Subjt:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

TrEMBL top hitse value%identityAlignment
A0A6J1CHU0 uncharacterized protein LOC1110112761.8e-11381.85Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL
        MQPSPPLI+GP RTLTTTVRPST+I+ AYH Y QS PKF+ F GY T L+ GC RRFPA A+ SSGP VPAASAP IQSD   A RTS LEK   IEE L
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL

Query:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF
        EKAIYRCRFMAFLGV GSL+GS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGT ++ S +N EHRSNLFGLF
Subjt:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF

Query:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        TLKERPKW+ I TVNELKTKLGHVIVMLLLIGFF+K+KK  IQSPGDLLCLA SVFLSSGSLFLLSKLTE
Subjt:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

A0A6J1F9X5 uncharacterized protein LOC1114434369.2e-11380.74Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL
        MQPSP LI+GP RTLTTT RPST+II AY Q+ Q  PKFN   GY+  L+ GCGRRFPA A+ SSGP VPAASAP +QSDVG ASRTS LEK   +EE L
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL

Query:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF
        EKAIYRCRFMAFLGV GSL+GS+LCF+KGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+ R+ S+R++ HRSNLFGLF
Subjt:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF

Query:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK  IQSPGDLLCLA S+FLSS +LFLLSKLTE
Subjt:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

A0A6J1FPY6 uncharacterized protein LOC1114459064.5e-144100Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
        MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE

Query:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
        KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT

Query:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
Subjt:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

A0A6J1INA6 uncharacterized protein LOC1114770891.0e-11180.37Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL
        MQPSP LI+GP RTLTTT RPST+II AY Q+ Q   KFN   GY+  L+ GCGRRFPA A+ SSGP VPAASAP +QSDVG ASRTS LEK   +EE L
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLI-GCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDL

Query:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF
        EKAIYRCRFMAFLGV GSL+GS+LCF+KGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS+LG+ R+ S+R + HRSNLFGLF
Subjt:  EKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLF

Query:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKK  IQSPGDLLCLA S+FLSS +LFLLSKLTE
Subjt:  TLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

A0A6J1J2R4 uncharacterized protein LOC1114807454.2e-14298.51Show/hide
Query:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE
        MQPSPPLISGPSR+LTTTVRPSTMII AYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSD+GMASRTS LEKSGIIEEDLE
Subjt:  MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLE

Query:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
        KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT
Subjt:  KAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFT

Query:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
        LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE
Subjt:  LKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)2.8e-6159.02Show/hide
Query:  SSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLEKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTV
        S G    A+++ S  +    A  ++   +   +EE +EK IY CRFM FLG  GSL+GS+LCFIKGC++V  SF +Y VNRGKVI LLVEAID+YLLGTV
Subjt:  SSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLEKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTV

Query:  MLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLF
        MLVFG GLYELFIS+L T  + +   + +RS+LFG+FTLKERP+W+ + +V+ELKTKLGHVIVMLLLIG FDKSK+  I S  DLLC++ S+F SS  LF
Subjt:  MLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLF

Query:  LLSKL
        LLS+L
Subjt:  LLSKL

AT5G13720.1 Uncharacterised protein family (UPF0114)1.2e-4048.31Show/hide
Query:  EEDLEKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVN------RGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNI
        E ++E+ I+  RF+A L V GSL GS+LCF+ GCV++  ++  Y+ N       G++++ LVEAIDVYL GTVML+F  GLY LFISH   +        
Subjt:  EEDLEKAIYRCRFMAFLGVFGSLVGSILCFIKGCVHVAASFSEYFVN------RGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNI

Query:  EHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKL
           S+LFG+F +KERPKWM I++++ELKTK+GHVIVM+LL+  F++SK   I +  DLL  +  +FLSS SL++L  L
Subjt:  EHRSNLFGLFTLKERPKWMNITTVNELKTKLGHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCATCTCCACCGTTGATTAGTGGCCCTTCCAGAACTCTAACCACCACTGTCCGACCTTCCACGATGATCATCCATGCTTACCACCAGTACCTGCAATCTTATCC
AAAATTCAATAGCTTTATTGGGTATAAAACCCACCTTATCGGTTGTGGCCGTAGATTTCCTGCCTTTGCAACCGCCAGCTCAGGACCTCATGTTCCTGCTGCTTCTGCTC
CTTCAATCCAATCCGATGTTGGCATGGCGTCCCGCACGTCGGTACTGGAAAAGTCGGGTATAATAGAGGAGGACCTGGAAAAGGCCATTTATCGATGCCGATTTATGGCA
TTTTTGGGCGTCTTTGGATCTTTGGTTGGGTCTATACTCTGTTTCATCAAGGGGTGTGTTCATGTAGCAGCATCGTTCTCAGAATATTTTGTAAATCGTGGAAAAGTAAT
AATGTTGCTGGTTGAGGCTATCGATGTGTATCTCTTAGGAACTGTGATGCTCGTCTTTGGGACGGGTCTCTATGAGCTGTTTATCAGCCATCTTGGAACTGAACGGACTT
TGTCAAAGAGAAACATTGAGCATAGGTCCAATTTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGAACATAACGACGGTTAACGAGCTGAAAACGAAGCTG
GGGCATGTCATAGTGATGCTGCTTCTAATCGGGTTCTTCGACAAGAGTAAAAAGGCGGCTATACAATCTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAGTATTCCTGTC
CTCTGGTAGCCTGTTTTTGCTGTCTAAACTAACCGAATAA
mRNA sequenceShow/hide mRNA sequence
TATGATCCGTACCTTTCTCAACTCCCATTTCCTCGGACCTCAGTGGCTGGCGCCGCTGTATTGCAATCTTCTTTGATGTTTGAACCCTTACTCTCCCCTTCCATCTCCGG
CTGCCACCATGCAACCATCTCCACCGTTGATTAGTGGCCCTTCCAGAACTCTAACCACCACTGTCCGACCTTCCACGATGATCATCCATGCTTACCACCAGTACCTGCAA
TCTTATCCAAAATTCAATAGCTTTATTGGGTATAAAACCCACCTTATCGGTTGTGGCCGTAGATTTCCTGCCTTTGCAACCGCCAGCTCAGGACCTCATGTTCCTGCTGC
TTCTGCTCCTTCAATCCAATCCGATGTTGGCATGGCGTCCCGCACGTCGGTACTGGAAAAGTCGGGTATAATAGAGGAGGACCTGGAAAAGGCCATTTATCGATGCCGAT
TTATGGCATTTTTGGGCGTCTTTGGATCTTTGGTTGGGTCTATACTCTGTTTCATCAAGGGGTGTGTTCATGTAGCAGCATCGTTCTCAGAATATTTTGTAAATCGTGGA
AAAGTAATAATGTTGCTGGTTGAGGCTATCGATGTGTATCTCTTAGGAACTGTGATGCTCGTCTTTGGGACGGGTCTCTATGAGCTGTTTATCAGCCATCTTGGAACTGA
ACGGACTTTGTCAAAGAGAAACATTGAGCATAGGTCCAATTTATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGAACATAACGACGGTTAACGAGCTGAAAA
CGAAGCTGGGGCATGTCATAGTGATGCTGCTTCTAATCGGGTTCTTCGACAAGAGTAAAAAGGCGGCTATACAATCTCCAGGTGATTTGCTTTGCTTAGCTGCTTCAGTA
TTCCTGTCCTCTGGTAGCCTGTTTTTGCTGTCTAAACTAACCGAATAAAACTAATAAGTTATCGAATAAAACTAATAAGTTATGTACAAATATAAAATATATAATCCCTT
TTTTTTTTTGCCCTTTTGGCCCCCTCCGAAAACGCTTGAAACCATGAAATATTAGTGTTATTGTAAATGGCTGTGTAAGTTTGCTGTAAGAAGATGTTGCTGCAATGTAG
AGTGACTTCAAAAGAGTAAATAAACAAGGGGATGAAGCTTTAGAAAGCCATTTCTGGGTCGTGTCGAGATTAAGAACTGTAGAATTGGCCGAACGATGGGAGTCTCGAGA
GTGATGTGTTACTATATTCAATATTGGTCGAACTGAGTATAATTCAATTGATTAATATAAGATCGAGAAGAAAAGAAAAAGAACAGTTGAAAAGCCCCCAAACCTCTGTC
TAAACAAAGGCTGTACACAATCACAAAAACTAGTCTTAAAAAAAAGGAACAATAATCATGTCAAACCACTAGAATATCATTGCCTTTACCCCAAACCTAG
Protein sequenceShow/hide protein sequence
MQPSPPLISGPSRTLTTTVRPSTMIIHAYHQYLQSYPKFNSFIGYKTHLIGCGRRFPAFATASSGPHVPAASAPSIQSDVGMASRTSVLEKSGIIEEDLEKAIYRCRFMA
FLGVFGSLVGSILCFIKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISHLGTERTLSKRNIEHRSNLFGLFTLKERPKWMNITTVNELKTKL
GHVIVMLLLIGFFDKSKKAAIQSPGDLLCLAASVFLSSGSLFLLSKLTE