; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G004730 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G004730
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr08:2913036..2916064
RNA-Seq ExpressionCmoCh08G004730
SyntenyCmoCh08G004730
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593251.1 WRKY transcription factor 23, partial [Cucurbita argyrosperma subsp. sororia]2.5e-10594.67Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREV-VANVEKKNDRGK
        MALREGDLTFDLESGAKIVKE++GNVE SSIKR VKNI WGRLT+DTLPRDERAAASSSYVANII DANIKLLIDKNLKGEEGRREV VANVEKKNDRGK
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREV-VANVEKKNDRGK

Query:  HNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSS
        HNNKKKAPKPPRPPKGPSLD ADQMLVKEIA+LAMKKRSRIER KAMKKAKAEKT SCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSS
Subjt:  HNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSS

Query:  GFISVQYIMNFPRNESYIPNSTTSV
        GFISVQYIMNFPRNESYIPNSTTSV
Subjt:  GFISVQYIMNFPRNESYIPNSTTSV

KAG7025603.1 hypothetical protein SDJN02_12100 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-12384.62Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        MALREGDLTFDLESGAKIVKE++GNVEPSSIKR VKNI WGRLT+DTLPRDERAAASSSYVANII DANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
        NNKKKAPKPPRPPKGPSLDAADQMLVKEIA+LAMKKRSRIERMKAMKKAKAEKT SCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSS 
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSVNNLIRDTCMKTSHGVLCILSLVVHSYRVTDMTRKRIATCYAMYVTGVGRGFVKGHLGRGIGIGIRISIMNLETLR
                                 NLIRDTC+KTSHGVLCILSLVV SYRVTDMTRKRIATCYAMYV G GRGFVK HLGR       ISIMNLETLR
Subjt:  FISVQYIMNFPRNESYIPNSTTSVNNLIRDTCMKTSHGVLCILSLVVHSYRVTDMTRKRIATCYAMYVTGVGRGFVKGHLGRGIGIGIRISIMNLETLR

XP_022959630.1 uncharacterized protein LOC111460650 isoform X1 [Cucurbita moschata]2.3e-114100Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
        NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSV
        FISVQYIMNFPRNESYIPNSTTSV
Subjt:  FISVQYIMNFPRNESYIPNSTTSV

XP_023513697.1 uncharacterized protein LOC111778230 isoform X1 [Cucurbita pepo subsp. pepo]3.3e-10584.58Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        MALREGDLT DLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDA+IKLLIDKNLKGEEGRREVVANVEKKNDRGKH
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQS----------------------
        NNKKKAPKPPRPPKGPSLDAADQMLVKEIA+LAMKKRSRIERMK +KKAKAEKTSSCNTYIPALIITCLFFLVVI+QS                      
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQS----------------------

Query:  -------ISSRSSSLFQGSPEPAVGGSSGFISVQYIMNFPRNESYIPNSTTSV
               ISSRSSSLFQGSPEPAVGGSSGFISVQYIMNFPRNESYIPNSTTSV
Subjt:  -------ISSRSSSLFQGSPEPAVGGSSGFISVQYIMNFPRNESYIPNSTTSV

XP_023513698.1 uncharacterized protein LOC111778230 isoform X2 [Cucurbita pepo subsp. pepo]3.7e-10995.09Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        MALREGDLT DLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDA+IKLLIDKNLKGEEGRREVVANVEKKNDRGKH
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
        NNKKKAPKPPRPPKGPSLDAADQMLVKEIA+LAMKKRSRIERMK +KKAKAEKTSSCNTYIPALIITCLFFLVVI+Q ISSRSSSLFQGSPEPAVGGSSG
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSV
        FISVQYIMNFPRNESYIPNSTTSV
Subjt:  FISVQYIMNFPRNESYIPNSTTSV

TrEMBL top hitse value%identityAlignment
A0A5A7TZQ1 Putative transmembrane protein6.3e-7071.75Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSS----YVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKND
        MALRE DLTFDLE+G KIV+ED G+ EPSS KR VKNI W RLT+D+L +DERA AS+S     VA+II D ++ LLIDKNL+GE+   EV A++EK N 
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSS----YVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKND

Query:  RGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVG
        RGKH NKKKA KPPRPPKGPSLDAAD+M+VKE+A LAMKKR+R ERMKA+KKAKAEKTSS N+ IPALIIT LFFLV+IIQ IS RSSS+ QGSPEPAVG
Subjt:  RGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVG

Query:  GSSGFISVQYIMNFPRNESYIPN
        GSSGFISVQYI +FP +ES + N
Subjt:  GSSGFISVQYIMNFPRNESYIPN

A0A5D3D3X4 Putative transmembrane protein6.3e-7071.75Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSS----YVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKND
        MALRE DLTFDLE+G KIV+ED G+ EPSS KR VKNI W RLT+D+L +DERA AS+S     VA+II D ++ LLIDKNL+GE+   EV A++EK N 
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSS----YVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKND

Query:  RGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVG
        RGKH NKKKA KPPRPPKGPSLDAAD+M+VKE+A LAMKKR+R ERMKA+KKAKAEKTSS N+ IPALIIT LFFLV+IIQ IS RSSS+ QGSPEPAVG
Subjt:  RGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVG

Query:  GSSGFISVQYIMNFPRNESYIPN
        GSSGFISVQYI +FP +ES + N
Subjt:  GSSGFISVQYIMNFPRNESYIPN

A0A6J1H5E1 uncharacterized protein LOC111460650 isoform X29.0e-10191.52Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGR                
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
          +KKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSV
        FISVQYIMNFPRNESYIPNSTTSV
Subjt:  FISVQYIMNFPRNESYIPNSTTSV

A0A6J1H6U5 uncharacterized protein LOC111460650 isoform X11.1e-114100Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
        NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSV
        FISVQYIMNFPRNESYIPNSTTSV
Subjt:  FISVQYIMNFPRNESYIPNSTTSV

A0A6J1KSQ4 uncharacterized protein LOC1114978851.8e-9384.82Show/hide
Query:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH
        M LREGDLTFDLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDANIKLLI+KNLKGEEGR                
Subjt:  MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKH

Query:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG
          +KKAPKPPRPPKGPSLDAADQMLVKE A+LAMKKRSRIERMKA+KKAKAEKTSSCNTYIPAL+IT LFFLVVIIQ ISSRSSSLFQGSP+PAVGGSSG
Subjt:  NNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSV
        F+ VQYIMNFPRNESYIPNSTTSV
Subjt:  FISVQYIMNFPRNESYIPNSTTSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein2.2e-1937.05Show/hide
Query:  EGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKHNNKK
        E DL  D+E+G   V ++S +   S       N  W           ERA    S    I  D +  L+ D+N    E   + +   EKK   GK    +
Subjt:  EGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKHNNKK

Query:  KAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERM-KAMKKAKAEKTSSCNTYIP--ALIITCLFFLVVIIQSISSRSSSL-FQGSPEPAVGGSSG
        KA KPPRPPKGPSL   D+ ++++I +LAM+KR+RIERM K++K+ KA KTS  +  I   ++IIT +FF  ++ Q  S+ SSS+    SP P V  ++ 
Subjt:  KAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERM-KAMKKAKAEKTSSCNTYIP--ALIITCLFFLVVIIQSISSRSSSL-FQGSPEPAVGGSSG

Query:  FISVQYIMNFPRNESYIPNSTTSV
         ISVQ+  +F   E   P+ TTS+
Subjt:  FISVQYIMNFPRNESYIPNSTTSV

AT3G17120.1 unknown protein8.5e-1948.76Show/hide
Query:  KNDRGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNT---YIPALIITCLFFLVVIIQSISSRSSSLFQGS
        ++DR K   KK A KPPRPP+GPSLDAADQ L++EIA+LAM KR+RIERM+A+KK++A K +S  +    + A + T +FF V++ Q +S R++    G 
Subjt:  KNDRGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNT---YIPALIITCLFFLVVIIQSISSRSSSLFQGS

Query:  PEPAVGG--SSGFISVQYIMN
            V G  + GF+SVQY  N
Subjt:  PEPAVGG--SSGFISVQYIMN

AT3G17120.2 unknown protein8.5e-1948.76Show/hide
Query:  KNDRGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNT---YIPALIITCLFFLVVIIQSISSRSSSLFQGS
        ++DR K   KK A KPPRPP+GPSLDAADQ L++EIA+LAM KR+RIERM+A+KK++A K +S  +    + A + T +FF V++ Q +S R++    G 
Subjt:  KNDRGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNT---YIPALIITCLFFLVVIIQSISSRSSSLFQGS

Query:  PEPAVGG--SSGFISVQYIMN
            V G  + GF+SVQY  N
Subjt:  PEPAVGG--SSGFISVQYIMN

AT4G01960.1 unknown protein6.5e-1942.38Show/hide
Query:  LKGEEGRREVVAN----VEKKNDRGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLV
        L GE  RRE  +      + K D  K    +K  KPPRPPKGP L A DQ L++EI +LAM+KR+RIERMK +++ KA K+SS  + I A+I+T +FF+ 
Subjt:  LKGEEGRREVVAN----VEKKNDRGKHNNKKKAPKPPRPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLV

Query:  VIIQSISSRSSSL-FQGSPEPAVGGSSGFISVQYIMNFPRNESYIPNSTTS
        +I Q   + ++SL    SP P    ++  +SVQ+   F   E   P+ TTS
Subjt:  VIIQSISSRSSSL-FQGSPEPAVGGSSGFISVQYIMNFPRNESYIPNSTTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTAAGAGAAGGGGATCTCACATTTGATCTGGAAAGCGGGGCGAAGATTGTTAAAGAGGATTCTGGAAATGTGGAGCCAAGTTCAATCAAACGAGCGGTGAAGAA
CATTTTTTGGGGTAGGTTGACCAAGGATACATTGCCACGAGACGAACGAGCTGCAGCCTCGAGCAGTTACGTTGCCAATATCATCCGTGATGCGAACATAAAATTGTTGA
TAGATAAGAATTTGAAAGGAGAAGAGGGTCGTCGTGAAGTCGTTGCTAATGTCGAGAAGAAGAATGATAGAGGGAAGCATAATAACAAGAAAAAGGCTCCAAAGCCACCT
CGACCGCCCAAGGGACCTTCACTTGACGCTGCTGACCAAATGCTGGTCAAGGAAATCGCCAAGCTTGCCATGAAAAAACGATCGAGAATCGAGCGAATGAAAGCGATGAA
GAAGGCAAAAGCTGAGAAAACATCTTCTTGCAATACTTACATACCAGCATTGATTATCACATGCCTCTTCTTCCTTGTAGTAATCATTCAAAGTATAAGCTCTAGAAGCA
GTTCATTGTTCCAGGGGTCGCCCGAACCGGCCGTTGGTGGTAGTAGCGGTTTCATTTCGGTTCAGTACATTATGAACTTTCCCAGAAATGAAAGCTATATACCCAATTCC
ACCACCTCTGTTAACAATCTCATTAGAGATACTTGTATGAAAACATCTCATGGAGTTCTCTGCATTTTGAGCCTTGTGGTCCATTCTTATCGTGTGACCGACATGACACG
AAAAAGGATAGCGACTTGCTATGCCATGTATGTGACAGGGGTGGGTCGAGGCTTTGTGAAGGGGCATTTGGGTCGCGGCATTGGCATCGGCATCCGCATCAGCATCATGA
ATCTTGAAACGCTGCGACGTTAA
mRNA sequenceShow/hide mRNA sequence
GTGACCGACGATTCAGAATCAAGTTTCATGGGAGTTACTGTTTCCGTTGGATTGATTCAATCGTCGGTCGCCATCTCCGATTAACACAAATTCGATCATTAATCCCTTGT
TTTTTCTTCGATGATGTTCGTTTTGATTTTTCCCACCAGATGATCGGAAGAACACCATCAAAATCGATTTGGATTTTGTCGGTCTGAGCTTCAACTTCGAGTTCCTCTGT
TCATTTATAGTTCAAAAGGCTATGAAATTGAGTGGATGGCTTTAAGAGAAGGGGATCTCACATTTGATCTGGAAAGCGGGGCGAAGATTGTTAAAGAGGATTCTGGAAAT
GTGGAGCCAAGTTCAATCAAACGAGCGGTGAAGAACATTTTTTGGGGTAGGTTGACCAAGGATACATTGCCACGAGACGAACGAGCTGCAGCCTCGAGCAGTTACGTTGC
CAATATCATCCGTGATGCGAACATAAAATTGTTGATAGATAAGAATTTGAAAGGAGAAGAGGGTCGTCGTGAAGTCGTTGCTAATGTCGAGAAGAAGAATGATAGAGGGA
AGCATAATAACAAGAAAAAGGCTCCAAAGCCACCTCGACCGCCCAAGGGACCTTCACTTGACGCTGCTGACCAAATGCTGGTCAAGGAAATCGCCAAGCTTGCCATGAAA
AAACGATCGAGAATCGAGCGAATGAAAGCGATGAAGAAGGCAAAAGCTGAGAAAACATCTTCTTGCAATACTTACATACCAGCATTGATTATCACATGCCTCTTCTTCCT
TGTAGTAATCATTCAAAGTATAAGCTCTAGAAGCAGTTCATTGTTCCAGGGGTCGCCCGAACCGGCCGTTGGTGGTAGTAGCGGTTTCATTTCGGTTCAGTACATTATGA
ACTTTCCCAGAAATGAAAGCTATATACCCAATTCCACCACCTCTGTTAACAATCTCATTAGAGATACTTGTATGAAAACATCTCATGGAGTTCTCTGCATTTTGAGCCTT
GTGGTCCATTCTTATCGTGTGACCGACATGACACGAAAAAGGATAGCGACTTGCTATGCCATGTATGTGACAGGGGTGGGTCGAGGCTTTGTGAAGGGGCATTTGGGTCG
CGGCATTGGCATCGGCATCCGCATCAGCATCATGAATCTTGAAACGCTGCGACGTTAA
Protein sequenceShow/hide protein sequence
MALREGDLTFDLESGAKIVKEDSGNVEPSSIKRAVKNIFWGRLTKDTLPRDERAAASSSYVANIIRDANIKLLIDKNLKGEEGRREVVANVEKKNDRGKHNNKKKAPKPP
RPPKGPSLDAADQMLVKEIAKLAMKKRSRIERMKAMKKAKAEKTSSCNTYIPALIITCLFFLVVIIQSISSRSSSLFQGSPEPAVGGSSGFISVQYIMNFPRNESYIPNS
TTSVNNLIRDTCMKTSHGVLCILSLVVHSYRVTDMTRKRIATCYAMYVTGVGRGFVKGHLGRGIGIGIRISIMNLETLRR