; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g34800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g34800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:24632101..24642165
RNA-Seq ExpressionMoc01g34800
SyntenyMoc01g34800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.3e-10154.31Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        ++ + DAQVEALKAKC++KE   +DG++G+SPFT       + P   +     Y+GSKD K+YVEVFE LMDFQAASDAIKCRAF+IALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRFQEEQLKV H SDDS MCYFLTGL DE+ TVKLGEE  +TF EV QK KK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN
        +                                         GR +YRR+++G  RSRPYER+TP TIPI EILTNI+E+ MEKLLKRPEKLRG P++R+
Subjt:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN

Query:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRS
        KD+YCRF+R+H H+TS +WELK QIE+LI D YFKK+VGK R S AEKK+ERKRSRTP +R DRP VINTIF GP    + G +  + KE  R+
Subjt:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRS

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]6.2e-10055.85Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        ++ + DAQ EALKAKC++KE   +DG++G+SPFT       + P   +     Y+GSKD K+YVEVFEGLMDFQA SDAIKCRAFQIALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRFQEEQLKV H SDDS MCYFLTGL DE+ TVKLGEE  STF EV QKTKK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN
        +                                         GR +YRR+++G  RSRPYER+TP TIPI EILT I+E+ MEKLLKRPEKLR   ++R+
Subjt:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN

Query:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPA
        KD+YCRF+R+H H+TS  WELK QIEDLI D YFKK+VGK R S AEKK+ERKRSRTP +R DRP VINTIF GP+
Subjt:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPA

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.2e-10057.03Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        +K+KFDAQVEALKA+C+KKESSFDDG++G+  F+       + P   +     Y+GSKD K+YVEVFE LMDFQAA+DAIKC AFQIALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRF EEQLKV H SDDS MCYFLTGL DE+ TVKL EE  +TF EV QKTKK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  VQG-----------------------------------------RPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKR
        + G                                         R DYRRS+S   +SRPYE YTP TIPI+EILTNI+ET MEKLLKRPEKLRGDP+KR
Subjt:  VQG-----------------------------------------RPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKR

Query:  NKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVIN
        N D+YCRF+RDH H+TS++WELK QIEDLI D YFKK+VGK R +  EKK+ERKR RTP +RDDRP VIN
Subjt:  NKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVIN

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]9.3e-10453.75Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        ++ + DAQVEALKAKC++K+ S +DG++G+SPFT       + P   +     Y+G+KD K+YVEVFEGLMDFQAASDAIKCRAFQIALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRFQEEQLKV H SDDS MCYFLTGL DE+ TVKLGEE  +TF EV QK KK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN
        +                                         GR +YRR+++G  RSRPYER+TP TIPI+EILTNI+E+ MEKLLKRPEKLRG P++R+
Subjt:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN

Query:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRSRNFWWP
        KD+YCRF+R+H H+TS FWELK QIEDLI D YFKK+VGK R S AEKK+ERKRSRTP +R DRP VINTIF GP    + G   ++ KE  R+      
Subjt:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRSRNFWWP

Query:  EARTTQVRPMQNR
         AR    R  +NR
Subjt:  EARTTQVRPMQNR

XP_022159160.1 uncharacterized protein LOC111025585 [Momordica charantia]7.1e-10465.85Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGS-----AWL
        MK++FD QVE LK KC+K ES F+DGEMG+SPFT       + P         Y+GSKD K+YVEVFE LMDFQAASDAIKCRAFQIALTG+        
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGS-----AWL

Query:  WKTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKPVQGRPDYRRSDSGSIRSRPYERYTP
         K   HLATIRQK+GET+REYVTRFQEEQLKVTH SDDS MCYFLT L DE+ TVKLGEE  STF EV QK KK + G     RSDS S R  PYE YTP
Subjt:  WKTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKPVQGRPDYRRSDSGSIRSRPYERYTP

Query:  KTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRNKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRP
         TIPI EILTNI+E  +EKLLKRP+KLRGDP+KRNKDRYCRF+RDH +DTS+ WELK QIEDLI D Y KKYVGK   S   KK+ERKRSRTP +RDDRP
Subjt:  KTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRNKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRP

Query:  TVINTIFRGPALRVAAGNAQNQGKEPQR
         VINTIF GP    + G + N+ KE  R
Subjt:  TVINTIFRGPALRVAAGNAQNQGKEPQR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.1e-10154.31Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        ++ + DAQVEALKAKC++KE   +DG++G+SPFT       + P   +     Y+GSKD K+YVEVFE LMDFQAASDAIKCRAF+IALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRFQEEQLKV H SDDS MCYFLTGL DE+ TVKLGEE  +TF EV QK KK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN
        +                                         GR +YRR+++G  RSRPYER+TP TIPI EILTNI+E+ MEKLLKRPEKLRG P++R+
Subjt:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN

Query:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRS
        KD+YCRF+R+H H+TS +WELK QIE+LI D YFKK+VGK R S AEKK+ERKRSRTP +R DRP VINTIF GP    + G +  + KE  R+
Subjt:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRS

A0A6J1CKB3 uncharacterized protein LOC1110120813.0e-10055.85Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        ++ + DAQ EALKAKC++KE   +DG++G+SPFT       + P   +     Y+GSKD K+YVEVFEGLMDFQA SDAIKCRAFQIALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRFQEEQLKV H SDDS MCYFLTGL DE+ TVKLGEE  STF EV QKTKK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN
        +                                         GR +YRR+++G  RSRPYER+TP TIPI EILT I+E+ MEKLLKRPEKLR   ++R+
Subjt:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN

Query:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPA
        KD+YCRF+R+H H+TS  WELK QIEDLI D YFKK+VGK R S AEKK+ERKRSRTP +R DRP VINTIF GP+
Subjt:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPA

A0A6J1DHB3 uncharacterized protein LOC1110204793.0e-10057.03Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        +K+KFDAQVEALKA+C+KKESSFDDG++G+  F+       + P   +     Y+GSKD K+YVEVFE LMDFQAA+DAIKC AFQIALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRF EEQLKV H SDDS MCYFLTGL DE+ TVKL EE  +TF EV QKTKK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  VQG-----------------------------------------RPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKR
        + G                                         R DYRRS+S   +SRPYE YTP TIPI+EILTNI+ET MEKLLKRPEKLRGDP+KR
Subjt:  VQG-----------------------------------------RPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKR

Query:  NKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVIN
        N D+YCRF+RDH H+TS++WELK QIEDLI D YFKK+VGK R +  EKK+ERKR RTP +RDDRP VIN
Subjt:  NKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVIN

A0A6J1DS95 uncharacterized protein LOC1110234214.5e-10453.75Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----
        ++ + DAQVEALKAKC++K+ S +DG++G+SPFT       + P   +     Y+G+KD K+YVEVFEGLMDFQAASDAIKCRAFQIALTGSA LW    
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLW----

Query:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP
                                  KT  HLATIRQK+GET+REYVTRFQEEQLKV H SDDS MCYFLTGL DE+ TVKLGEE  +TF EV QK KK 
Subjt:  --------------------------KTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKP

Query:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN
        +                                         GR +YRR+++G  RSRPYER+TP TIPI+EILTNI+E+ MEKLLKRPEKLRG P++R+
Subjt:  V----------------------------------------QGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRN

Query:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRSRNFWWP
        KD+YCRF+R+H H+TS FWELK QIEDLI D YFKK+VGK R S AEKK+ERKRSRTP +R DRP VINTIF GP    + G   ++ KE  R+      
Subjt:  KDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRSRNFWWP

Query:  EARTTQVRPMQNR
         AR    R  +NR
Subjt:  EARTTQVRPMQNR

A0A6J1DXW4 uncharacterized protein LOC1110255853.4e-10465.85Show/hide
Query:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGS-----AWL
        MK++FD QVE LK KC+K ES F+DGEMG+SPFT       + P         Y+GSKD K+YVEVFE LMDFQAASDAIKCRAFQIALTG+        
Subjt:  MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFT-------LQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGS-----AWL

Query:  WKTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKPVQGRPDYRRSDSGSIRSRPYERYTP
         K   HLATIRQK+GET+REYVTRFQEEQLKVTH SDDS MCYFLT L DE+ TVKLGEE  STF EV QK KK + G     RSDS S R  PYE YTP
Subjt:  WKTTIHLATIRQKDGETVREYVTRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKPVQGRPDYRRSDSGSIRSRPYERYTP

Query:  KTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRNKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRP
         TIPI EILTNI+E  +EKLLKRP+KLRGDP+KRNKDRYCRF+RDH +DTS+ WELK QIEDLI D Y KKYVGK   S   KK+ERKRSRTP +RDDRP
Subjt:  KTIPIYEILTNIDETEMEKLLKRPEKLRGDPKKRNKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRP

Query:  TVINTIFRGPALRVAAGNAQNQGKEPQR
         VINTIF GP    + G + N+ KE  R
Subjt:  TVINTIFRGPALRVAAGNAQNQGKEPQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAATAAGTTCGATGCTCAGGTTGAGGCTCTTAAAGCAAAGTGCGACAAGAAAGAAAGCTCGTTCGATGATGGCGAGATGGGCAAATCGCCATTCACTTTACAACC
CGATTTGCCATCTATTCACTTTGGATATTACGAAGGATCTAAGGATCTGAAGAACTATGTTGAGGTCTTCGAAGGCCTCATGGATTTTCAGGCGGCATCTGATGCTATTA
AGTGTCGAGCATTCCAGATTGCCTTAACTGGAAGCGCTTGGCTATGGAAGACAACCATCCATCTCGCCACCATTCGACAAAAGGATGGCGAGACAGTTAGAGAGTACGTG
ACCAGATTCCAGGAAGAGCAGCTCAAGGTCACGCACTACTCTGACGATTCTACCATGTGCTACTTCCTCACCGGCCTGACCGACGAGTCACCGACGGTAAAATTGGGAGA
GGAAGTGTCGTCCACCTTTGTTGAGGTCTTCCAGAAGACCAAAAAGCCCGTCCAGGGCCGACCTGACTATAGAAGGTCTGATTCAGGCTCCATCCGAAGCCGACCTTATG
AGCGATACACTCCAAAAACCATCCCAATCTATGAGATCCTTACCAACATCGATGAGACAGAGATGGAGAAGCTTCTCAAGCGCCCAGAAAAGCTTAGGGGAGATCCTAAA
AAGCGCAACAAGGACAGGTATTGTCGTTTCTACCGCGACCACGACCATGATACCTCGAGTTTCTGGGAGTTGAAGTGGCAGATTGAAGACTTAATTTATGATGACTACTT
CAAGAAATACGTTGGCAAGCTGAGATTGAGCTTGGCAGAAAAGAAAGACGAGAGGAAGCGTTCAAGAACGCCACTTCAGAGAGATGATCGACCTACGGTCATCAACACTA
TTTTTAGAGGTCCAGCTTTGAGGGTAGCTGCTGGGAATGCTCAGAACCAAGGCAAAGAACCACAGCGTTCAAGAAACTTTTGGTGGCCAGAAGCTCGGACTACACAGGTG
AGACCTATGCAAAATAGAGGAGTTATATGGTTTTCTGATGAAGAAAATTCTAAAGAAAAAATGGTAGAACTAGCAGTGTGCAAAGCGCGCCAGGGGATGGAAATAGAGGA
GGGAGGAGAAGAAGTTAACGAGCAAGCCCCAACAGTTTCTACAGAATCGCTACACTTAAGCGAGGATGAAGGAGAAGCATTCGTAGAGAGTCATCCGATGCAGGATGCAC
AATCAAATCATGACTCCGACAACCACGATTCTTGGTGTACAAGGACACCTCTCACTTGGACTATTCCATCAAAAGAAGGTCCGGCAACAGTAATTCATATTTATGTGATG
ATTCAAACGAGGAGTATGAGCCCACAATTGATTGATACGATGACCATTGCCGAGAAATGGCGGTTTGGGTGGATCCGGCAATCACAAAAAGCCCAGTTGATTCACCCTCG
GTCTTTAGCCGAGAGGCATGGGAACATGTGGACATTTTTCATTATGCAACTTTCCCATCCTCACATACTCGGAGATTCGAGCACCAGGGAGTACTTCTCAGTTGTTTCCT
TAGTTATCAATGAAGGACAGAAAGAGTACATGTCGACTCCTATTACTGCTTTATTAGGTGGTGATAAACTTAATGGCGAGAACTACAAACAATGGAAATCGAACCTAAAC
ACTATACTAGTGATAGATGATCTTAGGTTCGTCTTTCAAAAGGATTGTCCTCAAGCTCCTACGCCTAACGCCACTGTGGCAGTGCGCAATGCCTATGACCGGTGGATCTT
GGCGAGCATATCTGATGTGCTTGCTAAAAAGCATGAATACATGGTCACCGCCAAGCAGATCATGGACTCACTGCAAAGCATAACACTTCAATGTGGCGGAGTCGAACGGG
GCCGTAATAAAAGAGCAGAGTCAGACCTATCAGTCTCTTATGAAAATAAGGGACAGGAAAATGAGGCAAATGTTGCCACCTCAAAATGGTTCTATCGAGGTTCAACCTCT
AAAACCAAGTCTGTGTGTTCTTCTTCTAGAAGTAAGACTTTTAAGAAGAAGAAGACGGCTGGTAAAGGGCCAAAAACTGACTCCGCTGCTGCTGCTGCTGCTGCCAAGAA
AGACAATGTCAAGGCTGTAGACAAAGGAAAATGTTTCCACTGCAACTTGGATAGGCATTGGAAATTCAATTGCCTGAAATACACGGTCGAGAAAAAGAAAGCCAACGAAG
ACAAATATGATTTACTTGTTTTGGAAACTTGTTTAGTGGAAAATGCTTATTCCGCCTCGATACTTGATTCAGGAACCACTATCCATGGTTTATTTTTTATTTCAAGACAT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAATAAGTTCGATGCTCAGGTTGAGGCTCTTAAAGCAAAGTGCGACAAGAAAGAAAGCTCGTTCGATGATGGCGAGATGGGCAAATCGCCATTCACTTTACAACC
CGATTTGCCATCTATTCACTTTGGATATTACGAAGGATCTAAGGATCTGAAGAACTATGTTGAGGTCTTCGAAGGCCTCATGGATTTTCAGGCGGCATCTGATGCTATTA
AGTGTCGAGCATTCCAGATTGCCTTAACTGGAAGCGCTTGGCTATGGAAGACAACCATCCATCTCGCCACCATTCGACAAAAGGATGGCGAGACAGTTAGAGAGTACGTG
ACCAGATTCCAGGAAGAGCAGCTCAAGGTCACGCACTACTCTGACGATTCTACCATGTGCTACTTCCTCACCGGCCTGACCGACGAGTCACCGACGGTAAAATTGGGAGA
GGAAGTGTCGTCCACCTTTGTTGAGGTCTTCCAGAAGACCAAAAAGCCCGTCCAGGGCCGACCTGACTATAGAAGGTCTGATTCAGGCTCCATCCGAAGCCGACCTTATG
AGCGATACACTCCAAAAACCATCCCAATCTATGAGATCCTTACCAACATCGATGAGACAGAGATGGAGAAGCTTCTCAAGCGCCCAGAAAAGCTTAGGGGAGATCCTAAA
AAGCGCAACAAGGACAGGTATTGTCGTTTCTACCGCGACCACGACCATGATACCTCGAGTTTCTGGGAGTTGAAGTGGCAGATTGAAGACTTAATTTATGATGACTACTT
CAAGAAATACGTTGGCAAGCTGAGATTGAGCTTGGCAGAAAAGAAAGACGAGAGGAAGCGTTCAAGAACGCCACTTCAGAGAGATGATCGACCTACGGTCATCAACACTA
TTTTTAGAGGTCCAGCTTTGAGGGTAGCTGCTGGGAATGCTCAGAACCAAGGCAAAGAACCACAGCGTTCAAGAAACTTTTGGTGGCCAGAAGCTCGGACTACACAGGTG
AGACCTATGCAAAATAGAGGAGTTATATGGTTTTCTGATGAAGAAAATTCTAAAGAAAAAATGGTAGAACTAGCAGTGTGCAAAGCGCGCCAGGGGATGGAAATAGAGGA
GGGAGGAGAAGAAGTTAACGAGCAAGCCCCAACAGTTTCTACAGAATCGCTACACTTAAGCGAGGATGAAGGAGAAGCATTCGTAGAGAGTCATCCGATGCAGGATGCAC
AATCAAATCATGACTCCGACAACCACGATTCTTGGTGTACAAGGACACCTCTCACTTGGACTATTCCATCAAAAGAAGGTCCGGCAACAGTAATTCATATTTATGTGATG
ATTCAAACGAGGAGTATGAGCCCACAATTGATTGATACGATGACCATTGCCGAGAAATGGCGGTTTGGGTGGATCCGGCAATCACAAAAAGCCCAGTTGATTCACCCTCG
GTCTTTAGCCGAGAGGCATGGGAACATGTGGACATTTTTCATTATGCAACTTTCCCATCCTCACATACTCGGAGATTCGAGCACCAGGGAGTACTTCTCAGTTGTTTCCT
TAGTTATCAATGAAGGACAGAAAGAGTACATGTCGACTCCTATTACTGCTTTATTAGGTGGTGATAAACTTAATGGCGAGAACTACAAACAATGGAAATCGAACCTAAAC
ACTATACTAGTGATAGATGATCTTAGGTTCGTCTTTCAAAAGGATTGTCCTCAAGCTCCTACGCCTAACGCCACTGTGGCAGTGCGCAATGCCTATGACCGGTGGATCTT
GGCGAGCATATCTGATGTGCTTGCTAAAAAGCATGAATACATGGTCACCGCCAAGCAGATCATGGACTCACTGCAAAGCATAACACTTCAATGTGGCGGAGTCGAACGGG
GCCGTAATAAAAGAGCAGAGTCAGACCTATCAGTCTCTTATGAAAATAAGGGACAGGAAAATGAGGCAAATGTTGCCACCTCAAAATGGTTCTATCGAGGTTCAACCTCT
AAAACCAAGTCTGTGTGTTCTTCTTCTAGAAGTAAGACTTTTAAGAAGAAGAAGACGGCTGGTAAAGGGCCAAAAACTGACTCCGCTGCTGCTGCTGCTGCTGCCAAGAA
AGACAATGTCAAGGCTGTAGACAAAGGAAAATGTTTCCACTGCAACTTGGATAGGCATTGGAAATTCAATTGCCTGAAATACACGGTCGAGAAAAAGAAAGCCAACGAAG
ACAAATATGATTTACTTGTTTTGGAAACTTGTTTAGTGGAAAATGCTTATTCCGCCTCGATACTTGATTCAGGAACCACTATCCATGGTTTATTTTTTATTTCAAGACAT
TAG
Protein sequenceShow/hide protein sequence
MKNKFDAQVEALKAKCDKKESSFDDGEMGKSPFTLQPDLPSIHFGYYEGSKDLKNYVEVFEGLMDFQAASDAIKCRAFQIALTGSAWLWKTTIHLATIRQKDGETVREYV
TRFQEEQLKVTHYSDDSTMCYFLTGLTDESPTVKLGEEVSSTFVEVFQKTKKPVQGRPDYRRSDSGSIRSRPYERYTPKTIPIYEILTNIDETEMEKLLKRPEKLRGDPK
KRNKDRYCRFYRDHDHDTSSFWELKWQIEDLIYDDYFKKYVGKLRLSLAEKKDERKRSRTPLQRDDRPTVINTIFRGPALRVAAGNAQNQGKEPQRSRNFWWPEARTTQV
RPMQNRGVIWFSDEENSKEKMVELAVCKARQGMEIEEGGEEVNEQAPTVSTESLHLSEDEGEAFVESHPMQDAQSNHDSDNHDSWCTRTPLTWTIPSKEGPATVIHIYVM
IQTRSMSPQLIDTMTIAEKWRFGWIRQSQKAQLIHPRSLAERHGNMWTFFIMQLSHPHILGDSSTREYFSVVSLVINEGQKEYMSTPITALLGGDKLNGENYKQWKSNLN
TILVIDDLRFVFQKDCPQAPTPNATVAVRNAYDRWILASISDVLAKKHEYMVTAKQIMDSLQSITLQCGGVERGRNKRAESDLSVSYENKGQENEANVATSKWFYRGSTS
KTKSVCSSSRSKTFKKKKTAGKGPKTDSAAAAAAAKKDNVKAVDKGKCFHCNLDRHWKFNCLKYTVEKKKANEDKYDLLVLETCLVENAYSASILDSGTTIHGLFFISRH