; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g20730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g20730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr1:14471865..14474402
RNA-Seq ExpressionMoc01g20730
SyntenyMoc01g20730
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]3.5e-11348.39Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE+VTIHNTMAEYPVD+VHE A+NR+TGQSE DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSS
        STP+LYL+RCIDPTCTWRLR TKIRDCNLFKIKKYIAVHSKCN A+MKQDH QAKSWVVGHLV+SKFTDVSRTYR KDIMQDIREEYGVNMSYDK W SS
Subjt:  STPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSS

Query:  EEALRLIRGDPASS--------------------------------------------------------------------------------------
        EEALRLIRGDPASS                                                                                      
Subjt:  EEALRLIRGDPASS--------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNF
                                            HARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAE+MIA+A DNARRHIVMNI+QFNF
Subjt:  ------------------------------------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNF

Query:  ESSPGFVNIDVQPQKKVVRVGRRQTMRIP
        E   G +N DV  Q +          ++P
Subjt:  ESSPGFVNIDVQPQKKVVRVGRRQTMRIP

XP_022142677.1 uncharacterized protein LOC111012733 [Momordica charantia]8.3e-9948.51Show/hide
Query:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
        MAEYPVD+VHE ANNR+TGQSE DRLQAMVQSAGT+DVKEGDVFDSKKELVMKMH  ALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
Subjt:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK

Query:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASS-----------------
        IKKYIAVHSKCN A+MKQDH QAKSWVVGHLV+SKFTDVSRTYR KDIMQDIREEYGVNMSYDK W  SEEALRLIRGDPASS                 
Subjt:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASS-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNFE
             HARKLPVTALLDHIR VLQRWFYERRTLASSRQSTLSDYAE+MI++ASDNARRHIVMNI+QFNFE
Subjt:  -----HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNFE

XP_022151512.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]3.4e-9294.02Show/hide
Query:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
        MAEYPVD VHETA NRLTGQS ADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
Subjt:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK

Query:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH
        IKKYIAVHSKCN A MKQDH  AKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDK W SSEEALRL RGDPASS+
Subjt:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]9.2e-9847.68Show/hide
Query:  RVFISFGGEWKDIEKDYMGGRTRGLTVDSKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKAKVCVVEITDDDDLTFFFTSEDISELPLYISTVPKKVH
        RVFI+FGGEW D EKDY+GGR RGLTVDS                                                                       
Subjt:  RVFISFGGEWKDIEKDYMGGRTRGLTVDSKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKAKVCVVEITDDDDLTFFFTSEDISELPLYISTVPKKVH

Query:  QNEPYMPSFPYYLGQHVSNVPIPSACAPPFAKPLFPRPSFSSPSVPSSSSNPSSSRPPPPYFGHIGHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNG
                      Q+VS+  +P AC P F +P  P PSF     PSSSSNPSSS+ P  Y+GH+GHDI  LTPL S+VVPCNLGDDR   W++ GLWN 
Subjt:  QNEPYMPSFPYYLGQHVSNVPIPSACAPPFAKPLFPRPSFSSPSVPSSSSNPSSSRPPPPYFGHIGHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNG

Query:  SENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDS--VHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKK
        ++   ++SDESY  +  +EEGD E E+ N+   D  D + E  E  +     E   D   V +   + L GQ   ++LQ +VQS+GTNDVKEG VFD+KK
Subjt:  SENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDS--VHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKK

Query:  ELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDI
        EL ++ HL+A+  NFQF+VKKSTPELY+LRC+D +CTWRLRA K+ DCNLFKIKKY ++H+ CN  ++KQDH QAK+WVV HLV++KFTDVS TYR KDI
Subjt:  ELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDI

Query:  MQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSHARKLPVTALLDHIRGVL
        +QD+R+EYGVN+SYDK W S+EEALRLIRGDP +S+   LP    L  IR VL
Subjt:  MQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSHARKLPVTALLDHIRGVL

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]1.5e-8762.08Show/hide
Query:  GHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNGSENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANN
        GHD+  LTPLGS+VVPCNLGDDR  DWDV G+WN +E   ++S ESY  +  +EEG  + EYGNE   D LD + E +   +H T      ++V     N
Subjt:  GHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNGSENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANN

Query:  RLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAI
         LTG    ++LQ +VQS+GTNDV EGDVFD+KKEL +KMHL+A+RKNFQF+VKKSTP+LY+LRC+   CTWRLRATK+++C LFKIKKY A H+ C    
Subjt:  RLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAI

Query:  MKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH
        +K DH QAKSWVVGHLV+ KFTDVSRTYR KDI+QD+R+EYGVN+SYD+ W SSEEALRLIRGDPASS+
Subjt:  MKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like1.7e-11348.39Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE+VTIHNTMAEYPVD+VHE A+NR+TGQSE DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSS
        STP+LYL+RCIDPTCTWRLR TKIRDCNLFKIKKYIAVHSKCN A+MKQDH QAKSWVVGHLV+SKFTDVSRTYR KDIMQDIREEYGVNMSYDK W SS
Subjt:  STPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSS

Query:  EEALRLIRGDPASS--------------------------------------------------------------------------------------
        EEALRLIRGDPASS                                                                                      
Subjt:  EEALRLIRGDPASS--------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNF
                                            HARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAE+MIA+A DNARRHIVMNI+QFNF
Subjt:  ------------------------------------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNF

Query:  ESSPGFVNIDVQPQKKVVRVGRRQTMRIP
        E   G +N DV  Q +          ++P
Subjt:  ESSPGFVNIDVQPQKKVVRVGRRQTMRIP

A0A6J1CNJ2 uncharacterized protein LOC1110127334.0e-9948.51Show/hide
Query:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
        MAEYPVD+VHE ANNR+TGQSE DRLQAMVQSAGT+DVKEGDVFDSKKELVMKMH  ALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
Subjt:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK

Query:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASS-----------------
        IKKYIAVHSKCN A+MKQDH QAKSWVVGHLV+SKFTDVSRTYR KDIMQDIREEYGVNMSYDK W  SEEALRLIRGDPASS                 
Subjt:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASS-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNFE
             HARKLPVTALLDHIR VLQRWFYERRTLASSRQSTLSDYAE+MI++ASDNARRHIVMNI+QFNFE
Subjt:  -----HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNFE

A0A6J1DDQ3 protein FAR1-RELATED SEQUENCE 4-like1.6e-9294.02Show/hide
Query:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
        MAEYPVD VHETA NRLTGQS ADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK
Subjt:  MAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFK

Query:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH
        IKKYIAVHSKCN A MKQDH  AKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDK W SSEEALRL RGDPASS+
Subjt:  IKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH

A0A6J1DP00 uncharacterized protein LOC1110229544.5e-9847.68Show/hide
Query:  RVFISFGGEWKDIEKDYMGGRTRGLTVDSKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKAKVCVVEITDDDDLTFFFTSEDISELPLYISTVPKKVH
        RVFI+FGGEW D EKDY+GGR RGLTVDS                                                                       
Subjt:  RVFISFGGEWKDIEKDYMGGRTRGLTVDSKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKAKVCVVEITDDDDLTFFFTSEDISELPLYISTVPKKVH

Query:  QNEPYMPSFPYYLGQHVSNVPIPSACAPPFAKPLFPRPSFSSPSVPSSSSNPSSSRPPPPYFGHIGHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNG
                      Q+VS+  +P AC P F +P  P PSF     PSSSSNPSSS+ P  Y+GH+GHDI  LTPL S+VVPCNLGDDR   W++ GLWN 
Subjt:  QNEPYMPSFPYYLGQHVSNVPIPSACAPPFAKPLFPRPSFSSPSVPSSSSNPSSSRPPPPYFGHIGHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNG

Query:  SENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDS--VHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKK
        ++   ++SDESY  +  +EEGD E E+ N+   D  D + E  E  +     E   D   V +   + L GQ   ++LQ +VQS+GTNDVKEG VFD+KK
Subjt:  SENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDS--VHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKK

Query:  ELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDI
        EL ++ HL+A+  NFQF+VKKSTPELY+LRC+D +CTWRLRA K+ DCNLFKIKKY ++H+ CN  ++KQDH QAK+WVV HLV++KFTDVS TYR KDI
Subjt:  ELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDI

Query:  MQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSHARKLPVTALLDHIRGVL
        +QD+R+EYGVN+SYDK W S+EEALRLIRGDP +S+   LP    L  IR VL
Subjt:  MQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSHARKLPVTALLDHIRGVL

A0A6J1DTG5 uncharacterized protein LOC1110238437.1e-8862.08Show/hide
Query:  GHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNGSENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANN
        GHD+  LTPLGS+VVPCNLGDDR  DWDV G+WN +E   ++S ESY  +  +EEG  + EYGNE   D LD + E +   +H T      ++V     N
Subjt:  GHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNGSENVDEDSDESYCLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANN

Query:  RLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAI
         LTG    ++LQ +VQS+GTNDV EGDVFD+KKEL +KMHL+A+RKNFQF+VKKSTP+LY+LRC+   CTWRLRATK+++C LFKIKKY A H+ C    
Subjt:  RLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAI

Query:  MKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH
        +K DH QAKSWVVGHLV+ KFTDVSRTYR KDI+QD+R+EYGVN+SYD+ W SSEEALRLIRGDPASS+
Subjt:  MKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACATGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTATGCTGA
ATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAGTGGAAATAACTG
ACGACGATGACTTGACTTTCTTCTTCACTAGTGAAGATATCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAACCGTACATGCCTTCT
TTCCCATATTATTTAGGCCAACACGTGTCCAATGTTCCTATTCCCTCAGCTTGTGCCCCCCCATTTGCAAAACCCTTATTTCCGAGACCCTCATTTTCAAGTCCGTCAGT
TCCGTCCTCGTCGTCGAACCCCTCTTCTTCTCGCCCACCACCCCCCTACTTTGGTCATATTGGTCATGATATAACATCTCTCACACCGTTAGGGTCAAATGTTGTTCCTT
GTAATTTGGGAGATGATAGGGCATATGATTGGGATGTGTCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATTGTCTAATGACCGACACG
GAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCTAGTGATCGACTTGATGTGCAACATGAGCATGAAGAGGTAACAATTCATAATACAATGGCTGAATATCC
TGTAGATTCCGTCCATGAAACGGCAAACAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACCAATGATGTTAAGGAGGGTG
ATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACGCCGGAACTATACTTGCTGCGA
TGCATCGATCCTACTTGCACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAGTCCACTCTAAGTGCAATGATGCCAT
TATGAAACAGGATCATCTTCAGGCGAAAAGTTGGGTGGTCGGTCATCTAGTACGATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCTGAAGGACATCATGCAAGATA
TTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGACCTGGTGTTCGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGCATGCACGTAAGTTG
CCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGAA
AATGATTGCCAAAGCTTCGGATAATGCACGGAGACACATTGTTATGAACATCAACCAGTTTAATTTTGAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACAGA
AGAAGGTCGTTAGGGTTGGACGGCGACAGACGATGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGTACGTCGGGGCACAATCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACATGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTATGCTGA
ATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAGTGGAAATAACTG
ACGACGATGACTTGACTTTCTTCTTCACTAGTGAAGATATCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAACCGTACATGCCTTCT
TTCCCATATTATTTAGGCCAACACGTGTCCAATGTTCCTATTCCCTCAGCTTGTGCCCCCCCATTTGCAAAACCCTTATTTCCGAGACCCTCATTTTCAAGTCCGTCAGT
TCCGTCCTCGTCGTCGAACCCCTCTTCTTCTCGCCCACCACCCCCCTACTTTGGTCATATTGGTCATGATATAACATCTCTCACACCGTTAGGGTCAAATGTTGTTCCTT
GTAATTTGGGAGATGATAGGGCATATGATTGGGATGTGTCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATTGTCTAATGACCGACACG
GAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCTAGTGATCGACTTGATGTGCAACATGAGCATGAAGAGGTAACAATTCATAATACAATGGCTGAATATCC
TGTAGATTCCGTCCATGAAACGGCAAACAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACCAATGATGTTAAGGAGGGTG
ATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACGCCGGAACTATACTTGCTGCGA
TGCATCGATCCTACTTGCACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAGTCCACTCTAAGTGCAATGATGCCAT
TATGAAACAGGATCATCTTCAGGCGAAAAGTTGGGTGGTCGGTCATCTAGTACGATCAAAGTTTACTGATGTTTCTCGCACGTACAGGCTGAAGGACATCATGCAAGATA
TTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGACCTGGTGTTCGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGCATGCACGTAAGTTG
CCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGAA
AATGATTGCCAAAGCTTCGGATAATGCACGGAGACACATTGTTATGAACATCAACCAGTTTAATTTTGAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACAGA
AGAAGGTCGTTAGGGTTGGACGGCGACAGACGATGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGTACGTCGGGGCACAATCGTTAA
Protein sequenceShow/hide protein sequence
MPRVFISFGGEWKDIEKDYMGGRTRGLTVDSKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKAKVCVVEITDDDDLTFFFTSEDISELPLYISTVPKKVHQNEPYMPS
FPYYLGQHVSNVPIPSACAPPFAKPLFPRPSFSSPSVPSSSSNPSSSRPPPPYFGHIGHDITSLTPLGSNVVPCNLGDDRAYDWDVSGLWNGSENVDEDSDESYCLMTDT
EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETANNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLR
CIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNDAIMKQDHLQAKSWVVGHLVRSKFTDVSRTYRLKDIMQDIREEYGVNMSYDKTWCSSEEALRLIRGDPASSHARKL
PVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEKMIAKASDNARRHIVMNINQFNFESSPGFVNIDVQPQKKVVRVGRRQTMRIPSTGEVRPPRKCSRCGTSGHNR