; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005708 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005708
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlant transposase
Genome locationscaffold8:20586748..20589608
RNA-Seq ExpressionSpg005708
SyntenySpg005708
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060466.1 Plant transposase [Cucumis melo var. makuwa]3.3e-10864.66Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARLAEEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

XP_016901232.1 PREDICTED: uncharacterized protein LOC103493280 isoform X1 [Cucumis melo]9.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

XP_016901236.1 PREDICTED: uncharacterized protein LOC103493280 isoform X3 [Cucumis melo]9.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

XP_016901238.1 PREDICTED: uncharacterized protein LOC103493280 isoform X5 [Cucumis melo]9.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

XP_016901239.1 PREDICTED: uncharacterized protein LOC103493280 isoform X6 [Cucumis melo]9.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

TrEMBL top hitse value%identityAlignment
A0A1S4DZ18 uncharacterized protein LOC103493280 isoform X34.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

A0A1S4DZ32 uncharacterized protein LOC103493280 isoform X64.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

A0A1S4DZ36 uncharacterized protein LOC103493280 isoform X14.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

A0A1S4DZ41 uncharacterized protein LOC103493280 isoform X54.6e-10864.37Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARL EEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

A0A5D3D4T6 Plant transposase1.6e-10864.66Show/hide
Query:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS
        M+ K+ +  AKRC+ FDP APR+RRSKRLKS SV + T E+G +G +   E D                    E PNI +NLD THTTP+SPLPSDS AS
Subjt:  MEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERD--------------------ESPNIRENLDCTHTTPRSPLPSDSPAS

Query:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL
            AL KLASR Q SPI+DRSQ+VGE  NVSEP MQQ+ KK RGPTKMK IAI   NKVDITF+++GQPI EASIG++SFLG LVRE+VPVTL+DWRKL
Subjt:  RTRGALRKLASRCQASPIVDRSQDVGENTNVSEPIMQQVLKKRRGPTKMKTIAIG--NKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKL

Query:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC
        STR KEILWTSIQ+                       GKSRIVSQIQ+ S +EE++KMKP+NIQS HDW+DFVKEK SA FKA+SEKFKSMKK QLPHTC
Subjt:  STRFKEILWTSIQV----------------------TGKSRIVSQIQNASNDEEVLKMKPANIQSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTC

Query:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL
        SRKGYARLAEEM+KSC DSSSVTR+AL AKAHRKKD NP+NSQVAETL
Subjt:  SRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAATGGAGAAGAAAATAAGAAATACTACTGCCAAAAGGTGTCTTGGGTTCGATCCAGATGCACCACGAAAGCGACGCTCTAAACGCTTGAAATCACCTTCAGTAGGCAT
AACAACAATGGAGGAGGGGGGTAATGGAACAATGGGTGAGAAGGAAAGAGATGAGTCACCTAATATTAGAGAGAATCTTGATTGTACACATACAACTCCAAGATCACCTT
TACCATCAGATTCGCCAGCATCTCGTACAAGAGGAGCTCTTCGAAAGTTAGCTTCTAGATGTCAAGCCTCACCAATTGTAGATAGGTCACAGGATGTAGGAGAAAATACC
AATGTTTCTGAACCAATTATGCAACAAGTCCTTAAGAAACGAAGAGGCCCTACAAAAATGAAAACCATTGCAATTGGTAATAAAGTAGATATAACCTTCAATGAGTATGG
ACAACCGATTGAGGAGGCTTCGATTGGCATGGCATCATTTTTAGGTCCACTTGTGAGAGAGGTGGTGCCTGTGACTTTAAATGATTGGAGAAAATTGTCAACAAGATTCA
AAGAAATTTTATGGACATCAATTCAAGTAACTGGAAAATCTCGAATTGTGTCACAAATTCAAAATGCCTCCAACGATGAGGAGGTTCTTAAAATGAAGCCAGCAAATATA
CAATCTACACACGATTGGATTGACTTTGTGAAAGAAAAGAACAGTGCAAGATTCAAGGCAAGAAGTGAAAAGTTCAAATCCATGAAGAAGAAGCAACTTCCACATACATG
TAGTCGTAAGGGTTATGCTCGATTGGCAGAAGAAATGAAAAAAAGTTGTTCAGATTCATCATCAGTGACAAGAGTCGCATTATGGGCAAAGGCACATAGGAAGAAGGATG
GAAATCCTATTAACTCACAAGTTGCAGAAACACTGGTATGGTTACAAAATCTCTATTTTTAG
mRNA sequenceShow/hide mRNA sequence
GAAATGGAGAAGAAAATAAGAAATACTACTGCCAAAAGGTGTCTTGGGTTCGATCCAGATGCACCACGAAAGCGACGCTCTAAACGCTTGAAATCACCTTCAGTAGGCAT
AACAACAATGGAGGAGGGGGGTAATGGAACAATGGGTGAGAAGGAAAGAGATGAGTCACCTAATATTAGAGAGAATCTTGATTGTACACATACAACTCCAAGATCACCTT
TACCATCAGATTCGCCAGCATCTCGTACAAGAGGAGCTCTTCGAAAGTTAGCTTCTAGATGTCAAGCCTCACCAATTGTAGATAGGTCACAGGATGTAGGAGAAAATACC
AATGTTTCTGAACCAATTATGCAACAAGTCCTTAAGAAACGAAGAGGCCCTACAAAAATGAAAACCATTGCAATTGGTAATAAAGTAGATATAACCTTCAATGAGTATGG
ACAACCGATTGAGGAGGCTTCGATTGGCATGGCATCATTTTTAGGTCCACTTGTGAGAGAGGTGGTGCCTGTGACTTTAAATGATTGGAGAAAATTGTCAACAAGATTCA
AAGAAATTTTATGGACATCAATTCAAGTAACTGGAAAATCTCGAATTGTGTCACAAATTCAAAATGCCTCCAACGATGAGGAGGTTCTTAAAATGAAGCCAGCAAATATA
CAATCTACACACGATTGGATTGACTTTGTGAAAGAAAAGAACAGTGCAAGATTCAAGGCAAGAAGTGAAAAGTTCAAATCCATGAAGAAGAAGCAACTTCCACATACATG
TAGTCGTAAGGGTTATGCTCGATTGGCAGAAGAAATGAAAAAAAGTTGTTCAGATTCATCATCAGTGACAAGAGTCGCATTATGGGCAAAGGCACATAGGAAGAAGGATG
GAAATCCTATTAACTCACAAGTTGCAGAAACACTGGTATGGTTACAAAATCTCTATTTTTAG
Protein sequenceShow/hide protein sequence
EMEKKIRNTTAKRCLGFDPDAPRKRRSKRLKSPSVGITTMEEGGNGTMGEKERDESPNIRENLDCTHTTPRSPLPSDSPASRTRGALRKLASRCQASPIVDRSQDVGENT
NVSEPIMQQVLKKRRGPTKMKTIAIGNKVDITFNEYGQPIEEASIGMASFLGPLVREVVPVTLNDWRKLSTRFKEILWTSIQVTGKSRIVSQIQNASNDEEVLKMKPANI
QSTHDWIDFVKEKNSARFKARSEKFKSMKKKQLPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKAHRKKDGNPINSQVAETLVWLQNLYF