; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014490 (gene) of Snake gourd v1 genome

Gene IDTan0014490
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant transposase
Genome locationLG11:38851013..38852275
RNA-Seq ExpressionTan0014490
SyntenyTan0014490
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038904085.1 uncharacterized protein LOC120090469 isoform X1 [Benincasa hispida]5.6e-9257.43Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------
        MKKK  DT + +CL  D    RKRRSKRLK+LS+GLA+ ED  DG M +KEGD++ +KL V QSQD   V G    N  +  D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------

Query:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------
                Q+L S GQ    ++RS  V  +++  E  ++   KKCRG T+M   A EE  KVDITFNE+GQPIGE S+G+SSF G LVRE          
Subjt:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------

Query:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH
                       SRY +KEDW+RKY+FQKMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSE+FKSMK KQLPH
Subjt:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH

Query:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        TC+RKGYARLAEEMKKS S+SSSV RVALWAKAH+KK+GNPVNSQVAE L
Subjt:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

XP_038904087.1 uncharacterized protein LOC120090469 isoform X2 [Benincasa hispida]5.6e-9257.43Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------
        MKKK  DT + +CL  D    RKRRSKRLK+LS+GLA+ ED  DG M +KEGD++ +KL V QSQD   V G    N  +  D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------

Query:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------
                Q+L S GQ    ++RS  V  +++  E  ++   KKCRG T+M   A EE  KVDITFNE+GQPIGE S+G+SSF G LVRE          
Subjt:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------

Query:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH
                       SRY +KEDW+RKY+FQKMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSE+FKSMK KQLPH
Subjt:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH

Query:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        TC+RKGYARLAEEMKKS S+SSSV RVALWAKAH+KK+GNPVNSQVAE L
Subjt:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

XP_038904088.1 uncharacterized protein LOC120090469 isoform X3 [Benincasa hispida]5.6e-9257.43Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------
        MKKK  DT + +CL  D    RKRRSKRLK+LS+GLA+ ED  DG M +KEGD++ +KL V QSQD   V G    N  +  D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------

Query:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------
                Q+L S GQ    ++RS  V  +++  E  ++   KKCRG T+M   A EE  KVDITFNE+GQPIGE S+G+SSF G LVRE          
Subjt:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------

Query:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH
                       SRY +KEDW+RKY+FQKMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSE+FKSMK KQLPH
Subjt:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH

Query:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        TC+RKGYARLAEEMKKS S+SSSV RVALWAKAH+KK+GNPVNSQVAE L
Subjt:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

XP_038904089.1 uncharacterized protein LOC120090469 isoform X4 [Benincasa hispida]5.6e-9257.43Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------
        MKKK  DT + +CL  D    RKRRSKRLK+LS+GLA+ ED  DG M +KEGD++ +KL V QSQD   V G    N  +  D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------

Query:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------
                Q+L S GQ    ++RS  V  +++  E  ++   KKCRG T+M   A EE  KVDITFNE+GQPIGE S+G+SSF G LVRE          
Subjt:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------

Query:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH
                       SRY +KEDW+RKY+FQKMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSE+FKSMK KQLPH
Subjt:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH

Query:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        TC+RKGYARLAEEMKKS S+SSSV RVALWAKAH+KK+GNPVNSQVAE L
Subjt:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

XP_038904090.1 uncharacterized protein LOC120090469 isoform X5 [Benincasa hispida]5.6e-9257.43Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------
        MKKK  DT + +CL  D    RKRRSKRLK+LS+GLA+ ED  DG M +KEGD++ +KL V QSQD   V G    N  +  D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMT------------

Query:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------
                Q+L S GQ    ++RS  V  +++  E  ++   KKCRG T+M   A EE  KVDITFNE+GQPIGE S+G+SSF G LVRE          
Subjt:  --------QRLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE----------

Query:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH
                       SRY +KEDW+RKY+FQKMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSE+FKSMK KQLPH
Subjt:  ---------------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPH

Query:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        TC+RKGYARLAEEMKKS S+SSSV RVALWAKAH+KK+GNPVNSQVAE L
Subjt:  TCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

TrEMBL top hitse value%identityAlignment
A0A1S4DZ18 uncharacterized protein LOC103493280 isoform X38.7e-9156.81Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------
        M+ KL D    +C+ FD   PR+RRSKRLK+ SV LA+ ED  DG +   EGD++ +KL VDQSQD  PV G E  N V+N D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------

Query:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------
            +L S GQ    ++RS  V   I+  E  ++   KK RG TKMK IA+EE  KVDITF+++GQPIGE S+G+SSF G+LVRE               
Subjt:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------

Query:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK
                   RY +KEDW+RK IF+KMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTC+RK
Subjt:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK

Query:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        GYARL EEM+KS  +SSSV R+AL AKAHRKKD NPVNSQVAETL
Subjt:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

A0A1S4DZ32 uncharacterized protein LOC103493280 isoform X68.7e-9156.81Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------
        M+ KL D    +C+ FD   PR+RRSKRLK+ SV LA+ ED  DG +   EGD++ +KL VDQSQD  PV G E  N V+N D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------

Query:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------
            +L S GQ    ++RS  V   I+  E  ++   KK RG TKMK IA+EE  KVDITF+++GQPIGE S+G+SSF G+LVRE               
Subjt:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------

Query:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK
                   RY +KEDW+RK IF+KMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTC+RK
Subjt:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK

Query:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        GYARL EEM+KS  +SSSV R+AL AKAHRKKD NPVNSQVAETL
Subjt:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

A0A1S4DZ36 uncharacterized protein LOC103493280 isoform X18.7e-9156.81Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------
        M+ KL D    +C+ FD   PR+RRSKRLK+ SV LA+ ED  DG +   EGD++ +KL VDQSQD  PV G E  N V+N D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------

Query:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------
            +L S GQ    ++RS  V   I+  E  ++   KK RG TKMK IA+EE  KVDITF+++GQPIGE S+G+SSF G+LVRE               
Subjt:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------

Query:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK
                   RY +KEDW+RK IF+KMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTC+RK
Subjt:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK

Query:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        GYARL EEM+KS  +SSSV R+AL AKAHRKKD NPVNSQVAETL
Subjt:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

A0A1S4DZ41 uncharacterized protein LOC103493280 isoform X58.7e-9156.81Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------
        M+ KL D    +C+ FD   PR+RRSKRLK+ SV LA+ ED  DG +   EGD++ +KL VDQSQD  PV G E  N V+N D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------

Query:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------
            +L S GQ    ++RS  V   I+  E  ++   KK RG TKMK IA+EE  KVDITF+++GQPIGE S+G+SSF G+LVRE               
Subjt:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------

Query:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK
                   RY +KEDW+RK IF+KMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTC+RK
Subjt:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK

Query:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        GYARL EEM+KS  +SSSV R+AL AKAHRKKD NPVNSQVAETL
Subjt:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

A0A5D3D4T6 Plant transposase3.0e-9157.1Show/hide
Query:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------
        M+ KL D    +C+ FD   PR+RRSKRLK+ SV LA+ ED  DG +   EGD++ +KL VDQSQD  PV G E  N V+N D+TH T            
Subjt:  MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPV-GEESLNDVENCDTTHMTQ-----------

Query:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------
            +L S GQ    ++RS  V   I+  E  ++   KK RG TKMK IA+EE  KVDITF+++GQPIGE S+G+SSF G+LVRE               
Subjt:  ----RLDSIGQTPLNVERSSYVEVNIDAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRE---------------

Query:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK
                   RY +KEDW+RK IF+KMG LWR GKSRIVSQI++ S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTC+RK
Subjt:  ----------SRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRK

Query:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL
        GYARLAEEM+KS  +SSSV R+AL AKAHRKKD NPVNSQVAETL
Subjt:  GYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGAAACTGAAAGATACTAATACCAGTAAATGTCTTATCTTCGATTCAAATGTTCCACGAAAACGACGGTCTAAGCGATTGAAAAACTTATCAGTGGGCTTAGC
AAGTAAAGAAGATGTTGGTGATGGAACAATGTGTGATAAGGAAGGAGATCACGTTAATGATAAGTTGTGTGTTGACCAATCTCAAGATTACTCACCAGTTGGAGAAGAGT
CATTGAATGATGTAGAGAATTGTGATACTACACACATGACTCAAAGGTTAGATTCTATCGGTCAAACTCCCCTAAACGTAGAAAGATCTTCATATGTAGAAGTCAATATT
GATGCATATGAACAAATTTTAAAACATCCCTCCAAGAAATGTAGAGGAGCTACAAAAATGAAAGCTATTGCAGTTGAGGAACATAGAAAAGTAGATATAACATTCAATGA
GTATGGACAACCGATTGGAGAGGATTCAGTTGGGATGTCTTCATTTTTTGGTTCACTCGTGAGAGAGTCAAGATATAAGATGAAGGAAGATTGGAAAAGAAAGTATATTT
TTCAAAAGATGGGTAGCTTATGGAGGACAGGTAAATCTCGAATTGTGTCACAAATTAAAAATGCCTCCAATGATGAGGAGCTTGTTAAATTGAAGCCAACCAATATACAA
TCTATGCATGATTGGATGGACTTTGTGAAAGAAAAGAAGAGTGCAAGGTTCAAGGCAAAAAGTGAAAAATTCAAATCCATGAAGAATAAACAACTTCCACATACATGTAA
CCGTAAGGGTTATGCTCGATTGGCCGAAGAAATGAAAAAGAGTTTTTCAAATTCATCTTCGGTGATAAGGGTTGCGTTATGGGCAAAAGCACATAGGAAAAAAGATGGGA
ATCCTGTTAACTCACAAGTTGCAGAAACATTGGTATGGTTACATAACCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGAAACTGAAAGATACTAATACCAGTAAATGTCTTATCTTCGATTCAAATGTTCCACGAAAACGACGGTCTAAGCGATTGAAAAACTTATCAGTGGGCTTAGC
AAGTAAAGAAGATGTTGGTGATGGAACAATGTGTGATAAGGAAGGAGATCACGTTAATGATAAGTTGTGTGTTGACCAATCTCAAGATTACTCACCAGTTGGAGAAGAGT
CATTGAATGATGTAGAGAATTGTGATACTACACACATGACTCAAAGGTTAGATTCTATCGGTCAAACTCCCCTAAACGTAGAAAGATCTTCATATGTAGAAGTCAATATT
GATGCATATGAACAAATTTTAAAACATCCCTCCAAGAAATGTAGAGGAGCTACAAAAATGAAAGCTATTGCAGTTGAGGAACATAGAAAAGTAGATATAACATTCAATGA
GTATGGACAACCGATTGGAGAGGATTCAGTTGGGATGTCTTCATTTTTTGGTTCACTCGTGAGAGAGTCAAGATATAAGATGAAGGAAGATTGGAAAAGAAAGTATATTT
TTCAAAAGATGGGTAGCTTATGGAGGACAGGTAAATCTCGAATTGTGTCACAAATTAAAAATGCCTCCAATGATGAGGAGCTTGTTAAATTGAAGCCAACCAATATACAA
TCTATGCATGATTGGATGGACTTTGTGAAAGAAAAGAAGAGTGCAAGGTTCAAGGCAAAAAGTGAAAAATTCAAATCCATGAAGAATAAACAACTTCCACATACATGTAA
CCGTAAGGGTTATGCTCGATTGGCCGAAGAAATGAAAAAGAGTTTTTCAAATTCATCTTCGGTGATAAGGGTTGCGTTATGGGCAAAAGCACATAGGAAAAAAGATGGGA
ATCCTGTTAACTCACAAGTTGCAGAAACATTGGTATGGTTACATAACCTATAA
Protein sequenceShow/hide protein sequence
MKKKLKDTNTSKCLIFDSNVPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVNDKLCVDQSQDYSPVGEESLNDVENCDTTHMTQRLDSIGQTPLNVERSSYVEVNI
DAYEQILKHPSKKCRGATKMKAIAVEEHRKVDITFNEYGQPIGEDSVGMSSFFGSLVRESRYKMKEDWKRKYIFQKMGSLWRTGKSRIVSQIKNASNDEELVKLKPTNIQ
SMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCNRKGYARLAEEMKKSFSNSSSVIRVALWAKAHRKKDGNPVNSQVAETLVWLHNL