; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007824 (gene) of Snake gourd v1 genome

Gene IDTan0007824
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant transposase
Genome locationLG07:21395622..21513305
RNA-Seq ExpressionTan0007824
SyntenyTan0007824
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060466.1 Plant transposase [Cucumis melo var. makuwa]1.1e-8453.07Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARLAEEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

XP_016901232.1 PREDICTED: uncharacterized protein LOC103493280 isoform X1 [Cucumis melo]3.2e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

XP_016901236.1 PREDICTED: uncharacterized protein LOC103493280 isoform X3 [Cucumis melo]3.2e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

XP_016901238.1 PREDICTED: uncharacterized protein LOC103493280 isoform X5 [Cucumis melo]3.2e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

XP_016901239.1 PREDICTED: uncharacterized protein LOC103493280 isoform X6 [Cucumis melo]3.2e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

TrEMBL top hitse value%identityAlignment
A0A1S4DZ18 uncharacterized protein LOC103493280 isoform X31.6e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

A0A1S4DZ32 uncharacterized protein LOC103493280 isoform X61.6e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

A0A1S4DZ36 uncharacterized protein LOC103493280 isoform X11.6e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

A0A1S4DZ41 uncharacterized protein LOC103493280 isoform X51.6e-8452.8Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARL EEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

A0A5D3D4T6 Plant transposase5.4e-8553.07Show/hide
Query:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------
        M+ KL D    +C+ FD  APR+RRSKRLK+ SV LA+ ED  DG +   EGD++T+KL VDQSQD  PV G E  N VDN D+TH T            
Subjt:  MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPV-GEESLNDVDNCDTTHMT------------

Query:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR
                                                RG TKMK IA+EE  KVDITF+++G+PIGE S+G+SSFLG+LVRE+V VTL DWRKLSTR
Subjt:  ---------------------------------------QRGATKMKAIAVEEHRKVDITFNEYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTR

Query:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK
         KEILWTSIQ+         ++ IF K  R    GK      ++  S +EELVK+KP+NIQSMHDWMDFVKEKKSA FKAKSEKFKSMK  QLPHTCSRK
Subjt:  LKEILWTSIQI--------GKESIFFK--RWVDYGK------LKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSMKNKQLPHTCSRK

Query:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV
        GYARLAEEM+KSC DSSSVTR+AL AK HRKKD NPVNSQVAETL   ++  + +    +     A ++RRS R+
Subjt:  GYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSL----LASRRRRSNRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGAAACTGAAAGATACTAATACCAATAAATGTCTTAGCTTCGATTCAAATGCTCCACGAAAACGACGGTCTAAGCGATTGAAAAACTTATCAGTGGGCCTAGC
AAGTAAAGAAGATGTTGGTGATGGAACAATGTGTGATAAGGAAGGAGATCACGTTACTGATAAGTTGTGTGTTGACCAATCTCAAGATTACTCACCAGTTGGAGAAGAGT
CATTGAATGATGTAGATAATTGTGATACTACACACATGACTCAAAGAGGAGCTACAAAAATGAAAGCTATTGCAGTTGAGGAACATAGAAAAGTAGATATAACATTCAAT
GAGTATGGAAAACCAATTGGAGAGGATTCAGTTGGGATGTCTTCATTTTTGGGTTCACTCGTGAGAGAGGTAGTGTCTGTGACTTTACAAGATTGGAGGAAATTGTCTAC
ACGATTGAAGGAAATTTTATGGACTTCAATTCAAATTGGAAAAGAAAGTATATTTTTCAAAAGATGGGTAGATTATGGAAAATTAAAATATGCCTCCAATGATGAGGAGC
TTGTTAAATTGAAGCCAACCAATATACAATCTATGCATGATTGGATGGACTTTGTGAAAGAAAAGAAGAGTGCAAGGTTCAAGGCAAAAAGTGAAAAATTCAAATCCATG
AAGAATAAGCAACTTCCACATACATGTAGCCGTAAAGGTTATGCTCGATTGGCCGAAGAAATGAAAAAGAGTTGTTCAGATTCATCTTCGGTGACAAGGGTTGCGTTATG
GGCAAAAGTACATAGGAAAAAGGATGGGAATCCTGTTAACTCACAAGTTGCAGAAACATTGACGCAATTCAGGCAGTCGGAAGAAGCCTTGCAATGGTTGAGTCTTCTGG
CGAGTCGGCGACGTCGTTCTAACAGAGTTGTGACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGAAACTGAAAGATACTAATACCAATAAATGTCTTAGCTTCGATTCAAATGCTCCACGAAAACGACGGTCTAAGCGATTGAAAAACTTATCAGTGGGCCTAGC
AAGTAAAGAAGATGTTGGTGATGGAACAATGTGTGATAAGGAAGGAGATCACGTTACTGATAAGTTGTGTGTTGACCAATCTCAAGATTACTCACCAGTTGGAGAAGAGT
CATTGAATGATGTAGATAATTGTGATACTACACACATGACTCAAAGAGGAGCTACAAAAATGAAAGCTATTGCAGTTGAGGAACATAGAAAAGTAGATATAACATTCAAT
GAGTATGGAAAACCAATTGGAGAGGATTCAGTTGGGATGTCTTCATTTTTGGGTTCACTCGTGAGAGAGGTAGTGTCTGTGACTTTACAAGATTGGAGGAAATTGTCTAC
ACGATTGAAGGAAATTTTATGGACTTCAATTCAAATTGGAAAAGAAAGTATATTTTTCAAAAGATGGGTAGATTATGGAAAATTAAAATATGCCTCCAATGATGAGGAGC
TTGTTAAATTGAAGCCAACCAATATACAATCTATGCATGATTGGATGGACTTTGTGAAAGAAAAGAAGAGTGCAAGGTTCAAGGCAAAAAGTGAAAAATTCAAATCCATG
AAGAATAAGCAACTTCCACATACATGTAGCCGTAAAGGTTATGCTCGATTGGCCGAAGAAATGAAAAAGAGTTGTTCAGATTCATCTTCGGTGACAAGGGTTGCGTTATG
GGCAAAAGTACATAGGAAAAAGGATGGGAATCCTGTTAACTCACAAGTTGCAGAAACATTGACGCAATTCAGGCAGTCGGAAGAAGCCTTGCAATGGTTGAGTCTTCTGG
CGAGTCGGCGACGTCGTTCTAACAGAGTTGTGACGTGA
Protein sequenceShow/hide protein sequence
MKKKLKDTNTNKCLSFDSNAPRKRRSKRLKNLSVGLASKEDVGDGTMCDKEGDHVTDKLCVDQSQDYSPVGEESLNDVDNCDTTHMTQRGATKMKAIAVEEHRKVDITFN
EYGKPIGEDSVGMSSFLGSLVREVVSVTLQDWRKLSTRLKEILWTSIQIGKESIFFKRWVDYGKLKYASNDEELVKLKPTNIQSMHDWMDFVKEKKSARFKAKSEKFKSM
KNKQLPHTCSRKGYARLAEEMKKSCSDSSSVTRVALWAKVHRKKDGNPVNSQVAETLTQFRQSEEALQWLSLLASRRRRSNRVVT