; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031159 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031159
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold8:28313340..28319781
RNA-Seq ExpressionSpg031159
SyntenySpg031159
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037581.1 reverse transcriptase [Cucumis melo var. makuwa]2.7e-2631.11Show/hide
Query:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL
        PL TLSVL DNDAVVSAG+A L +R   RW     PT    +  S  LVEIELPVPD LPTSAESSRS     N  T  +L  +E+  +++  +V L   
Subjt:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL

Query:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------
                                    T +   C+  +     W+ +  DDV WLHAIFR K AGGPGGGVTTWYQS                      
Subjt:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------EEKSAVEPS
                                                                                                   EEKSA+E S
Subjt:  -------------------------------------------------------------------------------------------EEKSAVEPS

Query:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT
        RG  T SG RGREQRRFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA SQ A+SVAR RTGQESVASES RT
Subjt:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT

TYK00845.1 zf-CCHC domain-containing protein [Cucumis melo var. makuwa]3.6e-2641.22Show/hide
Query:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPN-GPTDQKLQRYETNRLKLINQVSLHSLTVG
        PL TLSVLRDNDAV                               EIELPVPD+L TSAESS S+S        D     +   R K++        T  
Subjt:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPN-GPTDQKLQRYETNRLKLINQVSLHSLTVG

Query:  HSTTDPQLHSSHC------------RIFLCPRISTNTTSQSF--TCVRLTFQEGQWVSLEEDDV----HWLHAIFRTKLAGGPGGGVTTWYQ--------
        H+TT  Q   +              R    PR      S+ F  +   +  QE  W S  E         L    RT +       +  W          
Subjt:  HSTTDPQLHSSHC------------RIFLCPRISTNTTSQSF--TCVRLTFQEGQWVSLEEDDV----HWLHAIFRTKLAGGPGGGVTTWYQ--------

Query:  -------SEEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT
                EEKSA+E SRG  T SG RGREQRRFT GV+VSG QDFKRRSGG+  RQMSSGSAYQRQSQRA SQ  +SVAR RTGQESVASES RT
Subjt:  -------SEEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT

TYK03091.1 reverse transcriptase [Cucumis melo var. makuwa]2.7e-2631.11Show/hide
Query:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL
        PL TLSVL DNDAVVSAG+A L +R   RW     PT    +  S  LVEIELPVPD LPTSAESSRS     N  T  +L  +E+  +++  +V L   
Subjt:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL

Query:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------
                                    T +   C+  +     W+ +  DDV WLHAIFR K AGGPGGGVTTWYQS                      
Subjt:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------EEKSAVEPS
                                                                                                   EEKSA+E S
Subjt:  -------------------------------------------------------------------------------------------EEKSAVEPS

Query:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT
        RG  T SG RGREQRRFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA SQ A+SVAR RTGQESVASES RT
Subjt:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT

TYK14494.1 uncharacterized protein E5676_scaffold15G00050 [Cucumis melo var. makuwa]5.3e-2266.67Show/hide
Query:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ
        EEKS +E SRG  T SG RGREQ RFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQSQRA SQ  +SVAR RTGQESVASES RT  +++  + + +
Subjt:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ

Query:  GAKLN
         ++L+
Subjt:  GAKLN

TYK14494.1 uncharacterized protein E5676_scaffold15G00050 [Cucumis melo var. makuwa]1.4e-0127.16Show/hide
Query:  TLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSLTVGHSTT
        TLSVL DNDAV                              VEIELPV D+LPTSAESS S+                                      
Subjt:  TLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSLTVGHSTT

Query:  DPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQSEEKSAVEPSRGAPTASGFRGREQRRFTSG
                              S+S   VR             DDV WLH++FR K A GPGGGVTTWYQS     V P R          R +R+   G
Subjt:  DPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQSEEKSAVEPSRGAPTASGFRGREQRRFTSG

Query:  VNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQ
        +    +   +R S     +       + R +Q
Subjt:  VNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQ

TYK14494.1 uncharacterized protein E5676_scaffold15G00050 [Cucumis melo var. makuwa]7.0e-2270Show/hide
Query:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ
        EEKS ++ SRG  T SG RGREQRRFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQSQRA SQ  +SVA+P TGQESVASES RT      +   GQ
Subjt:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ

TrEMBL top hitse value%identityAlignment
A0A5A7T7M6 Reverse transcriptase1.3e-2631.11Show/hide
Query:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL
        PL TLSVL DNDAVVSAG+A L +R   RW     PT    +  S  LVEIELPVPD LPTSAESSRS     N  T  +L  +E+  +++  +V L   
Subjt:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL

Query:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------
                                    T +   C+  +     W+ +  DDV WLHAIFR K AGGPGGGVTTWYQS                      
Subjt:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------EEKSAVEPS
                                                                                                   EEKSA+E S
Subjt:  -------------------------------------------------------------------------------------------EEKSAVEPS

Query:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT
        RG  T SG RGREQRRFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA SQ A+SVAR RTGQESVASES RT
Subjt:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT

A0A5D3BNX1 Zf-CCHC domain-containing protein1.7e-2641.22Show/hide
Query:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPN-GPTDQKLQRYETNRLKLINQVSLHSLTVG
        PL TLSVLRDNDAV                               EIELPVPD+L TSAESS S+S        D     +   R K++        T  
Subjt:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPN-GPTDQKLQRYETNRLKLINQVSLHSLTVG

Query:  HSTTDPQLHSSHC------------RIFLCPRISTNTTSQSF--TCVRLTFQEGQWVSLEEDDV----HWLHAIFRTKLAGGPGGGVTTWYQ--------
        H+TT  Q   +              R    PR      S+ F  +   +  QE  W S  E         L    RT +       +  W          
Subjt:  HSTTDPQLHSSHC------------RIFLCPRISTNTTSQSF--TCVRLTFQEGQWVSLEEDDV----HWLHAIFRTKLAGGPGGGVTTWYQ--------

Query:  -------SEEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT
                EEKSA+E SRG  T SG RGREQRRFT GV+VSG QDFKRRSGG+  RQMSSGSAYQRQSQRA SQ  +SVAR RTGQESVASES RT
Subjt:  -------SEEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT

A0A5D3BTP3 Reverse transcriptase1.3e-2631.11Show/hide
Query:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL
        PL TLSVL DNDAVVSAG+A L +R   RW     PT    +  S  LVEIELPVPD LPTSAESSRS     N  T  +L  +E+  +++  +V L   
Subjt:  PLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPT----QRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSL

Query:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------
                                    T +   C+  +     W+ +  DDV WLHAIFR K AGGPGGGVTTWYQS                      
Subjt:  TVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQS----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------EEKSAVEPS
                                                                                                   EEKSA+E S
Subjt:  -------------------------------------------------------------------------------------------EEKSAVEPS

Query:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT
        RG  T SG RGREQRRFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQS+RA SQ A+SVAR RTGQESVASES RT
Subjt:  RGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRT

A0A5D3CU23 Retrotrans_gag domain-containing protein2.6e-2266.67Show/hide
Query:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ
        EEKS +E SRG  T SG RGREQ RFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQSQRA SQ  +SVAR RTGQESVASES RT  +++  + + +
Subjt:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ

Query:  GAKLN
         ++L+
Subjt:  GAKLN

A0A5D3CU23 Retrotrans_gag domain-containing protein6.6e-0227.16Show/hide
Query:  TLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSLTVGHSTT
        TLSVL DNDAV                              VEIELPV D+LPTSAESS S+                                      
Subjt:  TLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLKLINQVSLHSLTVGHSTT

Query:  DPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQSEEKSAVEPSRGAPTASGFRGREQRRFTSG
                              S+S   VR             DDV WLH++FR K A GPGGGVTTWYQS     V P R          R +R+   G
Subjt:  DPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQSEEKSAVEPSRGAPTASGFRGREQRRFTSG

Query:  VNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQ
        +    +   +R S     +       + R +Q
Subjt:  VNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQ

A0A5D3CU23 Retrotrans_gag domain-containing protein3.4e-2270Show/hide
Query:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ
        EEKS ++ SRG  T SG RGREQRRFT GVNVSG QDFKRRSGG+  RQMSSGSAYQRQSQRA SQ  +SVA+P TGQESVASES RT      +   GQ
Subjt:  EEKSAVEPSRGAPTASGFRGREQRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAAAAAAAGAAAAGAAAAGAAAATGGGAATGGACAGCAAACAACCCACCATCGCCTAAGCTAAACTCCAACCCATTTTTAGTCATTTTCAGCCACATTTCTCA
TCATTTTCCAAACAAAATTTCAGAGCACTTGAGGGAGAGAGTGAAGGAAGAAAAAGAAGAAGAATTTTGGGGTTTTGAGGTTGAAGAAGATAGGAAAAGCTCATGCAAAG
CCAGCCATAGCAGAATCTGGTTCAACATGCCAACCTCTGGTTTTTTATTCGGTTTGAGCTACACAGAAGGAACTAAGAGAGCACAGAATTTTCGAGCAAAGCAGGAGGAT
CTCTGGTTTTCTCTGGTTGTTCGAGCATCCTGGGGTGTACAAGTTGAATCAGAAGCTCTATTTGGTTATATTCCGTTGTTGACGTTGAGTGTTCTCCGTGACAACGATGC
TGTCGTGAGTGCTGGGCGGGCCTCACTACATCGTAGGACTAGTAAACGTTGGTTGTACTGGGCGTGCCCTACACAACGTAGATCGGTCATGTTAGTAGAGATCGAGCTCC
CAGTGCCTGATTCACTGCCAACGTCTGCTGAAAGTTCCAGATCAAGCTCCATGGGACCTAATGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATCGGCTTAAA
CTCATTAACCAAGTTAGTCTTCATTCGTTAACTGTGGGTCACTCCACTACAGACCCACAGCTGCACTCTTCTCACTGCAGAATATTTCTGTGTCCACGGATATCGACCAA
TACTACAAGTCAATCCTTCACGTGTGTTCGTTTAACGTTTCAGGAGGGTCAATGGGTATCGTTAGAGGAGGACGATGTCCATTGGCTTCACGCCATCTTCCGGACTAAGC
TAGCAGGTGGTCCGGGAGGGGGTGTGACAACTTGGTATCAGAGCGAGGAAAAGTCGGCTGTGGAGCCTAGTCGTGGGGCTCCAACAGCTAGTGGTTTTCGAGGTCGTGAG
CAGCGGAGGTTCACATCTGGAGTGAATGTTTCAGGCCGTCAAGACTTCAAGCGTCGATCTGGTGGCCAGTCATCAAGGCAGATGAGTTCGGGTAGTGCCTATCAGAGGCA
GAGTCAGAGAGCCCCCAGTCAGTGTGCGAGTTCAGTAGCAAGACCGCGAACGGGTCAGGAGTCCGTTGCTAGTGAATCAATGAGAACTGATGCGCAAATTGCTAACCGCA
AGTGTACGGGTCAAGGAGCAAAACTCAATGATAAAGATAAAACACAAAAGGATGAATTTATAGAAAATCTGCCGACAGCGTCGAGACGCTATGGACAGCGTCGCGACGCT
GTCGCGATACTGCGGATTTGGAAAATTCGCCGAGAGCGTCGCGACGCTGGTCCTAGGGTCGCGACGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAAAAAAAGAAAAGAAAAGAAAATGGGAATGGACAGCAAACAACCCACCATCGCCTAAGCTAAACTCCAACCCATTTTTAGTCATTTTCAGCCACATTTCTCA
TCATTTTCCAAACAAAATTTCAGAGCACTTGAGGGAGAGAGTGAAGGAAGAAAAAGAAGAAGAATTTTGGGGTTTTGAGGTTGAAGAAGATAGGAAAAGCTCATGCAAAG
CCAGCCATAGCAGAATCTGGTTCAACATGCCAACCTCTGGTTTTTTATTCGGTTTGAGCTACACAGAAGGAACTAAGAGAGCACAGAATTTTCGAGCAAAGCAGGAGGAT
CTCTGGTTTTCTCTGGTTGTTCGAGCATCCTGGGGTGTACAAGTTGAATCAGAAGCTCTATTTGGTTATATTCCGTTGTTGACGTTGAGTGTTCTCCGTGACAACGATGC
TGTCGTGAGTGCTGGGCGGGCCTCACTACATCGTAGGACTAGTAAACGTTGGTTGTACTGGGCGTGCCCTACACAACGTAGATCGGTCATGTTAGTAGAGATCGAGCTCC
CAGTGCCTGATTCACTGCCAACGTCTGCTGAAAGTTCCAGATCAAGCTCCATGGGACCTAATGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATCGGCTTAAA
CTCATTAACCAAGTTAGTCTTCATTCGTTAACTGTGGGTCACTCCACTACAGACCCACAGCTGCACTCTTCTCACTGCAGAATATTTCTGTGTCCACGGATATCGACCAA
TACTACAAGTCAATCCTTCACGTGTGTTCGTTTAACGTTTCAGGAGGGTCAATGGGTATCGTTAGAGGAGGACGATGTCCATTGGCTTCACGCCATCTTCCGGACTAAGC
TAGCAGGTGGTCCGGGAGGGGGTGTGACAACTTGGTATCAGAGCGAGGAAAAGTCGGCTGTGGAGCCTAGTCGTGGGGCTCCAACAGCTAGTGGTTTTCGAGGTCGTGAG
CAGCGGAGGTTCACATCTGGAGTGAATGTTTCAGGCCGTCAAGACTTCAAGCGTCGATCTGGTGGCCAGTCATCAAGGCAGATGAGTTCGGGTAGTGCCTATCAGAGGCA
GAGTCAGAGAGCCCCCAGTCAGTGTGCGAGTTCAGTAGCAAGACCGCGAACGGGTCAGGAGTCCGTTGCTAGTGAATCAATGAGAACTGATGCGCAAATTGCTAACCGCA
AGTGTACGGGTCAAGGAGCAAAACTCAATGATAAAGATAAAACACAAAAGGATGAATTTATAGAAAATCTGCCGACAGCGTCGAGACGCTATGGACAGCGTCGCGACGCT
GTCGCGATACTGCGGATTTGGAAAATTCGCCGAGAGCGTCGCGACGCTGGTCCTAGGGTCGCGACGCTATGA
Protein sequenceShow/hide protein sequence
MKKKKEKKRKWEWTANNPPSPKLNSNPFLVIFSHISHHFPNKISEHLRERVKEEKEEEFWGFEVEEDRKSSCKASHSRIWFNMPTSGFLFGLSYTEGTKRAQNFRAKQED
LWFSLVVRASWGVQVESEALFGYIPLLTLSVLRDNDAVVSAGRASLHRRTSKRWLYWACPTQRRSVMLVEIELPVPDSLPTSAESSRSSSMGPNGPTDQKLQRYETNRLK
LINQVSLHSLTVGHSTTDPQLHSSHCRIFLCPRISTNTTSQSFTCVRLTFQEGQWVSLEEDDVHWLHAIFRTKLAGGPGGGVTTWYQSEEKSAVEPSRGAPTASGFRGRE
QRRFTSGVNVSGRQDFKRRSGGQSSRQMSSGSAYQRQSQRAPSQCASSVARPRTGQESVASESMRTDAQIANRKCTGQGAKLNDKDKTQKDEFIENLPTASRRYGQRRDA
VAILRIWKIRRERRDAGPRVATL