; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0071071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0071071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein isoform X1
Genome locationCMiso1.1chr03:17151491..17152144
RNA-Seq ExpressionCmc03g0071071
SyntenyCmc03g0071071
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035681.1 reverse transcriptase [Cucumis melo var. makuwa]6.4e-9579.72Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MGKK+ILLPL KKNTE IRQKN+ QLFITVSGK LL+EREQDLLG +V DKS   + EI+EPRLK+LFAEF HLKKEP+GLPPL DI  QIDL+P ASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NL HYRMSPEEYQVLHDHIEDLLKKGHIK S SPCAVPAL  P KDGSWRMCV SRAIN++T KYRF IPRIGD+LDQLGKA +FSKIDL+ GYHQI+IR
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
        P DEWK AFKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

KAA0040321.1 RNA-directed DNA polymerase-like protein [Cucumis melo var. makuwa]1.8e-8472.81Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MGKKI+LLPL K N      KN+GQLF TVS KKL+RERE+D+LG V+ DK+  +  EI+EP+L++L AEF HLKKEP GLPPL DI  QIDLI GASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NL +YRMSP EYQ+LH+HI+DLLKKGH K S SPCA PAL  PKKDGSWRMCV SRAIN+ITVKYRF IPRIGD+LDQLGKAT+FSKIDL+ GYHQIRIR
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
          DEWK  FKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

KAA0056582.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.0e-7972.02Show/hide
Query:  MGKKIILLPLPKKNTEGIR-QKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASL
        MGKK+ILLPL KKNT+GIR QKN+GQLFITVSGKKLLREREQDLLG V+ADKS EK+HEIIEP+LKKLFAEF HLKKEP+GLPP   I  QIDLIPGASL
Subjt:  MGKKIILLPLPKKNTEGIR-QKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASL

Query:  PNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRI
        PNLA+YRMSP+EYQVLHDHIEDLLKK H+K S SPC VPAL  PKK                           G++LDQLGKAT+FSKIDLK  YHQIRI
Subjt:  PNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRI

Query:  RPEDEWKIAFKTNEGLSE
        RP DEWK AFKTNEGL E
Subjt:  RPEDEWKIAFKTNEGLSE

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]6.4e-9579.72Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MGKK+ILLPL KKNTE IRQKN+ QLFITVSGK LL+EREQDLLG +V DKS   + EI+EPRLK+LFAEF HLKKEP+GLPPL DI  QIDL+P ASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NL HYRMSPEEYQVLHDHIEDLLKKGHIK S SPCAVPAL  P KDGSWRMCV SRAIN++T KYRF IPRIGD+LDQLGKA +FSKIDL+ GYHQI+IR
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
        P DEWK AFKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]6.5e-8773.27Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MG+K++LLP+ KK  EG+R +   QLFITVSGKK+L+EREQ +LG VV +K+ EK  E IEP+L++L  EF H+K+EP+GLPPL DI   IDLIPGASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NLAHYRMSP+EY++LHDHIE+LLKKGHIK S SPCAVPAL  PKKDGSWRMCV SRAIN+ITVKYRF IPRI D+LDQLGKA++FSKIDLK GYHQIR+R
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
        P DEWK AFKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

TrEMBL top hitse value%identityAlignment
A0A5A7T256 Reverse transcriptase3.1e-9579.72Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MGKK+ILLPL KKNTE IRQKN+ QLFITVSGK LL+EREQDLLG +V DKS   + EI+EPRLK+LFAEF HLKKEP+GLPPL DI  QIDL+P ASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NL HYRMSPEEYQVLHDHIEDLLKKGHIK S SPCAVPAL  P KDGSWRMCV SRAIN++T KYRF IPRIGD+LDQLGKA +FSKIDL+ GYHQI+IR
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
        P DEWK AFKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

A0A5A7UL80 DNA/RNA polymerases superfamily protein2.4e-7972.02Show/hide
Query:  MGKKIILLPLPKKNTEGIR-QKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASL
        MGKK+ILLPL KKNT+GIR QKN+GQLFITVSGKKLLREREQDLLG V+ADKS EK+HEIIEP+LKKLFAEF HLKKEP+GLPP   I  QIDLIPGASL
Subjt:  MGKKIILLPLPKKNTEGIR-QKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASL

Query:  PNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRI
        PNLA+YRMSP+EYQVLHDHIEDLLKK H+K S SPC VPAL  PKK                           G++LDQLGKAT+FSKIDLK  YHQIRI
Subjt:  PNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRI

Query:  RPEDEWKIAFKTNEGLSE
        RP DEWK AFKTNEGL E
Subjt:  RPEDEWKIAFKTNEGLSE

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.62.3e-7476.8Show/hide
Query:  REREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCA
        R  EQDLLG VVA+KS   + EI+EPRLK+LFAEF HLKKEP+GLPPL DI  QIDL+PGASLP+L HYRMSPEEYQVLHD+IE+LLKKGHIK S SPC 
Subjt:  REREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCA

Query:  VPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE
        VPAL  PKKD SWRMCV SRAIN+ITVKY F IP++GD+LDQLGKA VFSKIDL+  YHQIRIRPEDEWK  FK NEGL E
Subjt:  VPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE

A0A5D3DIC3 RNA-directed DNA polymerase-like protein8.5e-8572.81Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MGKKI+LLPL K N      KN+GQLF TVS KKL+RERE+D+LG V+ DK+  +  EI+EP+L++L AEF HLKKEP GLPPL DI  QIDLI GASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NL +YRMSP EYQ+LH+HI+DLLKKGH K S SPCA PAL  PKKDGSWRMCV SRAIN+ITVKYRF IPRIGD+LDQLGKAT+FSKIDL+ GYHQIRIR
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
          DEWK  FKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X13.1e-9579.72Show/hide
Query:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP
        MGKK+ILLPL KKNTE IRQKN+ QLFITVSGK LL+EREQDLLG +V DKS   + EI+EPRLK+LFAEF HLKKEP+GLPPL DI  QIDL+P ASLP
Subjt:  MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLP

Query:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR
        NL HYRMSPEEYQVLHDHIEDLLKKGHIK S SPCAVPAL  P KDGSWRMCV SRAIN++T KYRF IPRIGD+LDQLGKA +FSKIDL+ GYHQI+IR
Subjt:  NLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIR

Query:  PEDEWKIAFKTNEGLSE
        P DEWK AFKTNEGL E
Subjt:  PEDEWKIAFKTNEGLSE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.2e-2033.12Show/hide
Query:  EPRLKKLFAEFSHLKKE--PEGLP-PLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRA
        EP L  ++ EF  +  E   E LP P+  +  +++L        + +Y + P + Q ++D I   LK G I+ S +  A P +F+PKK+G+ RM V  + 
Subjt:  EPRLKKLFAEFSHLKKE--PEGLP-PLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRA

Query:  INQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE
        +N+      + +P I  +L ++  +T+F+K+DLK  YH IR+R  DE K+AF+   G+ E
Subjt:  INQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE

P0CT41 Transposon Tf2-12 polyprotein3.2e-2033.12Show/hide
Query:  EPRLKKLFAEFSHLKKE--PEGLP-PLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRA
        EP L  ++ EF  +  E   E LP P+  +  +++L        + +Y + P + Q ++D I   LK G I+ S +  A P +F+PKK+G+ RM V  + 
Subjt:  EPRLKKLFAEFSHLKKE--PEGLP-PLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRA

Query:  INQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE
        +N+      + +P I  +L ++  +T+F+K+DLK  YH IR+R  DE K+AF+   G+ E
Subjt:  INQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-2538.93Show/hide
Query:  IHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFS
        +   I++ PGA LP L  Y ++ +  Q ++  ++ LL    I  S SPC+ P + +PKKDG++R+CV  R +N+ T+   F +PRI ++L ++G A +F+
Subjt:  IHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFS

Query:  KIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE
         +DL  GYHQI + P+D +K AF T  G  E
Subjt:  KIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.5e-2538.93Show/hide
Query:  IHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFS
        +   I++ PGA LP L  Y ++ +  Q ++  ++ LL    I  S SPC+ P + +PKKDG++R+CV  R +N+ T+   F +PRI ++L ++G A +F+
Subjt:  IHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFS

Query:  KIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE
         +DL  GYHQI + P+D +K AF T  G  E
Subjt:  KIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE

Q9UR07 Transposon Tf2-11 polyprotein3.2e-2033.12Show/hide
Query:  EPRLKKLFAEFSHLKKE--PEGLP-PLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRA
        EP L  ++ EF  +  E   E LP P+  +  +++L        + +Y + P + Q ++D I   LK G I+ S +  A P +F+PKK+G+ RM V  + 
Subjt:  EPRLKKLFAEFSHLKKE--PEGLP-PLCDIHQQIDLIPGASLPNLAHYRMSPEEYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRA

Query:  INQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE
        +N+      + +P I  +L ++  +T+F+K+DLK  YH IR+R  DE K+AF+   G+ E
Subjt:  INQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAAAATAATTCTACTCCCATTGCCAAAGAAAAATACAGAAGGTATAAGGCAGAAGAATAGGGGGCAGCTTTTCATCACAGTAAGTGGAAAGAAATTGTTGAG
AGAAAGGGAACAAGATCTTTTGGGACCAGTAGTTGCTGACAAATCCTATGAGAAAGATCATGAAATTATTGAGCCGAGACTAAAGAAATTGTTTGCAGAATTCTCTCATT
TAAAAAAGGAGCCAGAAGGACTGCCACCACTTTGTGACATTCACCAACAAATTGATCTTATTCCAGGAGCATCATTGCCTAATCTAGCCCACTATAGGATGAGCCCAGAA
GAATACCAAGTCTTGCATGATCATATTGAAGACTTGCTAAAGAAGGGTCATATCAAGTCAAGCCCAAGTCCATGCGCTGTACCTGCATTGTTCATACCAAAGAAGGATGG
AAGTTGGAGGATGTGCGTGTACAGCAGGGCTATTAATCAAATTACTGTGAAATATCGTTTTTCTATCCCTCGGATTGGAGACATATTGGACCAGCTAGGCAAGGCTACTG
TCTTCTCAAAAATTGATTTAAAAGGCGGCTACCATCAAATAAGAATTAGACCAGAGGATGAATGGAAGATAGCATTCAAGACGAATGAAGGGTTATCTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAGAAAATAATTCTACTCCCATTGCCAAAGAAAAATACAGAAGGTATAAGGCAGAAGAATAGGGGGCAGCTTTTCATCACAGTAAGTGGAAAGAAATTGTTGAG
AGAAAGGGAACAAGATCTTTTGGGACCAGTAGTTGCTGACAAATCCTATGAGAAAGATCATGAAATTATTGAGCCGAGACTAAAGAAATTGTTTGCAGAATTCTCTCATT
TAAAAAAGGAGCCAGAAGGACTGCCACCACTTTGTGACATTCACCAACAAATTGATCTTATTCCAGGAGCATCATTGCCTAATCTAGCCCACTATAGGATGAGCCCAGAA
GAATACCAAGTCTTGCATGATCATATTGAAGACTTGCTAAAGAAGGGTCATATCAAGTCAAGCCCAAGTCCATGCGCTGTACCTGCATTGTTCATACCAAAGAAGGATGG
AAGTTGGAGGATGTGCGTGTACAGCAGGGCTATTAATCAAATTACTGTGAAATATCGTTTTTCTATCCCTCGGATTGGAGACATATTGGACCAGCTAGGCAAGGCTACTG
TCTTCTCAAAAATTGATTTAAAAGGCGGCTACCATCAAATAAGAATTAGACCAGAGGATGAATGGAAGATAGCATTCAAGACGAATGAAGGGTTATCTGAGTAG
Protein sequenceShow/hide protein sequence
MGKKIILLPLPKKNTEGIRQKNRGQLFITVSGKKLLREREQDLLGPVVADKSYEKDHEIIEPRLKKLFAEFSHLKKEPEGLPPLCDIHQQIDLIPGASLPNLAHYRMSPE
EYQVLHDHIEDLLKKGHIKSSPSPCAVPALFIPKKDGSWRMCVYSRAINQITVKYRFSIPRIGDILDQLGKATVFSKIDLKGGYHQIRIRPEDEWKIAFKTNEGLSE