; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001016 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001016
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:6952930..6958862
RNA-Seq ExpressionIVF0001016
SyntenyIVF0001016
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033083.1 copia protein [Cucumis melo var. makuwa]2.05e-30351.76Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------
        MKALLGSQDVWDIVSNGYEEPESDA  NQ +RE  QNTRK  QKALTIIHQAI+DNNFEKISGATTAYQAW+ILENTYKGVDRVKK              
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------

Query:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                                DLS MSI++LMGSLQ HEEKLLKKNKQMTEQLFQSKLKLKDKE 
Subjt:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRGRGRG------DFK-----------------------------------------VEVEEATVKENLI---KVIQTQIHQE------------
        NLEKGNRGRGRG      DFK                                          E +E +   +L+   K ++T  +              
Subjt:  NLEKGNRGRGRG------DFK-----------------------------------------VEVEEATVKENLI---KVIQTQIHQE------------

Query:  ------VEEDNIIRGQL-----------GKDQIMTGGMTKDRLN----------------------------------VIIVINSAIVLG----------
              VE D  + G +           GK + M  G+   +L                                   V  V   + V G          
Subjt:  ------VEEDNIIRGQL-----------GKDQIMTGGMTKDRLN----------------------------------VIIVINSAIVLG----------

Query:  --------------------------------------------NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEA
                                                    N  +E K   I+       KCKKMPKEFWAQAVECAVYLSNR PTRSLWNKTP++A
Subjt:  --------------------------------------------NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEA

Query:  WIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIASP
        W GRKPSIG+LRVFGCMAYA+IPDQKRSKLDDKSEKYVFVGYDASSK YK YNPVTKKTIVSRDVVFDEEASWNWNDEPEDYK L FP++RDEPSDIASP
Subjt:  WIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIASP

Query:  PTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIG--
        PT PITPQQSTSSSSASSSE P  MRSL+DIYDETEELSQSFNNLTLFCLF D+EPLNFEEASQNDKW I MDEEIK IKKNDTWELSTLPNGKKA+G  
Subjt:  PTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIG--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------NCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFI
                                                                  NC SMF+DLK AMTQEFEMTDIGLMSYYL IEVKQSEEGIFI
Subjt:  ----------------------------------------------------------NCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFI

Query:  SQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLF
        SQERYTR+ILEKFNMMNSKSVATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VGLVS FMESPTTTHLKVAKRILRYLRGTLDY LF
Subjt:  SQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLF

Query:  YSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLST
        YSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST
Subjt:  YSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLST

KAA0037149.1 copia protein [Cucumis melo var. makuwa]1.22e-29850.85Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------
        MKALLGSQDVWDIVSNGYEEPESDAALNQ +RE  QNTRKK QKALTIIHQAI+DNNFEKISGATTAYQAW+ILENTYKGVDRVKK              
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------

Query:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                                DLS MSI++LMGSLQ HEEKLLKKNKQMTEQLFQSKLKLKDKE 
Subjt:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRGRGRG------DFK-------------------------------------------------------------------------------
        +LEKGNRGRGRG      DFK                                                                               
Subjt:  NLEKGNRGRGRG------DFK-------------------------------------------------------------------------------

Query:  ------VEVEEAT-----------------VKENLIKVI-------------------QTQIHQE-----------------VEEDNIIRGQLGKDQIMT
              +E++E+                   ++N++K +                   +    QE                 V+E + + G   + + + 
Subjt:  ------VEVEEAT-----------------VKENLIKVI-------------------QTQIHQE-----------------VEEDNIIRGQLGKDQIMT

Query:  ----------------GGMTKDRLNVIIVIN--------SAIVLGNAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEE
                        G  T +      V N        S     N  +E K   I+       KCKKMPKEFWAQAVECAVYLSNR PTRSLWNKTP++
Subjt:  ----------------GGMTKDRLNVIIVIN--------SAIVLGNAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEE

Query:  AWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIAS
        AW GRKPSIG+LRVFGCMAYA+IPDQK SKLDDKSEKYVFVGYDASSK YK YN V+KKTIVSRDVVFDEEASWNWNDEPEDYK L FP++ DEPSDIAS
Subjt:  AWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIAS

Query:  PPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIG-
        PPT PITPQQST SSSASSSE P  MRSL+DIYDETEEL Q FNNLTLFCLFGD+EPLNFEEASQNDKW I MDEEIK IKKNDTWELSTLPNGKKA+G 
Subjt:  PPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIG-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------NCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIF
                                                                   NC SMF+DLK AMTQEFEMTDIGLMSYYL IEVKQSEEGIF
Subjt:  -----------------------------------------------------------NCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIF

Query:  ISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRL
        ISQERYTR+ILEKFNMMNSKSVATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VGLVS FMESPTTTHLKV KRILRYLRGTLDY L
Subjt:  ISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRL

Query:  FYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLST
        FYSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST
Subjt:  FYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLST

KAA0043082.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
        MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
Subjt:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF

Query:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK
        DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK
Subjt:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK

Query:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIET
        WTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIET
Subjt:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIET

Query:  GTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKST
        GTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKST
Subjt:  GTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKST

Query:  SQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSK
        SQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSK
Subjt:  SQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSK

Query:  PVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRVL
        PVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRVL
Subjt:  PVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRVL

KAA0062149.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.08e-29147.14Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------
        MKALLGSQDVWDIVSNGYEEPESDAALNQ +RE  QNTRKKYQKALTIIHQAI+DNNFEKISGATTAYQAW+ILENTYKGVDRVKK              
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------

Query:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                                DLS MSI++LMGSLQ HEEKLLKKNKQMTE+LFQSKLKLKDKE 
Subjt:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRGRGRG------DFK-----------------------------------------------------------------------------VE
        +LEKGNRGRGRG      DFK                                                                             VE
Subjt:  NLEKGNRGRGRG------DFK-----------------------------------------------------------------------------VE

Query:  VEEAT----------------------------------------VKENLIKV-----------------------------------------IQTQIH
        ++E+                                         +K N++ +                                         IQT + 
Subjt:  VEEAT----------------------------------------VKENLIKV-----------------------------------------IQTQIH

Query:  QEVEE--------------------------DNIIRG----QLGKDQIMTG---------------------------------------GMTKDRLNVI
        + ++                            N+++G    +L  DQ+  G                                       GM K R   +
Subjt:  QEVEE--------------------------DNIIRG----QLGKDQIMTG---------------------------------------GMTKDRLNVI

Query:  IVINSAIVLG------------------------------------NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPE
        +   S   +                                     N  +E K   I+       KCKKMPKEFWAQAVECAVYLSNR P RSLWNKTP+
Subjt:  IVINSAIVLG------------------------------------NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPE

Query:  EAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIA
        +AWIGRKPSIG+LRVFGCMAYA+IPDQKRSKLDDKSEKYVFVGYDASSK YK YNPVTKKTIVSRDVVFDEEASWNWNDEPEDYK LFFP++RDEPSDIA
Subjt:  EAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIA

Query:  SPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIG
        SPPT PITPQQSTSSSSASSSE P  MRSL+DIYDETEELSQSFNNLTLFCLFGD+EPLNFEEASQNDKW I MDE+IK IKKNDTWELSTLPNGKKA+G
Subjt:  SPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIG

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------NCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGI
                                                                    NC SMF+DLKNAMTQEFEMTDIGLMSYYL IEVKQSEEGI
Subjt:  ------------------------------------------------------------NCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGI

Query:  FISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYR
        FISQERYTREILEKFNMMNSK VATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VGLVS FMESPTTTHLKVAKRILRYLRGTLDY 
Subjt:  FISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYR

Query:  LFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALL
        LFYSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST        +S  C    L
Subjt:  LFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALL

TYK12007.1 copia protein [Cucumis melo var. makuwa]8.10e-29649.09Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------
        MKALLGSQDVWDIVSNGYEEPESDAALNQ +RE  QNTRKK QKALTIIHQAI+DNNFEKIS ATTAYQAW+ILENTYKGVDRVKK              
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKK--------------

Query:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                                DLS MSI++LMGSLQ H+E L KKNKQMTEQLFQSKLKLKDKED
Subjt:  --------------------------------------------------------DLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRGRGRG------DFK-------------------------------------------------------------------------------
         LEKGNRGRGRG      DFK                                                                               
Subjt:  NLEKGNRGRGRG------DFK-------------------------------------------------------------------------------

Query:  -----------------VEVEEAT--------------------------------------VKENLIKVI-------------------QTQIHQE---
                         VE++E+                                        ++N++K +                   +    QE   
Subjt:  -----------------VEVEEAT--------------------------------------VKENLIKVI-------------------QTQIHQE---

Query:  --------------VEEDNIIRGQLGKDQIMT----------------GGMTKDRLNVIIVINSA--------IVLGNAEIELKKMQIMLRKMKKAKCKK
                      V+E + + G   + + +                 G  T +      V N              N  +E K   I+       KCKK
Subjt:  --------------VEEDNIIRGQLGKDQIMT----------------GGMTKDRLNVIIVINSA--------IVLGNAEIELKKMQIMLRKMKKAKCKK

Query:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
        MPKEFWAQAVECAVYLSNR PTRSLWNKTP++AW GRKPSIG+LRVFGCMAYA+IPDQKRSKLDDKSEKYVFVGYDASSK YK YNPVTKKTIVSRDVVF
Subjt:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF

Query:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK
        DEEASWNWNDEPEDYK L FP++ DEPSDIASPPT PITPQQSTSSSSASSSE P  MRSL+DIYDETEELSQSFNNLTLFCLF D+EPLNFEEASQNDK
Subjt:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK

Query:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAIG---------------------------------------------------------------------
        W I MDEEIK IKKNDTWELSTLPNGKKA+G                                                                     
Subjt:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAIG---------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------NCTSMFKDL
                                                                                                   NC SMF+DL
Subjt:  -------------------------------------------------------------------------------------------NCTSMFKDL

Query:  KNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVG
        K AMTQEFEMTDIGLMSYYL IEVKQSEEGIFISQERYTREILEKFNMMNSK VATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VG
Subjt:  KNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVG

Query:  LVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR
        LVS FMESPTTTHLKVAKRILRYLRGTLDY LFYSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST        +S  C 
Subjt:  LVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR

Query:  RALL
           L
Subjt:  RALL

TrEMBL top hitse value%identityAlignment
A0A5A7SRM0 Copia protein1.6e-25050.78Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------
        MKALLGSQDVWDIVSNGYEEPESDA  NQ +RE  QNTRK  QKALTIIHQAI+DNNFEKISGATTAYQAW+ILENTYKGVDRVK               
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------

Query:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                               KDLS MSI++LMGSLQ HEEKLLKKNKQMTEQLFQSKLKLKDKE 
Subjt:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRG------RGRGDFK-----------------------------------------VEVEEATVKENLI---KVIQTQIHQE------------
        NLEKGNRG      RGRGDFK                                          E +E +   +L+   K ++T  +              
Subjt:  NLEKGNRG------RGRGDFK-----------------------------------------VEVEEATVKENLI---KVIQTQIHQE------------

Query:  ------VEEDNIIRGQL-----------GKDQIMTGGMTKDRLN----------------------------------VIIVINSAIVLG----------
              VE D  + G +           GK + M  G+   +L                                   V  V   + V G          
Subjt:  ------VEEDNIIRGQL-----------GKDQIMTGGMTKDRLN----------------------------------VIIVINSAIVLG----------

Query:  --------------------------------------------NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEA
                                                    N  +E K   I+       KCKKMPKEFWAQAVECAVYLSNR PTRSLWNKTP++A
Subjt:  --------------------------------------------NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEA

Query:  WIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIASP
        W GRKPSIG+LRVFGCMAYA+IPDQKRSKLDDKSEKYVFVGYDASSK YK YNPVTKKTIVSRDVVFDEEASWNWNDEPEDYK L FP++RDEPSDIASP
Subjt:  WIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIASP

Query:  PTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI---
        PT PITPQQSTSSSSASSSE P  MRSL+DIYDETEELSQSFNNLTLFCLF D+EPLNFEEASQNDKW I MDEEIK IKKNDTWELSTLPNGKKA+   
Subjt:  PTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFI
                                                                 GNC SMF+DLK AMTQEFEMTDIGLMSYYL IEVKQSEEGIFI
Subjt:  ---------------------------------------------------------GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFI

Query:  SQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLF
        SQERYTR+ILEKFNMMNSKSVATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VGLVS FMESPTTTHLKVAKRILRYLRGTLDY LF
Subjt:  SQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLF

Query:  YSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR----RALLHRPCIVDRSPAL
        YSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST        +S  C     R LL    I+   P +
Subjt:  YSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR----RALLHRPCIVDRSPAL

A0A5A7T701 Copia protein1.1e-24650.42Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------
        MKALLGSQDVWDIVSNGYEEPESDAALNQ +RE  QNTRKK QKALTIIHQAI+DNNFEKISGATTAYQAW+ILENTYKGVDRVK               
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------

Query:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                               KDLS MSI++LMGSLQ HEEKLLKKNKQMTEQLFQSKLKLKDKE 
Subjt:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRG------RGRGDFK-------------------------------------------------------------------------------
        +LEKGNRG      RGRGDFK                                                                               
Subjt:  NLEKGNRG------RGRGDFK-------------------------------------------------------------------------------

Query:  ------VEVEEAT-----------------VKENLIKVI-------------------QTQIHQE-----------------VEEDNIIRGQLGKDQIMT
              +E++E+                   ++N++K +                   +    QE                 V+E + + G   + + + 
Subjt:  ------VEVEEAT-----------------VKENLIKVI-------------------QTQIHQE-----------------VEEDNIIRGQLGKDQIMT

Query:  ----------------GGMTKDRLNVIIVIN--------SAIVLGNAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEE
                        G  T +      V N        S     N  +E K   I+       KCKKMPKEFWAQAVECAVYLSNR PTRSLWNKTP++
Subjt:  ----------------GGMTKDRLNVIIVIN--------SAIVLGNAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEE

Query:  AWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIAS
        AW GRKPSIG+LRVFGCMAYA+IPDQK SKLDDKSEKYVFVGYDASSK YK YN V+KKTIVSRDVVFDEEASWNWNDEPEDYK L FP++ DEPSDIAS
Subjt:  AWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIAS

Query:  PPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI--
        PPT PITPQQST SSSASSSE P  MRSL+DIYDETEEL Q FNNLTLFCLFGD+EPLNFEEASQNDKW I MDEEIK IKKNDTWELSTLPNGKKA+  
Subjt:  PPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIF
                                                                  GNC SMF+DLK AMTQEFEMTDIGLMSYYL IEVKQSEEGIF
Subjt:  ----------------------------------------------------------GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIF

Query:  ISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRL
        ISQERYTR+ILEKFNMMNSKSVATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VGLVS FMESPTTTHLKV KRILRYLRGTLDY L
Subjt:  ISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRL

Query:  FYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSC
        FYSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST        +S  C
Subjt:  FYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSC

A0A5A7V8Y7 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-24346.63Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------
        MKALLGSQDVWDIVSNGYEEPESDAALNQ +RE  QNTRKKYQKALTIIHQAI+DNNFEKISGATTAYQAW+ILENTYKGVDRVK               
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------

Query:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                               KDLS MSI++LMGSLQ HEEKLLKKNKQMTE+LFQSKLKLKDKE 
Subjt:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRG------RGRGDFK-----------------------------------------------------------------------------VE
        +LEKGNRG      RGRGDFK                                                                             VE
Subjt:  NLEKGNRG------RGRGDFK-----------------------------------------------------------------------------VE

Query:  VEEA----------------------------------------TVKENLIKV-----------------------------------------IQTQIH
        ++E+                                         +K N++ +                                         IQT + 
Subjt:  VEEA----------------------------------------TVKENLIKV-----------------------------------------IQTQIH

Query:  QEVE--------------------------EDNIIRG------------------------------------------QLGKDQIMTGGMTKDRLNVII
        + ++                            N+++G                                             K++    GM K R   ++
Subjt:  QEVE--------------------------EDNIIRG------------------------------------------QLGKDQIMTGGMTKDRLNVII

Query:  VINSAIVL------------------------------------GNAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEE
           S   +                                     N  +E K   I+       KCKKMPKEFWAQAVECAVYLSNR P RSLWNKTP++
Subjt:  VINSAIVL------------------------------------GNAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEE

Query:  AWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIAS
        AWIGRKPSIG+LRVFGCMAYA+IPDQKRSKLDDKSEKYVFVGYDASSK YK YNPVTKKTIVSRDVVFDEEASWNWNDEPEDYK LFFP++RDEPSDIAS
Subjt:  AWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIAS

Query:  PPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI--
        PPT PITPQQSTSSSSASSSE P  MRSL+DIYDETEELSQSFNNLTLFCLFGD+EPLNFEEASQNDKW I MDE+IK IKKNDTWELSTLPNGKKA+  
Subjt:  PPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIF
                                                                  GNC SMF+DLKNAMTQEFEMTDIGLMSYYL IEVKQSEEGIF
Subjt:  ----------------------------------------------------------GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIF

Query:  ISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRL
        ISQERYTREILEKFNMMNSK VATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VGLVS FMESPTTTHLKVAKRILRYLRGTLDY L
Subjt:  ISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRL

Query:  FYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR----RALLHRPCIVDRSPAL
        FYSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST        +S  C     R LL    I+   P +
Subjt:  FYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR----RALLHRPCIVDRSPAL

A0A5D3CJ69 Copia protein4.5e-24548.75Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------
        MKALLGSQDVWDIVSNGYEEPESDAALNQ +RE  QNTRKK QKALTIIHQAI+DNNFEKIS ATTAYQAW+ILENTYKGVDRVK               
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVK---------------

Query:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED
                                                               KDLS MSI++LMGSLQ H+E L KKNKQMTEQLFQSKLKLKDKED
Subjt:  -------------------------------------------------------KDLSKMSINRLMGSLQVHEEKLLKKNKQMTEQLFQSKLKLKDKED

Query:  NLEKGNRG------RGRGDFK-------------------------------------------------------------------------------
         LEKGNRG      RGRGDFK                                                                               
Subjt:  NLEKGNRG------RGRGDFK-------------------------------------------------------------------------------

Query:  -----------------VEVEEAT--------------------------------------VKENLIKVI-------------------QTQIHQE---
                         VE++E+                                        ++N++K +                   +    QE   
Subjt:  -----------------VEVEEAT--------------------------------------VKENLIKVI-------------------QTQIHQE---

Query:  --------------VEEDNIIRGQLGKDQIMT----------------GGMTKDRLNVIIVINSA--------IVLGNAEIELKKMQIMLRKMKKAKCKK
                      V+E + + G   + + +                 G  T +      V N              N  +E K   I+       KCKK
Subjt:  --------------VEEDNIIRGQLGKDQIMT----------------GGMTKDRLNVIIVINSA--------IVLGNAEIELKKMQIMLRKMKKAKCKK

Query:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
        MPKEFWAQAVECAVYLSNR PTRSLWNKTP++AW GRKPSIG+LRVFGCMAYA+IPDQKRSKLDDKSEKYVFVGYDASSK YK YNPVTKKTIVSRDVVF
Subjt:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF

Query:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK
        DEEASWNWNDEPEDYK L FP++ DEPSDIASPPT PITPQQSTSSSSASSSE P  MRSL+DIYDETEELSQSFNNLTLFCLF D+EPLNFEEASQNDK
Subjt:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK

Query:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAI----------------------------------------------------------------------
        W I MDEEIK IKKNDTWELSTLPNGKKA+                                                                      
Subjt:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAI----------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------GNCTSMFKDL
                                                                                                  GNC SMF+DL
Subjt:  ------------------------------------------------------------------------------------------GNCTSMFKDL

Query:  KNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVG
        K AMTQEFEMTDIGLMSYYL IEVKQSEEGIFISQERYTREILEKFNMMNSK VATPIETGTKLSKHEEGDDVD SYFKSLV SLRYLTCT+PDILF VG
Subjt:  KNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVG

Query:  LVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR
        LVS FMESPTTTHLKVAKRILRYLRGTLDY LFYSSSKEFKLEGYCDSDWAGDT+DRKSTS YVFF+GNTAFTWSSKKQPIVTLST        +S  C 
Subjt:  LVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCR

Query:  ----RALLHRPCIVDRSPAL
            R LL    I+   P +
Subjt:  ----RALLHRPCIVDRSPAL

A0A5D3E4P4 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+00100Show/hide
Query:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
        MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
Subjt:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF

Query:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK
        DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK
Subjt:  DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDK

Query:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIET
        WTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIET
Subjt:  WTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIET

Query:  GTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKST
        GTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKST
Subjt:  GTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKST

Query:  SQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSK
        SQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSK
Subjt:  SQYVFFVGNTAFTWSSKKQPIVTLSTSNRCVAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSK

Query:  PVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRVL
        PVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRVL
Subjt:  PVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRVL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-2533.5Show/hide
Query:  AIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRY-
        A G+ T M  + K  + ++F MTD+  + +++ I ++  E+ I++SQ  Y ++IL KFNM N  +V+TP+ +          +D + +  +SL+  L Y 
Subjt:  AIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRY-

Query:  LTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEF--KLEGYCDSDWAGDTDDRKSTSQYVFFVGN-TAFTWSSKKQPIVTL
        + CT+PD+   V ++S +     +   +  KR+LRYL+GT+D +L +  +  F  K+ GY DSDWAG   DRKST+ Y+F + +     W++K+Q  V  
Subjt:  LTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEF--KLEGYCDSDWAGDTDDRKSTSQYVFFVGN-TAFTWSSKKQPIVTL

Query:  STS
        S++
Subjt:  STS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-4325.7Show/hide
Query:  NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDA
        N   E     I+ +     +  K+PK FW +AV+ A YL NR P+  L  + PE  W  ++ S  +L+VFGC A+A++P ++R+KLDDKS   +F+GY  
Subjt:  NAEIELKKMQIMLRKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDA

Query:  SSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDE-TEEL-----
            Y+ ++PV KK I SRDVVF E       D  E  K    P     PS   +P     T  +ST+   +   E P  +    +  DE  EE+     
Subjt:  SSKVYKFYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDE-TEEL-----

Query:  -----------------SQSFNNLTLFCLFGDNEPLNFEEA---SQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI-----------GNC-------
                         S+ + +     +  D EP + +E     + ++    M EE++ ++KN T++L  LP GK+ +           G+C       
Subjt:  -----------------SQSFNNLTLFCLFGDNEPLNFEEA---SQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAI-----------GNC-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------TSMFKDLKNAMTQEFEMTDIGLMSYYLSIEV--KQSEEGIFISQERYTREILEKFNMM
                                                    +   LK  +++ F+M D+G     L +++  +++   +++SQE+Y   +LE+FNM 
Subjt:  ------------------------------------------TSMFKDLKNAMTQEFEMTDIGLMSYYLSIEV--KQSEEGIFISQERYTREILEKFNMM

Query:  NSKSVATPIETGTKLSK-------HEEGDDVDLSYFKSLVRSLRY-LTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFK
        N+K V+TP+    KLSK        E+G+   + Y  S V SL Y + CT+PDI   VG+VS F+E+P   H +  K ILRYLRGT    L +  S    
Subjt:  NSKSVATPIETGTKLSK-------HEEGDDVDLSYFKSLVRSLRY-LTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFK

Query:  LEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS
        L+GY D+D AGD D+RKS++ Y+F     A +W SK Q  V LST+
Subjt:  LEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS

P92519 Uncharacterized mitochondrial protein AtMg008103.2e-3034.17Show/hide
Query:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVAT--PIETGTKLSKHEEGDDVDLSYFKSLVRSLRYL
        G+  ++   L   ++  F M D+G + Y+L I++K    G+F+SQ +Y  +IL    M++ K ++T  P++  + +S  +  D  D   F+S+V +L+YL
Subjt:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVAT--PIETGTKLSKHEEGDDVDLSYFKSLVRSLRYL

Query:  TCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS
        T T+PDI + V +V   M  PT     + KR+LRY++GT+ + L+   + +  ++ +CDSDWAG T  R+ST+ +  F+G    +WS+K+QP V+ S++
Subjt:  TCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.8e-3338.28Show/hide
Query:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTC
        GN  ++  +  + ++Q F + D   + Y+L IE K+   G+ +SQ RY  ++L + NM+ +K V TP+    KLS +      D + ++ +V SL+YL  
Subjt:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTC

Query:  TQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSN--
        T+PDI + V  +S FM  PT  HL+  KRILRYL GT ++ +F        L  Y D+DWAGD DD  ST+ Y+ ++G+   +WSSKKQ  V  S++   
Subjt:  TQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSN--

Query:  -RCVAPPSS
         R VA  SS
Subjt:  -RCVAPPSS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-1332.43Show/hide
Query:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
        +PK +W  A   AVYL NR PT  L  ++P +   G  P+   LRVFGC  Y ++    + KLDDKS + VF+GY  +   Y   +  T +  +SR V F
Subjt:  MPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF

Query:  DEEASWNWNDEP-EDYKILFFP--EDRDEPSDIASPPT-----LPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNN
        DE      N  P  +Y     P  E R E S + SP T      P+ P  S S    +++         R+    +  L  SF++
Subjt:  DEEASWNWNDEP-EDYKILFFP--EDRDEPSDIASPPT-----LPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-3639.71Show/hide
Query:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTC
        GN T + K   +A++Q F + +   + Y+L IE K+  +G+ +SQ RYT ++L + NM+ +K VATP+ T  KL+ H      D + ++ +V SL+YL  
Subjt:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTC

Query:  TQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSN--
        T+PD+ + V  +S +M  PT  H    KR+LRYL GT D+ +F        L  Y D+DWAGDTDD  ST+ Y+ ++G+   +WSSKKQ  V  S++   
Subjt:  TQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSN--

Query:  -RCVAPPSS
         R VA  SS
Subjt:  -RCVAPPSS

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein8.9e-1243.28Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTA
        MKA+LG+ DVW+IV  G+ EPE++ +L+Q +++  +++RK+ +KAL +I+Q ++++ FEK+  AT+A
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTA

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.2e-3033.86Show/hide
Query:  DLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFR
        +LK+ +   F++ D+G + Y+L +E+ +S  GI I Q +Y  ++L++  ++  K  + P++     S H  GD VD   ++ L+  L YL  T+ DI F 
Subjt:  DLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKSLVRSLRYLTCTQPDILFR

Query:  VGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS
        V  +S F E+P   H +   +IL Y++GT+   LFYSS  E +L+ + D+ +    D R+ST+ Y  F+G +  +W SKKQ +V+ S++
Subjt:  VGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-0536.49Show/hide
Query:  YDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFK
        Y++   L  SF    L C+    EP  + EA +   W   MD+EI  ++   TWE+ TLP  KK IG C  ++K
Subjt:  YDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIKVIKKNDTWELSTLPNGKKAIGNCTSMFK

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.7e-1035.8Show/hide
Query:  YLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFV
        YLT T+PD+ F V  +S F  +  T  ++   ++L Y++GT+   LFYS++ + +L+ + DSDWA   D R+S + +   V
Subjt:  YLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0631.71Show/hide
Query:  MLRKMKKAKCK-KMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEK
        ++ K++   C+  +PK F A A   AV++ N+ P+ ++    P+E W    P+  YLR FGC+AY +  + K      K E+
Subjt:  MLRKMKKAKCK-KMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEK

ATMG00810.1 DNA/RNA polymerases superfamily protein2.2e-3134.17Show/hide
Query:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVAT--PIETGTKLSKHEEGDDVDLSYFKSLVRSLRYL
        G+  ++   L   ++  F M D+G + Y+L I++K    G+F+SQ +Y  +IL    M++ K ++T  P++  + +S  +  D  D   F+S+V +L+YL
Subjt:  GNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVAT--PIETGTKLSKHEEGDDVDLSYFKSLVRSLRYL

Query:  TCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS
        T T+PDI + V +V   M  PT     + KR+LRY++GT+ + L+   + +  ++ +CDSDWAG T  R+ST+ +  F+G    +WS+K+QP V+ S++
Subjt:  TCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCTCTACTTGGTTCACAAGATGTGTGGGACATTGTTAGTAATGGTTATGAAGAACCAGAAAGTGATGCAGCTTTGAATCAAGTTGAACGAGAAGTTTTTCAAAA
TACAAGAAAAAAATATCAAAAGGCTCTCACCATCATTCATCAAGCCATTAATGATAACAATTTTGAGAAAATTTCTGGAGCAACTACTGCATATCAAGCATGGAAAATTT
TGGAGAATACGTATAAAGGAGTAGATCGAGTCAAGAAGGATTTGAGTAAAATGTCCATTAATCGACTTATGGGTTCTTTACAAGTCCATGAAGAGAAGCTTCTTAAGAAG
AACAAGCAGATGACTGAGCAACTTTTTCAATCAAAGTTGAAATTAAAAGACAAGGAAGACAACCTAGAAAAAGGAAATCGAGGTCGAGGACGTGGTGATTTCAAAGTCGA
GGTTGAGGAAGCTACGGTCAAAGAAAATTTGATAAAAGTAATTCAAACTCAAATTCATCAAGAGGTAGAGGAAGACAACATTATTCGAGGTCAATTGGGGAAAGATCAAA
TAATGACAGGAGGTATGACAAAAGACAGGTTGAATGTTATAATTGTCATAAATTCGGCCATTGTTCTTGGGAATGCAGAAATAGAGTTGAAGAAAATGCAAATTATGCTG
AGAAAGATGAAGAAAGCGAAGTGCAAGAAGATGCCAAAAGAATTTTGGGCACAAGCTGTTGAGTGTGCAGTGTACTTGTCAAATCGTTGCCCTACTAGAAGCTTATGGAA
CAAAACTCCTGAAGAAGCATGGATAGGAAGAAAACCATCCATTGGTTATTTGAGAGTATTCGGATGCATGGCTTATGCGTATATACCTGATCAAAAGCGTAGTAAGCTTG
ATGATAAAAGTGAGAAATATGTTTTTGTTGGCTATGATGCAAGCTCAAAAGTCTACAAGTTTTATAATCCTGTTACAAAGAAGACGATCGTAAGCAGAGATGTTGTGTTT
GATGAAGAAGCATCATGGAATTGGAATGACGAACCAGAAGATTATAAAATTTTGTTTTTTCCTGAAGATCGTGATGAGCCTAGTGATATTGCTTCTCCACCAACATTGCC
AATCACTCCACAACAAAGCACATCTTCATCATCTGCAAGTTCAAGTGAAGCGCCTTGTAGCATGAGAAGCTTACGAGACATATATGATGAAACTGAAGAGTTAAGTCAAA
GTTTTAATAACCTTACTCTTTTTTGTCTATTTGGTGACAATGAACCTTTGAATTTTGAAGAAGCTTCACAAAATGACAAATGGACGATAGTTATGGATGAAGAGATAAAA
GTCATAAAAAAGAATGATACGTGGGAACTTTCTACTCTTCCAAATGGAAAGAAAGCAATAGGAAATTGTACAAGTATGTTTAAAGATCTCAAGAACGCGATGACCCAAGA
ATTCGAAATGACAGATATAGGGCTAATGTCATATTATCTTAGCATTGAAGTGAAGCAGTCAGAGGAAGGAATTTTCATCTCTCAAGAACGATATACTAGAGAAATTCTAG
AGAAGTTTAATATGATGAATTCTAAGTCTGTCGCAACTCCGATTGAAACTGGGACCAAACTGTCCAAACATGAAGAAGGAGATGATGTTGATCTTTCATATTTCAAAAGT
TTGGTTAGGAGTTTGAGATATTTGACTTGCACACAACCAGATATTCTTTTCAGAGTTGGATTGGTGAGTCTATTTATGGAATCTCCTACAACTACTCATTTGAAAGTGGC
AAAGAGAATTCTTCGTTACCTTAGAGGTACGCTTGACTATAGGTTGTTTTATTCTTCATCTAAAGAATTCAAGCTTGAAGGCTATTGTGATAGTGATTGGGCTGGAGATA
CTGATGATCGAAAGAGCACTAGTCAATATGTTTTCTTCGTTGGGAATACTGCATTTACATGGAGTTCTAAGAAGCAACCTATTGTGACATTATCCACTTCCAATCGCTGT
GTCGCACCGCCTTCTTCCAGTAGCTGTCGACGGGCCTTATTGCACCGTCCGTGCATCGTCGATCGAAGCCCAGCCTTAAGATCGTCTGCTGGTGATCGAGCCGATCATCC
CCGTCTCATCAGTCGTCCACCTTTTGTCGAGCTTCGTCAGTCGTCCGCTAGCCTACCCGAGGCACTTCTACGCACGTCGGAGCCAAGCCATCCTTCGAAGCCCGTAGCGT
CAGCCCTTCGTCCACACCTCACGAGTCGCTTCCTCGGTCGGGCCGCGGAGCCACGAGTGAGCAAGCCCTACCTGCGCGAGCCGTACCCATCCGCAAGCCACACACGTGTC
CTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCTCTACTTGGTTCACAAGATGTGTGGGACATTGTTAGTAATGGTTATGAAGAACCAGAAAGTGATGCAGCTTTGAATCAAGTTGAACGAGAAGTTTTTCAAAA
TACAAGAAAAAAATATCAAAAGGCTCTCACCATCATTCATCAAGCCATTAATGATAACAATTTTGAGAAAATTTCTGGAGCAACTACTGCATATCAAGCATGGAAAATTT
TGGAGAATACGTATAAAGGAGTAGATCGAGTCAAGAAGGATTTGAGTAAAATGTCCATTAATCGACTTATGGGTTCTTTACAAGTCCATGAAGAGAAGCTTCTTAAGAAG
AACAAGCAGATGACTGAGCAACTTTTTCAATCAAAGTTGAAATTAAAAGACAAGGAAGACAACCTAGAAAAAGGAAATCGAGGTCGAGGACGTGGTGATTTCAAAGTCGA
GGTTGAGGAAGCTACGGTCAAAGAAAATTTGATAAAAGTAATTCAAACTCAAATTCATCAAGAGGTAGAGGAAGACAACATTATTCGAGGTCAATTGGGGAAAGATCAAA
TAATGACAGGAGGTATGACAAAAGACAGGTTGAATGTTATAATTGTCATAAATTCGGCCATTGTTCTTGGGAATGCAGAAATAGAGTTGAAGAAAATGCAAATTATGCTG
AGAAAGATGAAGAAAGCGAAGTGCAAGAAGATGCCAAAAGAATTTTGGGCACAAGCTGTTGAGTGTGCAGTGTACTTGTCAAATCGTTGCCCTACTAGAAGCTTATGGAA
CAAAACTCCTGAAGAAGCATGGATAGGAAGAAAACCATCCATTGGTTATTTGAGAGTATTCGGATGCATGGCTTATGCGTATATACCTGATCAAAAGCGTAGTAAGCTTG
ATGATAAAAGTGAGAAATATGTTTTTGTTGGCTATGATGCAAGCTCAAAAGTCTACAAGTTTTATAATCCTGTTACAAAGAAGACGATCGTAAGCAGAGATGTTGTGTTT
GATGAAGAAGCATCATGGAATTGGAATGACGAACCAGAAGATTATAAAATTTTGTTTTTTCCTGAAGATCGTGATGAGCCTAGTGATATTGCTTCTCCACCAACATTGCC
AATCACTCCACAACAAAGCACATCTTCATCATCTGCAAGTTCAAGTGAAGCGCCTTGTAGCATGAGAAGCTTACGAGACATATATGATGAAACTGAAGAGTTAAGTCAAA
GTTTTAATAACCTTACTCTTTTTTGTCTATTTGGTGACAATGAACCTTTGAATTTTGAAGAAGCTTCACAAAATGACAAATGGACGATAGTTATGGATGAAGAGATAAAA
GTCATAAAAAAGAATGATACGTGGGAACTTTCTACTCTTCCAAATGGAAAGAAAGCAATAGGAAATTGTACAAGTATGTTTAAAGATCTCAAGAACGCGATGACCCAAGA
ATTCGAAATGACAGATATAGGGCTAATGTCATATTATCTTAGCATTGAAGTGAAGCAGTCAGAGGAAGGAATTTTCATCTCTCAAGAACGATATACTAGAGAAATTCTAG
AGAAGTTTAATATGATGAATTCTAAGTCTGTCGCAACTCCGATTGAAACTGGGACCAAACTGTCCAAACATGAAGAAGGAGATGATGTTGATCTTTCATATTTCAAAAGT
TTGGTTAGGAGTTTGAGATATTTGACTTGCACACAACCAGATATTCTTTTCAGAGTTGGATTGGTGAGTCTATTTATGGAATCTCCTACAACTACTCATTTGAAAGTGGC
AAAGAGAATTCTTCGTTACCTTAGAGGTACGCTTGACTATAGGTTGTTTTATTCTTCATCTAAAGAATTCAAGCTTGAAGGCTATTGTGATAGTGATTGGGCTGGAGATA
CTGATGATCGAAAGAGCACTAGTCAATATGTTTTCTTCGTTGGGAATACTGCATTTACATGGAGTTCTAAGAAGCAACCTATTGTGACATTATCCACTTCCAATCGCTGT
GTCGCACCGCCTTCTTCCAGTAGCTGTCGACGGGCCTTATTGCACCGTCCGTGCATCGTCGATCGAAGCCCAGCCTTAAGATCGTCTGCTGGTGATCGAGCCGATCATCC
CCGTCTCATCAGTCGTCCACCTTTTGTCGAGCTTCGTCAGTCGTCCGCTAGCCTACCCGAGGCACTTCTACGCACGTCGGAGCCAAGCCATCCTTCGAAGCCCGTAGCGT
CAGCCCTTCGTCCACACCTCACGAGTCGCTTCCTCGGTCGGGCCGCGGAGCCACGAGTGAGCAAGCCCTACCTGCGCGAGCCGTACCCATCCGCAAGCCACACACGTGTC
CTTTAG
Protein sequenceShow/hide protein sequence
MKALLGSQDVWDIVSNGYEEPESDAALNQVEREVFQNTRKKYQKALTIIHQAINDNNFEKISGATTAYQAWKILENTYKGVDRVKKDLSKMSINRLMGSLQVHEEKLLKK
NKQMTEQLFQSKLKLKDKEDNLEKGNRGRGRGDFKVEVEEATVKENLIKVIQTQIHQEVEEDNIIRGQLGKDQIMTGGMTKDRLNVIIVINSAIVLGNAEIELKKMQIML
RKMKKAKCKKMPKEFWAQAVECAVYLSNRCPTRSLWNKTPEEAWIGRKPSIGYLRVFGCMAYAYIPDQKRSKLDDKSEKYVFVGYDASSKVYKFYNPVTKKTIVSRDVVF
DEEASWNWNDEPEDYKILFFPEDRDEPSDIASPPTLPITPQQSTSSSSASSSEAPCSMRSLRDIYDETEELSQSFNNLTLFCLFGDNEPLNFEEASQNDKWTIVMDEEIK
VIKKNDTWELSTLPNGKKAIGNCTSMFKDLKNAMTQEFEMTDIGLMSYYLSIEVKQSEEGIFISQERYTREILEKFNMMNSKSVATPIETGTKLSKHEEGDDVDLSYFKS
LVRSLRYLTCTQPDILFRVGLVSLFMESPTTTHLKVAKRILRYLRGTLDYRLFYSSSKEFKLEGYCDSDWAGDTDDRKSTSQYVFFVGNTAFTWSSKKQPIVTLSTSNRC
VAPPSSSSCRRALLHRPCIVDRSPALRSSAGDRADHPRLISRPPFVELRQSSASLPEALLRTSEPSHPSKPVASALRPHLTSRFLGRAAEPRVSKPYLREPYPSASHTRV
L