; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G008780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G008780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCG_Chr02:11987303..11992824
RNA-Seq ExpressionClCG02G008780
SyntenyClCG02G008780
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040609.1 DNA polymerase epsilon catalytic subunit A-like [Cucumis melo var. makuwa]9.6e-21138.61Show/hide
Query:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------
        E+LK+RMF+ +++  +G+L  R + IKQ+  Y++Y K            ++ VL    +  L + L+A+V+S +P+ +E                     
Subjt:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------

Query:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG
               + + AQ+HK  +  T    +  +   F  K   + +    V K   P                                  K    L  FIT 
Subjt:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG

Query:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------
        + +  E   E E E+E+ E AE          E++LR+ILGF+S  TMK++G I   E+I                                        
Subjt:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------

Query:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM
                V I+LP + ++ +F  ++LG  D++L M WL T G+M + W ++T+ F  G+S+V+LK D SLTKAE S+K +   WEEED+GFLIEFQ + 
Subjt:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM

Query:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA
        +E   E + +  +  EE                         +R +DH+I++ EGQ PIN+RPYKYG  QK EIER + EM+              P   
Subjt:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA

Query:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR
          K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNA  TFQSLMNQ  R
Subjt:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR

Query:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER
           P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM+++PVLALP FS  FIIE 
Subjt:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER

Query:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR
        DASG  +  +  ++     +     +IL+GA+G++ ++++       + L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     + W+  
Subjt:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR

Query:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL
         L YKG LV+S    LIPSLL   HD V+GGHS  +RTYK ++ E++W  MK  V+ YV Q              +GLLQ LP+   I +D +MDFIEGL
Subjt:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL

Query:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY
        PK+ G DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S A+    D QTERVNRCLETY
Subjt:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY

Query:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------
        LRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+                 
Subjt:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------

Query:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE
                                           + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + + ++G RWN++
Subjt:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE

Query:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK
         G  E L++W             E   + +P FHLE K
Subjt:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK

KAA0066550.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.4e-20739.78Show/hide
Query:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLMEIEGLTAQSHKKGEGKTVWRKR
        E+LK+RMF+ +++  +G+L  R + IKQ+  Y++Y K            ++ VL    +  L + L+ +V+S +P+ +E          +  GK  W   
Subjt:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLMEIEGLTAQSHKKGEGKTVWRKR

Query:  PKEMVKFKKKGKKVYVSNVTKNIHPTTS--ANPKRRENLWLFITGKGQVIEEV-DEFEGEEEIEETAE--------------KVELAL----RSILGFSS
          + +    KG     + V K   P     ++ + R  L     G     EE+  E E E+E+ E AE                ELAL    R+  G + 
Subjt:  PKEMVKFKKKGKKVYVSNVTKNIHPTTS--ANPKRRENLWLFITGKGQVIEEV-DEFEGEEEIEETAE--------------KVELAL----RSILGFSS

Query:  PSTMKLKGKIGELELIVTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGF
             L+G    +   V I+LP + ++ +F  ++LG  D++L M W  T G+M + W ++T+ F  G+S+V+LK D SL KAE S+K +   WEEEDQGF
Subjt:  PSTMKLKGKIGELELIVTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGF

Query:  LIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM----------
        LI FQ + +E   E + +  +  EE                         +R +DH+I++ EGQ PINVRPYKYG  QK EIER + EM+          
Subjt:  LIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM----------

Query:  ----PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQ
            P     K    WRFCVDY K+NQ T++ KFPI VIEELLDELHG+  FSKLDLRSGYHQIRMKE+DI K A RTHE HYEF+VMPF LTN   TFQ
Subjt:  ----PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQ

Query:  SLMNQDIRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKF
        SLMNQ  R   P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM+++PVLALP F
Subjt:  SLMNQDIRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKF

Query:  SKTFIIERDASGFGLGVVLMQEDRPIAYFSH-----------------------------------------------------CPQI---LVGAKGSST
        S  FIIE DASG GLG VL Q +RPIAYFS                                                       PQ    L    G   
Subjt:  SKTFIIERDASGFGLGVVLMQEDRPIAYFSH-----------------------------------------------------CPQI---LVGAKGSST

Query:  SI-------TKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVM
         I        K A     + L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     + W+   L YKG LV+S    LIPSLL   HD V+
Subjt:  SI-------TKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVM

Query:  GGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLF
        GGH   +RTYK ++ E++W  MK  V+ YV Q              +GLLQ LP+   I +D +MDFIEGLPK+ G+DSIMVVVD+L K  HF+ L H F
Subjt:  GGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLF

Query:  SAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHG
        SAKQVA  F++E++  HG    IV+DRD+  LS+FW +LF   GT LK S A+    D QTERVNRCLETYLRCFCNE+P +W + + WAE WYN TFH 
Subjt:  SAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHG

Query:  AIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD----------------------------------------------
        + KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+ AQ+                                              
Subjt:  AIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD----------------------------------------------

Query:  ------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWEN-------------QPKI
              + IG VAYRL LP  A IHNVF +SQLK K+G    VQ + P LTE+ E+  +P+ ++G RWN++ G  E L++W+                + 
Subjt:  ------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWEN-------------QPKI

Query:  FPEFHLEHK
        +P FHLE K
Subjt:  FPEFHLEHK

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]3.5e-20540.99Show/hide
Query:  VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLMLELKVEGK
        V ++L  V+V+ +F  ++LG+ D+IL M WL T G M + W +LT++F  G  K++LK D SL +AE  +K +   WE EDQGFL+EFQ   ++  VE K
Subjt:  VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLMLELKVEGK

Query:  ------------------------VDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFASDKERREWR
                                 DP KL   +R IDH+I++   Q+PINVRPYKYG  QK+EIE+ + EM+              P     K+   WR
Subjt:  ------------------------VDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFASDKERREWR

Query:  FCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQ------------
        FCVDYRK+NQ TIS KFPI VIEELLDELHG+T FSKLD++S YHQIRM+E+D+ K A RTHE HYEF+VMPF LTNA  TFQSLMNQ            
Subjt:  FCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQ------------

Query:  -------------------------------------------------------DIRFQQPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLA
                                                                ++  + + K MV+W  P++++ LRG LGL GYYRRF K Y  +A
Subjt:  -------------------------------------------------------DIRFQQPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLA

Query:  VPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK--------------------
         PL +LLQKN+FVW+E AT AF+KLK AM TIPVLALP +   F+IE DASG GLG VL Q   PIA+FS    I   AK                    
Subjt:  VPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK--------------------

Query:  --GSSTSI-----------------------------------------TKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDP
          G   +I                                          K A     +   +E+  L+   ++D E++ +EV QDEELQ  I +LK++P
Subjt:  --GSSTSI-----------------------------------------TKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDP

Query:  SSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIW
          + KF+WEN  LLYK  +V+S + ++IP+LL   HD ++GGHS  +RTYK +S ELYW+ MK  ++ YV Q              +G+L  +P  + I 
Subjt:  SSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIW

Query:  DDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQ
        ++ SMDFIEGLPK+ G + IMVVVD+L KY +F+T+ H F+AKQVA  FI+++V  HG PK I+SDRD+  +S+FW +LF   GT LK S A+    D Q
Subjt:  DDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQ

Query:  TERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------
        TERVN+CLETYLRCFCNE+P++W E++SWAE WYN TFH +I++ PF IVYGR+ PP+++YG +K  ND ++  L E+D  L ALK+ L  AQ+      
Subjt:  TERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------

Query:  ----------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQP
                                                      + IG VAYRLKLP  A IH+VF ISQLK K+GK  VVQ   P LT D E+   P
Subjt:  ----------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQP

Query:  KMIIGTRWNSERGKEEVLVEWENQPKI-------------FPEFHLEHK
        + ++G RWN + G  E LV+W+N P+              FP FHLE K
Subjt:  KMIIGTRWNSERGKEEVLVEWENQPKI-------------FPEFHLEHK

TYJ99303.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-20439Show/hide
Query:  PTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELIVTI------------------
        P      K +  L  FI  + +  EE D    EE IEET E        +  + L+++   SS  TMKLKG I + E++V I                  
Subjt:  PTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELIVTI------------------

Query:  ------------------------------QLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMM
                                      +L  + ++ +F  ++LG  D +L M WL T G M + W +LT++F  G  ++VLK D SL +AE S++ +
Subjt:  ------------------------------QLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMM

Query:  ISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEM
           W+EEDQGFL+E+ ++ +E     K D  +  +E                         +R IDH+I+    QKPINVRPYKYG  QK EIE+ + EM
Subjt:  ISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEM

Query:  M--------------PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMP
        +              P     K+   WRFCVDYRK+NQATIS KFPI VIEELLDEL+G+  FSKLDL+SGYHQIRMKE+DI K A RTHE HYEF+VMP
Subjt:  M--------------PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMP

Query:  FDLTNALTTFQSLMNQ-------------------------------------------------------------------DIRFQQPECKRMVEWSI
        F LTNA  TFQSLMNQ                                                                    +   + + + MV W  
Subjt:  FDLTNALTTFQSLMNQ-------------------------------------------------------------------DIRFQQPECKRMVEWSI

Query:  PRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFS--
        P +++ELRG L L GYYRRF K Y  +A PL +LLQKNAF WNE A  AF +LK AM TIPVLALP +S  F IE DASG GLG VL Q   PIA++S  
Subjt:  PRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFS--

Query:  -----------------------HCPQILVGAKGSSTS--------------------------------------ITKMASEAHGISLEVELQILSAPA
                                    L+G K S  S                                      + K A     +  ++EL  ++   
Subjt:  -----------------------HCPQILVGAKGSSTS--------------------------------------ITKMASEAHGISLEVELQILSAPA

Query:  LLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ
        ++D  +V  EV +DE LQ I+AKLK++     KF W+N  LLYKG LV+  + +LIP LL   HD V+GGHS  +RTYK +S ELYW+ MK  ++ YV Q
Subjt:  LLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ

Query:  --------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLL
                      +G+LQ LPI ++I +D +MDFIEGLPK+ G + IMVVVD+L KY +F+TL H FSAKQVA  FI ++VR HG P  I+SDRD+  L
Subjt:  --------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLL

Query:  SNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLD
        SNFW +LF   GT+LK S A+    D QTERVN+CLETYLRCFCNE+P +W +++ WAE WYN TFH + KTTPF  VYGR  PPL++YGDKK +N+ ++
Subjt:  SNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLD

Query:  EQLVEKDRVLVALKDQLVRAQD----------------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQ
          L E+D  L ALK+ L  AQ+                                                    + IGAVAYRL LP  A IHNVF ISQ
Subjt:  EQLVEKDRVLVALKDQLVRAQD----------------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQ

Query:  LKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWE-------------NQPKIFPEFHLEHK
        LK K+G  +VVQ   P LTE+ E+  QP+ ++G RWN E G  E L++W+                + FP FHLE K
Subjt:  LKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWE-------------NQPKIFPEFHLEHK

TYK05651.1 Retrotransposable element Tf2 [Cucumis melo var. makuwa]1.3e-21038.53Show/hide
Query:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------
        E+LK+RMF+ +++  +G+L  R + IKQ+  Y++Y K            ++ VL    +  L + L+A+V+S +P+ +E                     
Subjt:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------

Query:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG
               + + AQ+HK  +  T    +  +   F  K   + +    V K   P                                  K    L  FIT 
Subjt:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG

Query:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------
        + +  E   E E E+E+ E AE          E++LR+ILGF+S  TMK++G I   E+I                                        
Subjt:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------

Query:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM
                V I+LP + ++ +F  ++LG  D++L M WL T G+M + W ++T+ F  G+S+V+LK D SLTKAE S+K +   WEEED+GFLIEFQ + 
Subjt:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM

Query:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA
        +E   E + +  +  EE                         +R +DH+I++ EGQ PIN+RPYKYG  QK EIER + EM+              P   
Subjt:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA

Query:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR
          K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNA  TFQSLMNQ  R
Subjt:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR

Query:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER
           P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM+++PVLALP FS  FIIE 
Subjt:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER

Query:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR
        DASG  +  +  ++     +     +IL+GA+G++ ++++       + L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     + W+  
Subjt:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR

Query:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL
         L YKG LV+S    LIPSLL   HD V+GGH   +RTYK ++ E++W  MK  V+ YV Q              +GLLQ LP+   I +D +MDFIEGL
Subjt:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL

Query:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY
        PK+ G+DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S A+    D QTERVNRCLETY
Subjt:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY

Query:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------
        LRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+                 
Subjt:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------

Query:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE
                                           + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + + ++G RWN++
Subjt:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE

Query:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK
         G  E L++W             E   + +P FHLE K
Subjt:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK

TrEMBL top hitse value%identityAlignment
A0A5A7TH70 DNA polymerase epsilon catalytic subunit4.7e-21138.61Show/hide
Query:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------
        E+LK+RMF+ +++  +G+L  R + IKQ+  Y++Y K            ++ VL    +  L + L+A+V+S +P+ +E                     
Subjt:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------

Query:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG
               + + AQ+HK  +  T    +  +   F  K   + +    V K   P                                  K    L  FIT 
Subjt:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG

Query:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------
        + +  E   E E E+E+ E AE          E++LR+ILGF+S  TMK++G I   E+I                                        
Subjt:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------

Query:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM
                V I+LP + ++ +F  ++LG  D++L M WL T G+M + W ++T+ F  G+S+V+LK D SLTKAE S+K +   WEEED+GFLIEFQ + 
Subjt:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM

Query:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA
        +E   E + +  +  EE                         +R +DH+I++ EGQ PIN+RPYKYG  QK EIER + EM+              P   
Subjt:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA

Query:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR
          K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNA  TFQSLMNQ  R
Subjt:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR

Query:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER
           P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM+++PVLALP FS  FIIE 
Subjt:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER

Query:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR
        DASG  +  +  ++     +     +IL+GA+G++ ++++       + L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     + W+  
Subjt:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR

Query:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL
         L YKG LV+S    LIPSLL   HD V+GGHS  +RTYK ++ E++W  MK  V+ YV Q              +GLLQ LP+   I +D +MDFIEGL
Subjt:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL

Query:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY
        PK+ G DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S A+    D QTERVNRCLETY
Subjt:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY

Query:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------
        LRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+                 
Subjt:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------

Query:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE
                                           + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + + ++G RWN++
Subjt:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE

Query:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK
         G  E L++W             E   + +P FHLE K
Subjt:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK

A0A5A7UYM1 Ty3/gypsy retrotransposon protein1.1e-20438.91Show/hide
Query:  PTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELIVTI------------------
        P      K +  L  FI  + +  EE D    EE IEET E        +  + L+++   SS  TMKLKG I + E++V I                  
Subjt:  PTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELIVTI------------------

Query:  ------------------------------QLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMM
                                      +L  + ++ +F  ++LG  D +L M WL T G M ++W +LT++F  G  ++VLK D SL +AE S++ +
Subjt:  ------------------------------QLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMM

Query:  ISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEM
           W+EEDQGFL+E+ ++ +E     K D  +  +E                         +R IDH+I+    QKPINVRPYKYG  QK EIE+ + EM
Subjt:  ISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEM

Query:  M--------------PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMP
        +              P     K+   WRFCVDYRK+NQATIS KFPI VIEELLDEL+G+  FSKLDL+SGYHQIRMKE+D+ K A RTHE HYEF+VMP
Subjt:  M--------------PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMP

Query:  FDLTNALTTFQSLMNQ-------------------------------------------------------------------DIRFQQPECKRMVEWSI
        F LTNA  TFQSLMNQ                                                                    +   + + + MV W  
Subjt:  FDLTNALTTFQSLMNQ-------------------------------------------------------------------DIRFQQPECKRMVEWSI

Query:  PRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFS--
        P +++ELRG LGL GYYRRF K Y  +A PL +LLQKNAF WNE A  AF +LK AM TIPVLALP +S  F IE DASG GLG VL Q   PIA++S  
Subjt:  PRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFS--

Query:  -----------------------HCPQILVGAKGSSTS--------------------------------------ITKMASEAHGISLEVELQILSAPA
                                    L+G K S  S                                      + K A     +  ++EL  ++   
Subjt:  -----------------------HCPQILVGAKGSSTS--------------------------------------ITKMASEAHGISLEVELQILSAPA

Query:  LLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ
        ++D  +V  EV +DE LQ I+AKLK++     KF W+N  LLYKG LV+  + +LIP LL   HD V+GGHS  +RTYK +S ELYW+ MK  ++ YV Q
Subjt:  LLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ

Query:  --------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLL
                      +G+LQ LPI ++I +D +MDFIEGLPK+ G + IMVVVD+L KY +F+TL H FSAKQVA  FI ++VR HG P  I+SDRD+  L
Subjt:  --------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLL

Query:  SNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLD
        SNFW +LF   GT+LK S A+    D QTERVN+CLETYLRCFCNE+P +W +++ WAE WYN  FH + KTTPF  VYGR  PPL++YGDKK  N+ ++
Subjt:  SNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLD

Query:  EQLVEKDRVLVALKDQLVRAQD----------------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQ
          L E+D  L ALK+ L  AQ+                                                    + IGAVAYRL LP  A IHNVF ISQ
Subjt:  EQLVEKDRVLVALKDQLVRAQD----------------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQ

Query:  LKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWE-------------NQPKIFPEFHLEHK
        LK K+G  +VVQ   P LTE+ E+  QP+ ++G RWN E G  E L++W+                + FP FHLE K
Subjt:  LKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWE-------------NQPKIFPEFHLEHK

A0A5A7VHA7 Ty3/gypsy retrotransposon protein4.1e-20739.78Show/hide
Query:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLMEIEGLTAQSHKKGEGKTVWRKR
        E+LK+RMF+ +++  +G+L  R + IKQ+  Y++Y K            ++ VL    +  L + L+ +V+S +P+ +E          +  GK  W   
Subjt:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLMEIEGLTAQSHKKGEGKTVWRKR

Query:  PKEMVKFKKKGKKVYVSNVTKNIHPTTS--ANPKRRENLWLFITGKGQVIEEV-DEFEGEEEIEETAE--------------KVELAL----RSILGFSS
          + +    KG     + V K   P     ++ + R  L     G     EE+  E E E+E+ E AE                ELAL    R+  G + 
Subjt:  PKEMVKFKKKGKKVYVSNVTKNIHPTTS--ANPKRRENLWLFITGKGQVIEEV-DEFEGEEEIEETAE--------------KVELAL----RSILGFSS

Query:  PSTMKLKGKIGELELIVTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGF
             L+G    +   V I+LP + ++ +F  ++LG  D++L M W  T G+M + W ++T+ F  G+S+V+LK D SL KAE S+K +   WEEEDQGF
Subjt:  PSTMKLKGKIGELELIVTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGF

Query:  LIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM----------
        LI FQ + +E   E + +  +  EE                         +R +DH+I++ EGQ PINVRPYKYG  QK EIER + EM+          
Subjt:  LIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM----------

Query:  ----PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQ
            P     K    WRFCVDY K+NQ T++ KFPI VIEELLDELHG+  FSKLDLRSGYHQIRMKE+DI K A RTHE HYEF+VMPF LTN   TFQ
Subjt:  ----PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQ

Query:  SLMNQDIRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKF
        SLMNQ  R   P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM+++PVLALP F
Subjt:  SLMNQDIRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKF

Query:  SKTFIIERDASGFGLGVVLMQEDRPIAYFSH-----------------------------------------------------CPQI---LVGAKGSST
        S  FIIE DASG GLG VL Q +RPIAYFS                                                       PQ    L    G   
Subjt:  SKTFIIERDASGFGLGVVLMQEDRPIAYFSH-----------------------------------------------------CPQI---LVGAKGSST

Query:  SI-------TKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVM
         I        K A     + L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     + W+   L YKG LV+S    LIPSLL   HD V+
Subjt:  SI-------TKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVM

Query:  GGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLF
        GGH   +RTYK ++ E++W  MK  V+ YV Q              +GLLQ LP+   I +D +MDFIEGLPK+ G+DSIMVVVD+L K  HF+ L H F
Subjt:  GGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLF

Query:  SAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHG
        SAKQVA  F++E++  HG    IV+DRD+  LS+FW +LF   GT LK S A+    D QTERVNRCLETYLRCFCNE+P +W + + WAE WYN TFH 
Subjt:  SAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHG

Query:  AIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD----------------------------------------------
        + KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+ AQ+                                              
Subjt:  AIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD----------------------------------------------

Query:  ------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWEN-------------QPKI
              + IG VAYRL LP  A IHNVF +SQLK K+G    VQ + P LTE+ E+  +P+ ++G RWN++ G  E L++W+                + 
Subjt:  ------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWEN-------------QPKI

Query:  FPEFHLEHK
        +P FHLE K
Subjt:  FPEFHLEHK

A0A5D3BJ50 Ty3/gypsy retrotransposon protein8.5e-20539Show/hide
Query:  PTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELIVTI------------------
        P      K +  L  FI  + +  EE D    EE IEET E        +  + L+++   SS  TMKLKG I + E++V I                  
Subjt:  PTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELIVTI------------------

Query:  ------------------------------QLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMM
                                      +L  + ++ +F  ++LG  D +L M WL T G M + W +LT++F  G  ++VLK D SL +AE S++ +
Subjt:  ------------------------------QLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMM

Query:  ISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEM
           W+EEDQGFL+E+ ++ +E     K D  +  +E                         +R IDH+I+    QKPINVRPYKYG  QK EIE+ + EM
Subjt:  ISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEM

Query:  M--------------PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMP
        +              P     K+   WRFCVDYRK+NQATIS KFPI VIEELLDEL+G+  FSKLDL+SGYHQIRMKE+DI K A RTHE HYEF+VMP
Subjt:  M--------------PNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMP

Query:  FDLTNALTTFQSLMNQ-------------------------------------------------------------------DIRFQQPECKRMVEWSI
        F LTNA  TFQSLMNQ                                                                    +   + + + MV W  
Subjt:  FDLTNALTTFQSLMNQ-------------------------------------------------------------------DIRFQQPECKRMVEWSI

Query:  PRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFS--
        P +++ELRG L L GYYRRF K Y  +A PL +LLQKNAF WNE A  AF +LK AM TIPVLALP +S  F IE DASG GLG VL Q   PIA++S  
Subjt:  PRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFS--

Query:  -----------------------HCPQILVGAKGSSTS--------------------------------------ITKMASEAHGISLEVELQILSAPA
                                    L+G K S  S                                      + K A     +  ++EL  ++   
Subjt:  -----------------------HCPQILVGAKGSSTS--------------------------------------ITKMASEAHGISLEVELQILSAPA

Query:  LLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ
        ++D  +V  EV +DE LQ I+AKLK++     KF W+N  LLYKG LV+  + +LIP LL   HD V+GGHS  +RTYK +S ELYW+ MK  ++ YV Q
Subjt:  LLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ

Query:  --------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLL
                      +G+LQ LPI ++I +D +MDFIEGLPK+ G + IMVVVD+L KY +F+TL H FSAKQVA  FI ++VR HG P  I+SDRD+  L
Subjt:  --------------SGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLL

Query:  SNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLD
        SNFW +LF   GT+LK S A+    D QTERVN+CLETYLRCFCNE+P +W +++ WAE WYN TFH + KTTPF  VYGR  PPL++YGDKK +N+ ++
Subjt:  SNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGRE-PPLIAYGDKKISNDTLD

Query:  EQLVEKDRVLVALKDQLVRAQD----------------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQ
          L E+D  L ALK+ L  AQ+                                                    + IGAVAYRL LP  A IHNVF ISQ
Subjt:  EQLVEKDRVLVALKDQLVRAQD----------------------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQ

Query:  LKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWE-------------NQPKIFPEFHLEHK
        LK K+G  +VVQ   P LTE+ E+  QP+ ++G RWN E G  E L++W+                + FP FHLE K
Subjt:  LKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWE-------------NQPKIFPEFHLEHK

A0A5D3C142 Retrotransposable element Tf26.1e-21138.53Show/hide
Query:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------
        E+LK+RMF+ +++  +G+L  R + IKQ+  Y++Y K            ++ VL    +  L + L+A+V+S +P+ +E                     
Subjt:  EELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKH-----------SKLVLEGTIINRLNSKLKAKVESCNPMLME---------------------

Query:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG
               + + AQ+HK  +  T    +  +   F  K   + +    V K   P                                  K    L  FIT 
Subjt:  ------IEGLTAQSHKKGEGKTVWRKRPKEMVKFKKKGKKVYVSN--VTKNIHPTTS----------------------------ANPKRRENLWLFITG

Query:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------
        + +  E   E E E+E+ E AE          E++LR+ILGF+S  TMK++G I   E+I                                        
Subjt:  KGQVIEEVDEFEGEEEIEETAE--------KVELALRSILGFSSPSTMKLKGKIGELELI----------------------------------------

Query:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM
                V I+LP + ++ +F  ++LG  D++L M WL T G+M + W ++T+ F  G+S+V+LK D SLTKAE S+K +   WEEED+GFLIEFQ + 
Subjt:  --------VTIQLPVVIVLENFFPLDLGHSDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLM

Query:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA
        +E   E + +  +  EE                         +R +DH+I++ EGQ PIN+RPYKYG  QK EIER + EM+              P   
Subjt:  LELKVEGKVDPLKLQEE-------------------------QRGIDHQIVMKEGQKPINVRPYKYGQTQKDEIERPIKEMM--------------PNFA

Query:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR
          K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNA  TFQSLMNQ  R
Subjt:  SDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQSLMNQDIR

Query:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER
           P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM+++PVLALP FS  FIIE 
Subjt:  FQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIER

Query:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR
        DASG  +  +  ++     +     +IL+GA+G++ ++++       + L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     + W+  
Subjt:  DASGFGLGVVLMQEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENR

Query:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL
         L YKG LV+S    LIPSLL   HD V+GGH   +RTYK ++ E++W  MK  V+ YV Q              +GLLQ LP+   I +D +MDFIEGL
Subjt:  WLLYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQ--------------SGLLQLLPISERIWDDISMDFIEGL

Query:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY
        PK+ G+DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S A+    D QTERVNRCLETY
Subjt:  PKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETY

Query:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------
        LRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+                 
Subjt:  LRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD-----------------

Query:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE
                                           + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + + ++G RWN++
Subjt:  -----------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSE

Query:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK
         G  E L++W             E   + +P FHLE K
Subjt:  RGKEEVLVEW-------------ENQPKIFPEFHLEHK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.2e-5424.55Show/hide
Query:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------
        + S +K  +S    +SN  +E +      EF+ +  E   E      KL +  +G++ ++ + +    + +R Y       Q   DEI + +K       
Subjt:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------

Query:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT
              P     K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ A   
Subjt:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT

Query:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG
        F                                         Q L N ++   Q +C+                          ++++W  P+N  ELR 
Subjt:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG

Query:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK
         LG   Y R+F      L  PL  LL+K+  + W  T T+A + +K+ +V+ PVL    FSK  ++E DAS   +G VL Q+     Y+   P     AK
Subjt:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK

Query:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK
         S   +    S+   +++          +E  I     L D   ++  +  + E         QL                                I K
Subjt:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK

Query:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL
          ED S                 V ++T + + L               L  G+L+ S    L+P+  Q+   ++   H      +  +           
Subjt:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL

Query:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH
         WK ++K++Q+YV                 G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + 
Subjt:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH

Query:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL
        G PK I++D D    S  W     +   V+K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P 
Subjt:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL

Query:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL
        ++  +    +D  DE   E  +V   +K+ L
Subjt:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL

P0CT35 Transposon Tf2-2 polyprotein1.2e-5424.55Show/hide
Query:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------
        + S +K  +S    +SN  +E +      EF+ +  E   E      KL +  +G++ ++ + +    + +R Y       Q   DEI + +K       
Subjt:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------

Query:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT
              P     K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ A   
Subjt:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT

Query:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG
        F                                         Q L N ++   Q +C+                          ++++W  P+N  ELR 
Subjt:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG

Query:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK
         LG   Y R+F      L  PL  LL+K+  + W  T T+A + +K+ +V+ PVL    FSK  ++E DAS   +G VL Q+     Y+   P     AK
Subjt:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK

Query:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK
         S   +    S+   +++          +E  I     L D   ++  +  + E         QL                                I K
Subjt:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK

Query:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL
          ED S                 V ++T + + L               L  G+L+ S    L+P+  Q+   ++   H      +  +           
Subjt:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL

Query:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH
         WK ++K++Q+YV                 G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + 
Subjt:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH

Query:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL
        G PK I++D D    S  W     +   V+K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P 
Subjt:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL

Query:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL
        ++  +    +D  DE   E  +V   +K+ L
Subjt:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL

P0CT36 Transposon Tf2-3 polyprotein1.2e-5424.55Show/hide
Query:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------
        + S +K  +S    +SN  +E +      EF+ +  E   E      KL +  +G++ ++ + +    + +R Y       Q   DEI + +K       
Subjt:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------

Query:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT
              P     K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ A   
Subjt:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT

Query:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG
        F                                         Q L N ++   Q +C+                          ++++W  P+N  ELR 
Subjt:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG

Query:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK
         LG   Y R+F      L  PL  LL+K+  + W  T T+A + +K+ +V+ PVL    FSK  ++E DAS   +G VL Q+     Y+   P     AK
Subjt:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK

Query:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK
         S   +    S+   +++          +E  I     L D   ++  +  + E         QL                                I K
Subjt:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK

Query:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL
          ED S                 V ++T + + L               L  G+L+ S    L+P+  Q+   ++   H      +  +           
Subjt:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL

Query:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH
         WK ++K++Q+YV                 G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + 
Subjt:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH

Query:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL
        G PK I++D D    S  W     +   V+K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P 
Subjt:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL

Query:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL
        ++  +    +D  DE   E  +V   +K+ L
Subjt:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL

P0CT37 Transposon Tf2-4 polyprotein1.2e-5424.55Show/hide
Query:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------
        + S +K  +S    +SN  +E +      EF+ +  E   E      KL +  +G++ ++ + +    + +R Y       Q   DEI + +K       
Subjt:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------

Query:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT
              P     K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ A   
Subjt:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT

Query:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG
        F                                         Q L N ++   Q +C+                          ++++W  P+N  ELR 
Subjt:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG

Query:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK
         LG   Y R+F      L  PL  LL+K+  + W  T T+A + +K+ +V+ PVL    FSK  ++E DAS   +G VL Q+     Y+   P     AK
Subjt:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK

Query:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK
         S   +    S+   +++          +E  I     L D   ++  +  + E         QL                                I K
Subjt:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK

Query:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL
          ED S                 V ++T + + L               L  G+L+ S    L+P+  Q+   ++   H      +  +           
Subjt:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL

Query:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH
         WK ++K++Q+YV                 G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + 
Subjt:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH

Query:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL
        G PK I++D D    S  W     +   V+K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P 
Subjt:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL

Query:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL
        ++  +    +D  DE   E  +V   +K+ L
Subjt:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL

P0CT41 Transposon Tf2-12 polyprotein1.2e-5424.55Show/hide
Query:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------
        + S +K  +S    +SN  +E +      EF+ +  E   E      KL +  +G++ ++ + +    + +R Y       Q   DEI + +K       
Subjt:  DASLTKAEVSMKMMISNWEEEDQ--GFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYG----QTQKDEIERPIK-------

Query:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT
              P     K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ A   
Subjt:  ---EMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTT

Query:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG
        F                                         Q L N ++   Q +C+                          ++++W  P+N  ELR 
Subjt:  F-----------------------------------------QSLMNQDIRFQQPECK--------------------------RMVEWSIPRNISELRG

Query:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK
         LG   Y R+F      L  PL  LL+K+  + W  T T+A + +K+ +V+ PVL    FSK  ++E DAS   +G VL Q+     Y+   P     AK
Subjt:  VLGLKGYYRRFFKDYEVLAVPLAQLLQKNA-FVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLMQEDRPIAYFSHCPQILVGAK

Query:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK
         S   +    S+   +++          +E  I     L D   ++  +  + E         QL                                I K
Subjt:  GSSTSITKMASEAHGISL---------EVELQILSAPALLDSEMVLREVGQDEE--------LQLI-------------------------------IAK

Query:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL
          ED S                 V ++T + + L               L  G+L+ S    L+P+  Q+   ++   H      +  +           
Subjt:  LKEDPS----------------SVLKFTWENRWL---------------LYKGMLVISSSPNLIPSLLQMHHDLVMGGHSRCIRTYKWLS-------NEL

Query:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH
         WK ++K++Q+YV                 G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + 
Subjt:  YWKVMKKRVQDYVAQS--------------GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH

Query:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL
        G PK I++D D    S  W     +   V+K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P 
Subjt:  GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPL

Query:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL
        ++  +    +D  DE   E  +V   +K+ L
Subjt:  IAYGDKKISNDTLDEQLVEKDRVLVALKDQL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.5e-1753.16Show/hide
Query:  MVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFI
        MV W  P+N +ELRG LGL GYYRRF K+Y  +  PL +LL+KN+  W E A  AF  LK A+ T+PVLALP     F+
Subjt:  MVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCACAATCAGAGTCCTATCAAGTTCTAGAGGAATTGAAGAAAAGAATGTTTGATCGATATCAAACGTCCAAGAAAGGAAACTTGACACGACGTTTTCTGGCTAT
CAAACAGGAGGTAAACTATGCTAATTACCAGAAGCATTCCAAATTAGTGTTGGAAGGAACGATTATCAACAGATTGAATTCAAAATTGAAGGCGAAGGTGGAAAGTTGTA
ATCCGATGTTAATGGAGATCGAGGGGCTCACAGCCCAGAGTCACAAGAAGGGGGAAGGAAAGACAGTTTGGAGAAAACGACCCAAGGAAATGGTGAAATTCAAGAAAAAA
GGAAAAAAGGTTTATGTTTCAAATGTGACGAAAAATATTCACCCGACCACCAGTGCAAATCCAAAGAGAAGAGAGAACTTGTGGCTTTTTATCACCGGAAAAGGCCAGGT
GATTGAAGAAGTGGACGAGTTCGAAGGTGAAGAAGAAATAGAAGAAACTGCAGAGAAGGTGGAGCTAGCGCTTCGATCGATCCTAGGATTTTCCTCTCCCAGCACCATGA
AATTGAAGGGGAAAATCGGGGAACTAGAGTTAATTGTGACTATTCAATTACCAGTGGTCATTGTGCTCGAAAATTTTTTTCCATTGGACTTGGGTCATTCAGACATCATT
TTGGTAATGGCTTGGCTATATACCATGGGGTACATGGGAGTTGATTGGGCGAACCTCACCGTTACATTCACCCAAGGGAATTCGAAAGTAGTACTGAAAGAAGATGCTTC
CCTGACGAAGGCCGAAGTGTCTATGAAAATGATGATCAGTAACTGGGAGGAGGAAGATCAAGGCTTCTTAATAGAATTTCAAAGCCTGATGCTGGAATTGAAAGTAGAAG
GAAAAGTTGACCCACTGAAGTTGCAAGAAGAACAAAGAGGGATTGATCACCAGATTGTTATGAAAGAAGGACAGAAACCAATCAATGTACGCCCGTATAAATATGGACAA
ACCCAAAAAGATGAAATTGAAAGGCCAATCAAAGAAATGATGCCCAATTTCGCTAGTGACAAAGAAAGACGGGAGTGGCGGTTTTGTGTCGACTATAGAAAGGTGAATCA
AGCTACTATCTCGAAAAAATTCCCAATTCTCGTTATTGAAGAATTACTTGATGAATTACATGGCTCGACATATTTTTCCAAACTGGATTTACGCTCGGGGTATCATCAGA
TAAGAATGAAGGAAGATGATATACCAAAAATCGCTTTGCGAACTCATGAGAGGCATTATGAATTTGTTGTGATGCCATTCGACCTCACTAACGCGCTGACCACATTTCAA
TCGTTAATGAATCAAGACATTAGGTTTCAGCAACCAGAGTGCAAGCGGATGGTAGAATGGTCGATTCCAAGAAATATATCCGAATTGCGAGGAGTTTTAGGTCTGAAGGG
ATATTATCGAAGGTTTTTCAAGGATTATGAGGTTCTTGCTGTCCCGTTGGCACAATTGTTGCAGAAGAATGCTTTTGTATGGAACGAGACAGCTACTGAAGCATTCGATA
AACTGAAAAGGGCCATGGTAACCATTCCCGTGCTAGCACTTCCAAAATTTTCCAAAACATTTATAATTGAAAGGGATGCGTCTGGTTTTGGGTTGGGGGTAGTTTTGATG
CAAGAAGATCGACCGATTGCATATTTTAGTCATTGCCCTCAAATTCTTGTTGGAGCAAAGGGAAGTTCAACCTCAATTACAAAAATGGCTAGTGAAGCTCATGGGATCTC
ACTCGAAGTTGAACTGCAGATACTTTCGGCACCTGCATTGCTGGATAGTGAAATGGTCTTGAGGGAGGTTGGGCAAGACGAAGAACTACAGTTGATCATTGCAAAACTGA
AGGAGGACCCGAGTAGTGTGCTGAAATTCACTTGGGAGAATCGGTGGCTATTGTACAAAGGCATGTTAGTTATTTCATCTTCTCCAAATTTGATCCCTTCTCTTCTCCAA
ATGCATCATGATTTGGTGATGGGAGGTCATTCAAGGTGCATCCGAACCTATAAGTGGTTGAGCAATGAGTTATACTGGAAAGTGATGAAGAAGAGAGTTCAAGACTATGT
GGCACAATCAGGGCTCCTTCAATTGTTACCCATATCAGAAAGGATTTGGGATGATATATCCATGGATTTCATAGAGGGTCTGCCGAAGTCAAGGGGTTTCGATTCAATTA
TGGTGGTGGTAGATCAGTTATGTAAATACAACCACTTTGTAACCTTGTCTCACCTGTTTTCAGCGAAGCAGGTGGCAGGGGTGTTTATTAAAGAGGTGGTTCGCCATCAT
GGATTCCCAAAATTAATAGTATCGGATAGAGATCAAAGCTTATTGAGCAATTTTTGGGTAAAGTTGTTCAGACAACAAGGCACTGTGCTGAAATGTAGTATCGCATATGA
TAGGCAAACTGAGAGGGTAAATCGCTGTCTCGAGACATACTTACGTTGTTTTTGTAATGAGAAACCCCGAAGATGGAGTGAATGGATGTCATGGGCAGAGTATTGGTATA
ACATGACATTCCATGGAGCAATTAAAACTACACCCTTCTGCATAGTTTATGGCCGTGAGCCTCCTCTTATTGCGTATGGGGACAAGAAAATCTCAAACGACACTTTAGAT
GAGCAGTTGGTTGAAAAGGATAGGGTGTTGGTGGCTTTAAAGGATCAGTTAGTTAGGGCGCAAGACCAAAGAATTGGGGCGGTCGCTTATCGGTTGAAGTTACCCTCTTC
AGCAAGGATACATAATGTGTTCCAAATTTCACAGTTAAAGAAGAAAGTAGGGAAAGATAAGGTTGTTCAACCCATGTTACCGGAGTTGACGGAAGACGCTGAAATGAATG
GACAACCTAAGATGATTATAGGAACACGTTGGAATTCAGAGAGAGGAAAAGAAGAGGTGTTGGTCGAATGGGAAAACCAGCCAAAAATATTTCCCGAATTCCACCTTGAG
CACAAGACCGCTGAATTCATACATATCATAGAAGGGGTAAAAAAAGAAAATAACCAGGTTGTTGATGTTGCAAAATCAATTAGAGAGAAAGATCCCAATAAGACATTCGG
AGTTTTGGTTGAGGAAGACAATAGTCATCTTGGAAAGTATGTTTATGAAATGTTATTGAATTATGAGTTGCCTAGGAGGCTCAGTGCTGAAGGACATGAGAAGGCATGTG
AGTTTCCCCGGGACCTAATACTGAAGGACATGAGAAGGTACTTGGGCAAACCGCCATGTTCCTATGTGCACATAGAGGTTGATTGGCTGGTGCGTCGAGTTTTGCTCCGC
ATCGAGCCAGGGATAATTACGATGTTGGGTGTGTTGAGTCTTGCTCCACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCACAATCAGAGTCCTATCAAGTTCTAGAGGAATTGAAGAAAAGAATGTTTGATCGATATCAAACGTCCAAGAAAGGAAACTTGACACGACGTTTTCTGGCTAT
CAAACAGGAGGTAAACTATGCTAATTACCAGAAGCATTCCAAATTAGTGTTGGAAGGAACGATTATCAACAGATTGAATTCAAAATTGAAGGCGAAGGTGGAAAGTTGTA
ATCCGATGTTAATGGAGATCGAGGGGCTCACAGCCCAGAGTCACAAGAAGGGGGAAGGAAAGACAGTTTGGAGAAAACGACCCAAGGAAATGGTGAAATTCAAGAAAAAA
GGAAAAAAGGTTTATGTTTCAAATGTGACGAAAAATATTCACCCGACCACCAGTGCAAATCCAAAGAGAAGAGAGAACTTGTGGCTTTTTATCACCGGAAAAGGCCAGGT
GATTGAAGAAGTGGACGAGTTCGAAGGTGAAGAAGAAATAGAAGAAACTGCAGAGAAGGTGGAGCTAGCGCTTCGATCGATCCTAGGATTTTCCTCTCCCAGCACCATGA
AATTGAAGGGGAAAATCGGGGAACTAGAGTTAATTGTGACTATTCAATTACCAGTGGTCATTGTGCTCGAAAATTTTTTTCCATTGGACTTGGGTCATTCAGACATCATT
TTGGTAATGGCTTGGCTATATACCATGGGGTACATGGGAGTTGATTGGGCGAACCTCACCGTTACATTCACCCAAGGGAATTCGAAAGTAGTACTGAAAGAAGATGCTTC
CCTGACGAAGGCCGAAGTGTCTATGAAAATGATGATCAGTAACTGGGAGGAGGAAGATCAAGGCTTCTTAATAGAATTTCAAAGCCTGATGCTGGAATTGAAAGTAGAAG
GAAAAGTTGACCCACTGAAGTTGCAAGAAGAACAAAGAGGGATTGATCACCAGATTGTTATGAAAGAAGGACAGAAACCAATCAATGTACGCCCGTATAAATATGGACAA
ACCCAAAAAGATGAAATTGAAAGGCCAATCAAAGAAATGATGCCCAATTTCGCTAGTGACAAAGAAAGACGGGAGTGGCGGTTTTGTGTCGACTATAGAAAGGTGAATCA
AGCTACTATCTCGAAAAAATTCCCAATTCTCGTTATTGAAGAATTACTTGATGAATTACATGGCTCGACATATTTTTCCAAACTGGATTTACGCTCGGGGTATCATCAGA
TAAGAATGAAGGAAGATGATATACCAAAAATCGCTTTGCGAACTCATGAGAGGCATTATGAATTTGTTGTGATGCCATTCGACCTCACTAACGCGCTGACCACATTTCAA
TCGTTAATGAATCAAGACATTAGGTTTCAGCAACCAGAGTGCAAGCGGATGGTAGAATGGTCGATTCCAAGAAATATATCCGAATTGCGAGGAGTTTTAGGTCTGAAGGG
ATATTATCGAAGGTTTTTCAAGGATTATGAGGTTCTTGCTGTCCCGTTGGCACAATTGTTGCAGAAGAATGCTTTTGTATGGAACGAGACAGCTACTGAAGCATTCGATA
AACTGAAAAGGGCCATGGTAACCATTCCCGTGCTAGCACTTCCAAAATTTTCCAAAACATTTATAATTGAAAGGGATGCGTCTGGTTTTGGGTTGGGGGTAGTTTTGATG
CAAGAAGATCGACCGATTGCATATTTTAGTCATTGCCCTCAAATTCTTGTTGGAGCAAAGGGAAGTTCAACCTCAATTACAAAAATGGCTAGTGAAGCTCATGGGATCTC
ACTCGAAGTTGAACTGCAGATACTTTCGGCACCTGCATTGCTGGATAGTGAAATGGTCTTGAGGGAGGTTGGGCAAGACGAAGAACTACAGTTGATCATTGCAAAACTGA
AGGAGGACCCGAGTAGTGTGCTGAAATTCACTTGGGAGAATCGGTGGCTATTGTACAAAGGCATGTTAGTTATTTCATCTTCTCCAAATTTGATCCCTTCTCTTCTCCAA
ATGCATCATGATTTGGTGATGGGAGGTCATTCAAGGTGCATCCGAACCTATAAGTGGTTGAGCAATGAGTTATACTGGAAAGTGATGAAGAAGAGAGTTCAAGACTATGT
GGCACAATCAGGGCTCCTTCAATTGTTACCCATATCAGAAAGGATTTGGGATGATATATCCATGGATTTCATAGAGGGTCTGCCGAAGTCAAGGGGTTTCGATTCAATTA
TGGTGGTGGTAGATCAGTTATGTAAATACAACCACTTTGTAACCTTGTCTCACCTGTTTTCAGCGAAGCAGGTGGCAGGGGTGTTTATTAAAGAGGTGGTTCGCCATCAT
GGATTCCCAAAATTAATAGTATCGGATAGAGATCAAAGCTTATTGAGCAATTTTTGGGTAAAGTTGTTCAGACAACAAGGCACTGTGCTGAAATGTAGTATCGCATATGA
TAGGCAAACTGAGAGGGTAAATCGCTGTCTCGAGACATACTTACGTTGTTTTTGTAATGAGAAACCCCGAAGATGGAGTGAATGGATGTCATGGGCAGAGTATTGGTATA
ACATGACATTCCATGGAGCAATTAAAACTACACCCTTCTGCATAGTTTATGGCCGTGAGCCTCCTCTTATTGCGTATGGGGACAAGAAAATCTCAAACGACACTTTAGAT
GAGCAGTTGGTTGAAAAGGATAGGGTGTTGGTGGCTTTAAAGGATCAGTTAGTTAGGGCGCAAGACCAAAGAATTGGGGCGGTCGCTTATCGGTTGAAGTTACCCTCTTC
AGCAAGGATACATAATGTGTTCCAAATTTCACAGTTAAAGAAGAAAGTAGGGAAAGATAAGGTTGTTCAACCCATGTTACCGGAGTTGACGGAAGACGCTGAAATGAATG
GACAACCTAAGATGATTATAGGAACACGTTGGAATTCAGAGAGAGGAAAAGAAGAGGTGTTGGTCGAATGGGAAAACCAGCCAAAAATATTTCCCGAATTCCACCTTGAG
CACAAGACCGCTGAATTCATACATATCATAGAAGGGGTAAAAAAAGAAAATAACCAGGTTGTTGATGTTGCAAAATCAATTAGAGAGAAAGATCCCAATAAGACATTCGG
AGTTTTGGTTGAGGAAGACAATAGTCATCTTGGAAAGTATGTTTATGAAATGTTATTGAATTATGAGTTGCCTAGGAGGCTCAGTGCTGAAGGACATGAGAAGGCATGTG
AGTTTCCCCGGGACCTAATACTGAAGGACATGAGAAGGTACTTGGGCAAACCGCCATGTTCCTATGTGCACATAGAGGTTGATTGGCTGGTGCGTCGAGTTTTGCTCCGC
ATCGAGCCAGGGATAATTACGATGTTGGGTGTGTTGAGTCTTGCTCCACCCTAA
Protein sequenceShow/hide protein sequence
MESQSESYQVLEELKKRMFDRYQTSKKGNLTRRFLAIKQEVNYANYQKHSKLVLEGTIINRLNSKLKAKVESCNPMLMEIEGLTAQSHKKGEGKTVWRKRPKEMVKFKKK
GKKVYVSNVTKNIHPTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAEKVELALRSILGFSSPSTMKLKGKIGELELIVTIQLPVVIVLENFFPLDLGHSDII
LVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKEDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLMLELKVEGKVDPLKLQEEQRGIDHQIVMKEGQKPINVRPYKYGQ
TQKDEIERPIKEMMPNFASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNALTTFQ
SLMNQDIRFQQPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMVTIPVLALPKFSKTFIIERDASGFGLGVVLM
QEDRPIAYFSHCPQILVGAKGSSTSITKMASEAHGISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGMLVISSSPNLIPSLLQ
MHHDLVMGGHSRCIRTYKWLSNELYWKVMKKRVQDYVAQSGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVRHH
GFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSIAYDRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLD
EQLVEKDRVLVALKDQLVRAQDQRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPKMIIGTRWNSERGKEEVLVEWENQPKIFPEFHLE
HKTAEFIHIIEGVKKENNQVVDVAKSIREKDPNKTFGVLVEEDNSHLGKYVYEMLLNYELPRRLSAEGHEKACEFPRDLILKDMRRYLGKPPCSYVHIEVDWLVRRVLLR
IEPGIITMLGVLSLAPP