; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G08960 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G08960
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationClcChr02:11397093..11402141
RNA-Seq ExpressionClc02G08960
SyntenyClc02G08960
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040609.1 DNA polymerase epsilon catalytic subunit A-like [Cucumis melo var. makuwa]2.6e-17638.45Show/hide
Query:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT
        K    L  FIT + +  E   E E E+E+ E AE          E++LR ILGF+S  TMK++G I   E+I++ID  ATHNFIHR +V++  + +    
Subjt:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT

Query:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE
         +G+ I +GTAL+GN +CK V I+LP + ++  F  ++LG +D++L M WL T G+M + W ++T+ F  G+S+V+LKGD SLTKAE S+K +   WEEE
Subjt:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE

Query:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------
        D+GFLIEFQ + +E   E + +  +  EE                        P  A D                                         
Subjt:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------

Query:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP
                      K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNAP
Subjt:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP

Query:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----
         TFQSLMNQ  R   P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM  +    
Subjt:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----

Query:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF
                                  K+                  ADAL Q+ L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     +
Subjt:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF

Query:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD
         W+   L YK                                                                       GLLQ LP+   I +D +MD
Subjt:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD

Query:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR
        FIEGLPK+ G DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S  +    D QTERVNR
Subjt:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR

Query:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------
        CLETYLRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+            
Subjt:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------

Query:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT
                                                + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + E ++G 
Subjt:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT

Query:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK
        RWN++ G  E L++W             E   + +P FHLE K
Subjt:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK

KAA0058899.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.3e-17035.3Show/hide
Query:  EEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEK
        E E+ E +  VEL+L  ++G ++P T KLKG +   E+++M+DCGATHNFI  +LV++ K+ +  TTNYG+++ +G A+QG  +CK +T+ LPV+ ++E 
Subjt:  EEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEK

Query:  FFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVEGKVDP-----
        F PL+LG++D++L M WL   G M VDW  LT+TF  G++KV+LKGD SLT+ E+S+K+++  W+ +DQGFL+ F+++ +     EL V   V+      
Subjt:  FFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVEGKVDP-----

Query:  LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVDYRKVNQATIS
         +LQ+E                                                                   P     K+   WRFCVDYR +N+AT+ 
Subjt:  LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVDYRKVNQATIS

Query:  KKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-----------------------
         KFPI +I+ELLDEL G++ FSK+DL+SGYHQIR++++DI K A RTHE HYEF+VMPF LTNAP TFQ+LMNQ  R                       
Subjt:  KKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-----------------------

Query:  --------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVW
                FQ                                    Q + K M+EW IP+NI ELRG LGL GYYRRF  +Y  +A PL +L +KN F W
Subjt:  --------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVW

Query:  NETATEAFDKLKRAM-------------------------------------------------------------------------------------
        +E AT+AF++LKRAM                                                                                     
Subjt:  NETATEAFDKLKRAM-------------------------------------------------------------------------------------

Query:  -----------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFT-------
                   PG+                     NK ADAL ++  E EL  ++ P+LLD  ++ +EV  DE+L  I  ++  DP  V ++T       
Subjt:  -----------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFT-------

Query:  WEN--------------------RWLLYKGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHG
        W+                     + L   GLLQ LPI  RIW+DISMDF+EGLP+S+GFD+I+VVVD L KY HF+TL H FSAK VA VFIKEVV  HG
Subjt:  WEN--------------------RWLLYKGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHG

Query:  FPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPL
        +P+ IVSDRD+  LS+FW +LFR QGT LK S  Y    D QTE VN+CLE YLRC C EK + WS+ ++WAEYWYN  +  +IK TP+ +VYG+  PP+
Subjt:  FPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPL

Query:  IAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ----------------------------------------------------RIGAVAYRLK
        I+YG    + ND++++QL  +D +L  LK  L  AQ++                                                    R+G VAY L 
Subjt:  IAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ----------------------------------------------------RIGAVAYRLK

Query:  LPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG
        LP +A+IH VF +SQLKK +G    V P +  + +  E+  +PE ++   WN      E LV+W+NQP               FP  HLE K  L   G
Subjt:  LPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG

TYK05651.1 Retrotransposable element Tf2 [Cucumis melo var. makuwa]7.0e-17738.45Show/hide
Query:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT
        K    L  FIT + +  E   E E E+E+ E AE          E++LR ILGF+S  TMK++G I   E+I++ID  ATHNFIHR +V++  + +    
Subjt:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT

Query:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE
         +G+ I +GTAL+GN +CK V I+LP + ++  F  ++LG +D++L M WL T G+M + W ++T+ F  G+S+V+LKGD SLTKAE S+K +   WEEE
Subjt:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE

Query:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------
        D+GFLIEFQ + +E   E + +  +  EE                        P  A D                                         
Subjt:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------

Query:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP
                      K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNAP
Subjt:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP

Query:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----
         TFQSLMNQ  R   P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM  +    
Subjt:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----

Query:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF
                                  K+                  ADAL Q+ L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     +
Subjt:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF

Query:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD
         W+   L YK                                                                       GLLQ LP+   I +D +MD
Subjt:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD

Query:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR
        FIEGLPK+ G+DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S  +    D QTERVNR
Subjt:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR

Query:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------
        CLETYLRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+            
Subjt:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------

Query:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT
                                                + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + E ++G 
Subjt:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT

Query:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK
        RWN++ G  E L++W             E   + +P FHLE K
Subjt:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK

TYK10423.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-16834.08Show/hide
Query:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI
        +E VD  +EG  E+ E +  VEL+L  ++G ++P T KLKG +   E+++M+DCGATHNFI  +LV++ K+ +  TTNYG+++ +G A+QG  +CK +T+
Subjt:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI

Query:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE
         LPV+ ++E F PL+LG++D++L M WL   G M VDW  LT+TF  G++KV+LKGD SLT+ E+S+K+++  W+ +DQGFL+ F+++ +     EL V 
Subjt:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE

Query:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD
          V+       +LQ+E                                                                   P     K+   WRFCVD
Subjt:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD

Query:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------
        YR +N+AT+  KFPI +I+ELLDEL G++ FSK+DL+SGYHQIR++++DI K A RTHE HYEF+VMPF LTNAP TFQ+LMNQ  R             
Subjt:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------

Query:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA
                          FQ                                    Q + K M+EW IP+NI ELRG LGL GYYRRF  +Y  +A PL 
Subjt:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA

Query:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------
        +L +KN F W+E AT+AF++LKRAM                                                                           
Subjt:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------

Query:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL
                             PG+                     NK ADAL ++  E EL  ++ P+LLD  ++ +EV  DE+L+ I  ++  DP  V 
Subjt:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL

Query:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS
        ++T     L YK                                                                       GLLQ LPI  RIW+DIS
Subjt:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS

Query:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV
        MDF+EGLP+S+GFD+I+VVVD+L KY HF+TL H FSAK VA VFIKEVV  HG+P+ IVSDRD+  LS+FW +LFR QGT LK S  Y    D QTE V
Subjt:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV

Query:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------
        N+CLE YLRC C EK + WS+ ++WAEYWYN  +  +IK+TP+ +VYG+  PP+I+YG    + ND++++QL  +D +L  LK  L  AQ++        
Subjt:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------

Query:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI
                                                    R+G VAY L LP +A+IH VF +SQLKK +G    V P +  + +  E+  +PE +
Subjt:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI

Query:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG
        +  RWN      E LV+W++QP               FP  HLE K  L   G
Subjt:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG

TYK21209.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-16834.08Show/hide
Query:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI
        +E VD  +EG  E+ E +  VEL+L  ++G ++P T KLKG +   E+++M+DCGATHNFI  +LV++ K+ +  TTNYG+++ +G A+QG  +CK +T+
Subjt:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI

Query:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE
         LPV+ ++E F PL+LG++D++L M WL   G M VDW  LT+TF  G++KV+LKGD SLT+ E+S+K+++  W+ +DQGFL+ F+++ +     EL V 
Subjt:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE

Query:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD
          V+       +LQ+E                                                                   P     K+   WRFCVD
Subjt:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD

Query:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------
        YR +N+AT+  KFPI +I+ELLDEL G++ FSK+DL+SGYHQIR++++DI K A RTHE HYEF+VMPF LTNAP TFQ+LMNQ  R             
Subjt:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------

Query:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA
                          FQ                                    Q + K M+EW IP+NI ELRG LGL GYYRRF  +Y  +A PL 
Subjt:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA

Query:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------
        +L +KN F W+E AT+AF++LKRAM                                                                           
Subjt:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------

Query:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL
                             PG+                     NK ADAL ++  E EL  ++ P+LLD  ++ +EV  DE+L+ I  ++  DP  V 
Subjt:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL

Query:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS
        ++T     L YK                                                                       GLLQ LPI  RIW+DIS
Subjt:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS

Query:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV
        MDF+EGLP+S+GFD+I+VVVD+L KY HF+TL H FSAK VA VFIKEVV  HG+P+ IVSDRD+  LS+FW +LFR QGT LK S  Y    D QTE V
Subjt:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV

Query:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------
        N+CLE YLRC C EK + WS+ ++WAEYWYN  +  +IK TP+ +VYG+  PP+I+YG    + ND++++QL  +D +L  LK  L  AQ++        
Subjt:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------

Query:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI
                                                    R+G VAY L LP +A+IH VF +SQLKK +G    V P +  + +  E+  +PE +
Subjt:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI

Query:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG
        +  RWN      E LV+W++QP               FP  HLE K  L   G
Subjt:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG

TrEMBL top hitse value%identityAlignment
A0A5A7TH70 DNA polymerase epsilon catalytic subunit1.3e-17638.45Show/hide
Query:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT
        K    L  FIT + +  E   E E E+E+ E AE          E++LR ILGF+S  TMK++G I   E+I++ID  ATHNFIHR +V++  + +    
Subjt:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT

Query:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE
         +G+ I +GTAL+GN +CK V I+LP + ++  F  ++LG +D++L M WL T G+M + W ++T+ F  G+S+V+LKGD SLTKAE S+K +   WEEE
Subjt:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE

Query:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------
        D+GFLIEFQ + +E   E + +  +  EE                        P  A D                                         
Subjt:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------

Query:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP
                      K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNAP
Subjt:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP

Query:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----
         TFQSLMNQ  R   P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM  +    
Subjt:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----

Query:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF
                                  K+                  ADAL Q+ L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     +
Subjt:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF

Query:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD
         W+   L YK                                                                       GLLQ LP+   I +D +MD
Subjt:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD

Query:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR
        FIEGLPK+ G DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S  +    D QTERVNR
Subjt:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR

Query:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------
        CLETYLRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+            
Subjt:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------

Query:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT
                                                + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + E ++G 
Subjt:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT

Query:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK
        RWN++ G  E L++W             E   + +P FHLE K
Subjt:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK

A0A5A7UZE8 Ty3/gypsy retrotransposon protein3.1e-17035.3Show/hide
Query:  EEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEK
        E E+ E +  VEL+L  ++G ++P T KLKG +   E+++M+DCGATHNFI  +LV++ K+ +  TTNYG+++ +G A+QG  +CK +T+ LPV+ ++E 
Subjt:  EEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEK

Query:  FFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVEGKVDP-----
        F PL+LG++D++L M WL   G M VDW  LT+TF  G++KV+LKGD SLT+ E+S+K+++  W+ +DQGFL+ F+++ +     EL V   V+      
Subjt:  FFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVEGKVDP-----

Query:  LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVDYRKVNQATIS
         +LQ+E                                                                   P     K+   WRFCVDYR +N+AT+ 
Subjt:  LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVDYRKVNQATIS

Query:  KKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-----------------------
         KFPI +I+ELLDEL G++ FSK+DL+SGYHQIR++++DI K A RTHE HYEF+VMPF LTNAP TFQ+LMNQ  R                       
Subjt:  KKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-----------------------

Query:  --------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVW
                FQ                                    Q + K M+EW IP+NI ELRG LGL GYYRRF  +Y  +A PL +L +KN F W
Subjt:  --------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVW

Query:  NETATEAFDKLKRAM-------------------------------------------------------------------------------------
        +E AT+AF++LKRAM                                                                                     
Subjt:  NETATEAFDKLKRAM-------------------------------------------------------------------------------------

Query:  -----------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFT-------
                   PG+                     NK ADAL ++  E EL  ++ P+LLD  ++ +EV  DE+L  I  ++  DP  V ++T       
Subjt:  -----------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFT-------

Query:  WEN--------------------RWLLYKGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHG
        W+                     + L   GLLQ LPI  RIW+DISMDF+EGLP+S+GFD+I+VVVD L KY HF+TL H FSAK VA VFIKEVV  HG
Subjt:  WEN--------------------RWLLYKGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHG

Query:  FPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPL
        +P+ IVSDRD+  LS+FW +LFR QGT LK S  Y    D QTE VN+CLE YLRC C EK + WS+ ++WAEYWYN  +  +IK TP+ +VYG+  PP+
Subjt:  FPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPL

Query:  IAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ----------------------------------------------------RIGAVAYRLK
        I+YG    + ND++++QL  +D +L  LK  L  AQ++                                                    R+G VAY L 
Subjt:  IAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ----------------------------------------------------RIGAVAYRLK

Query:  LPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG
        LP +A+IH VF +SQLKK +G    V P +  + +  E+  +PE ++   WN      E LV+W+NQP               FP  HLE K  L   G
Subjt:  LPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG

A0A5D3C142 Retrotransposable element Tf23.4e-17738.45Show/hide
Query:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT
        K    L  FIT + +  E   E E E+E+ E AE          E++LR ILGF+S  TMK++G I   E+I++ID  ATHNFIHR +V++  + +    
Subjt:  KRRENLWLFITGKGQVIEEVDEFEGEEEIEETAE--------KVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTT

Query:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE
         +G+ I +GTAL+GN +CK V I+LP + ++  F  ++LG +D++L M WL T G+M + W ++T+ F  G+S+V+LKGD SLTKAE S+K +   WEEE
Subjt:  NYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE

Query:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------
        D+GFLIEFQ + +E   E + +  +  EE                        P  A D                                         
Subjt:  DQGFLIEFQSLMLELKVEGKVDPLKLQEEQ-----------------------PNSASD-----------------------------------------

Query:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP
                      K    WRFCVDYRK+NQ T++ KF I VIEELLDELHG+  FSKLDLRSGYHQI+MKE+DI K   RTHE HYEF+VMPF LTNAP
Subjt:  --------------KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAP

Query:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----
         TFQSLMNQ  R   P  +R V          +  I  +     G LGL GYYRRF K+Y  +A PL +LLQKN F W E AT AF+ LK+AM  +    
Subjt:  TTFQSLMNQDTRFQQPECKRMV----------EWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGL----

Query:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF
                                  K+                  ADAL Q+ L  E   L A  LLD ++V  EV +DEELQ II  LKEDP     +
Subjt:  ------------------------NNKV------------------ADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKF

Query:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD
         W+   L YK                                                                       GLLQ LP+   I +D +MD
Subjt:  TWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDISMD

Query:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR
        FIEGLPK+ G+DSIMVVVD+L K  HF+ L H FSAKQVA  F++E++  HG P  IV+DRD+  LS+FW +LF   GT LK S  +    D QTERVNR
Subjt:  FIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERVNR

Query:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------
        CLETYLRCFCNE+P +W + + WAE WYN TFH   KTTPF +VYGR  PPL+ YGD K +N+++++ L E+D V+ ALK+ L+  Q+            
Subjt:  CLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQD------------

Query:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT
                                                + IG VAYRL LP  A IHNVF +SQLK K+G   +VQ + P LTE+ E+  + E ++G 
Subjt:  ----------------------------------------QRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGT

Query:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK
        RWN++ G  E L++W             E   + +P FHLE K
Subjt:  RWNSERGKEEVLVEW-------------ENQPKIFPEFHLEHK

A0A5D3CEX8 Ty3/gypsy retrotransposon protein5.8e-16934.08Show/hide
Query:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI
        +E VD  +EG  E+ E +  VEL+L  ++G ++P T KLKG +   E+++M+DCGATHNFI  +LV++ K+ +  TTNYG+++ +G A+QG  +CK +T+
Subjt:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI

Query:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE
         LPV+ ++E F PL+LG++D++L M WL   G M VDW  LT+TF  G++KV+LKGD SLT+ E+S+K+++  W+ +DQGFL+ F+++ +     EL V 
Subjt:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE

Query:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD
          V+       +LQ+E                                                                   P     K+   WRFCVD
Subjt:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD

Query:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------
        YR +N+AT+  KFPI +I+ELLDEL G++ FSK+DL+SGYHQIR++++DI K A RTHE HYEF+VMPF LTNAP TFQ+LMNQ  R             
Subjt:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------

Query:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA
                          FQ                                    Q + K M+EW IP+NI ELRG LGL GYYRRF  +Y  +A PL 
Subjt:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA

Query:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------
        +L +KN F W+E AT+AF++LKRAM                                                                           
Subjt:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------

Query:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL
                             PG+                     NK ADAL ++  E EL  ++ P+LLD  ++ +EV  DE+L+ I  ++  DP  V 
Subjt:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL

Query:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS
        ++T     L YK                                                                       GLLQ LPI  RIW+DIS
Subjt:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS

Query:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV
        MDF+EGLP+S+GFD+I+VVVD+L KY HF+TL H FSAK VA VFIKEVV  HG+P+ IVSDRD+  LS+FW +LFR QGT LK S  Y    D QTE V
Subjt:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV

Query:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------
        N+CLE YLRC C EK + WS+ ++WAEYWYN  +  +IK+TP+ +VYG+  PP+I+YG    + ND++++QL  +D +L  LK  L  AQ++        
Subjt:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------

Query:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI
                                                    R+G VAY L LP +A+IH VF +SQLKK +G    V P +  + +  E+  +PE +
Subjt:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI

Query:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG
        +  RWN      E LV+W++QP               FP  HLE K  L   G
Subjt:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG

A0A5D3DD68 Ty3/gypsy retrotransposon protein5.8e-16934.08Show/hide
Query:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI
        +E VD  +EG  E+ E +  VEL+L  ++G ++P T KLKG +   E+++M+DCGATHNFI  +LV++ K+ +  TTNYG+++ +G A+QG  +CK +T+
Subjt:  IEEVDE-FEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCKEVTI

Query:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE
         LPV+ ++E F PL+LG++D++L M WL   G M VDW  LT+TF  G++KV+LKGD SLT+ E+S+K+++  W+ +DQGFL+ F+++ +     EL V 
Subjt:  QLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEEDQGFLIEFQSLML-----ELKVE

Query:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD
          V+       +LQ+E                                                                   P     K+   WRFCVD
Subjt:  GKVDP-----LKLQEE------------------------------------------------------------------QPNSASDKERREWRFCVD

Query:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------
        YR +N+AT+  KFPI +I+ELLDEL G++ FSK+DL+SGYHQIR++++DI K A RTHE HYEF+VMPF LTNAP TFQ+LMNQ  R             
Subjt:  YRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMNQDTR-------------

Query:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA
                          FQ                                    Q + K M+EW IP+NI ELRG LGL GYYRRF  +Y  +A PL 
Subjt:  ------------------FQ------------------------------------QPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLA

Query:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------
        +L +KN F W+E AT+AF++LKRAM                                                                           
Subjt:  QLLQKNAFVWNETATEAFDKLKRAM---------------------------------------------------------------------------

Query:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL
                             PG+                     NK ADAL ++  E EL  ++ P+LLD  ++ +EV  DE+L+ I  ++  DP  V 
Subjt:  ---------------------PGL--------------------NNKVADALPQISLEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVL

Query:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS
        ++T     L YK                                                                       GLLQ LPI  RIW+DIS
Subjt:  KFTWENRWLLYK-----------------------------------------------------------------------GLLQLLPISERIWDDIS

Query:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV
        MDF+EGLP+S+GFD+I+VVVD+L KY HF+TL H FSAK VA VFIKEVV  HG+P+ IVSDRD+  LS+FW +LFR QGT LK S  Y    D QTE V
Subjt:  MDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITY----DRQTERV

Query:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------
        N+CLE YLRC C EK + WS+ ++WAEYWYN  +  +IK TP+ +VYG+  PP+I+YG    + ND++++QL  +D +L  LK  L  AQ++        
Subjt:  NRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGR-EPPLIAYGDKKIS-NDTLDEQLVEKDRVLVALKDQLVRAQDQ--------

Query:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI
                                                    R+G VAY L LP +A+IH VF +SQLKK +G    V P +  + +  E+  +PE +
Subjt:  --------------------------------------------RIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMI

Query:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG
        +  RWN      E LV+W++QP               FP  HLE K  L   G
Subjt:  IGTRWNSERGKEEVLVEWENQPK-------------IFPEFHLEHKRRLSAEG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.5e-2835.82Show/hide
Query:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL
        G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + G PK I++D D    S  W     +   V+
Subjt:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL

Query:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ
        K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P ++  +    +D  DE   E  +V   +K+ 
Subjt:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ

Query:  L
        L
Subjt:  L

P0CT34 Transposon Tf2-1 polyprotein5.1e-1339.36Show/hide
Query:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN
        K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ AP  FQ  +N
Subjt:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN

P0CT35 Transposon Tf2-2 polyprotein2.5e-2835.82Show/hide
Query:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL
        G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + G PK I++D D    S  W     +   V+
Subjt:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL

Query:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ
        K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P ++  +    +D  DE   E  +V   +K+ 
Subjt:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ

Query:  L
        L
Subjt:  L

P0CT35 Transposon Tf2-2 polyprotein5.1e-1339.36Show/hide
Query:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN
        K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ AP  FQ  +N
Subjt:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN

P0CT36 Transposon Tf2-3 polyprotein2.5e-2835.82Show/hide
Query:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL
        G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + G PK I++D D    S  W     +   V+
Subjt:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL

Query:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ
        K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P ++  +    +D  DE   E  +V   +K+ 
Subjt:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ

Query:  L
        L
Subjt:  L

P0CT36 Transposon Tf2-3 polyprotein5.1e-1339.36Show/hide
Query:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN
        K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ AP  FQ  +N
Subjt:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN

P0CT41 Transposon Tf2-12 polyprotein2.5e-2835.82Show/hide
Query:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL
        G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + G PK I++D D    S  W     +   V+
Subjt:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL

Query:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ
        K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P ++  +    +D  DE   E  +V   +K+ 
Subjt:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ

Query:  L
        L
Subjt:  L

P0CT41 Transposon Tf2-12 polyprotein5.1e-1339.36Show/hide
Query:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN
        K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ AP  FQ  +N
Subjt:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN

Q9UR07 Transposon Tf2-11 polyprotein2.5e-2835.82Show/hide
Query:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL
        G LQ +P SER W+ +SMDFI  LP+S G++++ VVVD+  K    V  +   +A+Q A +F + V+ + G PK I++D D    S  W     +   V+
Subjt:  GLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFSAKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVL

Query:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ
        K S+ Y    D QTER N+ +E  LRC C+  P  W + +S  +  YN   H A + TPF IV+ R  P ++  +    +D  DE   E  +V   +K+ 
Subjt:  KCSITY----DRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREPPLIAYGDKKISNDTLDEQLVEKDRVLVALKDQ

Query:  L
        L
Subjt:  L

Q9UR07 Transposon Tf2-11 polyprotein1.1e-1239.36Show/hide
Query:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN
        K+    R  VDY+ +N+      +P+ +IE+LL ++ GST F+KLDL+S YH IR+++ D  K+A R     +E++VMP+ ++ AP  FQ  +N
Subjt:  KERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYEFVVMPFDLTNAPTTFQSLMN

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.9e-0725Show/hide
Query:  KGQVIEEVDEFEGEE-EIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCK
        K  VI E++E E +   + +  E++      ++  +    M+  G I + +++V ID GAT NFI  +L    K+  + T    +++     +Q    C 
Subjt:  KGQVIEEVDEFEGEE-EIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVDDWKITVTNTTNYGIVIENGTALQGNRVCK

Query:  EVTIQLPVVIVLEKFFPLDLG--HLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGD-ASLTKAEVSMKMMISNWEEE
         + + +  V + E F  LDL    +D+IL   WL  +G   V+W N   +F+     + L  +   L +    +KM   N +E+
Subjt:  EVTIQLPVVIVLEKFFPLDLG--HLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGD-ASLTKAEVSMKMMISNWEEE

AT3G30770.1 Eukaryotic aspartyl protease family protein8.1e-0623.83Show/hide
Query:  YVSNVTKNIHPTTS--ANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLV
        Y   +  N+ P T    +   +  L  FI  +G      +E+EG  E  +T  +V+   +    F+    M+  G I   +++V+ID GAT+NFI  +L 
Subjt:  YVSNVTKNIHPTTS--ANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLV

Query:  DDWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAW--LYTMGYMGVDWANLTVTFTQGNSKVVL-KGDASLTKA
           K+  + T    +++     +Q    C  + + +  V + E F  LDL   D+ +++ +     +    + W N   +F      V L   D  L + 
Subjt:  DDWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAW--LYTMGYMGVDWANLTVTFTQGNSKVVL-KGDASLTKA

Query:  EVSMKMMISNWEEE
           +KM  S +E+E
Subjt:  EVSMKMMISNWEEE

ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-1249.37Show/hide
Query:  MVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGLNNKVADALPQISL
        MV W  P+N +ELRG LGL GYYRRF K+Y  +  PL +LL+KN+  W E A  AF  LK A+  L      ALP + L
Subjt:  MVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGLNNKVADALPQISL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAAATTCAAGAAAAAAGGAAAAAAGGTTTATGTTTCAAATGTGACGAAAAATATTCACCCGACCACCAGTGCAAATCCAAAGAGAAGAGAGAACTTGTGGCTTTT
TATCACCGGAAAAGGCCAGGTGATTGAAGAAGTGGACGAGTTCGAAGGTGAAGAAGAAATAGAAGAAACTGCAGAAAAGGTGGAGCTAGCGCTTCGATGGATCCTAGGAT
TTTCCTCTCCCAGCACCATGAAATTGAAGGGGAAAATCGGGGAACTAGAGTTAATTGTGATGATAGATTGCGGAGCCACACATAATTTCATACATCGACAACTTGTGGAT
GATTGGAAAATAACTGTGACAAACACGACTAATTATGGAATTGTTATCGAAAATGGAACTGCTCTGCAAGGGAATAGGGTTTGTAAAGAGGTGACTATTCAATTACCAGT
GGTCATTGTGCTCGAAAAATTTTTTCCATTGGACTTGGGTCATTTAGACATCATTTTGGTAATGGCTTGGCTATATACCATGGGGTACATGGGAGTTGATTGGGCGAACC
TCACCGTTACATTCACCCAAGGGAATTCGAAAGTAGTACTGAAAGGAGATGCTTCCCTGACGAAGGCCGAAGTGTCTATGAAAATGATGATCAGTAACTGGGAGGAGGAA
GATCAAGGCTTCTTAATAGAATTTCAAAGCCTGATGCTGGAATTGAAAGTAGAAGGAAAAGTTGACCCACTGAAGTTGCAAGAAGAACAACCCAATTCCGCTAGTGACAA
AGAAAGACGGGAGTGGCGGTTTTGTGTCGACTATAGAAAGGTGAATCAAGCTACTATCTCGAAAAAATTCCCAATTCTCGTTATTGAAGAATTACTTGATGAATTACATG
GCTCGACATATTTTTCCAAACTGGATTTACGCTCGGGGTATCATCAGATAAGAATGAAGGAAGATGATATACCAAAAATCGCTTTGCGAACTCATGAGAGGCATTATGAA
TTTGTTGTGATGCCATTCGACCTCACTAACGCGCCGACCACATTTCAATCGTTAATGAATCAAGACACTAGGTTTCAGCAACCAGAGTGCAAGCGGATGGTAGAATGGTC
GATTCCAAGAAATATATCCGAATTGCGAGGAGTTTTAGGTCTGAAGGGATATTATCGAAGGTTTTTCAAGGATTATGAGGTTCTTGCTGTCCCGTTGGCACAATTGTTGC
AGAAGAATGCTTTTGTATGGAACGAGACAGCTACTGAAGCATTCGATAAACTGAAAAGGGCCATGCCCGGATTGAATAATAAGGTGGCAGATGCACTGCCACAGATCTCA
CTCGAAGTTGAACTGCAGATACTTTCGGCACCTGCATTGCTGGATAGTGAAATGGTCTTGAGGGAGGTTGGGCAAGACGAAGAACTACAGTTGATCATTGCAAAACTGAA
GGAGGACCCGAGTAGTGTGCTGAAATTCACTTGGGAGAATCGGTGGCTATTGTACAAAGGGCTCCTTCAATTGTTACCCATATCAGAAAGGATTTGGGATGATATATCCA
TGGATTTCATAGAGGGTCTGCCGAAGTCAAGGGGTTTCGATTCAATTATGGTGGTGGTAGATCAGTTATGTAAATACAACCACTTTGTAACCTTGTCTCACCTGTTTTCA
GCGAAGCAGGTGGCAGGGGTGTTTATTAAAGAGGTGGTTTGCCATCATGGATTCCCAAAATTAATAGTATCGGATAGAGATCAAAGCTTATTGAGCAATTTTTGGGTAAA
GTTGTTCAGACAACAAGGCACTGTGCTGAAATGTAGTATCACATATGATAGGCAAACTGAGAGGGTAAATCGCTGTCTCGAGACATACTTACGTTGTTTTTGTAATGAGA
AACCCCGAAGATGGAGTGAATGGATGTCATGGGCAGAGTATTGGTATAACATGACATTCCATGGAGCAATTAAAACTACACCCTTCTGCATAGTTTATGGCCGTGAGCCT
CCTCTTATTGCGTATGGGGACAAGAAAATCTCAAACGACACTTTAGATGAGCAGTTGGTTGAAAAGGATAGGGTGTTGGTGGCTTTAAAGGATCAGTTAGTTAGGGCGCA
AGACCAAAGAATTGGGGCGGTCGCTTATCGGTTGAAGTTACCCTCTTCAGCAAGGATACATAATGTGTTCCAAATTTCACAGTTAAAGAAGAAAGTAGGGAAAGATAAGG
TTGTTCAACCCATGTTACCGGAGTTGACGGAAGACGCTGAAATGAATGGACAACCTGAGATGATTATAGGAACACGTTGGAATTCAGAGAGAGGAAAAGAAGAGGTGTTG
GTCGAATGGGAAAACCAGCCAAAAATATTTCCCGAATTCCACCTTGAGCACAAGAGGAGGCTCAGTGCTGAAGGACATGAGAAGGCATGTGAGTTTCCCCGGGACCTAGT
ACTGAAGGACATGAGAAGGTACTTGGGCAAACCGCCATGTTCCTATGTGCACATAGAGGTTGATTGGCTGGTGCGTCGAGTTTTGCTCCGCATCGAGCCAGGGATAATTA
CGATGTTGGGTGTGTTGAGTCTTGCTCCACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAAATTCAAGAAAAAAGGAAAAAAGGTTTATGTTTCAAATGTGACGAAAAATATTCACCCGACCACCAGTGCAAATCCAAAGAGAAGAGAGAACTTGTGGCTTTT
TATCACCGGAAAAGGCCAGGTGATTGAAGAAGTGGACGAGTTCGAAGGTGAAGAAGAAATAGAAGAAACTGCAGAAAAGGTGGAGCTAGCGCTTCGATGGATCCTAGGAT
TTTCCTCTCCCAGCACCATGAAATTGAAGGGGAAAATCGGGGAACTAGAGTTAATTGTGATGATAGATTGCGGAGCCACACATAATTTCATACATCGACAACTTGTGGAT
GATTGGAAAATAACTGTGACAAACACGACTAATTATGGAATTGTTATCGAAAATGGAACTGCTCTGCAAGGGAATAGGGTTTGTAAAGAGGTGACTATTCAATTACCAGT
GGTCATTGTGCTCGAAAAATTTTTTCCATTGGACTTGGGTCATTTAGACATCATTTTGGTAATGGCTTGGCTATATACCATGGGGTACATGGGAGTTGATTGGGCGAACC
TCACCGTTACATTCACCCAAGGGAATTCGAAAGTAGTACTGAAAGGAGATGCTTCCCTGACGAAGGCCGAAGTGTCTATGAAAATGATGATCAGTAACTGGGAGGAGGAA
GATCAAGGCTTCTTAATAGAATTTCAAAGCCTGATGCTGGAATTGAAAGTAGAAGGAAAAGTTGACCCACTGAAGTTGCAAGAAGAACAACCCAATTCCGCTAGTGACAA
AGAAAGACGGGAGTGGCGGTTTTGTGTCGACTATAGAAAGGTGAATCAAGCTACTATCTCGAAAAAATTCCCAATTCTCGTTATTGAAGAATTACTTGATGAATTACATG
GCTCGACATATTTTTCCAAACTGGATTTACGCTCGGGGTATCATCAGATAAGAATGAAGGAAGATGATATACCAAAAATCGCTTTGCGAACTCATGAGAGGCATTATGAA
TTTGTTGTGATGCCATTCGACCTCACTAACGCGCCGACCACATTTCAATCGTTAATGAATCAAGACACTAGGTTTCAGCAACCAGAGTGCAAGCGGATGGTAGAATGGTC
GATTCCAAGAAATATATCCGAATTGCGAGGAGTTTTAGGTCTGAAGGGATATTATCGAAGGTTTTTCAAGGATTATGAGGTTCTTGCTGTCCCGTTGGCACAATTGTTGC
AGAAGAATGCTTTTGTATGGAACGAGACAGCTACTGAAGCATTCGATAAACTGAAAAGGGCCATGCCCGGATTGAATAATAAGGTGGCAGATGCACTGCCACAGATCTCA
CTCGAAGTTGAACTGCAGATACTTTCGGCACCTGCATTGCTGGATAGTGAAATGGTCTTGAGGGAGGTTGGGCAAGACGAAGAACTACAGTTGATCATTGCAAAACTGAA
GGAGGACCCGAGTAGTGTGCTGAAATTCACTTGGGAGAATCGGTGGCTATTGTACAAAGGGCTCCTTCAATTGTTACCCATATCAGAAAGGATTTGGGATGATATATCCA
TGGATTTCATAGAGGGTCTGCCGAAGTCAAGGGGTTTCGATTCAATTATGGTGGTGGTAGATCAGTTATGTAAATACAACCACTTTGTAACCTTGTCTCACCTGTTTTCA
GCGAAGCAGGTGGCAGGGGTGTTTATTAAAGAGGTGGTTTGCCATCATGGATTCCCAAAATTAATAGTATCGGATAGAGATCAAAGCTTATTGAGCAATTTTTGGGTAAA
GTTGTTCAGACAACAAGGCACTGTGCTGAAATGTAGTATCACATATGATAGGCAAACTGAGAGGGTAAATCGCTGTCTCGAGACATACTTACGTTGTTTTTGTAATGAGA
AACCCCGAAGATGGAGTGAATGGATGTCATGGGCAGAGTATTGGTATAACATGACATTCCATGGAGCAATTAAAACTACACCCTTCTGCATAGTTTATGGCCGTGAGCCT
CCTCTTATTGCGTATGGGGACAAGAAAATCTCAAACGACACTTTAGATGAGCAGTTGGTTGAAAAGGATAGGGTGTTGGTGGCTTTAAAGGATCAGTTAGTTAGGGCGCA
AGACCAAAGAATTGGGGCGGTCGCTTATCGGTTGAAGTTACCCTCTTCAGCAAGGATACATAATGTGTTCCAAATTTCACAGTTAAAGAAGAAAGTAGGGAAAGATAAGG
TTGTTCAACCCATGTTACCGGAGTTGACGGAAGACGCTGAAATGAATGGACAACCTGAGATGATTATAGGAACACGTTGGAATTCAGAGAGAGGAAAAGAAGAGGTGTTG
GTCGAATGGGAAAACCAGCCAAAAATATTTCCCGAATTCCACCTTGAGCACAAGAGGAGGCTCAGTGCTGAAGGACATGAGAAGGCATGTGAGTTTCCCCGGGACCTAGT
ACTGAAGGACATGAGAAGGTACTTGGGCAAACCGCCATGTTCCTATGTGCACATAGAGGTTGATTGGCTGGTGCGTCGAGTTTTGCTCCGCATCGAGCCAGGGATAATTA
CGATGTTGGGTGTGTTGAGTCTTGCTCCACCCTAA
Protein sequenceShow/hide protein sequence
MVKFKKKGKKVYVSNVTKNIHPTTSANPKRRENLWLFITGKGQVIEEVDEFEGEEEIEETAEKVELALRWILGFSSPSTMKLKGKIGELELIVMIDCGATHNFIHRQLVD
DWKITVTNTTNYGIVIENGTALQGNRVCKEVTIQLPVVIVLEKFFPLDLGHLDIILVMAWLYTMGYMGVDWANLTVTFTQGNSKVVLKGDASLTKAEVSMKMMISNWEEE
DQGFLIEFQSLMLELKVEGKVDPLKLQEEQPNSASDKERREWRFCVDYRKVNQATISKKFPILVIEELLDELHGSTYFSKLDLRSGYHQIRMKEDDIPKIALRTHERHYE
FVVMPFDLTNAPTTFQSLMNQDTRFQQPECKRMVEWSIPRNISELRGVLGLKGYYRRFFKDYEVLAVPLAQLLQKNAFVWNETATEAFDKLKRAMPGLNNKVADALPQIS
LEVELQILSAPALLDSEMVLREVGQDEELQLIIAKLKEDPSSVLKFTWENRWLLYKGLLQLLPISERIWDDISMDFIEGLPKSRGFDSIMVVVDQLCKYNHFVTLSHLFS
AKQVAGVFIKEVVCHHGFPKLIVSDRDQSLLSNFWVKLFRQQGTVLKCSITYDRQTERVNRCLETYLRCFCNEKPRRWSEWMSWAEYWYNMTFHGAIKTTPFCIVYGREP
PLIAYGDKKISNDTLDEQLVEKDRVLVALKDQLVRAQDQRIGAVAYRLKLPSSARIHNVFQISQLKKKVGKDKVVQPMLPELTEDAEMNGQPEMIIGTRWNSERGKEEVL
VEWENQPKIFPEFHLEHKRRLSAEGHEKACEFPRDLVLKDMRRYLGKPPCSYVHIEVDWLVRRVLLRIEPGIITMLGVLSLAPP