; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027392 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027392
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:97756..100019
RNA-Seq ExpressionLag0027392
SyntenyLag0027392
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_028090832.1 uncharacterized protein LOC114291041 [Camellia sinensis]1.2e-9136.01Show/hide
Query:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------------
        M +LCWN RGL NPR+ ++L++ ++     +VF+ E+KI       ++  LGF   F V   G +GGL L                              
Subjt:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------------

Query:  ---------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRF
                 P++ +R +SW LL RL + +  PW+  GDFNEILF  EK G   R+P QM+ F+ V+    L  +   G  FTW  +    + ++ERLDR 
Subjt:  ---------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRF

Query:  FVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWN-RNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNR----
        F  D  +L      V  +    S+H PI +++   K  + +  +  +  +FE  W     C+ ++   W  S ++N  +L + +      +  W+R    
Subjt:  FVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWN-RNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNR----

Query:  CRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDEEEI
        C +K  LK   +++ Q +L + SN  +D +E + +    ++ LLE+E I+W  R+R  WL  GD+NT                         W +D+  +
Subjt:  CRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDEEEI

Query:  RVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQ
            V YF++LFS+      ++  +L+ I  ++S+  A  L R F+E E+  AL  M PTKAPGPDG  ALFFQK+W+ +G  V SV L VLN  + +S 
Subjt:  RVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQ

Query:  INKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
        IN T+I LIPK KNPK M + RPISLCNV+YKII+K LANRLKEVL ++IS  Q+AFIPGRLITDNA++ FE  H + NKR+GK G
Subjt:  INKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]4.5e-8635.15Show/hide
Query:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------------
        M  L WNV+GL NP    AL   V+ ++P +VF+SE++      + +++ LG++ CF V + G+SGGL L                              
Subjt:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------------

Query:  --------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFF
                P   +R  SW LL R+   +N PW+ GGDFNEI  + EK GGG ++   M  F   I+ C L+++D  G  FTW    +  + + ERLDR  
Subjt:  --------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFF

Query:  VADLDKLKVSKVTVKHLNLHHSDHRPILLEMFW-----EKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKK----MAF
        V +      S   VKHL+   SDH P+LL         +K  +W         +E+AWA  EEC+ I+Q  WK+    N     T + E I      +  
Subjt:  VADLDKLKVSKVTVKHLNLHHSDHRPILLEMFW-----EKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKK----MAF

Query:  WNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDE
        WN+ + K +L   I  +++E+ K  ++   D    L   E+ L   L +EE++WK RSR  WL  GD+NT+                        W   +
Subjt:  WNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDE

Query:  EEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEY
        E+I  T   +F+DLFS+     +  E++   +  + S  +   L  +FT  +I   L  ++  KAP  DG   LF++ +WE IG+D+  VCL++LN  + 
Subjt:  EEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEY

Query:  MSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSG
          Q+NKT + LIPK K+PK + D RPISLCNV YKIIAK LANR+K+ LK +IS NQ+A I GRLI DNAILGFE +H +   R G
Subjt:  MSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSG

XP_042939444.1 uncharacterized protein LOC122274474 [Carya illinoinensis]8.7e-9037.25Show/hide
Query:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFCPKVEERPSSWTLLSRLKSLYNCPWIVGG
        M  L WN RGL NPR  Q L   V+  +P +VF+SE+K  + R  K+K+ LGF+ CFSV    R  G     P   +RP SW LL  LK   N PW+  G
Subjt:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFCPKVEERPSSWTLLSRLKSLYNCPWIVGG

Query:  DFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKG
        DFNEI    EK G   R  +QM +F+  ++ C L  +   GD FTW  + +     KERLDR           +  +V HL+   SDH+ +L++      
Subjt:  DFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKG

Query:  YQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIK-KMAFWNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLE
            R +    +FE AW+   EC+ I++  W+ S  +   +   +     K K+  W+R + +   K   N+ E   L    N GE  SE++ + ++ + 
Subjt:  YQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIK-KMAFWNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLE

Query:  SLLEEEEIYWKMRSREEWLNWGDKNTKCW-----------------------IDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWL
        S+++ E + W+ R+++ WL  GD+NTK +                         D + I  T + +F +LF+S+    S I+D L  +   I+++    L
Subjt:  SLLEEEEIYWKMRSREEWLNWGDKNTKCW-----------------------IDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWL

Query:  DRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANR
           FTE E+ +A   M+P  +PGPDG  ALFFQ+YW+ +G  V    LEVLN G++ + +N+T IALIPK  NP  + D RPISLCNV+YKIIAK LANR
Subjt:  DRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANR

Query:  LKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
        LK++L  IISP Q AF+PGRLITDN I+ FE +H +  +  G+ G
Subjt:  LKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]3.1e-8735.28Show/hide
Query:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF-----------------------CPKVE-
        MK+L WN+RGL NPR+ ++LR  +    P+I+F+ E+K+   R +  K+ LGF  CF V S GRSGGL L                        C  VE 
Subjt:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF-----------------------CPKVE-

Query:  ------------ERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFF
                    +R   W LL  L      PWIV GDFNEIL   EK GG  R+  QM EF+EV++ C L+ +   G  FTW      +  +KERLDRF 
Subjt:  ------------ERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFF

Query:  VADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYW-KDSGHSNPFNLNTKVQESIKKMAFWNRCRLKG
           L       + V H    +SDH P+ L+    +G    R   +  +FE  W    EC SI++  W +  G  +   +  ++     ++  WN+    G
Subjt:  VADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYW-KDSGHSNPFNLNTKVQESIKKMAFWNRCRLKG

Query:  SLKGAINRVEQEILKIRSN-SGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGD---------------KNTKCWIDDE-------EEIRVTAV
         ++  +   ++ +  +  N SG+   E+  Q    ++  LE +E+ WK RSR +WL  GD               KN+   + DE       +++ V   
Subjt:  SLKGAINRVEQEILKIRSN-SGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGD---------------KNTKCWIDDE-------EEIRVTAV

Query:  RYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTY
         YF+ LF++   +  D+EDVL  ++ +++ E    L + +   E+  ALK M P+KAPGPDG   LFFQKYW  +G  + +  L  LN G + S +N T+
Subjt:  RYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTY

Query:  IALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
        I LIPK  +P  + D RPISLCNV+YKI++K +ANRLK VL  IIS +Q+AF+PGR I+DN ++ +E +H + NKR G++G
Subjt:  IALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]7.4e-8936.64Show/hide
Query:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------------
        MK +CWN  GL NP   +ALR  + R  P ++F+ E+K+       LK  LGF  CFSV S GRSGGL L                              
Subjt:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------------

Query:  --------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFF
                P    R  +W L+  L S+   PW+VGGD NE+L   EKRGG  R   Q++ F+EV+  C L+ +   G  FTW+      + + ERLDRF 
Subjt:  --------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFF

Query:  VADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPF-NLNTKVQESIKKMAFWNRCRLKG
          DL       + V+H N+ HSDH PIL E      +Q      K  +FE  W   E+C+ I++  W + G +     L   +Q   +++  WNR    G
Subjt:  VADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPF-NLNTKVQESIKKMAFWNRCRLKG

Query:  SLKGAINRVEQEILKIR-SNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKCWIDDEEEIRVTAVR------YFRDLFSSNRQNLSD
         +K  +N    E  K + S+S +   E L Q    ++  LE EE+ W+ RSR +W     +  K   DD+  +   A R      +F DLF+S  Q   D
Subjt:  SLKGAINRVEQEILKIR-SNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKCWIDDEEEIRVTAVR------YFRDLFSSNRQNLSD

Query:  IEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDL
         E+VL  +D +++ +  + L + F   E+  AL  M PTKAPGPDG  ALFFQ YW  +G  V +  L+ LN G   SQ+N+T I LI K K  + + D 
Subjt:  IEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDL

Query:  RPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
        RPISLCNV+YK+ +K + NRLK  L  IIS +Q AF+ GRLI+DN ++ +E ++ + NKR GK+G
Subjt:  RPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

TrEMBL top hitse value%identityAlignment
A0A2N9GII4 Uncharacterized protein3.8e-9135.42Show/hide
Query:  QSPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------
        Q  P +MK+L WN +GL NP    +L   V+   P+++F+ E+K+G  + + +++ LGF   F V S GRS GL L                        
Subjt:  QSPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC----------------------

Query:  --------------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKE
                      P+ + +  SW LL +L    + PW+  GDFNEIL  +EKRG   R  ++M EF+EV+N C    +   G  FTW  +   ++ +KE
Subjt:  --------------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKE

Query:  RLDRFFVADLDKLKV-SKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDS-GHSNP-FNLNTKVQESIKKMAF
        RLDR  VA L    + + ++V HL +  SDH PIL+E   ++     RN  +  +FEE WA   +C+++++G W++  G  +P F L  K++     +A 
Subjt:  RLDRFFVADLDKLKV-SKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDS-GHSNP-FNLNTKVQESIKKMAF

Query:  WNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKC-----------------------WIDDE
        W++    GS      R E        + G++RS     KE  + SLL  +EI+WK RSR  WL  GD NTK                        WI + 
Subjt:  WNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKC-----------------------WIDDE

Query:  EEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEY
         +++  + +YF+D+F+S+      IE+ +E +D  ++ E    L R F   E+ KA+  M P+K+PGPDG    FFQK+W  +G +V+   L VL+ G  
Subjt:  EEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEY

Query:  MSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRGA
        + + N T+IALIPK K P+ M + RPISLCNVI+K+I+K L NRLK VL ++IS  Q AF+PGRLITDN ++ +E I+++ +KR G+ G+
Subjt:  MSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRGA

A0A2N9HDH5 Uncharacterized protein6.5e-9135.36Show/hide
Query:  SPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF---------------------C--
        +PP  M I+ WN RGL N RA  AL   V+   PKI+F+ E+K+   + + ++V L F +CF+V S GRSGGL L                      C  
Subjt:  SPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF---------------------C--

Query:  -------------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKER
                     P    R  SW LL +L S+ + PW++ GDFNEIL  DE+ G    + + M EF EVIN CGL  +   G  FTW    + ++ +++R
Subjt:  -------------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKER

Query:  LDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSG--HSNPFNLNTKVQESIKKMAFWN
        LDR   ++         T+ HL   +SDH PILL    + G        +  KFEE W+   EC+ I+Q  W       S  F +  K+++  +++  W 
Subjt:  LDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSG--HSNPFNLNTKVQESIKKMAFWN

Query:  RCRLKGSLKGAINRVEQEILK----IRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWID
            KG      +++  + L     +  N   + +  ++  ++ +  LL  EE++W+ RSR  WL  GD+NTK                        W  
Subjt:  RCRLKGSLKGAINRVEQEILK----IRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWID

Query:  DEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKG
         E +I   AV YF+++F++++   + I++ L  ++  +SEE    L + +T  E+  AL  M P+KAPG DG  + FFQKYW  +G  + +  L +LN G
Subjt:  DEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKG

Query:  EYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
        + + +IN TY++LIPK KNP+ + D RPISLCNV+YKII+K LANRLK VL  IIS +Q+AF+PGRLITDN  + FE +H +  KR G++G
Subjt:  EYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

A0A2N9IPS8 Reverse transcriptase domain-containing protein7.7e-9232.94Show/hide
Query:  QEGANLNTAKTGSVPG-----NKAGKKKETLKNSRANPTGVKAWKRKAHLEKGNREVSDMDLDCKIRKHERQEEEVNQVNKRQSPPDAMKILCWNVRGLR
        QE +N+   +T  +P       + G     + N+     G  AWKR A  EKG         + ++  H+R  E     +   +PP  M++L WN +GL 
Subjt:  QEGANLNTAKTGSVPG-----NKAGKKKETLKNSRANPTGVKAWKRKAHLEKGNREVSDMDLDCKIRKHERQEEEVNQVNKRQSPPDAMKILCWNVRGLR

Query:  NPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC-------------------------------------PKV
        N    + L   ++   P ++F+SE+++     ++L+V + FD  F V   G  GGL +                                       P+ 
Subjt:  NPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLFC-------------------------------------PKV

Query:  EERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSK
         +R  SW LL  L  L + PW+  GDFNEIL  +E+ G G R   Q+ +F+E +  CGL  +   G+ +TW R     + +  RLDR   +         
Subjt:  EERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSK

Query:  VTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKD--SGHSNPFNLNTKVQESIKKMAFWNRCRLKGSLKGAINRVE
          V HL + +SDH PILL++    G    R   K  +FE  W   E+C+ ++   W D  +  S  F +  K++     +  W+R R  GSL  +I R  
Subjt:  VTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKD--SGHSNPFNLNTKVQESIKKMAFWNRCRLKGSLKGAINRVE

Query:  QEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDEEEIRVTAVRYFRDLFSSN
        +++  + + +    S  +++ +  L  LLE+EEI+W+ RSR  W++ GDKNTK                        W  ++ +I   AV YF+ +F+S+
Subjt:  QEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDEEEIRVTAVRYFRDLFSSN

Query:  RQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNP
          +   I  VL+ ++  ++      L  +FT+ E+  ALK M PTKAPGPDG  A+F+Q YW+ +G +V    L +L+ G  + +IN T+IALIPK KNP
Subjt:  RQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNP

Query:  KHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
        +++ D RPISLCNVIYKI++K LANRLK+VL  +IS  Q+AF+PGRLITDN ++ FE +H+++ KR GK+G
Subjt:  KHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

A0A7N2LIH6 Uncharacterized protein4.1e-9336.29Show/hide
Query:  SPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF-----------C------------
        +PP +M IL WN RGL    A + L  EV++  P +VF+ E+K    + +  +  LGF     V S GRSGGL L            C            
Subjt:  SPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF-----------C------------

Query:  --------------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKE
                      P   +R +SW LL  L +    PW+V GDFNEI+  DEK G   R   QMD F+EV++ CGL  +   G  FTW            
Subjt:  --------------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKE

Query:  RLDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNR
        RLDR    +   L   +  V H+++  SDH   LL +F  K     R   K   FEE W   EECK I++  W      +   +  +++   K +  WN+
Subjt:  RLDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNR

Query:  CRLKGSLKGAINRVEQEILKIRS-NSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDEEE
               KG I + +  + ++ S N   + +E++   ++ +  L   EE+ WK RSR  WL +GDKN+K                        W +D+E 
Subjt:  CRLKGSLKGAINRVEQEILKIRS-NSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTK-----------------------CWIDDEEE

Query:  IRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMS
             + YF+D++SSN+    D+   LEA+D +++ E    L ++F   E+ +AL+ M PTKAPGPDG   +F+QKYW+ +G  V +  L+ LN G    
Subjt:  IRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMS

Query:  QINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG
         INKTYI LIPK KNP+ + + RPISLCNVIYKII+K LANRLK+VL  +I   Q+AF+PGR+ITDN I+ FE +H+IN +R GK G
Subjt:  QINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKRG

A0A803PBM9 Uncharacterized protein2.0e-9235.88Show/hide
Query:  PDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF------C-------------------
        P  MK+L WNV+GL NP   + L+  V R  P++VFISES++   +A+ L+V LG+D CF V + G+SGGL+L       C                   
Subjt:  PDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVLF------C-------------------

Query:  -----------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLD
                   P   +R  SW LL+R+  +Y+ PW++GGDFNEIL   EK GG  +    ++ F+  +N   L++V+  G  +TW    KN+  + ERLD
Subjt:  -----------PKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLD

Query:  RFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFW-----EKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSN-PFNLNTKVQESIKKMAF
        R            K  V HL+   SDH P+LL         EKG +W+        FE AWA  E+C  I++  W   G  N    L  K+    K +  
Subjt:  RFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFW-----EKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSN-PFNLNTKVQESIKKMAF

Query:  WNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKC-----------------------WIDDE
        WN+ R K  +K  +   E++I  +  ++     + L   E+    LL++EE +W+ RSR  WL  GD+NTK                        W+   
Subjt:  WNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKC-----------------------WIDDE

Query:  EEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEY
        + +   A  YF+ +F+SN  +++D+E+    +  KIS E   +L   FT+ ++  A++ + P KAPG DG   LF++ YW +IG++V  VCL +LN+G  
Subjt:  EEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEY

Query:  MSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKR
        +++IN T I LIPK + P  M + RPISLCNVIYKI+AK LA R K  L   IS  Q+AF+ GRLI DNAI+GFE +H +  +R G R
Subjt:  MSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGKR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.6e-2024.84Show/hide
Query:  LLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGL----KKVDPAGDFFTWYRSPKNKSSLKERLDRFF--VADLDKLKVSKVT
        +LS L+   +   ++ GDFN  L + + R    +  K   E    ++   L    + + P    +T++ +P +  S   ++D      A L K K +++ 
Subjt:  LLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGL----KKVDPAGDFFTWYRSPKNKSSLKERLDRFF--VADLDKLKVSKVT

Query:  VKHLNLHHSDHRPILLEM----------------------FW---------EKGYQWNRNLIKNVKFEEAW-AYSEECKS---ILQGYWKDSGHSNPFNL
          +L    SDH  I LE+                      +W         +  ++ N N  K+  ++  W A+   C+     L  Y +    S    L
Subjt:  VKHLNLHHSDHRPILLEM----------------------FW---------EKGYQWNRNLIKNVKFEEAW-AYSEECKS---ILQGYWKDSGHSNPFNL

Query:  NTKVQESIKKMAFWNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEE-----EEIYWKMRSREEWLNWGD--KNTKCWI-DDEEE
         ++++E  K+     +   K S +       QEI KIR+   E  ++  +QK     S   E     +    ++  ++   N  D  KN K  I  D  E
Subjt:  NTKVQESIKKMAFWNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSEDLIQKERHLESLLEE-----EEIYWKMRSREEWLNWGD--KNTKCWI-DDEEE

Query:  IRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDC-KISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYM
        I+ T   Y++ L+++  +NL +++  L+     ++++EE   L+R  T  EIV  +  +   K+PGPDG  A F+Q+Y EE+   ++ +   +  +G   
Subjt:  IRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDC-KISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYM

Query:  SQINKTYIALIPK-CKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPG
        +   +  I LIPK  ++    ++ RPISL N+  KI+ K LANR+++ +K +I  +Q  FIPG
Subjt:  SQINKTYIALIPK-CKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPG

P08548 LINE-1 reverse transcriptase homolog5.6e-2323.71Show/hide
Query:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGL-VLFCPKVEERPSS------------------
        + I   NV GL  P     L   +Q+ KP I  I ES +      +LKV  G+   F      +  G+ +LF   +  +P+                   
Subjt:  MKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGL-VLFCPKVEERPSS------------------

Query:  -------------------WTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKV----DPAGDFFTWYRSPKNKSSLKE
                              L+ + +L +   IV GDFN  L V + R    +  K++ +    I    L  +     P    +T++ S     S   
Subjt:  -------------------WTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKV----DPAGDFFTWYRSPKNKSSLKE

Query:  RLDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNL--------IKNVKFEEAWAYSEECKSIL----QGYWKDSGHSNPFNLNTKV
        ++D       +  K  K+ +  +    SDH  I +E+        NRNL        + N+  ++ W   E  K I     Q   +D+ + N ++    V
Subjt:  RLDRFFVADLDKLKVSKVTVKHLNLHHSDHRPILLEMFWEKGYQWNRNL--------IKNVKFEEAWAYSEECKSIL----QGYWKDSGHSNPFNLNTKV

Query:  QES--IKKMAFWNRCRLK--GSLKGAINRVE------------QEILKIRSNSGEDRSEDLIQK---------------ERHLESLLEEEEIYWKMRSRE
             I   AF  +   +   +L G + ++E            +EI KIR+   E  ++ +IQ+               ++ L +L  ++ +   + S  
Subjt:  QES--IKKMAFWNRCRLK--GSLKGAINRVE------------QEILKIRSNSGEDRSEDLIQK---------------ERHLESLLEEEEIYWKMRSRE

Query:  EWLNWGDKNTKCWIDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDC-KISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEE
           N  D+ T     D  EI+     Y++ L+S   +NL +I+  LEA    ++S++E   L+R  +  EI   ++ +   K+PGPDG  + F+Q + EE
Subjt:  EWLNWGDKNTKCWIDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDC-KISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEE

Query:  IGKDVVSVCLEVLNKGEYMSQINKTYIALIPK-CKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPG
        +   ++++   +  +G   +   +  I LIPK  K+P   ++ RPISL N+  KI+ K L NR+++ +K II  +Q  FIPG
Subjt:  IGKDVVSVCLEVLNKGEYMSQINKTYIALIPK-CKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPG

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-1830.2Show/hide
Query:  LKGAINRVEQEILKIRSNSGEDRSEDLIQK-ERHLESLLE---EEEIYWKMRSREEWLNWGDKNTKCWIDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDV
        L+G IN+VE      R N       + I K ++ L  L +   ++ +  K+R+ +     GD  T     D EEI+ T   +++ L+S+  +NL +++  
Subjt:  LKGAINRVEQEILKIRSNSGEDRSEDLIQK-ERHLESLLE---EEEIYWKMRSREEWLNWGDKNTKCWIDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDV

Query:  LEAIDC-KISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPK-CKNPKHMKDLRP
        L+     K+++++   L+   +  EI   +  +   K+PGPDG  A F+Q + E++   +  +  ++  +G   +   +  I LIPK  K+P  +++ RP
Subjt:  LEAIDC-KISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPK-CKNPKHMKDLRP

Query:  ISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAIN
        ISL N+  KI+ K LANR++E +K II P+Q  FIPG     N       IH IN
Subjt:  ISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAIN

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-2525.81Show/hide
Query:  IVGGDFNEILFVDEKRGGGCRAPKQMDE----FKEVINCCGL----KKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSKVTVKHLNLHHSDH
        I+GGDFN  L   ++       PK+ D      +E+I    L    ++ +P    FT+ R      S + R+DR +++     +    T++      SDH
Subjt:  IVGGDFNEILFVDEKRGGGCRAPKQMDE----FKEVINCCGL----KKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSKVTVKHLNLHHSDH

Query:  RPILLEM----FWEKGYQW--NRNLIKNVKF----EEAW----AYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNRCRLKGSLKGAINRVEQ
          + L M       K   W  N +L+++  F     + W    A+ +E  ++ Q  W D G     +L    QE  K ++      ++ +L G +  +EQ
Subjt:  RPILLEM----FWEKGYQW--NRNLIKNVKF----EEAW----AYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNRCRLKGSLKGAINRVEQ

Query:  EILKIRSNSGEDRSE--DLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKCW-----------------------IDDEEEIRVTAVRYFRDLFSS
             R +  ED++   + ++++  L ++ + +     +RSR + L   D+ ++ +                       ++D E IR  A  ++++LFS 
Subjt:  EILKIRSNSGEDRSE--DLIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKCW-----------------------IDDEEEIRVTAVRYFRDLFSS

Query:  NRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKN
        +  +    E++ + +   +SE     L+   T  E+ +AL+ M   K+PG DG    FFQ +W+ +G D   V  E   KGE      +  ++L+PK  +
Subjt:  NRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKN

Query:  PKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIH
         + +K+ RP+SL +  YKI+AKA++ RLK VL  +I P+Q+  +PGR I DN  L  + +H
Subjt:  PKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIH

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.1e-1325.4Show/hide
Query:  EIYWKMRSREEWLNWGDKNTKCW-----------------------IDDEEEIRVTAVRYFRDLFSSNRQNLS--DIEDVLEAIDCKISEEEALWLDRDF
        E +++ +SR +WL  GD NT+ +                       +++  +++   V Y+  L  S+   L+   ++ + +    + ++  A  L    
Subjt:  EIYWKMRSREEWLNWGDKNTKCW-----------------------IDDEEEIRVTAVRYFRDLFSSNRQNLS--DIEDVLEAIDCKISEEEALWLDRDF

Query:  TEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKII
        ++ EI  A+  M   KAPGPD   A FF + W  +    ++   E    G  + + N T I LIPK      +   RP+S C V+YKII
Subjt:  TEYEIVKALKGMSPTKAPGPDGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.7e-0440.43Show/hide
Query:  LANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGK
        +  RLK ++  +I P QA+FIPGR+ TDN +   E +H++  K+  K
Subjt:  LANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIHAINNKRSGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGAAATCTCAGAGATAATCATAAGGAAACGTGTTACAGCACTAGGGAACAGGGGACTCAGGAAAGGGAATGCGACAAAAGGAAAAAGTCCGGAAAGAGTACTTG
GGCTTATGAGAGTAAAGAAACACCAAAAAAGCCCAATAATGAGACCCTAAATTTAACCAGTACAGGGGAAGGAAAAAAGGCCCACATAGAGCAAGAAGGCGCAAATCTGA
ATACGGCTAAAACAGGAAGTGTCCCAGGGAATAAGGCAGGGAAAAAGAAGGAGACTCTGAAGAATTCAAGGGCCAATCCAACTGGGGTTAAGGCTTGGAAAAGGAAGGCC
CACCTCGAGAAAGGCAACAGAGAAGTCAGTGACATGGACTTAGACTGCAAAATCAGGAAACACGAAAGACAGGAAGAGGAAGTTAACCAAGTCAACAAAAGGCAATCCCC
GCCGGACGCCATGAAAATCCTATGTTGGAACGTCCGGGGATTGAGAAATCCTCGGGCGTTCCAGGCTTTGCGCTACGAGGTGCAAAGAAACAAGCCTAAGATCGTTTTCA
TCTCTGAGTCCAAAATTGGGGATGCTAGAGCGCAGAAGCTTAAAGTGTTACTGGGTTTTGACTATTGTTTCAGCGTGTGCAGTGCGGGTAGAAGCGGTGGCTTAGTGTTG
TTTTGTCCTAAAGTCGAGGAGAGACCATCATCGTGGACTCTCCTCTCTAGACTCAAGTCTCTCTACAACTGTCCGTGGATAGTGGGAGGGGATTTCAACGAGATTTTGTT
TGTTGATGAAAAAAGGGGGGGTGGTTGTAGAGCTCCTAAGCAGATGGATGAGTTCAAAGAAGTGATTAACTGCTGCGGTCTAAAGAAGGTGGATCCGGCTGGTGATTTTT
TCACTTGGTACAGAAGTCCCAAGAACAAAAGTTCCTTAAAGGAGCGCCTGGACAGATTCTTCGTAGCAGACCTCGATAAGCTGAAAGTGAGCAAAGTTACTGTTAAACAC
CTCAACCTCCATCATTCTGACCACAGGCCCATTCTCCTTGAGATGTTTTGGGAGAAAGGCTATCAATGGAATAGAAACCTCATCAAGAATGTCAAATTTGAGGAAGCGTG
GGCTTATTCTGAGGAGTGTAAAAGCATTCTTCAAGGCTACTGGAAGGATTCTGGCCACTCAAATCCTTTCAATCTTAATACAAAAGTCCAAGAAAGCATTAAAAAGATGG
CATTTTGGAACAGATGCAGGTTGAAAGGCTCTTTAAAAGGTGCTATAAACCGGGTTGAACAAGAGATTCTGAAGATCCGTTCTAATTCAGGTGAAGACAGATCTGAGGAT
TTGATCCAGAAGGAGAGGCACCTAGAAAGTTTGCTTGAAGAAGAAGAAATTTACTGGAAAATGCGCTCTAGAGAAGAGTGGCTCAATTGGGGAGACAAGAACACCAAATG
CTGGATTGATGATGAGGAGGAGATCAGAGTGACTGCAGTCAGATATTTTAGAGACCTATTCTCCTCAAATAGGCAGAACCTTAGTGATATTGAGGATGTGCTAGAAGCAA
TTGATTGCAAGATCAGCGAAGAGGAAGCTTTATGGTTGGACAGAGACTTTACTGAATATGAGATTGTCAAAGCCCTTAAAGGAATGAGCCCAACCAAAGCGCCGGGCCCT
GACGGTGCACATGCTTTGTTTTTCCAAAAATACTGGGAAGAAATAGGCAAAGATGTGGTTTCAGTGTGTTTGGAAGTCCTCAATAAAGGGGAATATATGTCTCAGATCAA
TAAAACTTATATCGCGCTCATTCCGAAATGCAAAAATCCAAAGCACATGAAAGATCTGAGGCCTATTAGCCTATGCAATGTTATCTATAAGATCATCGCTAAAGCCTTAG
CTAACAGATTAAAAGAAGTCCTCAAGACAATCATCTCCCCTAACCAAGCAGCCTTCATACCGGGAAGACTCATTACAGACAACGCAATATTGGGTTTTGAATGCATTCAC
GCTATTAACAACAAAAGATCAGGGAAAAGGGGGGCACATTGCAATGAAGCTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGGAAATCTCAGAGATAATCATAAGGAAACGTGTTACAGCACTAGGGAACAGGGGACTCAGGAAAGGGAATGCGACAAAAGGAAAAAGTCCGGAAAGAGTACTTG
GGCTTATGAGAGTAAAGAAACACCAAAAAAGCCCAATAATGAGACCCTAAATTTAACCAGTACAGGGGAAGGAAAAAAGGCCCACATAGAGCAAGAAGGCGCAAATCTGA
ATACGGCTAAAACAGGAAGTGTCCCAGGGAATAAGGCAGGGAAAAAGAAGGAGACTCTGAAGAATTCAAGGGCCAATCCAACTGGGGTTAAGGCTTGGAAAAGGAAGGCC
CACCTCGAGAAAGGCAACAGAGAAGTCAGTGACATGGACTTAGACTGCAAAATCAGGAAACACGAAAGACAGGAAGAGGAAGTTAACCAAGTCAACAAAAGGCAATCCCC
GCCGGACGCCATGAAAATCCTATGTTGGAACGTCCGGGGATTGAGAAATCCTCGGGCGTTCCAGGCTTTGCGCTACGAGGTGCAAAGAAACAAGCCTAAGATCGTTTTCA
TCTCTGAGTCCAAAATTGGGGATGCTAGAGCGCAGAAGCTTAAAGTGTTACTGGGTTTTGACTATTGTTTCAGCGTGTGCAGTGCGGGTAGAAGCGGTGGCTTAGTGTTG
TTTTGTCCTAAAGTCGAGGAGAGACCATCATCGTGGACTCTCCTCTCTAGACTCAAGTCTCTCTACAACTGTCCGTGGATAGTGGGAGGGGATTTCAACGAGATTTTGTT
TGTTGATGAAAAAAGGGGGGGTGGTTGTAGAGCTCCTAAGCAGATGGATGAGTTCAAAGAAGTGATTAACTGCTGCGGTCTAAAGAAGGTGGATCCGGCTGGTGATTTTT
TCACTTGGTACAGAAGTCCCAAGAACAAAAGTTCCTTAAAGGAGCGCCTGGACAGATTCTTCGTAGCAGACCTCGATAAGCTGAAAGTGAGCAAAGTTACTGTTAAACAC
CTCAACCTCCATCATTCTGACCACAGGCCCATTCTCCTTGAGATGTTTTGGGAGAAAGGCTATCAATGGAATAGAAACCTCATCAAGAATGTCAAATTTGAGGAAGCGTG
GGCTTATTCTGAGGAGTGTAAAAGCATTCTTCAAGGCTACTGGAAGGATTCTGGCCACTCAAATCCTTTCAATCTTAATACAAAAGTCCAAGAAAGCATTAAAAAGATGG
CATTTTGGAACAGATGCAGGTTGAAAGGCTCTTTAAAAGGTGCTATAAACCGGGTTGAACAAGAGATTCTGAAGATCCGTTCTAATTCAGGTGAAGACAGATCTGAGGAT
TTGATCCAGAAGGAGAGGCACCTAGAAAGTTTGCTTGAAGAAGAAGAAATTTACTGGAAAATGCGCTCTAGAGAAGAGTGGCTCAATTGGGGAGACAAGAACACCAAATG
CTGGATTGATGATGAGGAGGAGATCAGAGTGACTGCAGTCAGATATTTTAGAGACCTATTCTCCTCAAATAGGCAGAACCTTAGTGATATTGAGGATGTGCTAGAAGCAA
TTGATTGCAAGATCAGCGAAGAGGAAGCTTTATGGTTGGACAGAGACTTTACTGAATATGAGATTGTCAAAGCCCTTAAAGGAATGAGCCCAACCAAAGCGCCGGGCCCT
GACGGTGCACATGCTTTGTTTTTCCAAAAATACTGGGAAGAAATAGGCAAAGATGTGGTTTCAGTGTGTTTGGAAGTCCTCAATAAAGGGGAATATATGTCTCAGATCAA
TAAAACTTATATCGCGCTCATTCCGAAATGCAAAAATCCAAAGCACATGAAAGATCTGAGGCCTATTAGCCTATGCAATGTTATCTATAAGATCATCGCTAAAGCCTTAG
CTAACAGATTAAAAGAAGTCCTCAAGACAATCATCTCCCCTAACCAAGCAGCCTTCATACCGGGAAGACTCATTACAGACAACGCAATATTGGGTTTTGAATGCATTCAC
GCTATTAACAACAAAAGATCAGGGAAAAGGGGGGCACATTGCAATGAAGCTAGATAG
Protein sequenceShow/hide protein sequence
MTGNLRDNHKETCYSTREQGTQERECDKRKKSGKSTWAYESKETPKKPNNETLNLTSTGEGKKAHIEQEGANLNTAKTGSVPGNKAGKKKETLKNSRANPTGVKAWKRKA
HLEKGNREVSDMDLDCKIRKHERQEEEVNQVNKRQSPPDAMKILCWNVRGLRNPRAFQALRYEVQRNKPKIVFISESKIGDARAQKLKVLLGFDYCFSVCSAGRSGGLVL
FCPKVEERPSSWTLLSRLKSLYNCPWIVGGDFNEILFVDEKRGGGCRAPKQMDEFKEVINCCGLKKVDPAGDFFTWYRSPKNKSSLKERLDRFFVADLDKLKVSKVTVKH
LNLHHSDHRPILLEMFWEKGYQWNRNLIKNVKFEEAWAYSEECKSILQGYWKDSGHSNPFNLNTKVQESIKKMAFWNRCRLKGSLKGAINRVEQEILKIRSNSGEDRSED
LIQKERHLESLLEEEEIYWKMRSREEWLNWGDKNTKCWIDDEEEIRVTAVRYFRDLFSSNRQNLSDIEDVLEAIDCKISEEEALWLDRDFTEYEIVKALKGMSPTKAPGP
DGAHALFFQKYWEEIGKDVVSVCLEVLNKGEYMSQINKTYIALIPKCKNPKHMKDLRPISLCNVIYKIIAKALANRLKEVLKTIISPNQAAFIPGRLITDNAILGFECIH
AINNKRSGKRGAHCNEAR