; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011129 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011129
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:15308380..15309593
RNA-Seq ExpressionLag0011129
SyntenyLag0011129
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66036.1 hypothetical protein [Beta vulgaris subsp. vulgaris]5.4e-7040.75Show/hide
Query:  EASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAF
        +AS   ++N V+ + + AG W +  D V      YF N++++ +    EM+  L  V   +T+E+ ++L  P ++EE+  AL Q HPNK+ GPD +   F
Subjt:  EASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAF

Query:  YQHSWDGLP-----------------GPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE
        YQH WD +                  G +N+T IVLI KKK+     ++RPISLCN++YK+V+KVL NRMK +L  +I ++QS F+PGR + DN ++ YE
Subjt:  YQHSWDGLP-----------------GPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE

Query:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML
        C H L+ +  G  G+  LKLDM K YDRVEW FLE +MLK+ F   + +L+  C++S R+S                          LF++CAEGLS +L
Subjt:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML

Query:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLNME
        R  EE   I G+++      ISHLFFADDSLLF RA E +   + DIL  YE ASGQ++N +KS +S S N+E
Subjt:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLNME

CCA66054.1 hypothetical protein [Beta vulgaris subsp. vulgaris]1.2e-6940.64Show/hide
Query:  EASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAF
        +AS  +++N VKGL DG G W+++ D +  +   YF +I+ +++P    +E  +  +   VTEE N KLL P  ++EIL AL+Q HP K+ GPD +   F
Subjt:  EASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAF

Query:  YQHSW-----------------DGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE
        YQ  W                    P  +N T I LI K K P + +E+RPI+LCN++YKL+SK +V R+K  L  +IS+NQSAF+PGR + DNA++  E
Subjt:  YQHSW-----------------DGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE

Query:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML
          H +K R++   G   +KLDM K YDRVEW FL K++L M F   WV LI   +SSV YSF                         LF++ A+  S M+
Subjt:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML

Query:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLNMEV
        +   +   + G + +R  P ISHLFFADDSLLF RA   +  +I DIL  YE ASGQ+IN++KS +S S  + V
Subjt:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLNMEV

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]5.4e-7040.97Show/hide
Query:  EASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAF
        +AS  RR+N + G+ D  G WQ   + +  +   YF  IY ++ P    + + L  +PTTVTEEMN  L+    +EEI  AL Q HP K+ GPD +   F
Subjt:  EASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAF

Query:  YQHSWDGLPG-----------------PLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE
        +Q  W+ +                    +N+T I L+ K K P ++S++RPISLCN+VYKL+SKVL NR+K IL  +IS+NQSAF+ GR + DN ++ +E
Subjt:  YQHSWDGLPG-----------------PLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE

Query:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML
         +H L+ + +G  G+A +KLDM K YDRVEW F++++M KM F   W++L+  CI+SV YS                          +FLLCA+G S +L
Subjt:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML

Query:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLN
          V     I G+ + R  P I+HLFFADDSLLF +A   +   + DIL  YE ASGQ+IN DKS +  S N
Subjt:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLN

XP_027148760.1 uncharacterized protein LOC113749274 [Coffea eugenioides]2.7e-6941.3Show/hide
Query:  RRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQHSW
        R++N +  L+   G W    + VLG I GYFGN++ +++P  E        +P  +T+ MNS+L+ P  + E+  A+   HPNK+ GPD +   F+Q  W
Subjt:  RRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQHSW

Query:  -----------------DGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVL
                           L   +NET I LI K   P  +S +RPISLCN++Y+++SK+LVNR+K  LN  +S NQSAFIPGR ++DN ++ +E +H L
Subjt:  -----------------DGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVL

Query:  KGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSFN------------------------LFLLCAEGLSCMLRGVEE
          R  G   +  LKLDM K YDRVEW FLEKIM KM F   W+  I  CISSV YSFN                        LFLL +EGLS +L     
Subjt:  KGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSFN------------------------LFLLCAEGLSCMLRGVEE

Query:  ADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLNME
           I GL++A   PA+SHL FADD+L+F RA + +A  +R +L  Y +ASGQ IN +KS +  S N E
Subjt:  ADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLNME

XP_031127667.1 uncharacterized protein LOC116029767 [Ipomoea triloba]1.1e-7041.89Show/hide
Query:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY
        AS  R+ N ++ L+D  G W +  D +   I  Y+  I++T     E     L+ VPT +++  N  LL P + +E+  AL    PNK+ GPD ++ +FY
Subjt:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY

Query:  QHSWD-----------------GLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYEC
        Q  W                   LP  LN T +VLISKKK P R+S+ RPI+LCN+VYK+++KV+ NR+K +LNG++S +QSAF+PGR + DN IL  E 
Subjt:  QHSWD-----------------GLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYEC

Query:  LHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCMLR
         H L+ +  G  GWA LKLDM K YDR+EW +LE ++L + FA  WVELI +C++SV Y+                          LF++CAEGLS + +
Subjt:  LHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCMLR

Query:  GVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLN
          E    I G+R+AR  P+ISHLFFADDSLLFF+A   +A  +++ L  Y  ASGQ +NFDKS ++ S+N
Subjt:  GVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLN

TrEMBL top hitse value%identityAlignment
A0A2N9EDY7 Reverse transcriptase domain-containing protein5.4e-7643.99Show/hide
Query:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY
        AS  RR+N + GL+D  G+W+++     GLI  +F +I+RT+ P  E +E+A+ HVP  +++E+N+ L      +E+ LALKQ  P K+ GPD +   F+
Subjt:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY

Query:  QHSWDGLPGP------------------LNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE
        Q  W  L GP                  +N T I LI K K P RI+E+RPISLCN+ YKL+SKV+ NR+KGIL  +IS+ QSAF+PGR + DN ++ +E
Subjt:  QHSWDGLPGP------------------LNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE

Query:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML
         LH +     G  G   +KLDM K YDRVEW FLEKIM KM F P WV LI  CIS+V YS                          LFLLCAEGL  ++
Subjt:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML

Query:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII
           +E  +++GL L R  P I+HLFFADDSLLF +A   +  +I++IL  YE+ASGQ++N DK+ +
Subjt:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII

A0A2N9ES30 Reverse transcriptase domain-containing protein2.3e-7443.72Show/hide
Query:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY
        AS  RR+N + GL+D  G+W+++     GLI  +F +I+RT+ P  E +E+A+ HVP  +++E+N+ L+     +E+ LALKQ  P K+ GPD +   F+
Subjt:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY

Query:  QHSWDGLPGP------------------LNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE
        Q  W  L GP                  +N T I LI K K P RI+E+RPISLCN+ YKL+SKV+ NR+KGIL  +IS+ QSAF+PGR + DN ++ +E
Subjt:  QHSWDGLPGP------------------LNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE

Query:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRY------------------------SFNLFLLCAEGLSCML
         L  +     G  G   +KLDM K YDRVEW FLEKIM KM F P WV LI  CIS+V Y                        S  LFLLCAEGL  ++
Subjt:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRY------------------------SFNLFLLCAEGLSCML

Query:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII
           +E  +++GL L R  P I+HLFFADDSLLF +A   +  +I++IL  YE+ASGQ++N DK+ +
Subjt:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII

A0A2N9HPU0 Uncharacterized protein1.2e-7543.99Show/hide
Query:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY
        AS  RR+N + GL+D  G+W+++   V GLI  +F +I+RT+ P  E +E+A+ HVP  +++E+N+ L       E+ LALKQ  P K+ GPD +   F+
Subjt:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY

Query:  QHSWDGLPGP------------------LNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE
        Q  W  + GP                  +N T I LI K K P RI+E+RPISLCN+ YKL+SKV+ NR+KGIL  +IS+ QSAF+PGR + DN ++ +E
Subjt:  QHSWDGLPGP------------------LNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYE

Query:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML
         LH +     G  G   +KLDM K YDRVEW FLEKIM KM F P WV LI  CIS+V YS                          LFLLCAEGL  ++
Subjt:  CLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCML

Query:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII
           +E  +++GL L R  P I+HLFFADDSLLF +A   +  +I++IL  YE+ASGQ++N DK+ +
Subjt:  RGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII

A0A2N9I509 Uncharacterized protein6.8e-7142.47Show/hide
Query:  DEASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGA
        + AS  R+ N + GL D    W ++P  +  +   YF  +++++ P    +E+ + HV + VT  MNS LL P   EEI  AL Q HP+K+ GPD +   
Subjt:  DEASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGA

Query:  FYQHSWD--GLP---------------GPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGY
        F+Q  W   GL                  +N T IVLI K K P R+S++RPISLCN+VYK+VSKVLVNRMK +L  +IS +QSAF+PGR + DN I+ +
Subjt:  FYQHSWD--GLP---------------GPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGY

Query:  ECLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCM
        E LH LK    G       KLDM K YDRVEW +L  I+LK+ FA  WV L+  C++S  YS                          LFL+CAEGLS +
Subjt:  ECLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCM

Query:  LRGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLN
        +R  E    IKG+ + R  P ISHLFFADDS++F RA  +  ++I++IL  YE+ASGQ +N DK+ I  S N
Subjt:  LRGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSIISSSLN

A0A2N9IMR2 Reverse transcriptase domain-containing protein6.8e-7142.47Show/hide
Query:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY
        AS  RR+N +K + D AGIWQ+  D+V  +   YF  ++ T++P    +E+A+   P  VT+ MN  L       E  LA+ Q  P+ + GPD +   FY
Subjt:  ASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFY

Query:  QHSW-----------------DGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYEC
        +  W                 D L   +N+T I LI K K P R++E+RPISLCN++YK++SKVLVNR+K IL  +IS+ QSAF+PG  + DN ++ +E 
Subjt:  QHSW-----------------DGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYEC

Query:  LHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCMLR
        LH++     G  G   LKLDM K YDRVEW +LEK+M KM F P W+ L  +CIS V YS                          LFLLCAEGL  M++
Subjt:  LHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSF------------------------NLFLLCAEGLSCMLR

Query:  GVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII
          E    ++G+ L RY P I+HLFFADDSLLF RA       I+D+L  YERASGQ++N DK+ I
Subjt:  GVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSII

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.0e-1523.33Show/hide
Query:  RRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQAL--LHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQH
        R KN++  +++  G     P  +   I  Y+ ++Y       EEM+  L    +P    EE+ S L  P    EI+  +      KS GPD     FYQ 
Subjt:  RRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQAL--LHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQH

Query:  SWDG-----------------LPGPLNETMIVLISKK-KYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECL
          +                  LP    E  I+LI K  +   +   +RPISL N+  K+++K+L NR++  +  LI  +Q  FIPG     N       +
Subjt:  SWDG-----------------LPGPLNETMIVLISKK-KYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECL

Query:  -HVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELI---------SLCISSVRY-SFNLFLLCAEG-----------LSCMLRGVE
         H+ + + K      ++ +D  K +D+++  F+ K + K+     ++++I         ++ ++  +  +F L     +G           L  + R + 
Subjt:  -HVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELI---------SLCISSVRY-SFNLFLLCAEG-----------LSCMLRGVE

Query:  EADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKS
        +   IKG++L +    +S   FADD +++       A  +  ++  + + SG +IN  KS
Subjt:  EADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKS

P08548 LINE-1 reverse transcriptase homolog2.7e-1623.82Show/hide
Query:  RRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQAL--LHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQH
        R K+ +  + +G       P  +  ++  Y+  +Y   +   +E++Q L   H+P    +E+   L  P    EI   ++     KS GPD     FYQ 
Subjt:  RRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQAL--LHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQH

Query:  SWDG-----------------LPGPLNETMIVLISKK-KYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECL
          +                  LP    E  I LI K  K P R   YRPISL N+  K+++K+L NR++  +  +I  +Q  FIPG     N       +
Subjt:  SWDG-----------------LPGPLNETMIVLISKK-KYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECL

Query:  -HVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELI---------SLCISSVRY-SFNLFLLCAEG-----------LSCMLRGVE
         H+ K ++K      +L +D  K +D ++  F+ + + K+     +++LI         ++ ++ V+  SF L     +G           +  +   + 
Subjt:  -HVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELI---------SLCISSVRY-SFNLFLLCAEG-----------LSCMLRGVE

Query:  EADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSI
        E  AIKG+ +      I    FADD +++          + +++  Y   SG +IN  KS+
Subjt:  EADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSI

P11369 LINE-1 retrotransposable element ORF2 protein2.5e-1422.99Show/hide
Query:  PDRVLGLIEGYFGNIYRTTHPFGEEMEQAL--LHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQ--------------HSWD--
        P+ +   I  ++  +Y T     +EM++ L    VP  + ++    L  P   +EI   +      KS GPD     FYQ              H  +  
Subjt:  PDRVLGLIEGYFGNIYRTTHPFGEEMEQAL--LHVPTTVTEEMNSKLLHPSQQEEILLALKQTHPNKSLGPDRLLGAFYQ--------------HSWD--

Query:  -GLPGPLNETMIVLISK-KKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVL-KGRHKGITGWALLKL
          LP    E  I LI K +K P +I  +RPISL N+  K+++K+L NR++  +  +I  +Q  FIPG     N       +H + K + K      ++ L
Subjt:  -GLPGPLNETMIVLISK-KKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVL-KGRHKGITGWALLKL

Query:  DMIKTYDRVEWVFLEKIMLKMDFAPGWVELI----SLCISSVR-----------------------YSFNLFLLCAEGLSCMLRGVEEADAIKGLRLARY
        D  K +D+++  F+ K++ +      ++ +I    S  +++++                       Y FN+       L  + R + +   IKG+++ + 
Subjt:  DMIKTYDRVEWVFLEKIMLKMDFAPGWVELI----SLCISSVR-----------------------YSFNLFLLCAEGLSCMLRGVEEADAIKGLRLARY

Query:  WPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSI
           IS L  ADD +++    +     + +++  +    G +IN +KS+
Subjt:  WPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKSI

P14381 Transposon TX1 uncharacterized 149 kDa protein3.4e-1925Show/hide
Query:  GAGVRYELQ---NAEEGLEALLVEDEASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQ
        GA VR  +Q   + + G       ++    R++      EDG  +  + P+ +      ++ N++ +  P   +  + L      V+E    +L  P   
Subjt:  GAGVRYELQ---NAEEGLEALLVEDEASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQ

Query:  EEILLALKQTHPNKSLGPDRLLGAFYQHSWD-----------------GLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILN
        +E+  AL+    NKS G D L   F+Q  WD                  LP      ++ L+ KK   R I  +RP+SL +  YK+V+K +  R+K +L 
Subjt:  EEILLALKQTHPNKSLGPDRLLGAFYQHSWD-----------------GLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILN

Query:  GLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISL------CISSVRYSF-------
         +I  +QS  +PGR + DN  L  + LH    R  G++  A L LD  K +DRV+  +L   +    F P +V  +        C+  + +S        
Subjt:  GLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISL------CISSVRYSF-------

Query:  -----------NLFLLCAEGLSCMLRGVEEADAIK--GLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKS--IISSSLN
                    L+ L  E   C+LR       +K   +R+     A   +  A D +   RA+E + +        Y  AS  RIN+ KS  ++  SL 
Subjt:  -----------NLFLLCAEGLSCMLRGVEEADAIK--GLRLARYWPAISHLFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDKS--IISSSLN

Query:  MEVL
        ++ L
Subjt:  MEVL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM5.3e-0422.5Show/hide
Query:  WDG-LPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHKGITGWALLKL
        W G LP  +     V I K    +R  ++RPIS+ +++ + ++ +L  R+   +N      Q  F+P     DNA +      VL+  HK      +  L
Subjt:  WDG-LPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHKGITGWALLKL

Query:  DMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSFNLFLLCAEGLS----CMLRGVEEADAIKGL-------RLARYWPA----------ISH
        D+ K +D +    +   +       G+V+ +         S N      +G S       RGV++ D +  +       RL R  P+           + 
Subjt:  DMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSFNLFLLCAEGLS----CMLRGVEEADAIKGL-------RLARYWPA----------ISH

Query:  LFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDK
          FADD +LF   +    +++ D    +    G ++N DK
Subjt:  LFFADDSLLFFRAKEAKAIMIRDILCCYERASGQRINFDK

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.4e-1440.23Show/hide
Query:  LVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELIS
        +V R+K ++  LI   Q++FIPGR   DN +   E +H ++ R KG+ GW LLKLD+ K YDR+ W +LE  ++   F   W+  I+
Subjt:  LVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHKGITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELIS

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.1e-0444.19Show/hide
Query:  LFLLCAEGLSCMLRGVEEADAIKGLRLARYWPAISHLFFADDS
        LF+LC E LS + R  +E   + G+R++   P I+HL FADD+
Subjt:  LFLLCAEGLSCMLRGVEEADAIKGLRLARYWPAISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTGTAGGGGGTGCAGGGGTTCGGTATGAATTACAGAATGCAGAGGAAGGACTTGAAGCTCTCTTGGTTGAGGATGAGGCCTCGTGTACGAGAAGGAAGAATGA
GGTGAAGGGTCTAGAGGATGGTGCGGGGATTTGGCAGCAAAAGCCAGATAGAGTTTTGGGGTTAATTGAAGGGTATTTTGGAAACATCTATAGGACGACGCACCCGTTTG
GGGAGGAGATGGAGCAGGCCTTGTTGCATGTACCGACCACTGTGACAGAGGAAATGAATAGTAAGCTGCTTCACCCGTCTCAACAGGAGGAGATTCTTCTTGCGTTGAAG
CAGACTCACCCGAATAAATCTCTAGGGCCTGATAGGTTGTTAGGGGCTTTCTATCAGCATTCGTGGGATGGTCTCCCAGGGCCTTTGAATGAAACGATGATCGTGTTGAT
CTCGAAGAAGAAGTACCCCAGACGTATTTCGGAGTATAGGCCCATTTCGCTCTGTAACATGGTTTACAAGCTGGTGTCAAAGGTGCTAGTGAATCGTATGAAGGGTATAC
TGAATGGGTTGATCTCCCAGAATCAGAGTGCTTTTATACCTGGGCGTTGTGTGGTGGATAATGCCATCCTGGGGTATGAATGCTTACACGTTTTGAAAGGGAGACACAAG
GGCATAACGGGGTGGGCTTTGCTTAAGTTAGATATGATCAAGACTTATGACAGGGTGGAATGGGTCTTTTTGGAGAAGATTATGCTGAAAATGGATTTTGCTCCGGGATG
GGTGGAGTTGATCTCTCTTTGCATTTCGTCTGTTCGGTATTCCTTTAATCTTTTTCTATTATGTGCGGAGGGCCTGTCATGTATGCTCCGAGGTGTAGAGGAGGCTGATG
CTATTAAAGGGTTGAGGTTAGCAAGGTATTGGCCTGCCATCTCACATTTGTTCTTTGCAGATGACAGTTTACTGTTCTTTCGGGCTAAAGAGGCGAAAGCTATTATGATT
CGGGATATTCTCTGCTGCTATGAGCGGGCTTCGGGGCAGAGGATTAATTTCGATAAGTCCATCATATCTTCCAGTCTGAATATGGAGGTGTTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTGTAGGGGGTGCAGGGGTTCGGTATGAATTACAGAATGCAGAGGAAGGACTTGAAGCTCTCTTGGTTGAGGATGAGGCCTCGTGTACGAGAAGGAAGAATGA
GGTGAAGGGTCTAGAGGATGGTGCGGGGATTTGGCAGCAAAAGCCAGATAGAGTTTTGGGGTTAATTGAAGGGTATTTTGGAAACATCTATAGGACGACGCACCCGTTTG
GGGAGGAGATGGAGCAGGCCTTGTTGCATGTACCGACCACTGTGACAGAGGAAATGAATAGTAAGCTGCTTCACCCGTCTCAACAGGAGGAGATTCTTCTTGCGTTGAAG
CAGACTCACCCGAATAAATCTCTAGGGCCTGATAGGTTGTTAGGGGCTTTCTATCAGCATTCGTGGGATGGTCTCCCAGGGCCTTTGAATGAAACGATGATCGTGTTGAT
CTCGAAGAAGAAGTACCCCAGACGTATTTCGGAGTATAGGCCCATTTCGCTCTGTAACATGGTTTACAAGCTGGTGTCAAAGGTGCTAGTGAATCGTATGAAGGGTATAC
TGAATGGGTTGATCTCCCAGAATCAGAGTGCTTTTATACCTGGGCGTTGTGTGGTGGATAATGCCATCCTGGGGTATGAATGCTTACACGTTTTGAAAGGGAGACACAAG
GGCATAACGGGGTGGGCTTTGCTTAAGTTAGATATGATCAAGACTTATGACAGGGTGGAATGGGTCTTTTTGGAGAAGATTATGCTGAAAATGGATTTTGCTCCGGGATG
GGTGGAGTTGATCTCTCTTTGCATTTCGTCTGTTCGGTATTCCTTTAATCTTTTTCTATTATGTGCGGAGGGCCTGTCATGTATGCTCCGAGGTGTAGAGGAGGCTGATG
CTATTAAAGGGTTGAGGTTAGCAAGGTATTGGCCTGCCATCTCACATTTGTTCTTTGCAGATGACAGTTTACTGTTCTTTCGGGCTAAAGAGGCGAAAGCTATTATGATT
CGGGATATTCTCTGCTGCTATGAGCGGGCTTCGGGGCAGAGGATTAATTTCGATAAGTCCATCATATCTTCCAGTCTGAATATGGAGGTGTTGGCTTAG
Protein sequenceShow/hide protein sequence
MEGVGGAGVRYELQNAEEGLEALLVEDEASCTRRKNEVKGLEDGAGIWQQKPDRVLGLIEGYFGNIYRTTHPFGEEMEQALLHVPTTVTEEMNSKLLHPSQQEEILLALK
QTHPNKSLGPDRLLGAFYQHSWDGLPGPLNETMIVLISKKKYPRRISEYRPISLCNMVYKLVSKVLVNRMKGILNGLISQNQSAFIPGRCVVDNAILGYECLHVLKGRHK
GITGWALLKLDMIKTYDRVEWVFLEKIMLKMDFAPGWVELISLCISSVRYSFNLFLLCAEGLSCMLRGVEEADAIKGLRLARYWPAISHLFFADDSLLFFRAKEAKAIMI
RDILCCYERASGQRINFDKSIISSSLNMEVLA