; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018773 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018773
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:34181440..34182860
RNA-Seq ExpressionLag0018773
SyntenyLag0018773
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66036.1 hypothetical protein [Beta vulgaris subsp. vulgaris]2.7e-8137.39Show/hide
Query:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH
        +  L +A   PW+ GGDFN ++   EKK G     R  +   +++  C+ +DLGF+G   TW  N+      +ERLDR++AN           V HL   
Subjt:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH

Query:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQ------------------GITRKELEIKRLSNPKD------------QAALDLLSKAE-------
         SDH  I+  +    +   ++   +  + E  WL+                  GI       K LS  K             Q  + +L ++E       
Subjt:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQ------------------GITRKELEIKRLSNPKD------------QAALDLLSKAE-------

Query:  ------WELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITK
                +++L + EE YWH R+R++W+KSGDKNTK+FH KA+ R++RN ++ I N  G W ED  ++      Y+ +L  S   N  +   +   +  
Subjt:  ------WELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITK

Query:  KISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIY
        +I+++   +LDA F REEV  A+  M P+KAPGPDG +A+ +Q+ W+T+GED     L +LNN + I  +N+T I LIPK K  ++  +FRPISLCNV+Y
Subjt:  KISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIY

Query:  KVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        K++AK LANRMK VL  +I  SQS F+PGR ITDNVLV +EC H L  ++TG+ GY+ +KLDMSKAYD
Subjt:  KVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]7.2e-8237.29Show/hide
Query:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH
        +++L    NLPW++GGD+NEI    +K  G  + +R +ND+ +++  C L D+ F+G R TW   +      R RLDR+  +    D+    RV HL   
Subjt:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH

Query:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQGITRKEL-----------------------------------------EIKRLSN----------
         SDH  ILL++      +++   KR  K E+ WL   T KE+                                         EI ++ N          
Subjt:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQGITRKEL-----------------------------------------EIKRLSN----------

Query:  ----PKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNR
             +D+ AL L      +L  LL +E+ +W  RA+  WLK GD NTK+FH +   RKK+N + G+FN+ G+W  +++E+      Y+N L TSS+P  
Subjt:  ----PKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNR

Query:  EDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMN
        ED + + + +   +SE     L     +EEV  AIK M PSK+PGPDG     FQ  WE +G+D V        +KE +  +N T +ALIPK + P+ M+
Subjt:  EDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMN

Query:  EFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        + RPISLCNV+YK+ +K LANR+K +LD++ISP QSAF+PGR I+DN L+ FE  H L  RR+G+ GY A+KLDMSKAYD
Subjt:  EFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]4.7e-8138.13Show/hide
Query:  PWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILLK
        P + GGDFNE++   E + G     R ++D  + +   +L DLGF G   TW + K+     RERLDR+LA+P+  D    + VEH+  + SDH  I+++
Subjt:  PWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILLK

Query:  IDWGDTNQQQSFHKRLAKLEQSWL--------------------------------------------QGITRKELEIKRLSNPKDQAALDLLSKAEWEL
        + +G   +++   K+  +   +WL                                            + I   E EIKRL +    A  + L +   +L
Subjt:  IDWGDTNQQQSFHKRLAKLEQSWL--------------------------------------------QGITRKELEIKRLSNPKDQAALDLLSKAEWEL

Query:  EKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAK
        + LLE++E+YW++R+R   +K GDKNTK+FH KA+QRK+RN I G+F+   +W +D ++I      YY +L TSS P+ E    V  A+   ISE+    
Subjt:  EKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAK

Query:  LDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALAN
        L     +EEV +A++ M PSKAPGPDG HA+ +Q  W  +G+D      GI++     + +N T IALIPK K P  ++EFRPISLCNVI+K++ K LAN
Subjt:  LDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALAN

Query:  RMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        R+K +L  ++S +QSAF+PGR ITDN L+  E  H +  R  G  G+VA+KLDMSKAYD
Subjt:  RMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]3.9e-8038.66Show/hide
Query:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH
        +E L    N  W+  GDFN I+ + EK    P     I+   + +  C+L DLG+ G   TW+  +     T+ RLDR +A  +         V HL  H
Subjt:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH

Query:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQ--------------------GITRKELEIK-----------RLSNPKD------QAALDLLSKAE
         SDH  IL+ +      QQ+   K+  K E++WL                     G+   + +IK             S P +      Q  L++L+ AE
Subjt:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQ--------------------GITRKELEIK-----------RLSNPKD------QAALDLLSKAE

Query:  --------------WELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTK
                       +L+ LL ++E YW  R+R  WLK GDKNTK+FHSKA+QR++RN IKGI +++G WVE+++EI    T Y+++L  +   N +  +
Subjt:  --------------WELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTK

Query:  EVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRP
        E   A+ +K++ +    L + F+ EEV+ A+  M P+KAPGPDG +A+ +Q  W  +G+  V   L  LNN      IN T I LIPK K+P+ M++FRP
Subjt:  EVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRP

Query:  ISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        ISLCNVIYK+I+K L NR+KQVL  IISP+QSAF+PGR ITDNVL+ +E +H +++R+ G+ GY+A+KLD+SKAYD
Subjt:  ISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]2.5e-8239.35Show/hide
Query:  PWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILLK
        PW++ GDFN  +   EK          I    +++  C L DLGF G   TW+  +     T+ RLDR +AN +  D  +  RV HL  H SDH  +LL 
Subjt:  PWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILLK

Query:  IDWGDTNQQQSFHKRLAKLEQSWL--------------------QGITRKELEIK-----------RLSNP------KDQAALDLL--------SKAEW-
        +     +Q +    R  K E+SWL                     G+   + +IK            +++P      + Q  LD L        SKAE+ 
Subjt:  IDWGDTNQQQSFHKRLAKLEQSWL--------------------QGITRKELEIK-----------RLSNP------KDQAALDLL--------SKAEW-

Query:  ----ELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKIS
            +++ LL+++E YW  R+R  WL+ GD+NTK+FH+KA+QR+++N I+GI NS+G WVE+++E+G    +Y+++L  +     +  +E   A+  K++
Subjt:  ----ELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKIS

Query:  EDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVI
        ED +  L   F+ EEV+ A+  M P+KAPGPDG +A+ +Q  W  +G+  V   L  LNN   +  IN T I LIPK ++P+ M+EFRPISLCNVIYK+I
Subjt:  EDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVI

Query:  AKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        +K LANR+KQVL  IIS +QSAF+PGR ITDNVLV +E +H ++ R+ G+ G VA+KLD+SKAYD
Subjt:  AKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein4.1e-8340.52Show/hide
Query:  LEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSD
        L   H LPW  GGDFNE++  +EK     +P   + +    +  C  +DLGF+G   TW   ++      ERLDR LA    L    + RV HLQ   SD
Subjt:  LEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSD

Query:  HRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQGITRKELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQR
        H+ + +++ W   N +    ++  + E+ W    +  E  IK+    +   +L +  K   EL  L  +EE  W  R+R  WL+SGD+NTK+FH +AT R
Subjt:  HRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQGITRKELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQR

Query:  KKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLW
        K+RN I GI +  G+W    +E+     EYY  L T+S P   D  E+   + + I+ D   +LDA F+  EVE A+  M P KAPGPDG   + +Q  W
Subjt:  KKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLW

Query:  ETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVL
          +G D   + L  L +   ++ IN T I LIPK ++P+++ +FRPISLCNVIYK+IAK LANR+K++L  IIS SQSAF+PGR I+DN+L+ FE +H +
Subjt:  ETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVL

Query:  NNRRTGRTGYVAIKLDMSKAYD
         + +  + GY+A+KLDMSKAYD
Subjt:  NNRRTGRTGYVAIKLDMSKAYD

A0A2N9HE46 Uncharacterized protein7.7e-8237.53Show/hide
Query:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH
        + RL+   NLPW   GDFNE++  +EK+    +  R +    D +  C+ +DLGF G   TW  N+     T ERLDR +A P  L     +RV HL+  
Subjt:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH

Query:  HSDHRAILLKID-----------------------------WGDTNQQQSFHKRLAKLE------QSWLQ---GITRKEL-EIKRLSNPKDQAALDLLSK
         SDH+ + +  D                             W    Q    H  + K+       ++W +   G  RK++ E++ L    +  ++  ++ 
Subjt:  HSDHRAILLKID-----------------------------WGDTNQQQSFHKRLAKLE------QSWLQ---GITRKEL-EIKRLSNPKDQAALDLLSK

Query:  AEW-----ELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAIT
         ++     +L+ LL +EE  W  R+R EWLK+GD+NT++FH +ATQRK+RN++  + +  G+W     ++      YY+SL T+S+P   D   V + I 
Subjt:  AEW-----ELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAIT

Query:  KKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVI
          ++    + L + F+ +EV +A+K M+P KAPGPDG   + +QN W  +GED +   LG LN+ + +  IN T + LIPK K+P+ + EFRPISLCNVI
Subjt:  KKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVI

Query:  YKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        YK+I+K LANR+K +L  I+S SQSAF+PGR ITDN+LV FE +H + +++TG+TG +A+KLDMSKAYD
Subjt:  YKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

A0A2P6SDG4 Putative RNA-directed DNA polymerase3.5e-8237.29Show/hide
Query:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH
        +++L    NLPW++GGD+NEI    +K  G  + +R +ND+ +++  C L D+ F+G R TW   +      R RLDR+  +    D+    RV HL   
Subjt:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH

Query:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQGITRKEL-----------------------------------------EIKRLSN----------
         SDH  ILL++      +++   KR  K E+ WL   T KE+                                         EI ++ N          
Subjt:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQGITRKEL-----------------------------------------EIKRLSN----------

Query:  ----PKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNR
             +D+ AL L      +L  LL +E+ +W  RA+  WLK GD NTK+FH +   RKK+N + G+FN+ G+W  +++E+      Y+N L TSS+P  
Subjt:  ----PKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNR

Query:  EDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMN
        ED + + + +   +SE     L     +EEV  AIK M PSK+PGPDG     FQ  WE +G+D V        +KE +  +N T +ALIPK + P+ M+
Subjt:  EDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMN

Query:  EFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        + RPISLCNV+YK+ +K LANR+K +LD++ISP QSAF+PGR I+DN L+ FE  H L  RR+G+ GY A+KLDMSKAYD
Subjt:  EFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

A0A803NML1 Uncharacterized protein7.0e-8337.2Show/hide
Query:  LPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILL
        LPW++ GDFNEI+ N +K  G  +  + ++   D++  C+L    F GD+ TW K + N  A +ERLD    N     I K +   HL ++ SDHRAI +
Subjt:  LPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILL

Query:  KIDWGDTNQQQSFHKRLAKLEQSWL-----------------------------------------------QGITRKELEIKRLSNPKDQ--AALDLLS
         ID+  +NQQQ   K   + E+ WL                                               + IT  + ++ RL+N  D+  + +D L 
Subjt:  KIDWGDTNQQQSFHKRLAKLEQSWL-----------------------------------------------QGITRKELEIKRLSNPKDQ--AALDLLS

Query:  KAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKIS
         +E  L++LL++EE+YWH R+R + L+ GD+NT +FH+ AT RK +N IK + N++G  V    E+    ++YY SL  S   + +   ++   +   I+
Subjt:  KAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKIS

Query:  EDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVI
         +    L   F+  EV +A++ MSP K+PG DG  AM +Q+ W  +G       LG+LN+   +  +NK++I LIPK  +P +M E+RPISLCNVIYK+I
Subjt:  EDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVI

Query:  AKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        +K + NR KQVL  +IS +QSAF+  R ITDN+L+ FE IH L +R+ GR GY A+KLDMSKA+D
Subjt:  AKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

F4NCI4 Reverse transcriptase domain-containing protein1.3e-8137.39Show/hide
Query:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH
        +  L +A   PW+ GGDFN ++   EKK G     R  +   +++  C+ +DLGF+G   TW  N+      +ERLDR++AN           V HL   
Subjt:  MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFH

Query:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQ------------------GITRKELEIKRLSNPKD------------QAALDLLSKAE-------
         SDH  I+  +    +   ++   +  + E  WL+                  GI       K LS  K             Q  + +L ++E       
Subjt:  HSDHRAILLKIDWGDTNQQQSFHKRLAKLEQSWLQ------------------GITRKELEIKRLSNPKD------------QAALDLLSKAE-------

Query:  ------WELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITK
                +++L + EE YWH R+R++W+KSGDKNTK+FH KA+ R++RN ++ I N  G W ED  ++      Y+ +L  S   N  +   +   +  
Subjt:  ------WELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITK

Query:  KISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIY
        +I+++   +LDA F REEV  A+  M P+KAPGPDG +A+ +Q+ W+T+GED     L +LNN + I  +N+T I LIPK K  ++  +FRPISLCNV+Y
Subjt:  KISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIY

Query:  KVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        K++AK LANRMK VL  +I  SQS F+PGR ITDNVLV +EC H L  ++TG+ GY+ +KLDMSKAYD
Subjt:  KVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-1627.96Show/hide
Query:  KLEQSWLQGITR--KELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKE
        K E+S +  +T   KELE +  ++ K     + ++K   EL+++  ++       +R  + +  +K  +       +++++N+I  I N +G    D  E
Subjt:  KLEQSWLQGITR--KELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKE

Query:  IGTATTEYYNSLLTSSHPNREDTKEVTKAIT-KKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEI
        I T   EYY  L  +   N E+        T  ++++++   L+   +  E+   I  +   K+PGPDG  A  +Q   E +    +K    I   KE I
Subjt:  IGTATTEYYNSLLTSSHPNREDTKEVTKAIT-KKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEI

Query:  EP--INKTLIALIPKT-KDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMS
         P    +  I LIPK  +D      FRPISL N+  K++ K LANR++Q +  +I   Q  FIPG Q   N+      I  +N  R     +V I +D  
Subjt:  EP--INKTLIALIPKT-KDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMS

Query:  KAYD
        KA+D
Subjt:  KAYD

P08548 LINE-1 reverse transcriptase homolog1.3e-1727.56Show/hide
Query:  QSFHKRLAKLEQSWLQGITRKELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIW
        Q+F K+  + E + L G   K+LE +  SNPK     + ++K   EL ++  +       +++  + +  +K  K   +   +++ ++ I  I N     
Subjt:  QSFHKRLAKLEQSWLQGITRKELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIW

Query:  VEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAI-TKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGIL
          D  EI     EYY  L +  + N ++  +  +A    ++S+ +   L+   S  E+   I+ +   K+PGPDG  +  +Q    T  E+ V   L + 
Subjt:  VEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAI-TKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGIL

Query:  NN--KEEIEP--INKTLIALIPKT-KDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGY
         N  KE I P    +  I LIPK  KDP     +RPISL N+  K++ K L NR++Q +  II   Q  FIPG Q   N+      I  +N  +     +
Subjt:  NN--KEEIEP--INKTLIALIPKT-KDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGY

Query:  VAIKLDMSKAYD
        + + +D  KA+D
Subjt:  VAIKLDMSKAYD

P11369 LINE-1 retrotransposable element ORF2 protein5.3e-1929.35Show/hide
Query:  KELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLT
        K LE K  ++PK     +++ K   E+ ++          + R  + +  +K  K         + +  I  I N +G    D +EI      +Y  L +
Subjt:  KELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLLT

Query:  SSHPNRED-TKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIE-----PINKTLIA
        +   N ++  K + +    K+++DQ   L++  S +E+E  I  +   K+PGPDG  A  +Q    T  ED +   L  L +K E+E        +  I 
Subjt:  SSHPNRED-TKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIE-----PINKTLIA

Query:  LIPK-TKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        LIPK  KDP  +  FRPISL N+  K++ K LANR+++ +  II P Q  FIPG Q   N+      IH +N  +     ++ I LD  KA+D
Subjt:  LIPK-TKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

P14381 Transposon TX1 uncharacterized 149 kDa protein2.2e-2527.69Show/hide
Query:  QSFHKRLAKLEQSWLQGITRKELEI-KRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGI
        Q + K ++    + ++ +  + L++ +RLS  +DQA      + +  L  + + +     +R+R + L   D+ +++F++   ++  R +I  +F   G 
Subjt:  QSFHKRLAKLEQSWLQGITRKELEI-KRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGI

Query:  WVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGIL
         +ED + I      +Y +L  S  P   D  E        +SE +K +L+   + +E+ +A++ M  +K+PG DG     FQ  W+T+G D  +      
Subjt:  WVEDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGIL

Query:  NNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKL
           E      + +++L+PK  D + +  +RP+SL +  YK++AKA++ R+K VL  +I P QS  +PGR I DNV +  + +H    RRTG      + L
Subjt:  NNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKL

Query:  DMSKAYD
        D  KA+D
Subjt:  DMSKAYD

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.5e-0528.66Show/hide
Query:  SREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWET-MGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQ
        +REE++ AIKG  PS APG DG   +  Q +  T +  + V+  L +L       P       LIPK  D +  + +RPI++ + + +++ + LA R++ 
Subjt:  SREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWET-MGEDTVKTCLGILNNKEEIEPINKTLIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQ

Query:  VLDTIISPSQSAF--IPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
         ++  + P+Q  +  I G  +   +L  +     +++RR  R  Y  + LD+ KA+D
Subjt:  VLDTIISPSQSAF--IPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-1725.89Show/hide
Query:  KELEIKRLSNPKDQA-ALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLL
        + ++ + L+NP D    ++ +++ +W         ES++  ++R +WL+ GD NT++FH      + +N IK +     + VE++ ++      YY  LL
Subjt:  KELEIKRLSNPKDQA-ALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWVEDIKEIGTATTEYYNSLL

Query:  TSSHP--NREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIP
         S       +  + +      + ++   ++L A  S +E+  A+  M  +KAPGPD   A  F   W  + + T+            ++  N T I LIP
Subjt:  TSSHP--NREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKTLIALIP

Query:  KTKDPKTMNEFRPISLCNVIYKVI
        K      ++ FRP+S C V+YK+I
Subjt:  KTKDPKTMNEFRPISLCNVIYKVI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.9e-0840.32Show/hide
Query:  LANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD
        +  R+K ++  +I P+Q++FIPGR  TDN++   E +H +  R+ G  G++ +KLD+ KAYD
Subjt:  LANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGATTGGAAAAGGCTCACAACCTCCCCTGGATCATTGGAGGGGATTTCAACGAAATAATGTTCAACAAGGAGAAAAAAAGGGGTTACCCTAAGCCTCTTAGATT
TATCAATGATTTATGCGATTCCATTAGAATATGTAATCTTATTGACTTGGGTTTTATTGGTGACAGGTGCACTTGGGCTAAAAATAAAAGCAACCATGGAGCTACCCGAG
AGAGACTTGATAGATATTTAGCTAACCCCAAAATGCTTGATATTGTCAAGGACATGAGAGTGGAGCACCTCCAATTCCACCATTCGGACCATAGGGCTATCCTGCTCAAG
ATTGATTGGGGAGATACTAATCAACAACAATCCTTTCATAAGAGGTTGGCCAAATTGGAACAAAGCTGGTTGCAGGGCATTACCCGAAAGGAGCTCGAAATCAAGAGACT
GTCCAATCCTAAAGACCAAGCCGCGCTGGACCTTCTATCCAAAGCCGAATGGGAGCTGGAAAAGCTCCTGGAAGAAGAAGAAAGTTATTGGCACATGAGAGCTAGAGAGG
AGTGGCTTAAGAGTGGAGACAAGAACACAAAGTGGTTCCACTCAAAAGCCACTCAAAGAAAGAAAAGAAACGAGATAAAAGGCATCTTTAACAGTAGAGGAATCTGGGTC
GAAGATATCAAAGAGATAGGGACAGCGACTACTGAATACTATAACTCCCTTCTAACATCGTCACATCCAAACAGAGAAGACACCAAGGAAGTCACCAAGGCTATTACTAA
AAAGATTTCTGAGGATCAAAAGGCCAAGCTAGATGCCTCTTTTTCCAGAGAAGAAGTCGAGAAAGCTATTAAAGGAATGAGTCCTTCCAAAGCCCCGGGTCCTGACGGGG
CGCACGCTATGCTTTTCCAAAATTTGTGGGAGACCATGGGGGAGGATACAGTGAAAACATGCTTGGGGATCCTAAACAACAAGGAAGAGATTGAGCCTATAAATAAAACT
CTTATAGCGCTTATCCCTAAAACCAAGGACCCAAAGACCATGAATGAGTTCAGACCAATTAGCCTGTGCAATGTGATTTACAAAGTGATAGCTAAAGCCCTAGCGAATCG
CATGAAGCAGGTGCTCGACACGATCATTTCCCCTTCCCAATCGGCATTCATTCCTGGAAGACAAATCACTGATAACGTTCTGGTGGGGTTCGAATGTATCCATGTGTTGA
ATAATAGAAGAACTGGAAGAACCGGATATGTGGCTATCAAACTCGACATGAGTAAAGCCTATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGATTGGAAAAGGCTCACAACCTCCCCTGGATCATTGGAGGGGATTTCAACGAAATAATGTTCAACAAGGAGAAAAAAAGGGGTTACCCTAAGCCTCTTAGATT
TATCAATGATTTATGCGATTCCATTAGAATATGTAATCTTATTGACTTGGGTTTTATTGGTGACAGGTGCACTTGGGCTAAAAATAAAAGCAACCATGGAGCTACCCGAG
AGAGACTTGATAGATATTTAGCTAACCCCAAAATGCTTGATATTGTCAAGGACATGAGAGTGGAGCACCTCCAATTCCACCATTCGGACCATAGGGCTATCCTGCTCAAG
ATTGATTGGGGAGATACTAATCAACAACAATCCTTTCATAAGAGGTTGGCCAAATTGGAACAAAGCTGGTTGCAGGGCATTACCCGAAAGGAGCTCGAAATCAAGAGACT
GTCCAATCCTAAAGACCAAGCCGCGCTGGACCTTCTATCCAAAGCCGAATGGGAGCTGGAAAAGCTCCTGGAAGAAGAAGAAAGTTATTGGCACATGAGAGCTAGAGAGG
AGTGGCTTAAGAGTGGAGACAAGAACACAAAGTGGTTCCACTCAAAAGCCACTCAAAGAAAGAAAAGAAACGAGATAAAAGGCATCTTTAACAGTAGAGGAATCTGGGTC
GAAGATATCAAAGAGATAGGGACAGCGACTACTGAATACTATAACTCCCTTCTAACATCGTCACATCCAAACAGAGAAGACACCAAGGAAGTCACCAAGGCTATTACTAA
AAAGATTTCTGAGGATCAAAAGGCCAAGCTAGATGCCTCTTTTTCCAGAGAAGAAGTCGAGAAAGCTATTAAAGGAATGAGTCCTTCCAAAGCCCCGGGTCCTGACGGGG
CGCACGCTATGCTTTTCCAAAATTTGTGGGAGACCATGGGGGAGGATACAGTGAAAACATGCTTGGGGATCCTAAACAACAAGGAAGAGATTGAGCCTATAAATAAAACT
CTTATAGCGCTTATCCCTAAAACCAAGGACCCAAAGACCATGAATGAGTTCAGACCAATTAGCCTGTGCAATGTGATTTACAAAGTGATAGCTAAAGCCCTAGCGAATCG
CATGAAGCAGGTGCTCGACACGATCATTTCCCCTTCCCAATCGGCATTCATTCCTGGAAGACAAATCACTGATAACGTTCTGGTGGGGTTCGAATGTATCCATGTGTTGA
ATAATAGAAGAACTGGAAGAACCGGATATGTGGCTATCAAACTCGACATGAGTAAAGCCTATGACTGA
Protein sequenceShow/hide protein sequence
MERLEKAHNLPWIIGGDFNEIMFNKEKKRGYPKPLRFINDLCDSIRICNLIDLGFIGDRCTWAKNKSNHGATRERLDRYLANPKMLDIVKDMRVEHLQFHHSDHRAILLK
IDWGDTNQQQSFHKRLAKLEQSWLQGITRKELEIKRLSNPKDQAALDLLSKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATQRKKRNEIKGIFNSRGIWV
EDIKEIGTATTEYYNSLLTSSHPNREDTKEVTKAITKKISEDQKAKLDASFSREEVEKAIKGMSPSKAPGPDGAHAMLFQNLWETMGEDTVKTCLGILNNKEEIEPINKT
LIALIPKTKDPKTMNEFRPISLCNVIYKVIAKALANRMKQVLDTIISPSQSAFIPGRQITDNVLVGFECIHVLNNRRTGRTGYVAIKLDMSKAYD