; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000497 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000497
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:8954870..8955938
RNA-Seq ExpressionLag0000497
SyntenyLag0000497
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010682933.1 PREDICTED: uncharacterized protein LOC104897695 [Beta vulgaris subsp. vulgaris]2.3e-9649.58Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        M  FRPISLC V+YKI+SK +ANR+K FL  +IS  +SAF+P RLITDN +  F+  H+      GK+G +A KLDMSKAYDRVE +F+ +VM RL F +
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVK
         W++++M+C+ SV YS ++NG       P RGLRQGDPLSPYLFL C E  SAL                          F +DS++  +A    C VV 
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVK

Query:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI
         IL  Y+ ASGQ INF+KS    SKN+D ++  D+    GV+ V+    YLG+P+  GR K  VF  LKER+WK LQGWKE+L S  GKE+L+KAV+Q+I
Subjt:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI

Query:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        PTY MS F +P+ I  EIN +CARFWWGS G +++ HWL+W+K+C  K  GG+GF
Subjt:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]1.8e-9650.42Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        M  FRPISLCNVIYKIISK +ANR+K  L  +ISP +SAF+P RLITDNVLL ++ +HA + ++ GK   +A+KLD+SKAYDRVE  F++ +M RL F +
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSA--------------------------LFNNDSLILLKANRSNCEVVK
         WI +VM CV +  +SV ING       P RGLRQGDPLSPYLFL C EG ++                          LF +DSLI  +AN+   +V+ 
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSA--------------------------LFNNDSLILLKANRSNCEVVK

Query:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI
          L  Y  ASGQ INFEKS+   S N    + + +   LGV+ VD    YLG+P+  GR K++ F  LKER+WK LQGW+ RL S  GKE+LIKAV Q+I
Subjt:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI

Query:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        PTYTM  F LP ++C E++ LCARFWWG  GD++K HW +WK L   K+ GG+GF
Subjt:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]5.0e-9950.7Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        M  FRPISLCNVIYKIISK +ANR+K  L  IIS  +SAF+P RLITDNVL+ ++ +H  ++++ GK+G +A+KLD+SKAYDRVE  F++ +ME++ F  
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFN--------------------------NDSLILLKANRSNCEVVK
         WI++VM CV +  +S+ +NG P    QP RG+RQGDP+SPYLFL C EGL+AL N                          +DSL+  +A R+  E + 
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFN--------------------------NDSLILLKANRSNCEVVK

Query:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI
        +IL  Y+ ASGQSIN EKS+   S N    +   +   LGVK VD    YLG+P+  GR K+  F +LK+R+WK LQGWK  L S  GKEILIKAV QAI
Subjt:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI

Query:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        PTYTMS F++P ++C E+  LCARFWWG VG+++K HW +W KL A K+ GG+GF
Subjt:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

XP_030923826.1 uncharacterized protein LOC115950728 [Quercus lobata]2.5e-9849.3Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        M  +RPISLCNVIYKIISK +ANR+K  L  IISP +SAFIP RLITDN+++ ++C+HA + ++ GK+G +A+KLD+SKAYDRVE AF++ +ME++ F +
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSA--------------------------LFNNDSLILLKANRSNCEVVK
         WI +VM CV +  +SV ING P     P RG+RQGDPLSPYLFL C EG ++                          LF +DSL+  KA +   +V+ 
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSA--------------------------LFNNDSLILLKANRSNCEVVK

Query:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI
        ++L  Y  ASGQ INFEKS+     N + +  + +   LGVK V     YLG+P+  GR K++ F  LK+R+WK LQGWK ++ S  GKE+LIKAV Q+I
Subjt:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI

Query:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        PTYTM  F+LP ++C E+N +CARFWWG VGD++K HW +W+ L   K++GG+GF
Subjt:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

XP_030931246.1 uncharacterized protein LOC115957168 [Quercus lobata]3.0e-9649.58Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        M  +RPISLCNVIYKIISK +AN++K  L  IIS  +SAF+PSRLITDN+L+ ++C+HA + ++ GK+G IA+KLD+SKAYDRVE AF++ +ME++ F  
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSA--------------------------LFNNDSLILLKANRSNCEVVK
         WI +VM CV +  +SV ING P     P RG+RQGDPLSPYLFL C EG ++                          LF +DSL+  +ANR   + + 
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSA--------------------------LFNNDSLILLKANRSNCEVVK

Query:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI
        +IL  Y  ASGQ IN EKS+   S N +    +++   LGV+ V     YLG+P+  GR K++ F  LK+RIWK LQGWK +L S  GKE+LIKAV Q+I
Subjt:  KILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAI

Query:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        PTYTM  F LP ++C E+N +CARFWWG VGD++K HW +W  +   K  GG+GF
Subjt:  PTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

TrEMBL top hitse value%identityAlignment
A0A2N9EK17 Reverse transcriptase domain-containing protein1.1e-9954.24Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        +  FRPISLCNV+YKIISK +ANRMK  L  +IS  +SAF+P RLITDN+L+ F+ +H  N+KR GK  ++A KLDMSKAYDRVE  +++ VM ++ F  
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFN-NDSLILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISK
         W+  +M+C+ +V YSV ING P    +P RGLRQGDPLSPYLFL C EGLSALF   +    L A +S+   +K+IL  Y+MASGQ +NFEKSA   SK
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFN-NDSLILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISK

Query:  NIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARF
        N  +   +D+S  LG  L    G YLG+P   GR+K + F + K RIWK LQGWK +L S  G+E+LIKAV  AIPTY MSCFK+P  +C EI  L +RF
Subjt:  NIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARF

Query:  WWGSVGDKKKAHWLNWKKLCASKENGGLGF
        WWG  GD++K HW+ W+KL   K+ GG+GF
Subjt:  WWGSVGDKKKAHWLNWKKLCASKENGGLGF

A0A2N9ESR2 Uncharacterized protein2.0e-9850.28Show/hide
Query:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI
        +RPISLCNVIYK+ISK +ANR+K  L  +IS  +SAF+P RLITDN+L+ F+ +H  +++RSGK G +A+KLDMSKAYDRVE  F+++VM R+ F ++W 
Subjt:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI

Query:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVKKIL
          +M C+ +V YS+ ING PT    P RGLRQGDP+SPYLFL C EGL+ L                          F +DSL+  +A R  C+ ++ IL
Subjt:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVKKIL

Query:  GEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTY
          Y+ ASGQ +N EK+    S+N  +A  ++L   LGV  +     YLG+PS  G+ K   F ++KER+W  ++GWKE+L S  G+EILIKAVVQAIPTY
Subjt:  GEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTY

Query:  TMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        TM+CFKLP  +C+EI  L  RFWWG  GD+ K HWL W+KLC  K  GGLGF
Subjt:  TMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

A0A2N9EVW3 Uncharacterized protein4.6e-9848.86Show/hide
Query:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI
        +RPISLCNV+YK+ISK +ANR+K  L  IIS  +SAF+P RLITDN+L+ F+ +H  ++KR+GK G +A+KLDMSKAYDRVE  F+ KVM+R+ F + WI
Subjt:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI

Query:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVKKIL
          +++C+ SV YS+ ING PT    P RGLRQGDP+SPYLFL C EGL+ L                          F +DSL+  +A+   C+ +++IL
Subjt:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVKKIL

Query:  GEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTY
          Y+ ASGQ +N  K+    S+N  +A  +D+   LGV  +     YLG+PS  G+ K   F ++KER+W  ++GWKE+L S  G+E+LIKAV+QAIPTY
Subjt:  GEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTY

Query:  TMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        TM+CFKLP  +C+EI  +  RFWWG   D++K HW+ W+K+C SK  GGLGF
Subjt:  TMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

A0A2N9HYS7 Uncharacterized protein8.6e-9753.19Show/hide
Query:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI
        +RPISLCNVIYK+ISK +ANR+K  L  +IS  +SAF+P RLITDN+L+ F+ +H  +++R GK G +A+KLDMSKAYDRVE  F+++VM R+ F D WI
Subjt:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI

Query:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFNNDSL---ILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISKN
          +M+C+ +V YS+ ING PT    P RGLRQGDP+SPYLFL C EGL+ L    S    + L A RS     ++    Y+ ASGQ +N  K+    SKN
Subjt:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFNNDSL---ILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISKN

Query:  IDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARFW
          +AK +++   LGV  +     YLG+PS  G+ K   F ++KER+W  ++GWKE+L S  G+EILIKAVVQAIPTYTM+CFKLP  +C+EI  +  RFW
Subjt:  IDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARFW

Query:  WGSVGDKKKAHWLNWKKLCASKENGGLGF
        WG  GDK+K HWL W+KLC SK  GGLGF
Subjt:  WGSVGDKKKAHWLNWKKLCASKENGGLGF

A0A2N9I9F4 Reverse transcriptase domain-containing protein9.2e-9950.28Show/hide
Query:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI
        +RPISLCNVIYK+ISK +ANR+K  L  +IS  +SAF+P RLITDN+L+ F+ +H  +++RSGK G +A+KLDMSKAYDRVE  F+++VM R+ F ++W 
Subjt:  FRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWI

Query:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVKKIL
          +M C+ +V YS+ ING PT    P RGLRQGDP+SPYLFL C EGL+ L                          F +DSL+  +A R  C+ ++ IL
Subjt:  KKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL--------------------------FNNDSLILLKANRSNCEVVKKIL

Query:  GEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTY
          Y+ ASGQ +N EK+    S+N  +A  ++L   LGV  +     YLG+PS  G+ K   F ++KER+W  ++GWKE+L S  G+EILIKAVVQAIPTY
Subjt:  GEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTY

Query:  TMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        TM+CFKLP  +C+EI  L  RFWWG  GD++K HWL W+KLC  K  GGLGF
Subjt:  TMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.4e-1923.31Show/hide
Query:  DSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDS
        ++FRPISL N+  KI++K +ANR++  +  +I   +  FIP      N+      I   N  R+  + ++ + +D  KA+D+++  F+ K + +L     
Subjt:  DSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDS

Query:  WIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLS---------------------ALFNNDSLILLKANRSNCEVVKKILGEY
        ++K +    +    ++ +NG     F    G RQG PLSP LF   +E L+                     +LF +D ++ L+    + + + K++  +
Subjt:  WIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLS---------------------ALFNNDSLILLKANRSNCEVVKKILGEY

Query:  KMASGQSINFEKSACMISKNIDRAKAK---DLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVV--QAIP
           SG  IN +KS   +  N  + +++   +L   +  K +  LG  L    +   +++  +  L + I +    WK    S  G+  ++K  +  + I 
Subjt:  KMASGQSINFEKSACMISKNIDRAKAK---DLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVV--QAIP

Query:  TYTMSCFKLPNRICEEINKLCARFWW
         +     KLP     E+ K   +F W
Subjt:  TYTMSCFKLPNRICEEINKLCARFWW

P08548 LINE-1 reverse transcriptase homolog1.4e-1923.62Show/hide
Query:  DSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDS
        +++RPISL N+  KI++K + NR++  +  II   +  FIP      N+      I   N  ++  + ++ + +D  KA+D ++  F+ + ++++    +
Subjt:  DSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDS

Query:  WIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLS---------------------ALFNNDSLILLKANRSNCEVVKKILGEY
        ++K +         ++ +NG     F    G RQG PLSP LF   +E L+                     +LF +D ++ L+  R +   + +++ EY
Subjt:  WIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLS---------------------ALFNNDSLILLKANRSNCEVVKKILGEY

Query:  KMASGQSINFEKSACMISKNIDRAK--AKD-LSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVV--QAIP
           SG  IN  KS   I  N ++A+   KD +   +  K +  LG YL    +   +++  +  L++ I + +  WK    S  G+  ++K  +  +AI 
Subjt:  KMASGQSINFEKSACMISKNIDRAK--AKD-LSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVV--QAIP

Query:  TYTMSCFKLPNRICEEINKLCARFWW
         +     K P    +++ K+   F W
Subjt:  TYTMSCFKLPNRICEEINKLCARFWW

P0C2F6 Putative ribonuclease H protein At1g657505.5e-1637.25Show/hide
Query:  MPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGG
        MP    R     F ++ ER+   + GW+E+  S  G+  L KAV+ ++P ++MS   LP  I   +++L   F WGS  +KKK H + W K+C+ K+ GG
Subjt:  MPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGG

Query:  LG
        LG
Subjt:  LG

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-2124.62Show/hide
Query:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD
        +++FRPISL N+  KI++K +ANR++  +  II P +  FIP      N+      IH  N  +   + ++ + LD  KA+D+++  F+ KV+ER     
Subjt:  MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGD

Query:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLS---------------------ALFNNDSLILLKANRSNCEVVKKILGE
         ++  +         ++++NG          G RQG PLSPYLF   +E L+                     +L  +D ++ +   +++   +  ++  
Subjt:  SWIKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLS---------------------ALFNNDSLILLKANRSNCEVVKKILGE

Query:  YKMASGQSINFEKS-ACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLG--MPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVV--QAI
        +    G  IN  KS A + +KN  +   K++       +V     YLG  +  +      K F  LK+ I + L+ WK+   S  G+  ++K  +  +AI
Subjt:  YKMASGQSINFEKS-ACMISKNIDRAKAKDLSCCLGVKLVDLLGYYLG--MPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVV--QAI

Query:  PTYTMSCFKLPNRICEEINKLCARFWWGS
          +     K+P +   E+     +F W +
Subjt:  PTYTMSCFKLPNRICEEINKLCARFWWGS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-2226.29Show/hide
Query:  SFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSW
        ++RP+SL +  YKI++KAI+ R+K  L  +I P +S  +P R I DNV L    +H   ++R+G      + LD  KA+DRV+  ++   ++   FG  +
Subjt:  SFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSW

Query:  IKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFNN--------------------DSLILLKANRSNCEVVKKILGEYKM
        +  +     S    V+IN   T+    GRG+RQG PLS  L+   +E    L                       D +IL+  +  + E  ++    Y  
Subjt:  IKKVMKCVESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFNN--------------------DSLILLKANRSNCEVVKKILGEYKM

Query:  ASGQSINFEKSACMI--SKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWK--ERLFSMGGKEILIKAVVQAIPTYT
        AS   IN+ KS+ ++  S  +D             K++  LG YL        +    F +L+E +   L  WK   ++ SM G+ ++I  +V +   Y 
Subjt:  ASGQSINFEKSACMI--SKNIDRAKAKDLSCCLGVKLVDLLGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWK--ERLFSMGGKEILIKAVVQAIPTYT

Query:  MSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLG
        + C         +I +    F W  +G     HW++        + GG G
Subjt:  MSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLG

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.3e-1122.22Show/hide
Query:  IANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWIKKVMKCVESVRYSVQIN
        +  R+K  +  +I P +++FIP R+ TDN++   + +H+   K+ G +G++ +KLD+ KAYDR+   ++   +    F + W+ ++ +     R      
Subjt:  IANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWIKKVMKCVESVRYSVQIN

Query:  GFPTSKFQP-----GRGLRQGDPLSPYL--FLFCVEGLSALFNNDSL----ILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISKNIDRAKAKDL
        G   +  +P       G R  D  +P+    + C E L  +     +    + LK  +      +K +    +A    + +    C+ S +++   AKD 
Subjt:  GFPTSKFQP-----GRGLRQGDPLSPYL--FLFCVEGLSALFNNDSL----ILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISKNIDRAKAKDL

Query:  SCCLGVK
           L V+
Subjt:  SCCLGVK

AT4G29090.1 Ribonuclease H-like superfamily protein3.2e-1145.61Show/hide
Query:  AIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF
        A+PTYTM+CF LP  +C++I  + A FWW +  + K  HW  W  L   K  GG+GF
Subjt:  AIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-1248.28Show/hide
Query:  AIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKE-NGGLGF
        A+P Y MSCF+L   +C+++      FWW S  +K+K  W+ W+KLC SKE +GGLGF
Subjt:  AIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKE-NGGLGF

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-0766.67Show/hide
Query:  INGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL
        ING P     P RGLRQGDPLSPYLF+ C E LS L
Subjt:  INGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTTCAGGCCTATTAGCTTGTGTAACGTGATCTATAAGATAATTTCAAAAGCTATAGCTAATAGGATGAAACATTTCCTCGATACTATTATCTCCCCTTGCAA
GTCGGCTTTTATCCCCAGTAGACTTATTACAGACAATGTGCTGCTAGGGTTTAAATGCATTCATGCCAGAAATAGTAAAAGATCCGGAAAAGAGGGCTACATTGCTATGA
AGCTGGATATGAGCAAAGCTTACGATAGGGTTGAATTGGCTTTCATTCGGAAAGTTATGGAAAGGCTTGATTTTGGAGACAGCTGGATCAAGAAAGTTATGAAGTGTGTT
GAATCGGTTAGATATTCTGTTCAAATCAACGGCTTCCCCACCTCGAAATTCCAACCCGGCAGAGGGCTTCGTCAAGGAGATCCATTATCCCCCTACCTGTTTCTTTTTTG
TGTAGAAGGGCTATCAGCACTGTTTAACAATGACAGCCTGATCCTGTTGAAAGCTAATCGAAGTAATTGCGAAGTGGTGAAGAAGATCCTGGGGGAATACAAAATGGCTT
CGGGCCAATCCATAAACTTTGAGAAGTCGGCTTGTATGATCAGTAAAAACATTGACAGAGCCAAAGCTAAAGATCTCAGTTGCTGCCTTGGGGTTAAGCTTGTGGATTTG
TTGGGGTACTATCTGGGTATGCCCTCTCAAACGGGTCGTAGGAAGCATAAAGTGTTTTGGAAGTTGAAGGAAAGAATTTGGAAGACCCTGCAAGGTTGGAAAGAACGGTT
GTTCTCTATGGGAGGCAAGGAGATTCTTATCAAAGCAGTGGTCCAAGCTATCCCAACTTATACTATGAGTTGCTTTAAGCTTCCAAATCGTATTTGTGAGGAGATTAACA
AGCTTTGCGCCAGATTTTGGTGGGGTTCAGTGGGAGATAAGAAAAAGGCTCATTGGTTGAATTGGAAGAAACTTTGTGCGAGCAAGGAGAATGGAGGCCTTGGTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGTTTCAGGCCTATTAGCTTGTGTAACGTGATCTATAAGATAATTTCAAAAGCTATAGCTAATAGGATGAAACATTTCCTCGATACTATTATCTCCCCTTGCAA
GTCGGCTTTTATCCCCAGTAGACTTATTACAGACAATGTGCTGCTAGGGTTTAAATGCATTCATGCCAGAAATAGTAAAAGATCCGGAAAAGAGGGCTACATTGCTATGA
AGCTGGATATGAGCAAAGCTTACGATAGGGTTGAATTGGCTTTCATTCGGAAAGTTATGGAAAGGCTTGATTTTGGAGACAGCTGGATCAAGAAAGTTATGAAGTGTGTT
GAATCGGTTAGATATTCTGTTCAAATCAACGGCTTCCCCACCTCGAAATTCCAACCCGGCAGAGGGCTTCGTCAAGGAGATCCATTATCCCCCTACCTGTTTCTTTTTTG
TGTAGAAGGGCTATCAGCACTGTTTAACAATGACAGCCTGATCCTGTTGAAAGCTAATCGAAGTAATTGCGAAGTGGTGAAGAAGATCCTGGGGGAATACAAAATGGCTT
CGGGCCAATCCATAAACTTTGAGAAGTCGGCTTGTATGATCAGTAAAAACATTGACAGAGCCAAAGCTAAAGATCTCAGTTGCTGCCTTGGGGTTAAGCTTGTGGATTTG
TTGGGGTACTATCTGGGTATGCCCTCTCAAACGGGTCGTAGGAAGCATAAAGTGTTTTGGAAGTTGAAGGAAAGAATTTGGAAGACCCTGCAAGGTTGGAAAGAACGGTT
GTTCTCTATGGGAGGCAAGGAGATTCTTATCAAAGCAGTGGTCCAAGCTATCCCAACTTATACTATGAGTTGCTTTAAGCTTCCAAATCGTATTTGTGAGGAGATTAACA
AGCTTTGCGCCAGATTTTGGTGGGGTTCAGTGGGAGATAAGAAAAAGGCTCATTGGTTGAATTGGAAGAAACTTTGTGCGAGCAAGGAGAATGGAGGCCTTGGTTTCTGA
Protein sequenceShow/hide protein sequence
MDSFRPISLCNVIYKIISKAIANRMKHFLDTIISPCKSAFIPSRLITDNVLLGFKCIHARNSKRSGKEGYIAMKLDMSKAYDRVELAFIRKVMERLDFGDSWIKKVMKCV
ESVRYSVQINGFPTSKFQPGRGLRQGDPLSPYLFLFCVEGLSALFNNDSLILLKANRSNCEVVKKILGEYKMASGQSINFEKSACMISKNIDRAKAKDLSCCLGVKLVDL
LGYYLGMPSQTGRRKHKVFWKLKERIWKTLQGWKERLFSMGGKEILIKAVVQAIPTYTMSCFKLPNRICEEINKLCARFWWGSVGDKKKAHWLNWKKLCASKENGGLGF