; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016234 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016234
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:35108536..35113466
RNA-Seq ExpressionLag0016234
SyntenyLag0016234
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]4.2e-9831.29Show/hide
Query:  MIVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL----NEN----------------------------KGKTTWASLKLDMSKAYDRVEWV
        +I LIPK + P  VSE+RPISLC   YK+++K + NR+K +L     EN                            KG+    +LKLDM+KAYDRVEWV
Subjt:  MIVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL----NEN----------------------------KGKTTWASLKLDMSKAYDRVEWV

Query:  FLEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGR-------GVQ-------------------
        FL ++MLK+GF+  WV  +  CIS+  FS    G   G ++P RGLRQG PLSPYLFL+C EG S +LRG        GVQ                   
Subjt:  FLEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGR-------GVQ-------------------

Query:  --------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVG
                            E S  +++  K    +S        D +  +L V V   H +YLGLP+   +GR      +KD++W+ I GWK KL S  
Subjt:  --------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVG

Query:  GREVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP----------
        G+E+L+K V+QAIP YSM+CFR+PK L  ++N  MARFWW   K  RGIHWV W+ LCK K  GG+GFRD+E FNQALLAKQCWRI++ P          
Subjt:  GREVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP----------

Query:  --LPSWACSEGQI-----FSFRRFLEG------------GCGIETVLYME---------EVRSPVTLAPETRVADLMTTSGQWNEGLIRQHFRPYEVSLI
           PS    E ++     F +R    G            G G+   +Y +         ++ SP  L   T V DL T+SGQWN  L++  F   EV   
Subjt:  --LPSWACSEGQI-----FSFRRFLEG------------GCGIETVLYME---------EVRSPVTLAPETRVADLMTTSGQWNEGLIRQHFRPYEVSLI

Query:  LSIPVRA-GAEDRIVWHYEQSG----------------PVFGEKQISVGSSS-------------------LACA-----TSVFLFER--------VRNG
        L IP+ +    D ++WHYE++G                 + GE  + V  +S                     CA         LF R            
Subjt:  LSIPVRA-GAEDRIVWHYEQSG----------------PVFGEKQISVGSSS-------------------LACA-----TSVFLFER--------VRNG

Query:  RAGESSLHVFWHCKFVKTVLMESEFGSL-----------IHNGPRTSGGTSGVGGVLYL----------VVPAGASACGVGL------------------
        R  ES LH  W C+  K V   S +G++           + +  + S      G   YL           +  G S     L                  
Subjt:  RAGESSLHVFWHCKFVKTVLMESEFGSL-----------IHNGPRTSGGTSGVGGVLYL----------VVPAGASACGVGL------------------

Query:  QSVRAREDV------RWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVA
         ++  R+         W PP AG YK+NVD + +      G+GVVVR+++G  M +     +        E  A   G   + D G    VLE D+    
Subjt:  QSVRAREDV------RWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVA

Query:  SFFQDGLGMTSRCGWPSG----ELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDM
            + +  T  C    G    E+   +       C++T R GN+VAH LA  A    E   W E  P  +  ++ +D+
Subjt:  SFFQDGLGMTSRCGWPSG----ELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDM

XP_023909336.1 uncharacterized protein LOC112020997 [Quercus suber]7.9e-9730.55Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        I LIPK ++P    ++RPISLCNV YKL+SK + NR+K  L                                N+ KGKT + +LKLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGV----------------------------
        LE +M K+GFA +W+DLI  CIS+V FS  +NG   G + P RGLRQGDPLSPYLFLLCAEGL ++++                                
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGV----------------------------

Query:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                          +  S  R++R K     SS      ++ +   L V V+    +YLGLPSF+ RG+  S S+I++R+WQ+IQGWK KL S  G
Subjt:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP-----------
        +EVL+K ++QA+P YSMNCF+LP+ L  DI   + +FWWG     R  HWV+W  +C PKC GG+GFRD+E FN ALL KQ WR++ N            
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP-----------

Query:  -LPSWACSEGQI-----FSFRRFLEG------------GCGIETVL---------YMEEVRSPVTLAP-ETRVADLMTTSG-QWNEGLIRQHFRPYEVSL
          P+ +  +  +     ++++  L+             G G   ++         +  +V SP    P   +V  L+  +G  W+   IR  F P E   
Subjt:  -LPSWACSEGQI-----FSFRRFLEG------------GCGIETVL---------YMEEVRSPVTLAP-ETRVADLMTTSG-QWNEGLIRQHFRPYEVSL

Query:  ILSIPVRAGAE-DRIVWHYEQSGPVFGEK-------------QISVGSSSL-----------------------ACA----TSVFLFER--VRNGRAG--
        ILSIP+ +    D  +W   ++G V+  K             Q    + S+                       AC+    T + L  R  + N      
Subjt:  ILSIPVRAGAE-DRIVWHYEQSGPVFGEK-------------QISVGSSSL-----------------------ACA----TSVFLFER--VRNGRAG--

Query:  ----ESSLHVFWHCKFVKTVLMESE-----------------FGSLIHNGPRTSGGTSGVGGVLYL---VVPAGASACGVGL------------------
            E ++H  W C  VK +  + E                  G L  + P  +   + +   ++     V AG+ +    +                  
Subjt:  ----ESSLHVFWHCKFVKTVLMESE-----------------FGSLIHNGPRTSGGTSGVGGVLYL---VVPAGASACGVGL------------------

Query:  --QSVRAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQ
            +   E +RWSPP   W K N D +  +E   AGLGVV+RD  G+V+ + S       S +  E  A  R  + + + GL  ++ E D+  +     
Subjt:  --QSVRAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQ

Query:  DGLGMTSRCGWPSGELRRD----MPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSD
          L     C  P G +  D    +   S F+    +R GN VA +LA LA      R WF  + S    L+L D
Subjt:  DGLGMTSRCGWPSGELRRD----MPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSD

XP_030932272.1 uncharacterized protein LOC115958047 [Quercus lobata]2.2e-9931.21Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        I LIPK + P+RV+++RPISLCNV YKL SK + NR+K +L                                   KG+    +LKLDMSKAYDRVEW  
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSML----------------RGRGV------------
        LE++MLKMGFA +WV LI  CI SV ++  +NG+  G ++PSRGLRQGDPLSPYLFLLCAEGLS+ML                RG  +            
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSML----------------RGRGV------------

Query:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                          + +S   L+  K +   S   R  V+ ++  +   QV   H  YLGLPS + R +++S + +K +V  ++ GWK KL S  G
Subjt:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNPLPSWA-CSEGQ
        +EVL+K V QA+P Y+M+CF+LP  L  D+   + +FWWG  K ++ + WVSW+ LC+PK  GGMGFRD++ FN ALLAKQ WR++ N    ++   + +
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNPLPSWA-CSEGQ

Query:  IFSFRRFLEGGCG--------------------------------IETVLYMEEVRSPVTLAP------ETRVADLMTTS-GQWNEGLIRQHFRPYEVSL
         F    F E   G                                I T  ++  +  P  L+P        +V DL+    G+WN G+IR  F P E  L
Subjt:  IFSFRRFLEGGCG--------------------------------IETVLYMEEVRSPVTLAP------ETRVADLMTTS-GQWNEGLIRQHFRPYEVSL

Query:  ILSIPVRAGAE-DRIVWHYEQSG-----PVFGEKQISVGSSSLACATS---VFLFERVRN---------------------------------------G
        +LSIP+    + DR+VW   +SG       +   +    S+   C+ +     L++R+ N                                       G
Subjt:  ILSIPVRAGAE-DRIVWHYEQSG-----PVFGEKQISVGSSSLACATS---VFLFERVRN---------------------------------------G

Query:  RAGESSLHVFWHCKFVKTVLMESEFG----------------SLIHNGPRTS----------------------GGTSGVGGVL-----YLVVPAGASAC
        +A ESS H+FW C   +     S+                   L+ + P +S                      GG S  G +L      +VV    +  
Subjt:  RAGESSLHVFWHCKFVKTVLMESEFG----------------SLIHNGPRTS----------------------GGTSGVGGVL-----YLVVPAGASAC

Query:  GVGLQSVRAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASF
         V L S      VRW PPE   YK+NVDA+  ++    G G+V+RDS G V+ + S           AE  A       +G+ G   +  ETDSS + + 
Subjt:  GVGLQSVRAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASF

Query:  FQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMALCDV
                +     +G +   +    F S    +REGN  AH LA  A    +  VW E  P+ +E     D+A C+V
Subjt:  FQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMALCDV

XP_030939568.1 uncharacterized protein LOC115964386 [Quercus lobata]5.1e-9630.42Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        I LIPK ++P +  ++RPISLCNV YK++SK + NR+K +L                                N+ KGK+ + +LKLDMSKAYDRVEW+F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRG-------RGV---------------------
        LEK+M K+GF   W+ LIS C+ +V FS  +NG   G   P+RGLRQGDPLSPYLFLLCAEGL S+++        RGV                     
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRG-------RGV---------------------

Query:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                          +  S  +++R K     S      +++ +   + V  T     YLGLP+F+ RG+  S S+I++R+W +IQGWK KL S GG
Subjt:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP-----------
        REVL+K V+QA+P Y+M CF LPK L  DI   + +FWWG +   R IHW+ W  LC PKC GG+GF+D+E FN ALL KQ WR++ N            
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP-----------

Query:  -LPSWACSEGQI-----FSFRRFLEG------------GCGIETVL----YMEEVRSPVTLAPE------TRVADLMTTSGQ-WNEGLIRQHFRPYEVSL
          P+ +  + ++     ++++  ++             G G + ++    ++    S   ++P+      TRV  L+  +   W E  +R  F P+E + 
Subjt:  -LPSWACSEGQI-----FSFRRFLEG------------GCGIETVL----YMEEVRSPVTLAPE------TRVADLMTTSGQ-WNEGLIRQHFRPYEVSL

Query:  ILSIPV-RAGAEDRIVWHYEQSG----------------------------PVFGEKQISVGSSS-------LAC----ATSVFLFERV--------RNG
        ILS+P+   G ED+++W   ++G                              F +K  S+   S        AC     T + L  R+        R G
Subjt:  ILSIPV-RAGAEDRIVWHYEQSG----------------------------PVFGEKQISVGSSS-------LAC----ATSVFLFERV--------RNG

Query:  RAGESSLHVFWHCKFVKTVLMESE------------FGSLIHNGP-RTSGGTSGVGGVLYLVVPAGASACGVG----------------LQSVRARED--
        R  E ++H  W C+ +K +  E E            F  L      R S   +     +   +    +A  VG                LQ  +A  D  
Subjt:  RAGESSLHVFWHCKFVKTVLMESE------------FGSLIHNGP-RTSGGTSGVGGVLYLVVPAGASACGVG----------------LQSVRARED--

Query:  ---------VRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQD
                   W PP    +K N D +  ++  +AGLGVV+R+  G VM + +       S E  E  A  R  T +G+ G+  +V E D   V      
Subjt:  ---------VRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQD

Query:  GLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESV
             +  G  +GE+R         S    +R GN VA +LA LA    + ++W E +
Subjt:  GLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESV

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]1.2e-9731.8Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        I LIPK ++P + +++RPISLCNV YK+VSK + NR+K +L                                 + KGKT + ++KLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRG-------RGV---------------------
        LEKVM K+GF   W+ L+S CI SV FS  VNG   G   P+RGLRQGDPLSPYLFLLCAEGL S+++        +GV                     
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRG-------RGV---------------------

Query:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                          +E S  +++R K     S      V++++  +L V  T  + +YLGLPSF+ RG+  S ++I++RVWQ++QGWK +L S GG
Subjt:  ------------------QENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQN------------
        REVL+K V+QA+P ++M CF+LPK L  DI   + +FWWG +   R IHWV WK LCK K  GG+GF+D+E+FN A+L KQ WR++ N            
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQN------------

Query:  -----PLPSWACSEGQIFSFRRFLEG------------GCGIET---------VLYMEEVRSPVTLAP-ETRVADLMTTSGQ-WNEGLIRQHFRPYEVSL
              +      E   ++++  L+             G G             L+   V SP    P  TRV  L+    + W E  IR+ F P+E   
Subjt:  -----PLPSWACSEGQIFSFRRFLEG------------GCGIET---------VLYMEEVRSPVTLAP-ETRVADLMTTSGQ-WNEGLIRQHFRPYEVSL

Query:  ILSIPVR-AGAEDRIVWHYEQSG---------PVFGEKQISVGSSSLACATSVFLFE--------RVRN------------------------------G
        ILS+P+   G EDR++W    +G          +    + +  SSS + A   F  E        ++R+                              G
Subjt:  ILSIPVR-AGAEDRIVWHYEQSG---------PVFGEKQISVGSSSLACATSVFLFE--------RVRN------------------------------G

Query:  RAGESSLHVFWHCKFVKTVLMESE-----------------FGSLIHNGPRTSGGTSGVGGVLYLVVPA---GASACGV------GLQSVRAREDVR---
           E  +H  W C+ +K V  E E                  G L    P  +     +G  ++    A   G+ +  +       ++ +R    VR   
Subjt:  RAGESSLHVFWHCKFVKTVLMESE-----------------FGSLIHNGPRTSGGTSGVGGVLYLVVPA---GASACGV------GLQSVRAREDVR---

Query:  -----------WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQD
                   W P     YKVN D +   +   AGLGVVVRDS G V+ + S       +    E  A  R    +G+ GL  +V E DS  +      
Subjt:  -----------WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQD

Query:  GLGMTSRCGWPSGELRRDMPGPSFFSCRF--TRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSD
             S  G    E R      SF S  F  T+R+GN VA +LA LA    E +VW E +   V  L+ +D
Subjt:  GLGMTSRCGWPSGELRRDMPGPSFFSCRF--TRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSD

TrEMBL top hitse value%identityAlignment
A0A2N9EVW3 Uncharacterized protein1.4e-9933.09Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        + LIPK +NP  V+EYRPISLCNV YKL+SKVL NR+K IL                                N+  GK    +LKLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGL-------------------------------------
        +EKVM +MGF  +W+ LI  CISSV +S  +NG   G ++P+RGLRQGDP+SPYLFLLCAEGL                                     
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGL-------------------------------------

Query:  --SSMLRGRGVQE-------NSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
          +S+L  + +QE        S  +L+R K T   S      +++ + +IL V     + +YLGLPS + + + +  S IK+RVW +++GWK KL S  G
Subjt:  --SSMLRGRGVQE-------NSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNPLPSWACSEGQI
        REVL+K V+QAIP Y+MNCF+LP  L  +I   + RFWWG     R IHW+ W+ +C  K  GG+GFRD++ FN ALLAKQ WR + N          ++
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNPLPSWACSEGQI

Query:  FSFRRFLEGGCGIETVL-----YMEE----VRSPVTLAP-ETRVADLMTTSGQ-WNEGLIRQHFRPYEVSLILSIPVRA-GAEDRIVWHYEQSGP-----
        FS + F   G G +  +      +EE    + SP+T  P + RV +L+  S   WN   I+  F PY+V  IL IP+     EDR+ W   Q+G      
Subjt:  FSFRRFLEGGCGIETVL-----YMEE----VRSPVTLAP-ETRVADLMTTSGQ-WNEGLIRQHFRPYEVSLILSIPVRA-GAEDRIVWHYEQSGP-----

Query:  ----VFGEKQISVGSSSL------------------------------ACATSVFLFERVRNG--------RAGESSLHVFWHCKFVKTV-LMESEFGSL
            +  E +  +  SS                               +  T   LF R               E SLH  W+C  V  V  +  EF  L
Subjt:  ----VFGEKQISVGSSSL------------------------------ACATSVFLFERVRNG--------RAGESSLHVFWHCKFVKTV-LMESEFGSL

Query:  IHNGPRTSGG-----TSGVGGVLY------------------LVVPAG-------------ASACGVGLQSVRAR---EDVRWSPPEAGWYKVNVDASFR
            P +              +L+                  L +P+              +    V  ++   +     VRW PP + ++KVN D +  
Subjt:  IHNGPRTSGG-----TSGVGGVLY------------------LVVPAG-------------ASACGVGLQSVRAR---EDVRWSPPEAGWYKVNVDASFR

Query:  RERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQDGLGMTSRCGWPSGELRRD----MPGPSFF
        RE    GLGVV+RD++G V+ + S       + EM E  A  R    + + G+  + LE D+  V       LG       P G +  D    +     F
Subjt:  RERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQDGLGMTSRCGWPSGELRRD----MPGPSFF

Query:  SCRFTRREGNEVAHQLAFLAGRDEESRVWFESVP
        S   TRR GN VAH LA  A +     VW E VP
Subjt:  SCRFTRREGNEVAHQLAFLAGRDEESRVWFESVP

A0A2N9GB96 Uncharacterized protein3.1e-9931.47Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        + LIPK +NP  V+EYRPISLCNV YKL+SKVL NR+K +L                                N+ +G+    +LKLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQ---------------------------
        L +VMLKMGF  +WV L+  CI++V +S  +NG   G + PSRGLRQGDP+SPYLFLLCAEGL+ +L     Q                           
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQ---------------------------

Query:  -------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                           + S  +L+R K T   S       +D +  IL V     + +YLGLPS + + + +  S IKDRVW +++GWK KL S  G
Subjt:  -------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIV--QNPL--------
        RE+L+K V+QAIP Y+MNCF+LP KL  DI   M RFWWG +  +R +HW+SW  LC+PK  GG+GFR+++ FN ALLAKQ WR +  +N L        
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIV--QNPL--------

Query:  --PSWACSEGQI-----FSFRRFLEG------------GCG----IETVLYM-----EEVRSPVTLAPE-TRVADLMTTSG-QWNEGLIRQHFRPYEVSL
          P+    E  +     F++R  ++             G G    I+   ++       V SP+ + P  ++VA LM  S  +W+   IR  F PY+   
Subjt:  --PSWACSEGQI-----FSFRRFLEG------------GCG----IETVLYM-----EEVRSPVTLAPE-TRVADLMTTSG-QWNEGLIRQHFRPYEVSL

Query:  ILSIPVRAGA-EDRIVWHYEQSGP---------VFGEKQISVGSSS-------------LACATS---VFLFERVRN----------------------G
        IL IP+ + +  D+++WH  + G          +  E Q +   SS               CA +    FL+                           G
Subjt:  ILSIPVRAGA-EDRIVWHYEQSGP---------VFGEKQISVGSSS-------------LACATS---VFLFERVRN----------------------G

Query:  RAGESSLHVFWHCKFVK---------TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGV-------------------------GLQSVRA
           E  LH  W C  ++           + + EFGS  H+  R  G  +       L++   A+ C +                          L    A
Subjt:  RAGESSLHVFWHCKFVK---------TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGV-------------------------GLQSVRA

Query:  R------------EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRG-ETGSGDGLCPLVLETDSSRV
        R              V W PP +  YKVN D +  RE  + G+GVV+RD +G V+ + S   +   S EM E  A  R  +     G+   + E DS  +
Subjt:  R------------EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRG-ETGSGDGLCPLVLETDSSRV

Query:  ASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMA
                 M +  G    + +  +     +    TRR GN VAH LA  A   +   VW E VP  +  ++ SD +
Subjt:  ASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMA

A0A2N9GLG8 Reverse transcriptase domain-containing protein3.1e-9931.47Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        + LIPK +NP  V+EYRPISLCNV YKL+SKVL NR+K +L                                N+ +G+    +LKLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQ---------------------------
        L +VMLKMGF  +WV L+  CI++V +S  +NG   G + PSRGLRQGDP+SPYLFLLCAEGL+ +L     Q                           
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQ---------------------------

Query:  -------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                           + S  +L+R K T   S       +D +  IL V     + +YLGLPS + + + +  S IKDRVW +++GWK KL S  G
Subjt:  -------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIV--QNPL--------
        RE+L+K V+QAIP Y+MNCF+LP KL  DI   M RFWWG +  +R +HW+SW  LC+PK  GG+GFR+++ FN ALLAKQ WR +  +N L        
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIV--QNPL--------

Query:  --PSWACSEGQI-----FSFRRFLEG------------GCG----IETVLYM-----EEVRSPVTLAPE-TRVADLMTTSG-QWNEGLIRQHFRPYEVSL
          P+    E  +     F++R  ++             G G    I+   ++       V SP+ + P  ++VA LM  S  +W+   IR  F PY+   
Subjt:  --PSWACSEGQI-----FSFRRFLEG------------GCG----IETVLYM-----EEVRSPVTLAPE-TRVADLMTTSG-QWNEGLIRQHFRPYEVSL

Query:  ILSIPVRAGA-EDRIVWHYEQSGP---------VFGEKQISVGSSS-------------LACATS---VFLFERVRN----------------------G
        IL IP+ + +  D+++WH  + G          +  E Q +   SS               CA +    FL+                           G
Subjt:  ILSIPVRAGA-EDRIVWHYEQSGP---------VFGEKQISVGSSS-------------LACATS---VFLFERVRN----------------------G

Query:  RAGESSLHVFWHCKFVK---------TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGV-------------------------GLQSVRA
           E  LH  W C  ++           + + EFGS  H+  R  G  +       L++   A+ C +                          L    A
Subjt:  RAGESSLHVFWHCKFVK---------TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGV-------------------------GLQSVRA

Query:  R------------EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRG-ETGSGDGLCPLVLETDSSRV
        R              V W PP +  YKVN D +  RE  + G+GVV+RD +G V+ + S   +   S EM E  A  R  +     G+   + E DS  +
Subjt:  R------------EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRG-ETGSGDGLCPLVLETDSSRV

Query:  ASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMA
                 M +  G    + +  +     +    TRR GN VAH LA  A   +   VW E VP  +  ++ SD +
Subjt:  ASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMA

A0A2N9GYM1 Reverse transcriptase domain-containing protein3.1e-9931.47Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        + LIPK +NP  V+EYRPISLCNV YKL+SKVL NR+K +L                                N+ +G+    +LKLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQ---------------------------
        L +VMLKMGF  +WV L+  CI++V +S  +NG   G + PSRGLRQGDP+SPYLFLLCAEGL+ +L     Q                           
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQ---------------------------

Query:  -------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
                           + S  +L+R K T   S       +D +  IL V     + +YLGLPS + + + +  S IKDRVW +++GWK KL S  G
Subjt:  -------------------ENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIV--QNPL--------
        RE+L+K V+QAIP Y+MNCF+LP KL  DI   M RFWWG +  +R +HW+SW  LC+PK  GG+GFR+++ FN ALLAKQ WR +  +N L        
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIV--QNPL--------

Query:  --PSWACSEGQI-----FSFRRFLEG------------GCG----IETVLYM-----EEVRSPVTLAPE-TRVADLMTTSG-QWNEGLIRQHFRPYEVSL
          P+    E  +     F++R  ++             G G    I+   ++       V SP+ + P  ++VA LM  S  +W+   IR  F PY+   
Subjt:  --PSWACSEGQI-----FSFRRFLEG------------GCG----IETVLYM-----EEVRSPVTLAPE-TRVADLMTTSG-QWNEGLIRQHFRPYEVSL

Query:  ILSIPVRAGA-EDRIVWHYEQSGP---------VFGEKQISVGSSS-------------LACATS---VFLFERVRN----------------------G
        IL IP+ + +  D+++WH  + G          +  E Q +   SS               CA +    FL+                           G
Subjt:  ILSIPVRAGA-EDRIVWHYEQSGP---------VFGEKQISVGSSS-------------LACATS---VFLFERVRN----------------------G

Query:  RAGESSLHVFWHCKFVK---------TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGV-------------------------GLQSVRA
           E  LH  W C  ++           + + EFGS  H+  R  G  +       L++   A+ C +                          L    A
Subjt:  RAGESSLHVFWHCKFVK---------TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGV-------------------------GLQSVRA

Query:  R------------EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRG-ETGSGDGLCPLVLETDSSRV
        R              V W PP +  YKVN D +  RE  + G+GVV+RD +G V+ + S   +   S EM E  A  R  +     G+   + E DS  +
Subjt:  R------------EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLVQRHVRSPEMAEGWARSRG-ETGSGDGLCPLVLETDSSRV

Query:  ASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMA
                 M +  G    + +  +     +    TRR GN VAH LA  A   +   VW E VP  +  ++ SD +
Subjt:  ASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLILSDMA

A0A2N9I9F4 Reverse transcriptase domain-containing protein8.0e-10331.7Show/hide
Query:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF
        + LIPK +NP RV+EYRPISLCNV YKL+SKVL NR+K +L                                N+  GK    +LKLDMSKAYDRVEW F
Subjt:  IVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGIL--------------------------------NENKGKTTWASLKLDMSKAYDRVEWVF

Query:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSML----------------RG---------------
        L++VM +MGF   W  +I  CIS+V +S  +NG   G + P+RGLRQGDP+SPYLFLLCAEGL+ +L                RG               
Subjt:  LEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSML----------------RG---------------

Query:  -RGVQENSQ--------------NRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG
         R  +E  Q               +L+R K T   S      +++++  IL V     + +YLGLPS + + + +  S IK+RVW +I+GWK KL S  G
Subjt:  -RGVQENSQ--------------NRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGG

Query:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQ-NPLPSWACSEGQ
        RE+L+K VVQAIP Y+MNCF+LP  L  +I   + RFWWG +  +R IHW+ W+ LC+PK  GG+GFR+++ FN ALLAKQ WR++Q      +     +
Subjt:  REVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQ-NPLPSWACSEGQ

Query:  IFSFRRFLEGGCGIETVLYMEEV----RSPV--------TLAPETRVADLMTTSGQ-WNEGLIRQHFRPYEVSLILSIPVRAGA-EDRIVWHYEQSGP--
         F  R++  G    + V+ ME+     RS +         L P+ +V +L+  S   WN  LI   F PY+   IL IP+ A A  D++VWH  + G   
Subjt:  IFSFRRFLEGGCGIETVLYMEEV----RSPV--------TLAPETRVADLMTTSGQ-WNEGLIRQHFRPYEVSLILSIPVRAGA-EDRIVWHYEQSGP--

Query:  -------VFGEKQIS----------------VGSSSLACATSVFLFERVRNG---RAG-------------------ESSLHVFWHCKFVK---------
               +  + ++S                + ++ +      FL+   +     ++G                   E  LH  W C  +          
Subjt:  -------VFGEKQIS----------------VGSSSLACATSVFLFERVRNG---RAG-------------------ESSLHVFWHCKFVK---------

Query:  TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGVGLQSVRARE--DVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLV
        +   ++ F S      +       +     ++ P+  +     +++ R ++    +W PP    +K N D +F +     G+GVV+RD  G V+ + S  
Subjt:  TVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGVGLQSVRARE--DVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLV

Query:  QRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWF
             S EM E  A  R    + + GL  +  E D+  +         + +  G    +++  + G   +S   TRR GN VAH LA  A       VW 
Subjt:  QRHVRSPEMAEGWARSRGETGSGD-GLCPLVLETDSSRVASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWF

Query:  ESVPSCVETLILSD
        E VP  +  ++L+D
Subjt:  ESVPSCVETLILSD

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog3.4e-1029.94Show/hide
Query:  IVLIPKK-RNPRRVSEYRPISLCNVTYKLVSKVLVNR----MKGILNENK-----GKTTWAS---------------------LKLDMSKAYDRVEWVFL
        I LIPK  ++P R   YRPISL N+  K+++K+L NR    +K I++ ++     G   W +                     L +D  KA+D ++  F+
Subjt:  IVLIPKK-RNPRRVSEYRPISLCNVTYKLVSKVLVNR----MKGILNENK-----GKTTWAS---------------------LKLDMSKAYDRVEWVFL

Query:  EKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLR
         + + K+G    ++ LI    S    +  +NGV+        G RQG PLSP LF +  E L+  +R
Subjt:  EKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLR

P0C2F6 Putative ribonuclease H protein At1g657508.4e-1731.01Show/hide
Query:  LPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGG
        +P    R    +   I +RV  ++ GW+ K  S  GR  L K V+ ++P +SM+   LP+ ++  +++    F WG     +  H V W  +C PK  GG
Subjt:  LPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGG

Query:  MGFRDMEIFNQALLAKQCWRIVQNPLPSW
        +G R  +  N+AL++K  WR++Q     W
Subjt:  MGFRDMEIFNQALLAKQCWRIVQNPLPSW

P11369 LINE-1 retrotransposable element ORF2 protein2.9e-1722.05Show/hide
Query:  IVLIPK-KRNPRRVSEYRPISLCNVTYKLVSKVLVNR----MKGILNENK-----GKTTWASLK---------------------LDMSKAYDRVEWVFL
        I LIPK +++P ++  +RPISL N+  K+++K+L NR    +K I++ ++     G   W +++                     LD  KA+D+++  F+
Subjt:  IVLIPK-KRNPRRVSEYRPISLCNVTYKLVSKVLVNR----MKGILNENK-----GKTTWASLK---------------------LDMSKAYDRVEWVFL

Query:  EKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRG----RGVQENSQN------------RLDRPKP
         KV+ + G    ++++I    S    +  VNG +   +    G RQG PLSPYLF +  E L+  +R     +G+Q   +              +  PK 
Subjt:  EKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRG----RGVQENSQN------------RLDRPKP

Query:  T--EPMSSVYRWG----------------------VKDQVGQILQVQVTAWHHQYLG--LPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLL
        +  E ++ +  +G                       + ++ +     +   + +YLG  L   +      +   +K  + + ++ WK    S  GR  ++
Subjt:  T--EPMSSVYRWG----------------------VKDQVGQILQVQVTAWHHQYLG--LPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLL

Query:  KFVVQAIPCYSMNC--FRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCW
        K  +     Y  N    ++P +   ++  A+ +F W  +K          KSL K K   GG+   D++++ +A++ K  W
Subjt:  KFVVQAIPCYSMNC--FRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCW

P14381 Transposon TX1 uncharacterized 149 kDa protein4.9e-0927.98Show/hide
Query:  MIVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNE---------NKGKTTW--------------------ASLKLDMSKAYDRVEWVFLE
        ++ L+PKK + R +  +RP+SL +  YK+V+K +  R+K +L E           G+T +                    A L LD  KA+DRV+  +L 
Subjt:  MIVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNE---------NKGKTTW--------------------ASLKLDMSKAYDRVEWVFLE

Query:  KVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGR
          +    F  ++V  +    +S      +N      +   RG+RQG PLS  L+ L  E    +LR R
Subjt:  KVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLRGR

P93295 Uncharacterized mitochondrial protein AtMg003101.3e-1752.5Show/hide
Query:  AIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIVQNP
        A+P Y+M+CFRL K L   +  AM  FWW   +  R I WV+W+ LCK K   GG+GFRD+  FNQALLAKQ +RI+  P
Subjt:  AIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIVQNP

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.3e-0630.33Show/hide
Query:  VLVNRMKGILNENKGKTTWASLKLDMSKAYDRVEWVFLEKVMLKMGFALEWVDLISLCISSVRFSFNVNGV--RCGGVVPSR---------GLRQGDPLS
        V V      +   KG   W  LKLD+ KAYDR+ W +LE  ++  GF   W+  I+      R +F    V    G    S+         G R  D  +
Subjt:  VLVNRMKGILNENKGKTTWASLKLDMSKAYDRVEWVFLEKVMLKMGFALEWVDLISLCISSVRFSFNVNGV--RCGGVVPSR---------GLRQGDPLS

Query:  PYL--FLLCAEGLSSMLRGRGV
        P+    + CAE L  + RG G+
Subjt:  PYL--FLLCAEGLSSMLRGRGV

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-1745.57Show/hide
Query:  AIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP
        A+P Y+M CF LPK +   I   +A FWW  ++  +G+HW +W  L   K  GG+GF+D+E FN ALL KQ WR++  P
Subjt:  AIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.2e-1952.5Show/hide
Query:  AIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIVQNP
        A+P Y+M+CFRL K L   +  AM  FWW   +  R I WV+W+ LCK K   GG+GFRD+  FNQALLAKQ +RI+  P
Subjt:  AIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIVQNP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-0867.5Show/hide
Query:  FNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLR
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS + R
Subjt:  FNVNGVRCGGVVPSRGLRQGDPLSPYLFLLCAEGLSSMLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTGTTGATACCGAAGAAGAGGAATCCCAGACGCGTTTCTGAATACAGACCCATCTCACTCTGTAATGTGACGTATAAGTTAGTGTCGAAGGTGTTGGTAAACCG
TATGAAAGGGATTCTAAATGAGAACAAGGGCAAAACGACGTGGGCCTCACTAAAGCTTGATATGAGCAAGGCATATGACAGAGTCGAATGGGTTTTCCTGGAGAAGGTTA
TGCTGAAAATGGGTTTTGCTCTAGAATGGGTGGATTTGATTTCTCTCTGTATCTCTTCTGTCCGTTTCTCCTTTAATGTGAATGGGGTCAGGTGTGGAGGTGTGGTTCCG
AGTAGAGGTCTGCGACAGGGTGATCCATTATCCCCATATCTTTTTCTCTTATGTGCTGAGGGGTTGTCAAGTATGCTTCGGGGTAGGGGTGTTCAAGAAAACTCTCAAAA
CCGACTCGACCGACCGAAACCGACCGAACCGATGTCGTCCGTGTACAGATGGGGGGTTAAGGACCAGGTGGGGCAGATTTTGCAGGTACAGGTTACGGCGTGGCACCACC
AATATCTGGGCCTCCCTTCTTTTATGACACGTGGTAGAACGAGCTCATTGAGTTTCATTAAGGACCGAGTTTGGCAGCAGATTCAGGGGTGGAAGGGAAAGTTATTCTCA
GTTGGGGGTAGGGAGGTTCTCCTAAAGTTTGTGGTGCAGGCGATCCCGTGTTATTCGATGAATTGCTTCCGGTTACCAAAGAAGTTGATTCTCGATATTAATAGAGCCAT
GGCCCGATTTTGGTGGGGTGGGGAGAAGGTGGATCGAGGAATTCATTGGGTGAGTTGGAAATCCCTATGTAAGCCTAAGTGCTATGGTGGGATGGGTTTCAGGGACATGG
AAATTTTTAACCAAGCGTTGTTGGCTAAACAGTGTTGGAGGATTGTCCAGAATCCTCTTCCCTCCTGGGCGTGTTCTGAAGGGCAGATATTTTCCTTTCGGCGATTTCTT
GAGGGCGGATGTGGGATCGAGACCGTCCTTTATATGGAAGAGGTGCGATCACCAGTGACATTGGCGCCGGAGACTCGGGTTGCGGATCTGATGACGACATCGGGGCAGTG
GAACGAAGGGCTCATTCGACAGCACTTTAGGCCTTATGAGGTCAGTCTGATTTTGTCAATTCCGGTACGGGCTGGGGCGGAGGATAGGATTGTTTGGCATTATGAGCAGT
CAGGACCTGTTTTCGGTGAAAAGCAGATATCGGTTGGGTCAAGCAGCCTGGCTTGCGCAACTTCCGTCTTCCTCTTCGAACGAGTCAGGAATGGGCGAGCTGGGGAGTCT
AGTTTGCATGTATTCTGGCATTGCAAGTTCGTCAAGACGGTGCTGATGGAGTCCGAGTTTGGAAGCTTGATACATAATGGGCCTAGAACCAGTGGTGGGACTAGTGGAGT
GGGCGGCGTGTTATATCTCGTCGTTCCAGCGGGCGCCTCGGCTTGTGGAGTTGGGTTGCAGAGTGTCCGTGCACGGGAAGATGTTAGATGGAGCCCCCCGGAGGCTGGGT
GGTATAAGGTGAACGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGTGTGGTGGTTCGAGATTCCTCGGGTCGGGTTATGTTGTCGACATCCTTGGTG
CAGCGGCATGTGCGAAGCCCGGAGATGGCTGAAGGATGGGCGCGGTCAAGGGGCGAGACTGGCAGTGGAGATGGGCTGTGCCCATTGGTGTTGGAGACTGACTCTAGTCG
GGTGGCTAGTTTTTTCCAAGATGGGCTGGGGATGACTTCTCGATGTGGGTGGCCTAGTGGGGAGTTACGAAGGGACATGCCAGGGCCTTCTTTTTTCAGCTGCAGGTTCA
CTCGAAGAGAGGGGAATGAGGTGGCACACCAGCTAGCTTTCCTGGCAGGGAGGGATGAAGAATCTAGGGTGTGGTTTGAGTCCGTACCCTCGTGTGTTGAGACCTTGATC
CTGTCTGATATGGCTCTGTGTGATGTTTTTTCTTACAGGTTTCTCTGTGGTCTATGTGTACTTTATATGTTTAATGGATCCTTGGATGGTGGCTGTGTAGGGGGAAGATG
GGTGGAGGGTCTTATAAATCAGTTCGGGCTAGTAGGGGCTGCTTTTGGTGAGGCAAGTCTGACTACTGAGATCAGGATCCGTGTTTTGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGTGTTGATACCGAAGAAGAGGAATCCCAGACGCGTTTCTGAATACAGACCCATCTCACTCTGTAATGTGACGTATAAGTTAGTGTCGAAGGTGTTGGTAAACCG
TATGAAAGGGATTCTAAATGAGAACAAGGGCAAAACGACGTGGGCCTCACTAAAGCTTGATATGAGCAAGGCATATGACAGAGTCGAATGGGTTTTCCTGGAGAAGGTTA
TGCTGAAAATGGGTTTTGCTCTAGAATGGGTGGATTTGATTTCTCTCTGTATCTCTTCTGTCCGTTTCTCCTTTAATGTGAATGGGGTCAGGTGTGGAGGTGTGGTTCCG
AGTAGAGGTCTGCGACAGGGTGATCCATTATCCCCATATCTTTTTCTCTTATGTGCTGAGGGGTTGTCAAGTATGCTTCGGGGTAGGGGTGTTCAAGAAAACTCTCAAAA
CCGACTCGACCGACCGAAACCGACCGAACCGATGTCGTCCGTGTACAGATGGGGGGTTAAGGACCAGGTGGGGCAGATTTTGCAGGTACAGGTTACGGCGTGGCACCACC
AATATCTGGGCCTCCCTTCTTTTATGACACGTGGTAGAACGAGCTCATTGAGTTTCATTAAGGACCGAGTTTGGCAGCAGATTCAGGGGTGGAAGGGAAAGTTATTCTCA
GTTGGGGGTAGGGAGGTTCTCCTAAAGTTTGTGGTGCAGGCGATCCCGTGTTATTCGATGAATTGCTTCCGGTTACCAAAGAAGTTGATTCTCGATATTAATAGAGCCAT
GGCCCGATTTTGGTGGGGTGGGGAGAAGGTGGATCGAGGAATTCATTGGGTGAGTTGGAAATCCCTATGTAAGCCTAAGTGCTATGGTGGGATGGGTTTCAGGGACATGG
AAATTTTTAACCAAGCGTTGTTGGCTAAACAGTGTTGGAGGATTGTCCAGAATCCTCTTCCCTCCTGGGCGTGTTCTGAAGGGCAGATATTTTCCTTTCGGCGATTTCTT
GAGGGCGGATGTGGGATCGAGACCGTCCTTTATATGGAAGAGGTGCGATCACCAGTGACATTGGCGCCGGAGACTCGGGTTGCGGATCTGATGACGACATCGGGGCAGTG
GAACGAAGGGCTCATTCGACAGCACTTTAGGCCTTATGAGGTCAGTCTGATTTTGTCAATTCCGGTACGGGCTGGGGCGGAGGATAGGATTGTTTGGCATTATGAGCAGT
CAGGACCTGTTTTCGGTGAAAAGCAGATATCGGTTGGGTCAAGCAGCCTGGCTTGCGCAACTTCCGTCTTCCTCTTCGAACGAGTCAGGAATGGGCGAGCTGGGGAGTCT
AGTTTGCATGTATTCTGGCATTGCAAGTTCGTCAAGACGGTGCTGATGGAGTCCGAGTTTGGAAGCTTGATACATAATGGGCCTAGAACCAGTGGTGGGACTAGTGGAGT
GGGCGGCGTGTTATATCTCGTCGTTCCAGCGGGCGCCTCGGCTTGTGGAGTTGGGTTGCAGAGTGTCCGTGCACGGGAAGATGTTAGATGGAGCCCCCCGGAGGCTGGGT
GGTATAAGGTGAACGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGTGTGGTGGTTCGAGATTCCTCGGGTCGGGTTATGTTGTCGACATCCTTGGTG
CAGCGGCATGTGCGAAGCCCGGAGATGGCTGAAGGATGGGCGCGGTCAAGGGGCGAGACTGGCAGTGGAGATGGGCTGTGCCCATTGGTGTTGGAGACTGACTCTAGTCG
GGTGGCTAGTTTTTTCCAAGATGGGCTGGGGATGACTTCTCGATGTGGGTGGCCTAGTGGGGAGTTACGAAGGGACATGCCAGGGCCTTCTTTTTTCAGCTGCAGGTTCA
CTCGAAGAGAGGGGAATGAGGTGGCACACCAGCTAGCTTTCCTGGCAGGGAGGGATGAAGAATCTAGGGTGTGGTTTGAGTCCGTACCCTCGTGTGTTGAGACCTTGATC
CTGTCTGATATGGCTCTGTGTGATGTTTTTTCTTACAGGTTTCTCTGTGGTCTATGTGTACTTTATATGTTTAATGGATCCTTGGATGGTGGCTGTGTAGGGGGAAGATG
GGTGGAGGGTCTTATAAATCAGTTCGGGCTAGTAGGGGCTGCTTTTGGTGAGGCAAGTCTGACTACTGAGATCAGGATCCGTGTTTTGAGCTAG
Protein sequenceShow/hide protein sequence
MIVLIPKKRNPRRVSEYRPISLCNVTYKLVSKVLVNRMKGILNENKGKTTWASLKLDMSKAYDRVEWVFLEKVMLKMGFALEWVDLISLCISSVRFSFNVNGVRCGGVVP
SRGLRQGDPLSPYLFLLCAEGLSSMLRGRGVQENSQNRLDRPKPTEPMSSVYRWGVKDQVGQILQVQVTAWHHQYLGLPSFMTRGRTSSLSFIKDRVWQQIQGWKGKLFS
VGGREVLLKFVVQAIPCYSMNCFRLPKKLILDINRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIVQNPLPSWACSEGQIFSFRRFL
EGGCGIETVLYMEEVRSPVTLAPETRVADLMTTSGQWNEGLIRQHFRPYEVSLILSIPVRAGAEDRIVWHYEQSGPVFGEKQISVGSSSLACATSVFLFERVRNGRAGES
SLHVFWHCKFVKTVLMESEFGSLIHNGPRTSGGTSGVGGVLYLVVPAGASACGVGLQSVRAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSTSLV
QRHVRSPEMAEGWARSRGETGSGDGLCPLVLETDSSRVASFFQDGLGMTSRCGWPSGELRRDMPGPSFFSCRFTRREGNEVAHQLAFLAGRDEESRVWFESVPSCVETLI
LSDMALCDVFSYRFLCGLCVLYMFNGSLDGGCVGGRWVEGLINQFGLVGAAFGEASLTTEIRIRVLS