; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028760 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028760
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:30128583..30132532
RNA-Seq ExpressionLag0028760
SyntenyLag0028760
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4262994.1 unnamed protein product [Prunus armeniaca]3.1e-10328.93Show/hide
Query:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLWDNRSAWWIWV------------------
        M  L WN +G+G+ R  + L  ++++ +P LVFL ET V +++ E  +++LG  G   V   G  GGLAL W  RS W + +                  
Subjt:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLWDNRSAWWIWV------------------

Query:  ------------TKGEKFTW---------------CNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFL-SILATPQVRGGRGRRIQ
                    T+    +W                       +V E++DRC  N    D +      HL    SDH  + + + +  P+    R RR  
Subjt:  ------------TKGEKFTW---------------CNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFL-SILATPQVRGGRGRRIQ

Query:  HFEDVWLMYPEFRSVVEEVWQL-----GAVDASAAGWLELLTGAL---------YLYPLGARAALSNLGRVGSRRDLQEA--EANLESVLIEDEVY----
        HFE++W   P+F  V+EE W++     G  ++ +    EL T              Y     AAL   GR+ + +   +A  E  +  +L + ++     
Subjt:  HFEDVWLMYPEFRSVVEEVWQL-----GAVDASAAGWLELLTGAL---------YLYPLGARAALSNLGRVGSRRDLQEA--EANLESVLIEDEVY----

Query:  -------------------SSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVAL
                           +S   K++ +CGI   + +W  D + I  L  +YF  ++SS       +++ +  V+  +T   N +LL+ FTR+++   L
Subjt:  -------------------SSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVAL

Query:  GQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVR--------------------------NPRQGVLSKLISQNQ
         Q+ P KA   + M   F+++ W IVG+ VV  CL+ILN +ES    N T+I LIPK++                          N  + +L  +I++NQ
Subjt:  GQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVR--------------------------NPRQGVLSKLISQNQ

Query:  RAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELITN---------VGEA----------------
          F+P   ++DN +  FE +H ++   +G    ++LK DM+KAYDRVEW F  ++MLK GF   WV  + +          G A                
Subjt:  RAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELITN---------VGEA----------------

Query:  ------------------------------QAIQDILQCYERASGQTVNFDKSTIAFSPNTDS--------------QVQENVNG-----------VLQS
                                       A++ + Q YE  SGQ +N+ KS  + SPN                 Q  E   G           + Q 
Subjt:  ------------------------------QAIQDILQCYERASGQTVNFDKSTIAFSPNTDS--------------QVQENVNG-----------VLQS

Query:  VEED-----SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLA
        +++      SG + +      +++L+K+  Q I  Y+M+CFRIPK +  E++ +M++FWW+ V++ R I WV W+ LCK K  GGLG +DLE FNQALL 
Subjt:  VEED-----SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLA

Query:  KMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP
        K C  I+  P SL+AR+ + RY PS+  L+A VG+ PSF+WRSL WG+ELL KG+RW++G GE  ++Y + WLP  +  +I S P
Subjt:  KMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.5e-10531.18Show/hide
Query:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILA-TPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASA
        G  FTW  R     VV E++DRC  N      +     +HL    SDH P+ +   A  P+    R  R  HFE++W   P+F  V+EE W++     S 
Subjt:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILA-TPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASA

Query:  AGWLELLTGALYLYPLGARAALSNLGRVGSRRDLQEAEANLESV--LIEDEVYSSKGLKQDTICGI--------EDGSGLWHQDSRDILGLISNYFSHIY
        +  L L    L  +        +++     R  L+ A   L ++   +  + +  K   ++TI  +           S +W  + + I  L  +YF  ++
Subjt:  AGWLELLTGALYLYPLGARAALSNLGRVGSRRDLQEAEANLESV--LIEDEVYSSKGLKQDTICGI--------EDGSGLWHQDSRDILGLISNYFSHIY

Query:  SSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKV
        SS    G ++++ +  V+  +T  MN +LL+ FTR+++   L Q+ P KAP  D M   F+++ W IVGD V K CL+ILN + S+   N T+I LIPKV
Subjt:  SSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKV

Query:  RNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLK
        + P                           + VL  +I++NQ AF+P   ++DN +  FE +H ++   +G    ++LK DM+KAYDRVEWVFL ++MLK
Subjt:  RNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLK

Query:  KGFDPDWVELITN---------------VGE---------------------------------------------------------------------
         GF   WV  + +               VG                                                                      
Subjt:  KGFDPDWVELITN---------------VGE---------------------------------------------------------------------

Query:  AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVL--------------------------QSVEED-----SGLEREAFFCGRRKVLLKS
         +A++ + Q YE  SGQ +N+ KS  + SPN      + + GVL                          Q +++      SG + +      +++L+K+
Subjt:  AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVL--------------------------QSVEED-----SGLEREAFFCGRRKVLLKS

Query:  AVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSEL
         +Q I  Y+M+CFRIPK +  E++ +M+RFWW+  ++ R IHWV W+ LCK K  GGLG +DLE FNQALLAK CWRIL  P SL+AR+ + RY PS   
Subjt:  AVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSEL

Query:  LDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP
        L+A VG+ PSF+WRSL WG+ELL KG+RW++G G   ++Y   WLP  +  +I SPP
Subjt:  LDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]3.7e-0334.57Show/hide
Query:  WLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMGFFPVMIETDSK
        W  PP   YK+NVD A      + G  + VRN++ E MA+ +R  +    +   E  A  +GL+FA +MGF   ++E D++
Subjt:  WLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMGFFPVMIETDSK

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.1e-10330.14Show/hide
Query:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAA
        G  FTW  R     VV E++DRC  N      +     +HL    SDH P+ +         G +  R  HFE++W   PEF  V+EE W++     S +
Subjt:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAA

Query:  GWLELLTGAL----YLYPLGARAALSNL--------GRVGSRRDLQEA--EANLESVLIEDEVY-----------------------SSKGLKQDTICGI
          L L    L    +++    R  L++         GR+ + + + +A  E  +  +L + E+                        +S   K++ +CGI
Subjt:  GWLELLTGAL----YLYPLGARAALSNL--------GRVGSRRDLQEA--EANLESVLIEDEVY-----------------------SSKGLKQDTICGI

Query:  EDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVK
         D +  W  + + I  L  +YF  ++SS    G ++++ +  V+  +T  MN +LL+ FTR+++   L Q+ P KAP  D M   F+++ W IVGD V K
Subjt:  EDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVK

Query:  CCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVG
         CL+ILN + S+   N T+I LIPKV+ P                           + VL  +I++ Q AF+P   ++DN +  FE ++ ++   +    
Subjt:  CCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVG

Query:  WVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI---------------TNVGE-------------------------------------------
         ++LK DM+KAYDRVEWVFL  +MLK GF   WV  +               T VG                                            
Subjt:  WVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI---------------TNVGE-------------------------------------------

Query:  --------------------------AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDS---------------QVQENVNG-----------VLQSVE
                                    A++ + Q YE  +GQ +N+ KS ++ SPN                  +  EN  G           + Q ++
Subjt:  --------------------------AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDS---------------QVQENVNG-----------VLQSVE

Query:  ED-----SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKM
        +      SG + +      +++L+K+ +Q I  Y+M+CF+IPK +  E++ +M+RFWW+  ++ R IHWV W+ LCK K  GGLG +DLE FNQALLAK 
Subjt:  ED-----SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKM

Query:  CWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP
        CWRIL  P SL+AR+ + RY PS   L+A VG+ PSF+W SL WG+ELL KG+RW++G G   ++Y   WLP  +  +I SPP
Subjt:  CWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]6.5e-11730.61Show/hide
Query:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLW----------------------------
        M  L WN RG+G+ R    L K++Q H P LVFL ET + + +   +  KL ++ CF+V+S GK GGLALLW                            
Subjt:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLW----------------------------

Query:  ---------DNRSAWWIWVT---------KGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGG------
                 D R     W           KG  +TW N + G   V E++DR   N A +D+F     T++D   SDH PV + +    QVRG       
Subjt:  ---------DNRSAWWIWVT---------KGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGG------

Query:  RGRRIQHFEDVWLMYPEFRSVVEEVWQLGA-------------VDASAAGWLELLTGALY------LYPLGARAALSNLGRVG--SRRDLQEAEANLESV
        R   + H+ED+W  Y   + ++E+ W L               V  ++   L L +   +      L  L  +     L RV       ++E E  ++ +
Subjt:  RGRRIQHFEDVWLMYPEFRSVVEEVWQLGA-------------VDASAAGWLELLTGALY------LYPLGARAALSNLGRVG--SRRDLQEAEANLESV

Query:  LIEDEVY-----------------------SSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLK
        L +DE+Y                       +S   K++ I GIE+ +G W +++  +    + YF++++++  P+ D+I  A+  +   V+  MN  L  
Subjt:  LIEDEVY-----------------------SSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLK

Query:  PFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------Q
        PFT +++V AL Q+ P KAP PD +   F+++ W+ V   V+  CL ILN +  +   N T IVLI K   PR                          +
Subjt:  PFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------Q

Query:  GVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALR-SKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELITNV----------------
         VL  LIS  Q AFIP W + DN IVG+EC+H +R  KGR   G V+LK D+SKAYD++EWVFLE+ M   GF  +WV LI ++                
Subjt:  GVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALR-SKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELITNV----------------

Query:  -------------------------GEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVLQ-----------SVEEDSGLEREAFF---
                                  + Q ++ I  CY   SGQ  NF+KS++  + N  +     +  + Q            +    G +R +FF   
Subjt:  -------------------------GEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVLQ-----------SVEEDSGLEREAFF---

Query:  -----------------CGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAK
                          G ++VL+K+A Q I  Y M+ F+IP  I  +I R+++ FWW    + + IHW  W  L + KC GG+G +D  +FNQAL+AK
Subjt:  -----------------CGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAK

Query:  MCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPPLWGWMLKSDAL
          WRIL  P+SL+A++L+ RYF   + ++A +GS PSF+WRS+LWGR+++ KG +W+IG G+   ++  NWLP   + +  S P     L +DAL
Subjt:  MCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPPLWGWMLKSDAL

XP_042939444.1 uncharacterized protein LOC122274474 [Carya illinoinensis]2.3e-11431.67Show/hide
Query:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVN------SNGKSGG----LALLWDNRSAWWIWV--------
        M+ L WN RG+G+ R +  LH LV+  +PLLVFLSET  ++ R + IK+ LGF+ CFSVN      ++ K  G    L  L    +  W+ +        
Subjt:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVN------SNGKSGG----LALLWDNRSAWWIWV--------

Query:  -------------------------------TKGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGR
                                         G+KFTW N + G +   E++DR  GN    +LF    VTHLD ++SDH+ + +    +  +  G+ R
Subjt:  -------------------------------TKGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGR

Query:  RIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAAGWLELLT---GALYLYPLG----ARAALSN---LGRVGSRR-------DLQEAEANLESVLIEDEV-
        R+  FE  W    E   ++++VW+L     +    L+ L    G L ++        R AL N   L R+   R       +++E + ++ S++  + + 
Subjt:  RIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAAGWLELLT---GALYLYPLG----ARAALSN---LGRVGSRR-------DLQEAEANLESVLIEDEV-

Query:  ----------------------YSSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDI
                               SS+  + ++I  I+  SGL  QD + I   +  +F+ +++S +PSG  ID  +  +Q  +TD M   L   FT  ++
Subjt:  ----------------------YSSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDI

Query:  VVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------QGVLSKLI
          A   ++P  +P PD     F++R W+IVG  V K  L +LN  +    LNET+I LIPK  NP                           + +L  +I
Subjt:  VVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------QGVLSKLI

Query:  SQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI----------TNVGEAQAIQDILQCYE
        S  Q AF+P   + DN IV FE +H ++++ +G  G+++LK DMSKAYDR+EW FL  +MLK GF   W+EL+           N  E   +Q +L  YE
Subjt:  SQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI----------TNVGEAQAIQDILQCYE

Query:  RASGQTVNFDKSTIAFSPNTDSQVQENVNGV-----LQSVEEDSGL--------------------------EREAFFCGRRKVLLKSAVQVILCYTMNC
        RASGQ +N DK++I FS NT  Q +E V  +       S E+  GL                          + +      ++ L+K+ +Q I  Y+M  
Subjt:  RASGQTVNFDKSTIAFSPNTDSQVQENVNGV-----LQSVEEDSGL--------------------------EREAFFCGRRKVLLKSAVQVILCYTMNC

Query:  FRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFV
        F++P++++ E+++ +  FWW   E++ RIHWV+WK + K K  GGLG++D E FN+ALLAK  WRI+  PNSL AR+LK +YF  S  LD   GS  SF+
Subjt:  FRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFV

Query:  WRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITS
        W+S +  R LL++G+ W++G G+   ++   WLP  TS ++ S
Subjt:  WRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITS

TrEMBL top hitse value%identityAlignment
A0A5E4FZN9 PREDICTED: retrotransposon7.2e-10631.18Show/hide
Query:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILA-TPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASA
        G  FTW  R     VV E++DRC  N      +     +HL    SDH P+ +   A  P+    R  R  HFE++W   P+F  V+EE W++     S 
Subjt:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILA-TPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASA

Query:  AGWLELLTGALYLYPLGARAALSNLGRVGSRRDLQEAEANLESV--LIEDEVYSSKGLKQDTICGI--------EDGSGLWHQDSRDILGLISNYFSHIY
        +  L L    L  +        +++     R  L+ A   L ++   +  + +  K   ++TI  +           S +W  + + I  L  +YF  ++
Subjt:  AGWLELLTGALYLYPLGARAALSNLGRVGSRRDLQEAEANLESV--LIEDEVYSSKGLKQDTICGI--------EDGSGLWHQDSRDILGLISNYFSHIY

Query:  SSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKV
        SS    G ++++ +  V+  +T  MN +LL+ FTR+++   L Q+ P KAP  D M   F+++ W IVGD V K CL+ILN + S+   N T+I LIPKV
Subjt:  SSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKV

Query:  RNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLK
        + P                           + VL  +I++NQ AF+P   ++DN +  FE +H ++   +G    ++LK DM+KAYDRVEWVFL ++MLK
Subjt:  RNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLK

Query:  KGFDPDWVELITN---------------VGE---------------------------------------------------------------------
         GF   WV  + +               VG                                                                      
Subjt:  KGFDPDWVELITN---------------VGE---------------------------------------------------------------------

Query:  AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVL--------------------------QSVEED-----SGLEREAFFCGRRKVLLKS
         +A++ + Q YE  SGQ +N+ KS  + SPN      + + GVL                          Q +++      SG + +      +++L+K+
Subjt:  AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVL--------------------------QSVEED-----SGLEREAFFCGRRKVLLKS

Query:  AVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSEL
         +Q I  Y+M+CFRIPK +  E++ +M+RFWW+  ++ R IHWV W+ LCK K  GGLG +DLE FNQALLAK CWRIL  P SL+AR+ + RY PS   
Subjt:  AVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSEL

Query:  LDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP
        L+A VG+ PSF+WRSL WG+ELL KG+RW++G G   ++Y   WLP  +  +I SPP
Subjt:  LDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP

A0A5E4FZN9 PREDICTED: retrotransposon1.8e-0334.57Show/hide
Query:  WLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMGFFPVMIETDSK
        W  PP   YK+NVD A      + G  + VRN++ E MA+ +R  +    +   E  A  +GL+FA +MGF   ++E D++
Subjt:  WLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMGFFPVMIETDSK

A0A5E4FZN9 PREDICTED: retrotransposon7.2e-10630.76Show/hide
Query:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVW----QLGAVD
        G  +TWCN Q GA  ++ ++DR        + F + +V HL  S SDH  +F+S   +  ++  R RR  HFE  W    + R ++E VW     +   +
Subjt:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVW----QLGAVD

Query:  ASAAGWLELLTGALYLYPLGA-----------RAALSNL------GRVGSRRDLQEAEANLESVLIEDEVY-----------------------SSKGLK
          A+G L+L    L  +               R  LS L      G +G   ++   E N   +L ++E++                       +S+  K
Subjt:  ASAAGWLELLTGALYLYPLGA-----------RAALSNL------GRVGSRRDLQEAEANLESVLIEDEVY-----------------------SSKGLK

Query:  QDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEI
        Q+TI G+ D  G W +D   I      YF  IYS+ NPS   +D+    +   +T+ MN +L + FTR++IV AL QIHP K+P PD M   F+++ W+I
Subjt:  QDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEI

Query:  VGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQ----------GVLSKLIS----------------QNQRAFIPRWCVVDNAIVGFECIHALRS
        VG +V    L +LN   SL+V+N+T IVLIPK  NP++           V+ KLIS                +NQ AF     + DN ++ +E +H L+ 
Subjt:  VGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQ----------GVLSKLIS----------------QNQRAFIPRWCVVDNAIVGFECIHALRS

Query:  KGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI--------------------------------------------------------
        K  G   +++ K DMSKA+DRVEW F+E++M K GF+  W+ LI                                                        
Subjt:  KGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI--------------------------------------------------------

Query:  ----------------------------TNVGEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVLQSVEED-----------------
                                     N  E + +++IL+ YE ASGQ VN DKS+I FSPNT  +++E +  +L  +++                  
Subjt:  ----------------------------TNVGEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVLQSVEED-----------------

Query:  --------------SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFN
                      +G + +    G +++L+K+  Q I  YTM+CF +PK +  E+ +MM  FWW    ++ ++ W++W+ +CKPK LGGLG ++L  FN
Subjt:  --------------SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFN

Query:  QALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP
         ALLAK  WRIL  P SL AR+LK +YFP  ++L+A +GS PS+ WRS+    E+L+KG RW++G G R  I+   WLP  ++ ++ +PP
Subjt:  QALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP

A0A803P614 Uncharacterized protein5.5e-10628.19Show/hide
Query:  KGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATP--QVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQ------
        +G+ FTW   +  A  + E++D CF N    ++F   +V HLD+  SDHR +  +++ TP   V+  + R    FE +WL  PE R ++   W       
Subjt:  KGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATP--QVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQ------

Query:  -----LGAVDASAA---GWLELLTGALYLYPLGARAALSNLGRVGSR-----RDLQEAEANLESVLIEDEVY-----------------------SSKGL
             L  +D  A     W     G +      A+  ++NL    +R      +L+ +E+ L+ +L ++E+Y                       +S   
Subjt:  -----LGAVDASAA---GWLELLTGALYLYPLGARAALSNLGRVGSR-----RDLQEAEANLESVLIEDEVY-----------------------SSKGL

Query:  KQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWE
          + I  + + +GL      D++ +I +YF +I+ +     D +   +  + V V+  MN  L +PFTR ++  AL  + P+K+P  D M   FY++ W+
Subjt:  KQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWE

Query:  IVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQ----------GVLSKL----------------ISQNQRAFIPRWCVVDNAIVGFECIHALR
        +VGD V +  L +LN       LN T+I LIPK + P+            V+SKL                IS+ Q AF+P   + DN +V FE +HA++
Subjt:  IVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQ----------GVLSKL----------------ISQNQRAFIPRWCVVDNAIVGFECIHALR

Query:  SKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI-------------------------------------------------------
        ++  G  G VSLK DMSKA+DRVEW F++ +M K GF   W++LI                                                       
Subjt:  SKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI-------------------------------------------------------

Query:  -----------------------------TNVGEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTD--SQV-------------QENVNGVLQSVEEDS
                                      N     AI+ +L  Y RASGQ +N DKS ++FSPNT   SQV              E   G+      D 
Subjt:  -----------------------------TNVGEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTD--SQV-------------QENVNGVLQSVEEDS

Query:  ----------------GLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETF
                            + F  G R+VLLK+ VQ I  Y M+CFR+P  +  ++  MM+ FWW   E   +IHW AW  LCK K  GG+G +  E F
Subjt:  ----------------GLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETF

Query:  NQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLP-CDTSLRITSPPLWGWMLKS
        NQALLAK  WR+L  P+SLL ++LK RYFP+++ L A  G  PS  W+ ++WGR LL KG+RW+IG+G   RI    W+P CDT     +P    ++  +
Subjt:  NQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLP-CDTSLRITSPPLWGWMLKS

Query:  DALSGAMPFFLVALSAW----LVEWM-PKNGDP-NFCGAIMGHLGL------------------------------QKSSTVQGGSL--SWELSAWSAGL
        DA+   +   +     W    L +W  P + DP N  G    H GL                              +++  V G S   +  L++++   
Subjt:  DALSGAMPFFLVALSAW----LVEWM-PKNGDP-NFCGAIMGHLGL------------------------------QKSSTVQGGSL--SWELSAWSAGL

Query:  LDSYRRV-----NAQSSVGVRRGSGREAVKWLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMG
        L ++R           S      S + A KW  P     KLNVDAA D  + +VG    VRNS   V+A+  +       S + E +AM   L +  ++ 
Subjt:  LDSYRRV-----NAQSSVGVRRGSGREAVKWLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMG

Query:  FFPVMIETDSKRVCELMR-------GEREELSDLKLLVTITPQ
            ++ETD+  V    +          + L D+  L++  PQ
Subjt:  FFPVMIETDSKRVCELMR-------GEREELSDLKLLVTITPQ

A0A803PUH4 Uncharacterized protein8.0e-10528.19Show/hide
Query:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLWDN---------------------RSAWW
        M  L WN +G+G+   +  L  LV +  P LVF+SE+ +   + E ++V LG+DGCF V ++GKSGGL LLW N                        WW
Subjt:  MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLWDN---------------------RSAWW

Query:  IWV-------------------------------------------TKGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVF
         +                                             +G ++TWCN +   E+++E++DR  GN    DLFP+ +V HLD   SDH P+ 
Subjt:  IWV-------------------------------------------TKGEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVF

Query:  LSILAT--PQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAAGWLELLT---GALYLYPLGARAALSN-----------LGRVGSRRD---L
        L  L        G R     HFE  W    +   +V E W  G    +A    + L     AL  +    +A + +           L R  + +D   L
Subjt:  LSILAT--PQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAAGWLELLT---GALYLYPLGARAALSN-----------LGRVGSRRD---L

Query:  QEAEANLESVLIEDEVY-----------------------SSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTV
        ++ E     +L ++E +                       ++   +++TI G+ D +G W   ++ +  +   YF  +++S + S  ++D+    V   +
Subjt:  QEAEANLESVLIEDEVY-----------------------SSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTV

Query:  TDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQGV--------------
        +  MN  L  PFT++D+  A+  IHP+KAP  D M G FYR  W  +G++V K CL ILN    L  +N+T+I LIPK+  P +                
Subjt:  TDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQGV--------------

Query:  ------------LSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELIT---------
                    L++ IS+ Q AF+    + DNAI+GFE +H ++ +  G    ++LK DMSKAYDRVEW FL  +M   G++ DW+E I          
Subjt:  ------------LSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELIT---------

Query:  ---------------------------------------------------------------------------NVGEAQAIQDILQCYERASGQTVNF
                                                                                   N  E   +  ILQ Y R SGQ +N 
Subjt:  ---------------------------------------------------------------------------NVGEAQAIQDILQCYERASGQTVNF

Query:  DKSTIAFSPNTDSQVQE---NVNGVLQSVEEDSGLEREAFFCGRRK-----------------------------VLLKSAVQVILCYTMNCFRIPKKIV
        +KS ++      SQ+     N  GV    +    L   +F  GRRK                             +L+K+ +Q I  Y+M+CFR+PKK++
Subjt:  DKSTIAFSPNTDSQVQE---NVNGVLQSVEEDSGLEREAFFCGRRK-----------------------------VLLKSAVQVILCYTMNCFRIPKKIV

Query:  GEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGR
          +  + + FWW   +E+++IHW  W  LCKPK  GGLG + L  FNQALLAK  WR++H P+SLLAR+LK  Y+P+S  L A      S +W+ + WGR
Subjt:  GEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGR

Query:  ELLQKGIRWQIGKGERTRIYGSNWLP
        +++ +G RW++G G   RI+   W+P
Subjt:  ELLQKGIRWQIGKGERTRIYGSNWLP

M5WJW2 Reverse transcriptase domain-containing protein5.2e-10430.14Show/hide
Query:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAA
        G  FTW  R     VV E++DRC  N      +     +HL    SDH P+ +         G +  R  HFE++W   PEF  V+EE W++     S +
Subjt:  GEKFTWCNRQLGAEVVWEQIDRCFGNLACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAA

Query:  GWLELLTGAL----YLYPLGARAALSNL--------GRVGSRRDLQEA--EANLESVLIEDEVY-----------------------SSKGLKQDTICGI
          L L    L    +++    R  L++         GR+ + + + +A  E  +  +L + E+                        +S   K++ +CGI
Subjt:  GWLELLTGAL----YLYPLGARAALSNL--------GRVGSRRDLQEA--EANLESVLIEDEVY-----------------------SSKGLKQDTICGI

Query:  EDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVK
         D +  W  + + I  L  +YF  ++SS    G ++++ +  V+  +T  MN +LL+ FTR+++   L Q+ P KAP  D M   F+++ W IVGD V K
Subjt:  EDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEMLGAFYRRSWEIVGDDVVK

Query:  CCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVG
         CL+ILN + S+   N T+I LIPKV+ P                           + VL  +I++ Q AF+P   ++DN +  FE ++ ++   +    
Subjt:  CCLRILNNKESLEVLNETMIVLIPKVRNPR--------------------------QGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVG

Query:  WVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI---------------TNVGE-------------------------------------------
         ++LK DM+KAYDRVEWVFL  +MLK GF   WV  +               T VG                                            
Subjt:  WVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWVELI---------------TNVGE-------------------------------------------

Query:  --------------------------AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDS---------------QVQENVNG-----------VLQSVE
                                    A++ + Q YE  +GQ +N+ KS ++ SPN                  +  EN  G           + Q ++
Subjt:  --------------------------AQAIQDILQCYERASGQTVNFDKSTIAFSPNTDS---------------QVQENVNG-----------VLQSVE

Query:  ED-----SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKM
        +      SG + +      +++L+K+ +Q I  Y+M+CF+IPK +  E++ +M+RFWW+  ++ R IHWV W+ LCK K  GGLG +DLE FNQALLAK 
Subjt:  ED-----SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKM

Query:  CWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP
        CWRIL  P SL+AR+ + RY PS   L+A VG+ PSF+W SL WG+ELL KG+RW++G G   ++Y   WLP  +  +I SPP
Subjt:  CWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.6e-1726.95Show/hide
Query:  SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHC
        SG   +      R  L K+ +  +  ++M+   +P+ I+  + ++   F W    E ++ H V W  +C PK  GGLG++  ++ N+AL++K+ WR+L  
Subjt:  SGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHC

Query:  PNSLLARMLKGRYFPSSELLDAYVGSRPSF--VWRSLLWG-RELLQKGIRWQIGKGERTRIYGSNWL
         NSL   +L+ +Y         ++  + S+   WRS+  G R+++  G+ W  G G++ R +   W+
Subjt:  PNSLLARMLKGRYFPSSELLDAYVGSRPSF--VWRSLLWG-RELLQKGIRWQIGKGERTRIYGSNWL

P93295 Uncharacterized mitochondrial protein AtMg003106.2e-3848.03Show/hide
Query:  YTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPK-CLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVG
        Y M+CFR+ K +  +++  M+ FWWS  E  R+I WVAW+ LCK K   GGLG +DL  FNQALLAK  +RI+H P++LL+R+L+ RYFP S +++  VG
Subjt:  YTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPK-CLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVG

Query:  SRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPPL
        +RPS+ WRS++ GRELL +G+   IG G  T+++   W+  +T L    PPL
Subjt:  SRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPPL

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-1128.3Show/hide
Query:  REAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSL
        R   F GR + L+ S +  +  + M+ FR+P   + EI  + S F WSG E + +   VAW D+C PK  GGLG++ L+  N+       W I    N+ 
Subjt:  REAFFCGRRKVLLKSAVQVILCYTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSL

Query:  LARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNW
        L                       S++W+ +L  R L    ++  I  G  T  +  NW
Subjt:  LARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.5e-1038.16Show/hide
Query:  VLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWV
        +++ LI   Q +FIP     DN +   E +H++R K +G+ GW+ LK D+ KAYDR+ W +LE  ++  GF   W+
Subjt:  VLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVFLEKIMLKKGFDPDWV

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-3443.79Show/hide
Query:  YTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGS
        YTM CF +PK +  +I  +++ FWW   +E + +HW AW  L   K  GG+G KD+E FN ALL K  WR+L  P SL+A++ K RYF  S+ L+A +GS
Subjt:  YTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGS

Query:  RPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWL---PCDTSLRITSPP
        RPSFVW+S+   +E+L++G R  +G GE   I+   WL   P   +LR+   P
Subjt:  RPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWL---PCDTSLRITSPP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.4e-3948.03Show/hide
Query:  YTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPK-CLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVG
        Y M+CFR+ K +  +++  M+ FWWS  E  R+I WVAW+ LCK K   GGLG +DL  FNQALLAK  +RI+H P++LL+R+L+ RYFP S +++  VG
Subjt:  YTMNCFRIPKKIVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPK-CLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVG

Query:  SRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPPL
        +RPS+ WRS++ GRELL +G+   IG G  T+++   W+  +T L    PPL
Subjt:  SRPSFVWRSLLWGRELLQKGIRWQIGKGERTRIYGSNWLPCDTSLRITSPPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACGCTTTTCTGGAATGCACGAGGCATGGGGTCAGATCGTGCATTATATATGTTGCATAAGCTAGTGCAACAGCATCGACCTCTTTTGGTGTTCCTTTCAGAAAC
GATGGTTCACTCTTCACGTTTTGAGTTGATTAAAGTGAAATTGGGCTTCGATGGTTGCTTCAGTGTAAACAGTAATGGCAAGAGTGGTGGTTTGGCTTTACTATGGGATA
ATCGATCAGCTTGGTGGATCTGGGTTACAAAAGGGGAGAAGTTTACTTGGTGTAACAGGCAGCTTGGGGCTGAGGTAGTATGGGAACAAATTGATAGATGCTTTGGTAAT
CTGGCCTGTCAGGATCTGTTTCCTCAGCAGGAAGTGACTCATCTGGATTTCAGCAGATCGGACCACCGTCCGGTTTTTCTCTCAATTCTCGCTACTCCCCAAGTCAGAGG
AGGTCGTGGGCGTAGGATTCAGCATTTTGAGGATGTCTGGCTAATGTATCCAGAGTTCAGGAGTGTGGTGGAGGAGGTTTGGCAGTTAGGTGCAGTTGATGCCTCGGCAG
CGGGTTGGTTGGAGCTACTAACTGGTGCCTTGTATCTTTATCCACTTGGGGCAAGGGCTGCTTTGAGTAACCTGGGTAGAGTAGGCTCCAGGAGGGATCTTCAGGAGGCT
GAGGCAAATCTCGAGTCTGTTCTGATTGAGGATGAGGTTTACTCCAGCAAAGGTCTCAAGCAGGATACTATTTGTGGCATTGAGGATGGCTCGGGGTTGTGGCATCAAGA
TTCTCGAGACATTCTTGGGCTTATTTCGAATTACTTTTCCCATATTTACTCATCTTGTAATCCTTCTGGGGATGAGATTGACAAAGCTATTGGTAATGTGCAGGTGACTG
TGACAGACGTGATGAATGGGCAGTTGCTAAAGCCTTTTACTCGAGATGACATAGTGGTGGCTTTGGGGCAGATTCATCCCAATAAGGCCCCTGACCCGGACGAGATGTTA
GGTGCCTTCTATAGAAGGTCATGGGAGATAGTTGGGGATGATGTGGTTAAATGCTGTTTGAGGATTCTTAATAACAAGGAGTCGCTTGAGGTGCTAAATGAGACTATGAT
CGTGTTGATCCCGAAGGTGAGGAATCCAAGGCAGGGTGTGTTGTCTAAGCTGATCTCACAGAATCAGAGAGCCTTCATTCCTAGGTGGTGTGTGGTGGATAATGCCATAG
TTGGGTTTGAATGTATTCATGCGCTGAGGTCAAAAGGGAGAGGAATGGTGGGTTGGGTTTCGCTCAAGCAAGATATGAGCAAGGCGTACGACCGAGTGGAGTGGGTGTTC
TTGGAAAAGATCATGTTGAAAAAGGGTTTTGACCCTGATTGGGTGGAGTTGATTACGAATGTGGGTGAAGCTCAGGCCATTCAGGATATCCTACAGTGCTATGAGAGGGC
TTCTGGGCAGACGGTGAATTTTGATAAGTCCACGATTGCCTTTAGTCCGAATACTGATTCCCAGGTTCAGGAAAATGTGAATGGAGTCCTCCAGAGTGTGGAAGAAGATT
CAGGGTTGGAAAGGGAAGCTTTTTTTTGTGGGAGGAGAAAAGTTCTCCTTAAGTCAGCGGTGCAGGTTATTCTGTGTTATACGATGAACTGCTTTAGAATCCCAAAGAAG
ATAGTGGGAGAGATCAGTCGAATGATGTCACGTTTTTGGTGGAGTGGAGTTGAGGAGGACAGAAGGATCCACTGGGTGGCATGGAAGGATTTGTGTAAGCCTAAGTGCCT
TGGTGGTTTGGGCCTCAAAGACTTAGAGACCTTTAATCAGGCTTTACTCGCCAAGATGTGTTGGAGGATTTTGCATTGCCCAAATTCCCTGCTTGCCCGTATGTTGAAAG
GGAGGTATTTTCCCTCATCGGAGTTGTTGGATGCCTATGTAGGGTCCAGACCGTCCTTCGTTTGGAGGAGCCTTCTATGGGGGAGAGAGTTGCTTCAGAAGGGTATTAGA
TGGCAGATTGGCAAGGGTGAACGAACTAGGATCTATGGGTCCAATTGGCTTCCTTGTGATACATCTCTGAGAATCACATCCCCCCCTCTTTGGGGATGGATGCTCAAGTC
GGATGCCTTATCTGGCGCAATGCCCTTCTTCCTTGTCGCTCTCTCGGCTTGGTTGGTGGAATGGATGCCGAAAAATGGCGATCCCAACTTTTGTGGTGCTATTATGGGCC
ATTTGGGATTGCAGAAATCGAGTACTGTTCAAGGGGGGAGCCTGTCGTGGGAGTTGTCAGCTTGGTCGGCTGGGCTGCTTGATTCATATAGGCGTGTTAATGCCCAGTCG
TCGGTGGGAGTGAGGCGCGGGTCCGGCAGAGAAGCTGTGAAGTGGTTAGTGCCGCCAAACAATTGGTACAAATTGAATGTGGATGCTGCCTTCGATAGGGAAAAGCAGCT
TGTTGGGGCGAGTTTGTGCGTGAGAAATAGTAGCAGGGAAGTGATGGCATCAGCTATGAGATTCCATGAGTTCGTAAGGGATTCGGATTTGGCAGAGGGAAGGGCTATGG
CCGATGGTTTGAAGTTTGCTACTGAGATGGGCTTTTTCCCTGTGATGATTGAGACGGATTCTAAACGAGTTTGCGAGCTAATGCGAGGCGAAAGGGAGGAGCTGTCTGAT
TTGAAACTATTAGTTACTATAACGCCCCAGGATTTCCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTACGCTTTTCTGGAATGCACGAGGCATGGGGTCAGATCGTGCATTATATATGTTGCATAAGCTAGTGCAACAGCATCGACCTCTTTTGGTGTTCCTTTCAGAAAC
GATGGTTCACTCTTCACGTTTTGAGTTGATTAAAGTGAAATTGGGCTTCGATGGTTGCTTCAGTGTAAACAGTAATGGCAAGAGTGGTGGTTTGGCTTTACTATGGGATA
ATCGATCAGCTTGGTGGATCTGGGTTACAAAAGGGGAGAAGTTTACTTGGTGTAACAGGCAGCTTGGGGCTGAGGTAGTATGGGAACAAATTGATAGATGCTTTGGTAAT
CTGGCCTGTCAGGATCTGTTTCCTCAGCAGGAAGTGACTCATCTGGATTTCAGCAGATCGGACCACCGTCCGGTTTTTCTCTCAATTCTCGCTACTCCCCAAGTCAGAGG
AGGTCGTGGGCGTAGGATTCAGCATTTTGAGGATGTCTGGCTAATGTATCCAGAGTTCAGGAGTGTGGTGGAGGAGGTTTGGCAGTTAGGTGCAGTTGATGCCTCGGCAG
CGGGTTGGTTGGAGCTACTAACTGGTGCCTTGTATCTTTATCCACTTGGGGCAAGGGCTGCTTTGAGTAACCTGGGTAGAGTAGGCTCCAGGAGGGATCTTCAGGAGGCT
GAGGCAAATCTCGAGTCTGTTCTGATTGAGGATGAGGTTTACTCCAGCAAAGGTCTCAAGCAGGATACTATTTGTGGCATTGAGGATGGCTCGGGGTTGTGGCATCAAGA
TTCTCGAGACATTCTTGGGCTTATTTCGAATTACTTTTCCCATATTTACTCATCTTGTAATCCTTCTGGGGATGAGATTGACAAAGCTATTGGTAATGTGCAGGTGACTG
TGACAGACGTGATGAATGGGCAGTTGCTAAAGCCTTTTACTCGAGATGACATAGTGGTGGCTTTGGGGCAGATTCATCCCAATAAGGCCCCTGACCCGGACGAGATGTTA
GGTGCCTTCTATAGAAGGTCATGGGAGATAGTTGGGGATGATGTGGTTAAATGCTGTTTGAGGATTCTTAATAACAAGGAGTCGCTTGAGGTGCTAAATGAGACTATGAT
CGTGTTGATCCCGAAGGTGAGGAATCCAAGGCAGGGTGTGTTGTCTAAGCTGATCTCACAGAATCAGAGAGCCTTCATTCCTAGGTGGTGTGTGGTGGATAATGCCATAG
TTGGGTTTGAATGTATTCATGCGCTGAGGTCAAAAGGGAGAGGAATGGTGGGTTGGGTTTCGCTCAAGCAAGATATGAGCAAGGCGTACGACCGAGTGGAGTGGGTGTTC
TTGGAAAAGATCATGTTGAAAAAGGGTTTTGACCCTGATTGGGTGGAGTTGATTACGAATGTGGGTGAAGCTCAGGCCATTCAGGATATCCTACAGTGCTATGAGAGGGC
TTCTGGGCAGACGGTGAATTTTGATAAGTCCACGATTGCCTTTAGTCCGAATACTGATTCCCAGGTTCAGGAAAATGTGAATGGAGTCCTCCAGAGTGTGGAAGAAGATT
CAGGGTTGGAAAGGGAAGCTTTTTTTTGTGGGAGGAGAAAAGTTCTCCTTAAGTCAGCGGTGCAGGTTATTCTGTGTTATACGATGAACTGCTTTAGAATCCCAAAGAAG
ATAGTGGGAGAGATCAGTCGAATGATGTCACGTTTTTGGTGGAGTGGAGTTGAGGAGGACAGAAGGATCCACTGGGTGGCATGGAAGGATTTGTGTAAGCCTAAGTGCCT
TGGTGGTTTGGGCCTCAAAGACTTAGAGACCTTTAATCAGGCTTTACTCGCCAAGATGTGTTGGAGGATTTTGCATTGCCCAAATTCCCTGCTTGCCCGTATGTTGAAAG
GGAGGTATTTTCCCTCATCGGAGTTGTTGGATGCCTATGTAGGGTCCAGACCGTCCTTCGTTTGGAGGAGCCTTCTATGGGGGAGAGAGTTGCTTCAGAAGGGTATTAGA
TGGCAGATTGGCAAGGGTGAACGAACTAGGATCTATGGGTCCAATTGGCTTCCTTGTGATACATCTCTGAGAATCACATCCCCCCCTCTTTGGGGATGGATGCTCAAGTC
GGATGCCTTATCTGGCGCAATGCCCTTCTTCCTTGTCGCTCTCTCGGCTTGGTTGGTGGAATGGATGCCGAAAAATGGCGATCCCAACTTTTGTGGTGCTATTATGGGCC
ATTTGGGATTGCAGAAATCGAGTACTGTTCAAGGGGGGAGCCTGTCGTGGGAGTTGTCAGCTTGGTCGGCTGGGCTGCTTGATTCATATAGGCGTGTTAATGCCCAGTCG
TCGGTGGGAGTGAGGCGCGGGTCCGGCAGAGAAGCTGTGAAGTGGTTAGTGCCGCCAAACAATTGGTACAAATTGAATGTGGATGCTGCCTTCGATAGGGAAAAGCAGCT
TGTTGGGGCGAGTTTGTGCGTGAGAAATAGTAGCAGGGAAGTGATGGCATCAGCTATGAGATTCCATGAGTTCGTAAGGGATTCGGATTTGGCAGAGGGAAGGGCTATGG
CCGATGGTTTGAAGTTTGCTACTGAGATGGGCTTTTTCCCTGTGATGATTGAGACGGATTCTAAACGAGTTTGCGAGCTAATGCGAGGCGAAAGGGAGGAGCTGTCTGAT
TTGAAACTATTAGTTACTATAACGCCCCAGGATTTCCAATAA
Protein sequenceShow/hide protein sequence
MSTLFWNARGMGSDRALYMLHKLVQQHRPLLVFLSETMVHSSRFELIKVKLGFDGCFSVNSNGKSGGLALLWDNRSAWWIWVTKGEKFTWCNRQLGAEVVWEQIDRCFGN
LACQDLFPQQEVTHLDFSRSDHRPVFLSILATPQVRGGRGRRIQHFEDVWLMYPEFRSVVEEVWQLGAVDASAAGWLELLTGALYLYPLGARAALSNLGRVGSRRDLQEA
EANLESVLIEDEVYSSKGLKQDTICGIEDGSGLWHQDSRDILGLISNYFSHIYSSCNPSGDEIDKAIGNVQVTVTDVMNGQLLKPFTRDDIVVALGQIHPNKAPDPDEML
GAFYRRSWEIVGDDVVKCCLRILNNKESLEVLNETMIVLIPKVRNPRQGVLSKLISQNQRAFIPRWCVVDNAIVGFECIHALRSKGRGMVGWVSLKQDMSKAYDRVEWVF
LEKIMLKKGFDPDWVELITNVGEAQAIQDILQCYERASGQTVNFDKSTIAFSPNTDSQVQENVNGVLQSVEEDSGLEREAFFCGRRKVLLKSAVQVILCYTMNCFRIPKK
IVGEISRMMSRFWWSGVEEDRRIHWVAWKDLCKPKCLGGLGLKDLETFNQALLAKMCWRILHCPNSLLARMLKGRYFPSSELLDAYVGSRPSFVWRSLLWGRELLQKGIR
WQIGKGERTRIYGSNWLPCDTSLRITSPPLWGWMLKSDALSGAMPFFLVALSAWLVEWMPKNGDPNFCGAIMGHLGLQKSSTVQGGSLSWELSAWSAGLLDSYRRVNAQS
SVGVRRGSGREAVKWLVPPNNWYKLNVDAAFDREKQLVGASLCVRNSSREVMASAMRFHEFVRDSDLAEGRAMADGLKFATEMGFFPVMIETDSKRVCELMRGEREELSD
LKLLVTITPQDFQ