; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028913 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028913
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:32839236..32844847
RNA-Seq ExpressionLag0028913
SyntenyLag0028913
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]1.7e-23237.1Show/hide
Query:  DWVVRFEAIWDLQVVNK----DWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKW-SEGPNDNLSLQRTLEQCAIVLKGWGF
        DW +  E   D    N     D+  SDH+ + LS        + +RK   F+FE  W    EC++I+     W +      +S    +EQC+  L  W  
Subjt:  DWVVRFEAIWDLQVVNK----DWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKW-SEGPNDNLSLQRTLEQCAIVLKGWGF

Query:  NQNLRLRNEIKEIKSKI-KVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKI
         +   L  EIK+   ++ ++  +R   +   E+   E +L+ LLL+EE++WKQRSR  WL+ G++NTK+FHQ+A  R +   + G+ + +  W SD  + 
Subjt:  NQNLRLRNEIKEIKSKI-KVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKI

Query:  RKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIED
            + ++  LF+++ P+ + I  +L+ +   VT +MN++L   +T  E+  A+ +  P K+PGPDG  A+F+Q++W+IVG       L  LN E  +E 
Subjt:  RKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIED

Query:  WNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDR
         N T + LIPK ++P+ V D+RPISLCNV YK I KV+ NRLK +L  II+  QSAF+PGR I+DN ++++E LH L+  R GK  + A+KLDMSKAYDR
Subjt:  WNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDR

Query:  VEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNS-MAGVSIARQCPKISHLFFAD
        VEWI++ +++ KLGF   W+  +MKC+T+ ++S  +NG+ +G + P+RG+RQGDPLSPYLFL+C+EG  ALL  A N S + G+ IAR  P ISHLFFAD
Subjt:  VEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNS-MAGVSIARQCPKISHLFFAD

Query:  DSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQF
        DSL+FLKA        ++I   Y + SGQ +NF KS +FFS NT  + +      L+M  +E++ +YLGLP    + K   F+ I ++VW  L  W+   
Subjt:  DSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQF

Query:  FSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSK
        FSQGGKE+L+K++IQA+PTY M CF+IP+G   +I  L AR+WWGS   KR +HW+ W ++  PK  GGL FR  + +NQA+LAKQAWR+L  P   LS+
Subjt:  FSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSK

Query:  VLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKE
        VL  KYF   S L        S  W+  VWG  LL +G+R+ +GNGQ+   FKDPWL RP +F  L     S   +KV E+      W+   ++Q  +  
Subjt:  VLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKE

Query:  DVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPV
        D+++I  +P+S     D W WHYN+ G Y+VKSGYKL        S S+  +  +WWK  W  +IP ++ IF W+ +H  +P+   L    + +H  CP+
Subjt:  DVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPV

Query:  CHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKD--RWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKAN
        C    ++  HA+F C  AQE+W +      + + + +  KD   +++   + D +   + +  W IW +RN +++ +   TP     W+  Y  E   A 
Subjt:  CHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKD--RWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKAN

Query:  PLGGSLVQSEDEV----IQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLN
              +   D +    ++ +S G  + +  DA     T + G+G  +       +A     + G  S L AEA+A++ GLQ A+        VL+D  +
Subjt:  PLGGSLVQSEDEV----IQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLN

Query:  LIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASE
        L++++N   +  + +   + D   LM  F  V    V+R  N  AHN+A +
Subjt:  LIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASE

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]5.5e-22837.68Show/hide
Query:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLR
        DW  +F  +    +V+  +   DH     +LL +      R +   F FE  WT   +C+ II  +  +    +    +   L  CA+ L  W      +
Subjt:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLR

Query:  LRNEIKEIKSKIKVSYERRGPIDFD-EIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFE
        +  +I++ +S++     R    D   EI+ L  +++ LL +EE YW QR++ +WLK GDRNTK+FH +A+ RR++N + G+ D  G W  +   I ++  
Subjt:  LRNEIKEIKSKIKVSYERRGPIDFD-EIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFE

Query:  NYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTN
        +YF+ ++SS+ P+Q  I+ V   IP KVT+EMN+ L+  +T+EE+  A++  HP KAPGPDG  A+F+QKYW IVG+      L +LN  + I + N TN
Subjt:  NYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTN

Query:  IVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIY
        I LIPK   P+ ++D+RPISLCNV YK+I+K++ANRLK +L  II E QSAF   R I+DN++++ E +H+L  K  GK G+ A+KLDMSKA+DRVEW +
Subjt:  IVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIY

Query:  LRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVF
        + ++ME++GF + W  LVM+CIT+ S+SIL+NG A G I P+RG+RQGDPLSP LFLLC+EGL AL+   ARN  + G+SI R CPK++HLFFADDS++F
Subjt:  LRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVF

Query:  LKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGG
         KA  EE    RSIL  YE+ASGQ +N +KS++FFS NT  ET+  + +IL    +     YLGLPS   R K++ F  + ++V   L GWK +  S GG
Subjt:  LKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGG

Query:  KEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGK
        KE+LIK++ QAIPTY M CF +P+G+   +  +   FWWG + ++  + W  W+ +C  K  GGL FR+L +FN AMLAKQAWR+L NPN  + +VL  +
Subjt:  KEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGK

Query:  YFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIK---VAEFFNPSVQWDEAK-LQQFLIKED
        YFP   +L+    +S S+ W+     ++++++G R  +GNG+ I +++D WLP P T+KV+S   P + N +   V+   +P  +W + + L+   +  +
Subjt:  YFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIK---VAEFFNPSVQWDEAK-LQQFLIKED

Query:  VEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGYKL--SMLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTC
        VE I  +P+S   P D+ IW  N  GE+SVKS Y +  S++ P      + G   R  WKK+W L +P ++KIF W++  + +P+  N+ +  +    TC
Subjt:  VEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGYKL--SMLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTC

Query:  PVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFA-AQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA
        P+C    E  +HAL  C  A  +W  +       Q       D  L    ++   +L    V +WAIW +RN I++N    +P        + L ++ KA
Subjt:  PVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFA-AQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA

Query:  N-----PLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDC
              P   S ++ E   + I        ++ D    D      IG+++R   G++ AA +  + G  +    EA+A+  G+ LA  L++ R+ +  D 
Subjt:  N-----PLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDC

Query:  LNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMAS-EGYSAPPCLWLG
        L +I+++N +  G + +   L  I ++  +FE   F  VNR +N  AH +A         C+W G
Subjt:  LNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMAS-EGYSAPPCLWLG

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]4.5e-23840.09Show/hide
Query:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIV---ANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQ
        DWV  ++   D++V +   + SDH    L +  + +V   ANRRR    F+FE  WT   +C+DII      S   N    +   L  CA  L  W    
Subjt:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIV---ANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQ

Query:  NLRLRNEIKEIKSKIK--VSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIR
           +  +I+E K  +   V  +R G +   EI+ L ++++ LL  EE+ W+QRSR  WL  GDRNTK+FH KA+ RRR+N ++G+ D +G WQ     I 
Subjt:  NLRLRNEIKEIKSKIK--VSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIR

Query:  KSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDW
        K   +YF  ++SS+ P +  I  VL  IP  VT+EMN  L+  +TREEI TA+   HPTKAPGPDG  A+F+QKYW+IVG+  V   L +LNS +S+ + 
Subjt:  KSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDW

Query:  NHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRV
        N TNI L+PK + P  +SD+RPISLCNV YK+I+KV+ANRLK +L +II E QSAF+ GR I+DN++++ E +H+L+ K++GK G+AA+KLDMSKAYDRV
Subjt:  NHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRV

Query:  EWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVD-ARNNSMAGVSIARQCPKISHLFFADD
        EW +++Q+MEK+GFH+ WI LVM CIT+ S+SIL+NG A+G I PTRG+RQGDP+SPY+FLLC++G  +LL D AR   ++GVSI R CPKI+HLFFADD
Subjt:  EWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVD-ARNNSMAGVSIARQCPKISHLFFADD

Query:  SLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFF
        SL+F KA ++E      IL  YE ASGQ +N +KS+VFFS NTP E +  +  +L          YLGLPS   + K   F  + +RV   L GWK +  
Subjt:  SLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFF

Query:  SQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKV
        S GG+E+LIK++ QAIPTY M CF+IPK +  +I ++  RFWWG + ++  + W  W++LCK K+ GG+ FR+L +FN AMLAKQ WR+++NPN  ++++
Subjt:  SQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKV

Query:  LCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFN-PSVQWDEAKLQQFLIKE
           +Y+P   V       S S+ W+    G++++++G R  +GNG+ IL+++D WLP P T+KV+S   P     +V+   +    +W +  ++   +  
Subjt:  LCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFN-PSVQWDEAKLQQFLIKE

Query:  DVEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGYKLS--MLRPMGPSMSARGLDSR--WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHM
        +   I  +P+S   P D+ IW  N  GE+SVKS Y ++  ++  +    S+ G DSR   W+K+W L IP +V+IF WK   N++P+  NL R  V +  
Subjt:  DVEMIKGLPISLTAP-DRWIWHYNNTGEYSVKSGYKLS--MLRPMGPSMSARGLDSR--WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHM

Query:  TCPVCHEELETTDHALFRCTRAQEIWRIFQPN---IWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTE
         CP C  E E+  H   +C  A+ +WR +  N   +     D+++I  + L F    D  L    V AWAIW +RN I++      P     +   Y+ E
Subjt:  TCPVCHEELETTDHALFRCTRAQEIWRIFQPN---IWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTE

Query:  YWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCL
        +  A+     ++   D        G   I + D    +      +G+++R   G + AA    + G +S    EA+A+  GL LAK  K+ ++ + SD L
Subjt:  YWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCL

Query:  NLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMA
         ++  +N   +    +      I +L+++F     + V R YN+ AH +A
Subjt:  NLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMA

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.3e-22939.49Show/hide
Query:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNL-SLQRTLEQCAIVLKGWGFNQNL
        +W  RF+     +VV+     SDH P+ L +        R+    +FKFEE W    EC  +I       +G  D L ++Q  ++ C + L  WG +   
Subjt:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNL-SLQRTLEQCAIVLKGWGFNQNL

Query:  RLRNEIKEIKSKI-KVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSF
             IKEI+ ++ +++          E   L +++D+LL ++E+YW QRSR NWL+ GDRNTK+FH KA+ RRRKN++ G+ +  G W  +L ++ +  
Subjt:  RLRNEIKEIKSKI-KVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSF

Query:  ENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHT
         +YF  LF +   +Q  ++  L  +  KVT++M + L   +T EE+  A+    PTKAPGPDG  ALFYQK+W IVGD  VS  L  LN+   + + NHT
Subjt:  ENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHT

Query:  NIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWI
        NIVLIPK + P  +S++RPISLCNV YKII+KV+ANRLK VL +II   QSAF+PGR I+DN+++++ETLH +  ++KGK G  ALKLD+SKAYDRVEW 
Subjt:  NIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWI

Query:  YLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNSM-AGVSIARQCPKISHLFFADDSLV
        +L+ IMEK+GF   WI  VM C+TT SFSIL+NG+ +  I+P+RGIRQGDP+SPYLFLLC+EGL ALL  A  N M  GVSI R  PKI++L FADDSL+
Subjt:  YLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNSM-AGVSIARQCPKISHLFFADDSLV

Query:  FLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQG
        F +A   E      IL  YE+ASGQ +N EKS+ +FS NT    K  +  IL +   +    YLGLP+   R K   F  + DRVW  LQGWK    S+ 
Subjt:  FLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQG

Query:  GKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCG
        GKE+LIK++ QAIPTY M  F+IP  + +++ +LCARFWWG    +R +HWK W++L  PK+ GG+ FRDL +FN AMLAKQ WR++   +  L +    
Subjt:  GKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCG

Query:  KYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQ-WDEAKLQQFLIKEDVE
        +YFPR S L      + SF W+  +    +L+ G    +GNG +I   KD WLP   T KVL+ V    + + VAE  NP    W+  +++    +++ E
Subjt:  KYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQ-WDEAKLQQFLIKEDVE

Query:  MIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLS--MLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPV
         I  +P+S    PD   W Y   G +SVKS Y ++  +L       ++ G+ ++  W  +WKLR+PN+VK+F W++ H  +P+  NL    +     C +
Subjt:  MIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLS--MLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPV

Query:  CHEELETTDHALFRCTRAQEIWRIFQPNIWM---DQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA
        C  E E+T HAL+ C   Q+IW      +      Q DM+++ +  L    Q +  L      AW +W  RNS+L+   +  P       ++Y+TE+  A
Subjt:  CHEELETTDHALFRCTRAQEIWRIFQPNIWM---DQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA

Query:  -NPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLI
         N L     Q     I       E  ++ DA       + G G ++R + G + AA   S     +   AE +A    L+        RL V  D +N+ 
Subjt:  -NPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLI

Query:  KSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMA
         +I+  L   S     L DI  L+ + + V+     R  N+ AH +A
Subjt:  KSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMA

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]3.6e-22739.04Show/hide
Query:  SDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSL---QRTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIK-----
        SDH PI L         NRR  N+ FKFEE W    +C +++     W +  N   +L   ++ +E C   L+ WG ++      EIK ++ +++     
Subjt:  SDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSL---QRTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIK-----

Query:  -VSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPN
          + E R      E   + + LDNLL ++E+YW QRSR  WLK GD+N+K+FH KA+ RRR+N++ G+ D++  W  +   I     NYF  LFS+   +
Subjt:  -VSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPN

Query:  QNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTNIVLIPKGRQPRLV
        Q  I   L+ +P KVTD+M Q L   +T  EI  A+    PTKAPGPDG  ALFYQK+W IVGD  +S  L   NS V   + NHTNIVLIPK   P  +
Subjt:  QNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTNIVLIPKGRQPRLV

Query:  SDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDC
        SD+RPISLCNV YKII+KV+ANRLK VL  II   QSAF+PG  I+DN++++ +TLH ++ +RKGK G  ALKLD+SKAYDRVEW +L+ IM KLGF D 
Subjt:  SDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDC

Query:  WIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDAR-NNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRS
        WI  VM C++T +FS+L+NG+ FG I P+RG+RQGDPLSPYLFLLC+EG  +LL  A     + GVSI ++ P+ISHL FADDSL+F +A  +E      
Subjt:  WIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDAR-NNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRS

Query:  ILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIP
        IL  Y  ASGQC+N EKS++ FS NTP   K++    L +   E   SYLGLP+   R K + F FI DRVW  LQGWK +  S+ GKEVLIK++ Q+IP
Subjt:  ILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIP

Query:  TYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHVSYL
        TY MG F++P  +  ++ ++CARFWWG    +R +HWK W  L + K+ GG+ FRDL +F  A+LAKQ WR++ + +  L +    +YFPR + L  S  
Subjt:  TYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHVSYL

Query:  TSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFN-PSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPD
         +SSF WK  +  M +LKQG    +G+G +I + +D W+    T KVL          +V++  +  S  WD   +     ++D E I  +P+S     D
Subjt:  TSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFN-PSVQWDEAKLQQFLIKEDVEMIKGLPIS-LTAPD

Query:  RWIWHYNNTGEYSVKSGYKLS--MLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHEELETTDHALFR
          +W +   G YSV+SGY L+  +LR    +  + G   R  W+K+WKL++PN++K++ W++  + +P+  NL R  +     C VC   LET  HAL+ 
Subjt:  RWIWHYNNTGEYSVKSGYKLS--MLRPMGPSMSARGLDSR-WWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHEELETTDHALFR

Query:  CTRAQEIWR---IFQPNIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKANPLGGSLVQSEDEV
        C  AQ++W    +          D+L++    L+  ++ +  L      +W IWN RN++++   I  P   C+  + YL EY +A     S+   +  V
Subjt:  CTRAQEIWR---IFQPNIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKANPLGGSLVQSEDEV

Query:  IQILSEG-EEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSIS
         Q      +   ++ DA         G G ++R + G + AA         +   AEA+A    ++ A       L +  D +N++K++  +   +S + 
Subjt:  IQILSEG-EEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSIS

Query:  TTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMAS-EGYSAPPCLWLGAFP
            DI+ L+      +   V R  N  AH++A    Y +   +WL   P
Subjt:  TTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMAS-EGYSAPPCLWLGAFP

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein9.5e-22637.47Show/hide
Query:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLR
        +W++ F      +V + D   SDHKPI   L+ +  +    RK   F+FEE WT    C + ++ + K  +       +   +  C   L+ W ++    
Subjt:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLR

Query:  LRNEIKEIKSKIK--VSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSF
        L+ +I E ++++K   +   RG  D  +   L+ +L  LL +EE  W+QRSR  WL+ GD+NT++FH +AT RRR+N +  ++D  G+W    +++ + F
Subjt:  LRNEIKEIKSKIK--VSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSF

Query:  ENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHT
          +++ LF+SA PNQ  I++V+  IPR VT EMN  L   +   E++ A++   P K+PGPDGFP +FYQKYW I+G+      L  LNS   ++  NHT
Subjt:  ENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHT

Query:  NIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWI
        +I LIPK + P  V D+RPISLCNV YKII+KV+ NRLK +L +I+ E QSAF+PGR I+DN++++ ETLH + ++R+GK G  ALKLDMSKAYDRVEW 
Subjt:  NIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWI

Query:  YLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARN-NSMAGVSIARQCPKISHLFFADDSLV
        YL ++M+++GFH+ W+ ++M+CI+T S+SIL+NGE  G+I+P+RG+RQGDPLSPYLFL C+EGL +LL  A+N  +M GVSI+R  PK++HLFFADDSL+
Subjt:  YLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARN-NSMAGVSIARQCPKISHLFFADDSLV

Query:  FLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQG
        F KA   E    + IL  YE ASGQ +N +K+T+FFS +TP   + ++  +L + +      YLGLPS   R K   F  I +RVW+ L+GWK +  SQ 
Subjt:  FLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQG

Query:  GKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCG
        GKE+LIKS+ QAIPTYAM CFR+P+ ++ +I  L  RFWWG + E+  +HW  W+ LC+ K+ GG+ FR+L  FN+A+LAKQ WR++ N N    KV   
Subjt:  GKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCG

Query:  KYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNP-SVQWDEAKLQQFLIKEDVE
        KYFP+ S+L       SS+ WK  +    ++++G    +G+G+ I ++ D WLP      ++S   P  T   V +  +P S  W+ A ++   +  + E
Subjt:  KYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNP-SVQWDEAKLQQFLIKEDVE

Query:  MIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLSM--LRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVC
         I G+P+  T   D  IW    +G Y+V+SGY L +   +   P +S     ++ WK +W L +P +V+ F+W++ H+S+P+  NL   HV     CP C
Subjt:  MIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLSM--LRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVC

Query:  HEELETTDHALFRCTRAQEIWRIFQPNIW------MDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYW
           +E+T HAL +C   Q IW   Q   W          D +E+  R +      +  L ++ +  W+IW  RN    N+PI           + LTE+ 
Subjt:  HEELETTDHALFRCTRAQEIWRIFQPNIW------MDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYW

Query:  KANPLGGSLVQSEDEVIQIL----SEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSD
        +A        +  +    I       G   +    A F + + + GIG ++R   G + AA    I   +S    EA A    +     L ++ + +  D
Subjt:  KANPLGGSLVQSEDEVIQIL----SEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSD

Query:  CLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYSAPPC-LWLGAFP
         L +I ++       +S    + D   L        F  V R  N  AH++A       P  +W+ + P
Subjt:  CLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYSAPPC-LWLGAFP

A0A2N9F5W1 Reverse transcriptase domain-containing protein6.1e-22536.83Show/hide
Query:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQ--RTLEQCAIVLKGWGFNQN
        DW+++F    + +V +     SDH+P+ + L  SG   N R     F+FEE WT H  C + I +   W    + +L  Q    ++     LK W FN+ 
Subjt:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQ--RTLEQCAIVLKGWGFNQN

Query:  ----LRLRNEIKEIKSKIKVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKI
              +  + ++++ ++++S   +      +++G   +L +L  +EE  WKQRSR  WL+ GDRNTK+FH +AT R+R+N +HG+ D  G WQS   ++
Subjt:  ----LRLRNEIKEIKSKIKVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKI

Query:  RKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIED
          +   Y+  LF+++QP   D   +LS + R +T +MNQ L A +T  E+  A+    P KA G DG   +FYQKYW+IVG    +  L+ L     ++ 
Subjt:  RKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIED

Query:  WNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDR
         NHT+I LIPK + P  V D+RPISLCNV YKII KV+ANRLK +L  II E QSAF+PGR ISDN++++ ETLH +K  +  K GY ALKLDMSKAYDR
Subjt:  WNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDR

Query:  VEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDAR-NNSMAGVSIARQCPKISHLFFAD
        VEW++L +IM K+GF + W+ ++M+C+ T S+S+L+NGE  GF  PTRG+RQGDP+SPYLFLLC+EGL ALL  A  +  + G+SI+R  PK+SHLFFAD
Subjt:  VEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDAR-NNSMAGVSIARQCPKISHLFFAD

Query:  DSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQF
        DS++F +A  +E    + IL  YE+AS Q +N +K+T+FFS +T  ET+  +   L + +     +YLGLPS   R K + F  + DRVW  +QGWK + 
Subjt:  DSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQF

Query:  FSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSK
         SQ G+EVLIK+++QAIP Y M  F++PK +   +  +   FWWG   E R VHW  W  LCKPK+VG + FR+L  FN A+LAKQ WR++ N +  L +
Subjt:  FSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSK

Query:  VLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSV-QWDEAKLQQFLIK
        VL  KYFPR S++ +   T +SF W+  +    ++++G R  +G G ++ +++D WLP P     LS  N       V+     S   W+   + Q  + 
Subjt:  VLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSV-QWDEAKLQQFLIK

Query:  EDVEMIKGLPISL-TAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGP--SMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMT
         D E+IK +P+S+    D  +W     G YSV+S YK+       P  S S  G   ++WK VW +R+P++V+ F+W++  N++P+M NL R H+     
Subjt:  EDVEMIKGLPISL-TAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGP--SMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMT

Query:  CPVCHEELETTDHALFRCTRAQEIW-------RIFQPNIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDY
        C  C    E   HA++ C    E+W       +IF+ +       +L++      F       +A +    W +W  RN  +Y   +          Q  
Subjt:  CPVCHEELETTDHALFRCTRAQEIW-------RIFQPNIWMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDY

Query:  LTEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLS
         ++Y  AN +          V    S   E  ++ DA          +G+V+R   G+         S  +    AEA+A    LQLA    +R + V  
Subjt:  LTEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLS

Query:  DCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASE--GYSAPPCLWLGAFPEWMERMTLME
        D L +I+++       +S    + DI+ L A+F+ VN+  V R  N  AH +A +     +   +WL   P++++ +   E
Subjt:  DCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASE--GYSAPPCLWLGAFPEWMERMTLME

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.5e-23438.93Show/hide
Query:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLR
        DW+  F  +    +V+   + SDH  + +S   S I+  +R ++  F FE +WT   +CR II      +   N    +   L+ CA  L  W  +   +
Subjt:  DWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLR

Query:  LRNEIKEIKSKI-KVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFE
        +   I+E +  + K++ + R      EI+ + R+L+ LL +EE++W QRS+ +WLK GDRNTK+FH +A+ RR++N + GV D  G W  D   I  +  
Subjt:  LRNEIKEIKSKI-KVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFE

Query:  NYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTN
         YF  ++S++ P+   +  V + IP  +T+EMN  L   +TREEI+TA++  HPTK+PGPDG  A+F+QKYWDIVG    +  L +LN  +S++  N TN
Subjt:  NYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTN

Query:  IVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIY
        IVLIPK   P+ ++D+RPISLCNV YK+I+K +ANRLK  L  II E QSAF   R I+DN+++++E +H+LK K+ GK  + A KLDMSKA+DRVEW +
Subjt:  IVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIY

Query:  LRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVF
        + ++M K+GF++ WI L+M+CI++ S+S+++NGE FG I PTRG+RQGDPLSPYLFLLC+EGL ALL D ARN  + G+S+ R CP+I+HLFFADDSL+F
Subjt:  LRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVD-ARNNSMAGVSIARQCPKISHLFFADDSLVF

Query:  LKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGG
         KA  EE    + IL  YE ASGQ VN +KS++FFS NT  E K+ + +IL          YLGLPS   R K   F  I +RV   L GWK +  S GG
Subjt:  LKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGG

Query:  KEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGK
        KE+LIK++ QAIPTY M CF +PK +  ++  +   FWWG + ++  V W  W ++CKPK +GGL FR+L +FN A+LAKQAWR+LTNP    +++L  K
Subjt:  KEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGK

Query:  YFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEA-KLQQFLIKEDVEM
        YFP   VL+ S  ++ S+ W+     +++LK+G R  +GNG+ I ++ D WLP P T+KV++    +     V+   +P  +W +   ++   +  D E 
Subjt:  YFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEA-KLQQFLIKEDVEM

Query:  IKGLPISLTAP-DRWIWHYNNTGEYSVKSGYKLSM-LRPMGPSMSARGLDS--RWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVC
        I  +P+S   P DR IW  N  GE+SVKS Y +++ L      +     D     WK +WKL +P +VKIF W++  N +P+M N+    +  ++ CPVC
Subjt:  IKGLPISLTAP-DRWIWHYNNTGEYSVKSGYKLSM-LRPMGPSMSARGLDS--RWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVC

Query:  HEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKANPL
         EE+E  +H L  C  A  +W ++Q        +  +IK   L F A  P   L      +WAIW++RN  +++    +P+   E  Q  + +Y  A  L
Subjt:  HEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFAA-QPDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKANPL

Query:  GGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSIN
            +QS          G   +    A  ID     G+G+V+R + G++ AA    +         E  AI  GL LA+ + + ++ + SD L+ I +IN
Subjt:  GGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSIN

Query:  -GHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYS-APPCLWLGAFPEWMERMTLMESHL
          + +G++     +  I    A F   +F ++ R YN  AH +A    S     +W G  P ++  + + +  L
Subjt:  -GHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYS-APPCLWLGAFPEWMERMTLMESHL

A0A803PV25 Uncharacterized protein6.5e-23537.6Show/hide
Query:  RVRVRAKLGL-IGDWVVRFEAIWDLQVVNKDWAKSDHKP--IELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNL-----SLQ
        R+  R   GL   +W+ +FE   D+ ++  DW +SDH+   +++ +   G    + ++   F FEE W    EC +I+ R   WSE   +N+     S +
Subjt:  RVRVRAKLGL-IGDWVVRFEAIWDLQVVNKDWAKSDHKP--IELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNL-----SLQ

Query:  RTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIKVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGV
          + +C   L+ W   +  +L +E++++K  +      + P  ++ I  +E +L+ LL ++E YW+QRSR  WL+WGDRNTK+FH KA+ RR+KN + G+
Subjt:  RTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIKVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGV

Query:  EDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVS
        +D  G WQ D   + +  E+Y+  LF+S+  N++ +  VLS +  KV+  MN  LLA +  EE+I A++  +PTKAPG DG PALFYQK+W  +    V+
Subjt:  EDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVS

Query:  ECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVG
          L +LN+   ++  N T + LIPK  +P+ + ++RPISLCNV YKI++K +ANR++  L  ++ + QSAF+ GR I DN I+ +E+LH ++K R     
Subjt:  ECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVG

Query:  YAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDA-RNNSMAGVSI
          ALKLDM+KAYDRVEW +L  +M KLG+   W+  +M C+T+  FS ++NGE  G + P RG+RQGDPLSP+LFLLC+E    L+  A +   + GV  
Subjt:  YAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDA-RNNSMAGVSI

Query:  ARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFIL
         RQ   +SHLFFADDSLVFL A  +E   FR +L  Y  ASGQ VNF KS + F  +     + +L+  + + + ++ G YLGLPS   R K + F+FI 
Subjt:  ARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFIL

Query:  DRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQ
        ++VW  L+GWK  FFS  GKEVLIK+I+QAIPTY M CFR+PK  +  I S+ ARFWWGS  +   +HW +W  LCK KE GGL FRDL  FNQA+LAKQ
Subjt:  DRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQ

Query:  AWRVLTNPNLTLSKVLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSV
         WR +  PN   SKVL   Y+P + VL       +SF W+  VWG K+++ G R  +GNG ++ +  DPWLPRP TFK+  +  P   N+ V +    + 
Subjt:  AWRVLTNPNLTLSKVLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSV

Query:  QWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFN
        +WDE  ++      D E+I  +  S     D+ +WHY+  GEYSV+SGY+++    +    S     +RWW+++WKL+IP +VK FVWK  H+ IP+   
Subjt:  QWDEAKLQQFLIKEDVEMIKGLPIS-LTAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFN

Query:  LKRHHVPVHMTCPVCHE-ELETTDHALFRCTRAQEIWRI--FQPNI-WMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMC
        L    V +   C  C     E   HAL+ C    ++W+I  F   I    + D+L    R  S  A+ D       V  W +W  RNS+ +    P    
Subjt:  LKRHHVPVHMTCPVCHE-ELETTDHALFRCTRAQEIWRI--FQPNI-WMDQWDMLEIKDRWLSFAAQPDSILASICVGAWAIWNDRNSILYNRPIPTPMC

Query:  RCEWIQDYLTEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGI-----VLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQL
          EW   +L E+  +N      V  +    +  +     +  +    +DA  K G G+     V+R   G +K A    +    SPL AE  AI  G++ 
Subjt:  RCEWIQDYLTEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGI-----VLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQL

Query:  AKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEG-YSAPPCLWLGAFPEWMERMTLMES
            K+    V +DCL  +  +     G   +   +  I  L+    +    FV R  N+FAH +A+E   +    +W+G  P    +  L++S
Subjt:  AKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEG-YSAPPCLWLGAFPEWMERMTLMES

A0A803Q8J4 Uncharacterized protein4.7e-22537.07Show/hide
Query:  DWVVRFEAIW-----DLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGF
        DWV     IW        V + D+  SDH+ + LS++      +  +    F+FE+ W    EC  IIS+   +S G N    L   L +C   L+ W  
Subjt:  DWVVRFEAIW-----DLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGF

Query:  NQNLRLRNEIKEIKSKIKVSYERRGPI--DFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSK
        ++   L+++I   ++ +        P    + ++   E  LD+LL  EE YW+QR+R +WL+ GDRNTK+FH KA+ R   N +  + D +G   +   +
Subjt:  NQNLRLRNEIKEIKSKIKVSYERRGPI--DFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSK

Query:  IRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIE
        I      YF  LF +   +   +Q VL+ IP  VT E N  L   +T  E+  A++  +   +PG DG  ALFYQ  WDIVGD      L +LN   S E
Subjt:  IRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIE

Query:  DWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYD
          N T I LIPK ++P+ + D+RPISLCNV YK+I+K +  R K +L  +I E QSAF+P R I+DN++++ E +H LK K +G+ G++ALKLDMSKA+D
Subjt:  DWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYD

Query:  RVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALL-VDARNNSMAGVSIARQCPKISHLFFA
        RVEW ++  +M K+GF   W+ L+M C+ T   S ++NG   G ++P RG+RQGDPLSPYLFL+CSEGL  LL  +     + G++I+R  P ISHL FA
Subjt:  RVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALL-VDARNNSMAGVSIARQCPKISHLFFA

Query:  DDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQ
        DDSL+F +A     G  + +L  Y KASGQ +N +KS + FS NT    K    +IL M + +   SYLGLP+   R K   F  I +R+W +L  W  +
Subjt:  DDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQ

Query:  FFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLS
         FS GGKEVL+K++IQ+IPTYAM CF++P     +I S+ + +WWG+  +K+ +HWK+W+ LC  K  GGL FR+ + FNQA+LAKQAWR+  N N  L 
Subjt:  FFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLS

Query:  KVLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTF-KVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLI
        +VL G+YFPR   L  +   +SS  W+G  WG +LLK+GIRK +GNG +I    DPW+P    F  +    NP   N  VA++  P  +W+ +KL     
Subjt:  KVLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWLPRPFTF-KVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLI

Query:  KEDVEMIKGLPISLTA-PDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTC
          DV  I  LP+S  A PD WIWH    GEY VKSGY  +       + S    ++ WWK  W+L++P +VKIF WK+ HN++P    L +       +C
Subjt:  KEDVEMIKGLPISLTA-PDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTC

Query:  PVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFAAQ-PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA
         +C    E+  HALF C  A+ +W++            + I+D     +     S L +I    W+IW+DRN++++ +    P       Q++L  Y   
Subjt:  PVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFAAQ-PDSILASICVGAWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA

Query:  NPL---GGSLVQSEDEVIQILSEGEEVI--MHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDC
          L    G    +   + +  S        ++ DA F +A  K G G ++R   G +KAA  H I+GC  P   EA  +   L+ A+ L  +   V +D 
Subjt:  NPL---GGSLVQSEDEVIQILSEGEEVI--MHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDC

Query:  LNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYSAP-PCLWLGAFPEWMERMTLMES
        L L  ++      +SS    ++D++  ++    V  + V R  N+ AH +A +       C WL  FP  +  + + +S
Subjt:  LNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYSAP-PCLWLGAFPEWMERMTLMES

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.5e-4223.49Show/hide
Query:  SDHKPIELSLLGSGIVANRRR--KNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIKVSYERR
        SDH  I+L L    +  +R    K  N    ++W  H E +  I    + +E  N + + Q   +    V +G     N   R   K+ +SKI     + 
Subjt:  SDHKPIELSLLGSGIVANRRR--KNWNFKFEEWWTGHVECRDIISRTGKWSEGPNDNLSLQRTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIKVSYERR

Query:  GPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKAT-----------MRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSS
          ++  E    +      + +     K+   +  L+  + +  WF ++              +R KN +  +++  G   +D ++I+ +   Y+  L+++
Subjt:  GPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKAT-----------MRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSS

Query:  AQPNQNDIQRVLS--HIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTNIVLIPK-
           N  ++   L    +PR   +E+ + L  P T  EI+  + +    K+PGPDGF A FYQ+Y + +    +    +I    +    +   +I+LIPK 
Subjt:  AQPNQNDIQRVLS--HIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTNIVLIPK-

Query:  GRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIM
        GR      ++RPISL N+  KI+ K++ANR++  + ++I   Q  FIPG     N+  S   + H  + K K  V    + +D  KA+D+++  ++ + +
Subjt:  GRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIM

Query:  EKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAE
         KLG    ++ ++       + +I+LNG+         G RQG PLSP LF +  E L   +   +   + G+ + ++  K+S   FADD +V+L+    
Subjt:  EKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAE

Query:  EFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGK--TRDFQFILDRVWTVLQGWKSQFFSQGGKEVL
               ++ ++ K SG  +N +KS  F   N   +T+  +   L   ++     YLG+  T         +++ +L  +      WK+   S  G+  +
Subjt:  EFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGK--TRDFQFILDRVWTVLQGWKSQFFSQGGKEVL

Query:  IKSIIQAIPTYAMGC--FRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW
        +K  I     Y       ++P    T++     +F W    +KR+   K    L +  + GG+   D   + +A + K AW
Subjt:  IKSIIQAIPTYAMGC--FRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW

P08548 LINE-1 reverse transcriptase homolog2.4e-4025.23Show/hide
Query:  DRNTKWFHQK-----------ATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLS--HIPRKVTDEMNQMLLAPYTREEI
        +++  WF +K              +R K+ +  + + +    +D S+I+K    Y+  L+S    N  +I + L   H+PR    E+ +ML  P +  EI
Subjt:  DRNTKWFHQK-----------ATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLS--HIPRKVTDEMNQMLLAPYTREEI

Query:  ITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEI
         + ++N    K+PGPDGF + FYQ + + +    ++    I    +    +   NI LIPK G+ P    +YRPISL N+  KI+ K++ NR++  + +I
Subjt:  ITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIEDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEI

Query:  IDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTR
        I   Q  FIPG     N+  S   + H  K K K    +  L +D  KA+D ++  ++ + ++K+G    ++ L+    +  + +I+LNG          
Subjt:  IDECQSAFIPGRSISDNMILSHETL-HFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTR

Query:  GIRQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETK
        G RQG PLSP LF +  E +LA+ +     ++ G+ I  +  K+S   FADD +V+L+   +       ++ +Y   SG  +N  KS  F   N     K
Subjt:  GIRQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETK

Query:  QYLSHILSMNMSESLGSYLG--LPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGILTKIMSLCARFWWG
             I    + + +  YLG  L          +++ +   +   +  WK+   S  G+  ++K  I  +AI  +     + P      +  +   F W 
Subjt:  QYLSHILSMNMSESLGSYLG--LPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGILTKIMSLCARFWWG

Query:  SQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW
         +  + +        L    + GG+   DL  + ++++ K AW
Subjt:  SQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAW

P0C2F6 Putative ribonuclease H protein At1g657504.8e-4124.83Show/hide
Query:  FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQA
        F  IL+RV + + GW+ +  S  G+  L K+++ ++P ++M    +P+ IL ++  L   F WGS AEK+  H  +W ++C PK+ GGL  R   S N+A
Subjt:  FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQA

Query:  MLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHVSYLTSS---------SFFWKGFVWGMK-LLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNP
        +++K  WR+L   N   + VL  KY       HV  +  S         S  W+    G++ ++  G+    G+GQ I  + D W+      ++ +   P
Subjt:  MLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHVSYLTSS---------SFFWKGFVWGMK-LLKQGIRKNLGNGQTILMFKDPWLPRPFTFKVLSQVNP

Query:  SLTNIKVA-EFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLT--APDRWIWHYNNTGEYSVKSGYK-LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNR
        +  +  VA + + P   WD AK+  +        ++ + + L   A DR  W ++  G++SV+S Y+ L++     P+M++      ++  +WK+R+P R
Subjt:  SLTNIKVA-EFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLT--APDRWIWHYNNTGEYSVKSGYK-LSMLRPMGPSMSARGLDSRWWKKVWKLRIPNR

Query:  VKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQP--------NIWMDQWDMLEIKDRWLSFAAQPDSILASICVG
        VK F+W   + ++ +     R H+     C VC   +E+  H L  C     IW    P        +  + +W    + DR         +I A I   
Subjt:  VKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQP--------NIWMDQWDMLEIKDRWLSFAAQPDSILASICVG

Query:  AWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA---NPLGGSLVQSEDEVIQILSEGEE-VIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHS
         W  W   N    N        R ++++++  E ++A   N L G      + +I  +S     V ++TD            G VLR   G        +
Subjt:  AWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA---NPLGGSLVQSEDEVIQILSEGEE-VIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHS

Query:  ISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYS
        I  C +P  AE   +  GL  A   KV R+ +  D   ++  +   +     +S  +      +    +V    V R  NR A  +A+  +S
Subjt:  ISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYS

P11369 LINE-1 retrotransposable element ORF2 protein2.2e-3824.67Show/hide
Query:  RRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLS--HIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQK
        R K  ++ + +  G   +D  +I+ +  +++  L+S+   N +++ + L    +P+   D+++  L +P + +EI   + +    K+PGPDGF A FYQ 
Subjt:  RRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLS--HIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQK

Query:  YWDIVGDKTVSECLAILNSEVSIE-----DWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMIL
        +      + +   L  L  ++ +E      +    I LIPK  + P  + ++RPISL N+  KI+ K++ANR++  +  II   Q  FIPG     N+  
Subjt:  YWDIVGDKTVSECLAILNSEVSIE-----DWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMIL

Query:  SHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLL
        S   +H++ K +     +  + LD  KA+D+++  ++ +++E+ G    ++ ++    +    +I +NGE    I    G RQG PLSPYLF +  E L 
Subjt:  SHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLL

Query:  ALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGL
          +   +   + G+ I ++  KIS L  ADD +V++           +++  + +  G  +N  KS  F         K+ +      ++  +   YLG+
Subjt:  ALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGL

Query:  PSTFHRGKTRD--FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKE
          T       D  F+ +   +   L+ WK    S  G+  ++K  I  +AI  +     +IP     ++     +F W ++  + +      + L K K 
Subjt:  PSTFHRGKTRD--FQFILDRVWTVLQGWKSQFFSQGGKEVLIKSII--QAIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKE

Query:  V-GGLNFRDLVSFNQAMLAKQAW
          GG+   DL  + +A++ K AW
Subjt:  V-GGLNFRDLVSFNQAMLAKQAW

P14381 Transposon TX1 uncharacterized 149 kDa protein6.0e-3626.36Show/hide
Query:  RSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITA
        RSR   L   DR +++F+     +  +  +  +   DGT   D   IR    +++  LFS    + +  + +   +P  V++   + L  P T +E+  A
Subjt:  RSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITA

Query:  MRNFHPTKAPGPDGFPALFYQKYWDIVG---DKTVSECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEII
        +R     K+PG DG    F+Q +WD +G    + ++E  A    E+ +       + L+PK    RL+ ++RP+SL +  YKI+ K I+ RLK VL E+I
Subjt:  MRNFHPTKAPGPDGFPALFYQKYWDIVG---DKTVSECLAILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEII

Query:  DECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGI
           QS  +PGR+I DN+ L  + LHF    R+  +  A L LD  KA+DRV+  YL   ++   F   ++  +     +A   + +N      +   RG+
Subjt:  DECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQIMEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGI

Query:  RQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQY
        RQG PLS  L+ L  E  L LL       + G+ +     ++    +ADD ++ +     +    +     Y  AS   +N+ KS+    G+   +    
Subjt:  RQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSILFDYEKASGQCVNFEKSTVFFSGNTPYETKQY

Query:  LSHILS--MNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWK--SQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWW
            +S    + + LG YL   S      +++F  + + V T L  WK  ++  S  G+ ++I  ++ +   Y + C    +  + KI      F W
Subjt:  LSHILS--MNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWK--SQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKGILTKIMSLCARFWW

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-2126.59Show/hide
Query:  GDWVVRFE---AIWDLQVVNKDWAKSDHKPIELSLLGSGIVAN-RRRKNWNFKFEEWWTGHVECRDIISRTGKWSEG---PNDNLSLQRTL---EQCAIV
        GDW   F    A+++L  V      SDH P  +      I+ N  +R    F++  + + H     ++S T  W E     +   SL   L   ++C  +
Subjt:  GDWVVRFE---AIWDLQVVNKDWAKSDHKPIELSLLGSGIVAN-RRRKNWNFKFEEWWTGHVECRDIISRTGKWSEG---PNDNLSLQRTL---EQCAIV

Query:  LKGWGFNQNLRLRNEIKEIKSKIK--VSYERRGPID--FDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDG
        L   GF     ++++ KE    ++   S     P D  F   H   ++ +      E +++Q+SR  WL+ GD NT++FH+     + KN +  +   D 
Subjt:  LKGWGFNQNLRLRNEIKEIKSKIK--VSYERRGPID--FDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDIDG

Query:  TWQSDLSKIRKSFENYFSALFSSAQP--NQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECL
            +++++++    Y++ L  S       + +QR+    P +  D +   L A  + +EI  A+      KAPGPD F A F+ + W +V D T++   
Subjt:  TWQSDLSKIRKSFENYFSALFSSAQP--NQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECL

Query:  AILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT
            +   ++ +N T I LIPK      +S +RP+S C V YKIIT
Subjt:  AILNSEVSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT

AT3G09510.1 Ribonuclease H-like superfamily protein1.0e-3827.15Show/hide
Query:  KYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILM----FKDPWLPRPF----TFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQF
        +YF  +S+L        S+ W   + G+ LLK+G R  +G+GQ I +      D   PRP     T+K ++  N  L   K + +F     WD++K+ QF
Subjt:  KYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILM----FKDPWLPRPF----TFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQF

Query:  LIKEDVEMIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLSMLRP------MGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRH
        + + D   I  + ++ +  PD+ IW+YN TGEY+V+SGY L    P      + P   +  L +R    +W L I  ++K F+W++   ++ +   L   
Subjt:  LIKEDVEMIKGLPISLT-APDRWIWHYNNTGEYSVKSGYKLSMLRP------MGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRH

Query:  HVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQW---DMLEIKDRWLSFAAQPDSILASI-----CVGAWAIWNDRNSILYNR----PIP
         + +  +CP CH E E+ +HALF C  A   WR+   ++  +Q    D  E     L+F    D+ ++           W IW  RN++++N+    P  
Subjt:  HVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQW---DMLEIKDRWLSFAAQPDSILASI-----CVGAWAIWNDRNSILYNR----PIP

Query:  TPMCRCEWIQDYL--TEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQ
        T +       D+L  T+  K  P     + +E+++         V  + DA F     +   G ++R   G   +     ++   +PL AE  A+L  LQ
Subjt:  TPMCRCEWIQDYL--TEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQ

Query:  LAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEG------YSAPPCLWLGAFPEWMERMTLMES
                ++ +  DC  LI  ING +   SS++  L DI      F  + F F+ R+ N+ AH +A  G      YS       G+ P W++R    +S
Subjt:  LAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEG------YSAPPCLWLGAFPEWMERMTLMES

Query:  H
        +
Subjt:  H

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-5728.67Show/hide
Query:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHV
        A+PTY M CF +PK +  +I+S+ A FWW ++ E + +HWK W+ L   K  GG+ F+D+ +FN A+L KQ WR+L+ P   ++KV   +YF +   L+ 
Subjt:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHV

Query:  SYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWL-PRPFTFKVLSQVNP-----SLTNI-KVAEFFNPS-VQWDEAKLQQFLIKEDVEMIKG
           +  SF WK      ++L+QG R  +GNG+ I++++  WL  +P +  +  Q  P     S+++I KV++  + S  +W +  ++    + + ++I  
Subjt:  SYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWL-PRPFTFKVLSQVNP-----SLTNI-KVAEFFNPS-VQWDEAKLQQFLIKEDVEMIKG

Query:  L-PISLTAPDRWIWHYNNTGEYSVKSGY----KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHE
        L P      D + W Y ++G+Y+VKSGY    ++   R     +S   L+   ++K+WK +   +++ F+WK   NS+P    L   H+     C  C  
Subjt:  L-PISLTAPDRWIWHYNNTGEYSVKSGY----KLSMLRPMGPSMSARGLDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHE

Query:  ELETTDHALFRCTRAQEIWRIFQPNIWM-DQW-DMLEIKDRWLSFA--AQPDSILASICVG--AWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA
          ET +H LF+CT A+  W I    I +  +W D + +   W+       P    AS  V    W +W +RN +++              +D L E W+ 
Subjt:  ELETTDHALFRCTRAQEIWRIFQPNIWM-DQW-DMLEIKDRWLSFA--AQPDSILASICVG--AWAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKA

Query:  NPLGGSLVQSEDEVIQILSEG-------EEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLS
             S        +   S G       + V  +TDAT+     +CGIG VLR + G +K     ++    S L AE  A+   +      +   +   S
Subjt:  NPLGGSLVQSEDEVIQILSEG-------EEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAILTGLQLAKGLKVRRLTVLS

Query:  DCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYS
        D   LI+ +N   +   S+  T+ D++ L++ F  V F F+ R  N  A  +A E  S
Subjt:  DCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-3041.96Show/hide
Query:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKE-VGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLH
        A+P YAM CFR+ K +  K+ S    FWW S   KR + W  W++LCK KE  GGL FRDL  FNQA+LAKQ++R++  P+  LS++L  +YFP  S++ 
Subjt:  AIPTYAMGCFRIPKGILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKE-VGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLH

Query:  VSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWL
         S  T  S+ W+  + G +LL +G+ + +G+G    ++ D W+
Subjt:  VSYLTSSSFFWKGFVWGMKLLKQGIRKNLGNGQTILMFKDPWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.7e-1438.26Show/hide
Query:  LLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNS-MAGVSIARQCPKISHLFFADDS----LVFLKAKAEEFGFFRSILFDYEKASGQC
        ++NG   G + P+RG+RQGDPLSPYLF+LC+E L  L   A+    + G+ ++   P+I+HL FADD+     + L A+     F    LF      G  
Subjt:  LLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNS-MAGVSIARQCPKISHLFFADDS----LVFLKAKAEEFGFFRSILFDYEKASGQC

Query:  VNFEKSTVFFSGNTP
        VN   S ++F G+ P
Subjt:  VNFEKSTVFFSGNTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTGGGAGAAATCAGATCAAAACAGCGACTAGAAACCGCAAAACAGAGAGCAACCCAAGGAAGTTTGGTCCGGGTTCGGGTTCGTGCTAAATTGGGTCTGATTGG
TGACTGGGTAGTCCGGTTTGAGGCGATTTGGGATCTCCAAGTTGTTAATAAGGACTGGGCGAAATCAGATCACAAACCTATAGAGCTCAGTTTGCTGGGGAGTGGTATAG
TTGCAAATAGGAGAAGGAAAAATTGGAATTTTAAATTTGAGGAGTGGTGGACTGGCCACGTGGAATGTCGTGATATTATCAGCCGGACTGGGAAATGGAGCGAGGGACCT
AATGACAATCTGTCGCTGCAGCGGACGTTGGAACAATGTGCCATTGTCCTTAAAGGATGGGGGTTTAATCAGAATTTGAGGTTGAGAAATGAGATTAAGGAGATCAAAAG
CAAAATTAAGGTCTCTTATGAGCGTCGTGGACCTATTGATTTTGATGAGATTCATGGTTTGGAGCGGCAACTTGACAATTTGTTGTTGGAGGAAGAGATGTACTGGAAAC
AGCGTTCAAGGGAGAATTGGCTTAAATGGGGAGACCGCAATACGAAGTGGTTCCATCAAAAGGCGACCATGAGACGACGTAAAAATTGGGTTCATGGAGTGGAGGATATC
GATGGTACCTGGCAATCAGATCTATCTAAAATTCGGAAGTCTTTCGAGAATTATTTTTCAGCGTTATTCTCCTCAGCTCAGCCGAACCAAAATGATATACAGAGAGTACT
CAGTCATATACCCCGAAAGGTAACTGATGAAATGAATCAGATGTTGTTGGCTCCTTATACCAGAGAGGAAATTATCACTGCAATGCGAAATTTTCATCCAACTAAGGCCC
CGGGACCGGATGGGTTTCCAGCATTATTCTACCAGAAATATTGGGACATTGTAGGAGACAAAACGGTATCTGAGTGCTTGGCTATTCTCAATTCGGAGGTTTCGATAGAG
GACTGGAATCATACCAATATAGTGCTTATCCCCAAAGGACGGCAGCCCAGGTTAGTATCAGATTATCGCCCAATTAGTCTATGCAATGTCTCTTATAAAATAATCACTAA
GGTCATCGCTAATAGACTCAAGGGTGTGTTAAATGAGATAATCGATGAATGTCAATCCGCGTTCATTCCTGGTAGATCAATATCTGATAACATGATTTTAAGTCATGAGA
CGCTTCATTTTCTAAAAAAAAAAAGGAAAGGAAAAGTTGGTTATGCTGCACTAAAACTAGATATGAGCAAAGCCTATGATAGGGTGGAGTGGATATATTTGAGACAAATC
ATGGAGAAGTTGGGGTTTCATGATTGCTGGATCTGGTTAGTTATGAAATGTATTACAACCGCCTCTTTTTCTATCTTGTTAAATGGGGAAGCTTTTGGTTTCATTAGACC
AACTCGTGGAATTCGTCAAGGCGATCCTTTATCACCTTATCTGTTTTTACTATGTTCCGAAGGTCTGTTGGCTTTGTTGGTGGATGCAAGGAATAATTCAATGGCGGGGG
TGTCCATAGCGCGCCAGTGTCCAAAAATTTCGCATTTATTCTTTGCGGATGATAGTTTGGTTTTTTTGAAAGCAAAGGCGGAGGAATTTGGTTTTTTCAGATCTATTCTG
TTCGATTATGAAAAGGCTTCAGGACAGTGTGTTAATTTTGAAAAATCAACGGTGTTCTTCTCGGGGAATACTCCATATGAGACCAAACAGTATCTTAGTCATATTTTGTC
TATGAATATGTCTGAATCTTTGGGTTCCTACCTTGGATTGCCATCAACTTTCCATAGAGGGAAAACTCGTGATTTCCAGTTTATTTTGGATAGGGTCTGGACTGTGCTTC
AAGGATGGAAAAGCCAATTTTTTTCACAAGGAGGGAAAGAGGTGCTGATAAAATCTATTATTCAAGCGATTCCAACCTATGCAATGGGGTGCTTTCGGATCCCAAAAGGT
ATTCTGACAAAGATTATGTCTTTATGCGCTAGATTCTGGTGGGGTTCTCAGGCAGAAAAGCGTAGTGTGCATTGGAAGAGATGGGAAGAGCTTTGCAAACCAAAAGAGGT
GGGGGGGTTGAATTTCCGTGATTTGGTTAGTTTCAACCAGGCAATGCTTGCTAAACAAGCCTGGAGGGTGCTTACTAATCCTAATCTTACACTATCCAAAGTGTTATGCG
GAAAGTACTTTCCTCGCATTTCAGTGTTACATGTCTCATATTTAACCTCGTCTTCCTTTTTTTGGAAAGGTTTTGTGTGGGGTATGAAACTATTAAAACAAGGAATTCGA
AAAAATCTAGGAAACGGTCAAACTATTCTTATGTTCAAAGACCCTTGGCTTCCTCGGCCTTTTACTTTTAAGGTGTTGTCACAGGTTAATCCAAGTTTGACGAATATCAA
GGTAGCGGAGTTTTTTAATCCAAGTGTGCAATGGGATGAGGCGAAGCTCCAACAATTTCTAATAAAGGAGGATGTGGAGATGATTAAAGGCTTACCTATTAGTCTAACTG
CTCCAGACAGGTGGATTTGGCACTACAATAATACAGGAGAATATTCCGTTAAGAGTGGCTATAAATTAAGCATGTTGAGACCGATGGGACCATCAATGTCGGCTCGTGGA
TTGGATAGTAGGTGGTGGAAAAAGGTTTGGAAGTTGCGAATTCCCAATAGAGTTAAAATTTTTGTGTGGAAATCGTTCCATAACTCTATCCCGTCTATGTTTAATTTAAA
AAGACATCATGTTCCAGTACATATGACTTGCCCTGTATGCCATGAGGAGTTAGAAACTACGGATCATGCTCTTTTCCGATGTACTAGAGCTCAAGAGATTTGGAGGATTT
TTCAACCGAATATTTGGATGGATCAATGGGACATGCTAGAGATCAAGGATCGGTGGTTGAGTTTTGCTGCCCAACCGGATTCTATTTTAGCAAGTATTTGTGTTGGTGCT
TGGGCGATTTGGAATGATAGAAATAGTATACTCTATAACCGTCCAATTCCAACGCCTATGTGTCGTTGTGAATGGATTCAAGACTATCTGACGGAATACTGGAAGGCAAA
CCCTTTGGGTGGATCTTTGGTTCAGTCAGAGGATGAGGTGATACAGATTCTTTCTGAAGGGGAGGAAGTCATCATGCATACTGATGCTACCTTTATTGATGCTACGACTA
AATGTGGAATTGGCATTGTATTACGTACAAAGGGAGGCATTTTGAAGGCAGCGCAACATCACTCTATCTCTGGGTGTCATTCCCCATTGGGGGCCGAAGCAGTTGCCATT
CTGACGGGACTTCAATTAGCAAAGGGCTTGAAGGTGAGACGTCTAACAGTTTTGTCAGATTGTTTGAATCTCATAAAGTCTATCAACGGTCATCTTCAAGGACAGTCAAG
CATTTCTACGACCCTATGGGATATTGAAGCACTTATGGCTGCTTTTGAGATTGTTAATTTCCATTTTGTTAATCGTCGTTATAATAGGTTTGCTCATAATATGGCCAGTG
AAGGCTACTCGGCACCTCCATGCTTATGGTTAGGTGCTTTTCCTGAGTGGATGGAACGGATGACACTCATGGAAAGCCATTTGTATGTTCCTTTTATCTCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTTGGGAGAAATCAGATCAAAACAGCGACTAGAAACCGCAAAACAGAGAGCAACCCAAGGAAGTTTGGTCCGGGTTCGGGTTCGTGCTAAATTGGGTCTGATTGG
TGACTGGGTAGTCCGGTTTGAGGCGATTTGGGATCTCCAAGTTGTTAATAAGGACTGGGCGAAATCAGATCACAAACCTATAGAGCTCAGTTTGCTGGGGAGTGGTATAG
TTGCAAATAGGAGAAGGAAAAATTGGAATTTTAAATTTGAGGAGTGGTGGACTGGCCACGTGGAATGTCGTGATATTATCAGCCGGACTGGGAAATGGAGCGAGGGACCT
AATGACAATCTGTCGCTGCAGCGGACGTTGGAACAATGTGCCATTGTCCTTAAAGGATGGGGGTTTAATCAGAATTTGAGGTTGAGAAATGAGATTAAGGAGATCAAAAG
CAAAATTAAGGTCTCTTATGAGCGTCGTGGACCTATTGATTTTGATGAGATTCATGGTTTGGAGCGGCAACTTGACAATTTGTTGTTGGAGGAAGAGATGTACTGGAAAC
AGCGTTCAAGGGAGAATTGGCTTAAATGGGGAGACCGCAATACGAAGTGGTTCCATCAAAAGGCGACCATGAGACGACGTAAAAATTGGGTTCATGGAGTGGAGGATATC
GATGGTACCTGGCAATCAGATCTATCTAAAATTCGGAAGTCTTTCGAGAATTATTTTTCAGCGTTATTCTCCTCAGCTCAGCCGAACCAAAATGATATACAGAGAGTACT
CAGTCATATACCCCGAAAGGTAACTGATGAAATGAATCAGATGTTGTTGGCTCCTTATACCAGAGAGGAAATTATCACTGCAATGCGAAATTTTCATCCAACTAAGGCCC
CGGGACCGGATGGGTTTCCAGCATTATTCTACCAGAAATATTGGGACATTGTAGGAGACAAAACGGTATCTGAGTGCTTGGCTATTCTCAATTCGGAGGTTTCGATAGAG
GACTGGAATCATACCAATATAGTGCTTATCCCCAAAGGACGGCAGCCCAGGTTAGTATCAGATTATCGCCCAATTAGTCTATGCAATGTCTCTTATAAAATAATCACTAA
GGTCATCGCTAATAGACTCAAGGGTGTGTTAAATGAGATAATCGATGAATGTCAATCCGCGTTCATTCCTGGTAGATCAATATCTGATAACATGATTTTAAGTCATGAGA
CGCTTCATTTTCTAAAAAAAAAAAGGAAAGGAAAAGTTGGTTATGCTGCACTAAAACTAGATATGAGCAAAGCCTATGATAGGGTGGAGTGGATATATTTGAGACAAATC
ATGGAGAAGTTGGGGTTTCATGATTGCTGGATCTGGTTAGTTATGAAATGTATTACAACCGCCTCTTTTTCTATCTTGTTAAATGGGGAAGCTTTTGGTTTCATTAGACC
AACTCGTGGAATTCGTCAAGGCGATCCTTTATCACCTTATCTGTTTTTACTATGTTCCGAAGGTCTGTTGGCTTTGTTGGTGGATGCAAGGAATAATTCAATGGCGGGGG
TGTCCATAGCGCGCCAGTGTCCAAAAATTTCGCATTTATTCTTTGCGGATGATAGTTTGGTTTTTTTGAAAGCAAAGGCGGAGGAATTTGGTTTTTTCAGATCTATTCTG
TTCGATTATGAAAAGGCTTCAGGACAGTGTGTTAATTTTGAAAAATCAACGGTGTTCTTCTCGGGGAATACTCCATATGAGACCAAACAGTATCTTAGTCATATTTTGTC
TATGAATATGTCTGAATCTTTGGGTTCCTACCTTGGATTGCCATCAACTTTCCATAGAGGGAAAACTCGTGATTTCCAGTTTATTTTGGATAGGGTCTGGACTGTGCTTC
AAGGATGGAAAAGCCAATTTTTTTCACAAGGAGGGAAAGAGGTGCTGATAAAATCTATTATTCAAGCGATTCCAACCTATGCAATGGGGTGCTTTCGGATCCCAAAAGGT
ATTCTGACAAAGATTATGTCTTTATGCGCTAGATTCTGGTGGGGTTCTCAGGCAGAAAAGCGTAGTGTGCATTGGAAGAGATGGGAAGAGCTTTGCAAACCAAAAGAGGT
GGGGGGGTTGAATTTCCGTGATTTGGTTAGTTTCAACCAGGCAATGCTTGCTAAACAAGCCTGGAGGGTGCTTACTAATCCTAATCTTACACTATCCAAAGTGTTATGCG
GAAAGTACTTTCCTCGCATTTCAGTGTTACATGTCTCATATTTAACCTCGTCTTCCTTTTTTTGGAAAGGTTTTGTGTGGGGTATGAAACTATTAAAACAAGGAATTCGA
AAAAATCTAGGAAACGGTCAAACTATTCTTATGTTCAAAGACCCTTGGCTTCCTCGGCCTTTTACTTTTAAGGTGTTGTCACAGGTTAATCCAAGTTTGACGAATATCAA
GGTAGCGGAGTTTTTTAATCCAAGTGTGCAATGGGATGAGGCGAAGCTCCAACAATTTCTAATAAAGGAGGATGTGGAGATGATTAAAGGCTTACCTATTAGTCTAACTG
CTCCAGACAGGTGGATTTGGCACTACAATAATACAGGAGAATATTCCGTTAAGAGTGGCTATAAATTAAGCATGTTGAGACCGATGGGACCATCAATGTCGGCTCGTGGA
TTGGATAGTAGGTGGTGGAAAAAGGTTTGGAAGTTGCGAATTCCCAATAGAGTTAAAATTTTTGTGTGGAAATCGTTCCATAACTCTATCCCGTCTATGTTTAATTTAAA
AAGACATCATGTTCCAGTACATATGACTTGCCCTGTATGCCATGAGGAGTTAGAAACTACGGATCATGCTCTTTTCCGATGTACTAGAGCTCAAGAGATTTGGAGGATTT
TTCAACCGAATATTTGGATGGATCAATGGGACATGCTAGAGATCAAGGATCGGTGGTTGAGTTTTGCTGCCCAACCGGATTCTATTTTAGCAAGTATTTGTGTTGGTGCT
TGGGCGATTTGGAATGATAGAAATAGTATACTCTATAACCGTCCAATTCCAACGCCTATGTGTCGTTGTGAATGGATTCAAGACTATCTGACGGAATACTGGAAGGCAAA
CCCTTTGGGTGGATCTTTGGTTCAGTCAGAGGATGAGGTGATACAGATTCTTTCTGAAGGGGAGGAAGTCATCATGCATACTGATGCTACCTTTATTGATGCTACGACTA
AATGTGGAATTGGCATTGTATTACGTACAAAGGGAGGCATTTTGAAGGCAGCGCAACATCACTCTATCTCTGGGTGTCATTCCCCATTGGGGGCCGAAGCAGTTGCCATT
CTGACGGGACTTCAATTAGCAAAGGGCTTGAAGGTGAGACGTCTAACAGTTTTGTCAGATTGTTTGAATCTCATAAAGTCTATCAACGGTCATCTTCAAGGACAGTCAAG
CATTTCTACGACCCTATGGGATATTGAAGCACTTATGGCTGCTTTTGAGATTGTTAATTTCCATTTTGTTAATCGTCGTTATAATAGGTTTGCTCATAATATGGCCAGTG
AAGGCTACTCGGCACCTCCATGCTTATGGTTAGGTGCTTTTCCTGAGTGGATGGAACGGATGACACTCATGGAAAGCCATTTGTATGTTCCTTTTATCTCTTCTTAG
Protein sequenceShow/hide protein sequence
MNLGEIRSKQRLETAKQRATQGSLVRVRVRAKLGLIGDWVVRFEAIWDLQVVNKDWAKSDHKPIELSLLGSGIVANRRRKNWNFKFEEWWTGHVECRDIISRTGKWSEGP
NDNLSLQRTLEQCAIVLKGWGFNQNLRLRNEIKEIKSKIKVSYERRGPIDFDEIHGLERQLDNLLLEEEMYWKQRSRENWLKWGDRNTKWFHQKATMRRRKNWVHGVEDI
DGTWQSDLSKIRKSFENYFSALFSSAQPNQNDIQRVLSHIPRKVTDEMNQMLLAPYTREEIITAMRNFHPTKAPGPDGFPALFYQKYWDIVGDKTVSECLAILNSEVSIE
DWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKRKGKVGYAALKLDMSKAYDRVEWIYLRQI
MEKLGFHDCWIWLVMKCITTASFSILLNGEAFGFIRPTRGIRQGDPLSPYLFLLCSEGLLALLVDARNNSMAGVSIARQCPKISHLFFADDSLVFLKAKAEEFGFFRSIL
FDYEKASGQCVNFEKSTVFFSGNTPYETKQYLSHILSMNMSESLGSYLGLPSTFHRGKTRDFQFILDRVWTVLQGWKSQFFSQGGKEVLIKSIIQAIPTYAMGCFRIPKG
ILTKIMSLCARFWWGSQAEKRSVHWKRWEELCKPKEVGGLNFRDLVSFNQAMLAKQAWRVLTNPNLTLSKVLCGKYFPRISVLHVSYLTSSSFFWKGFVWGMKLLKQGIR
KNLGNGQTILMFKDPWLPRPFTFKVLSQVNPSLTNIKVAEFFNPSVQWDEAKLQQFLIKEDVEMIKGLPISLTAPDRWIWHYNNTGEYSVKSGYKLSMLRPMGPSMSARG
LDSRWWKKVWKLRIPNRVKIFVWKSFHNSIPSMFNLKRHHVPVHMTCPVCHEELETTDHALFRCTRAQEIWRIFQPNIWMDQWDMLEIKDRWLSFAAQPDSILASICVGA
WAIWNDRNSILYNRPIPTPMCRCEWIQDYLTEYWKANPLGGSLVQSEDEVIQILSEGEEVIMHTDATFIDATTKCGIGIVLRTKGGILKAAQHHSISGCHSPLGAEAVAI
LTGLQLAKGLKVRRLTVLSDCLNLIKSINGHLQGQSSISTTLWDIEALMAAFEIVNFHFVNRRYNRFAHNMASEGYSAPPCLWLGAFPEWMERMTLMESHLYVPFISS