; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035066 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035066
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:14339080..14342169
RNA-Seq ExpressionLag0035066
SyntenyLag0035066
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EPS63383.1 hypothetical protein M569_11401 [Genlisea aurea]6.6e-12832.38Show/hide
Query:  DLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRS
        D+ + + S  LT+ E+  V    G            ++GKV   + VN E     MR A+      +++  G+N F  +F+ +G+  RV    PW F+  
Subjt:  DLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRS

Query:  LVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYER
        L+VL      G    V  + C F VQIHN+P       +A  LG +IGT         G+     LR+RV + + +PL R  Q    EG      + YE+
Subjt:  LVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYER

Query:  LPDFCYDCGCLGHSRRECGVIREGLGESGRRTQEGGVDNAAIRRPEIP----KPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSGEGD
        LP FC +CG L H  R C + RE           G    AA  R   P    K       T  S  E A  E   A+        + +  Q +    E  
Subjt:  LPDFCYDCGCLGHSRRECGVIREGLGESGRRTQEGGVDNAAIRRPEIP----KPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSGEGD

Query:  SVVVVARGEGM--ADKGKGKQKVGSVMVERRGMEVMGKGGGGRNWKRRARDVLSDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRR
        + V     +GM   D       + + + +    E+ G          R R + SD T  D          + +  +APP  MSL+ WN RG  S  T RR
Subjt:  SVVVVARGEGM--ADKGKGKQKVGSVMVERRGMEVMGKGGGGRNWKRRARDVLSDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRR

Query:  LTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC--RWRLTGFYGFPSADMHPQSW
        L  ++    P ++FLSETK  AS +   K  L +     V   G SGGLAL W   +  SLLSF  ++ID  +        WR TGFYG P+  + P+SW
Subjt:  LTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC--RWRLTGFYGFPSADMHPQSW

Query:  SLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLD
         LL ++R  +  PWL+ GDFN V+ Q+E E    +P S++ AF+  +  C L+D+GF G  FTWCN+R   +T+  RLDR    T W +L+P  +V HL 
Subjt:  SLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLD

Query:  YSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAI
        Y  SDH P+ + L P      +  +R  +FE  W   P    ++ +SWA       +  P     + ++   S++ W +SK G    R+    Q++ + +
Subjt:  YSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAI

Query:  AGLSTSDSRDLLVQAEAQLEE----VLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGP
          L+     D +   E+ L+E    + ++EE+YWKQR +  WLR GDRNT +FH  AS ++  NRI G+++  G+W      V   M  Y+Q LF+SS P
Subjt:  AGLSTSDSRDLLVQAEAQLEE----VLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGP

Query:  SVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARR
           + E AL  +  ++ D+M   L RP+   EV  A+++  P  +PGPDG    FY+ +W  VG   +++ L +LN G     +N + IVLIPK    + 
Subjt:  SVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARR

Query:  MVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIN
           +RPISL NV+YK+ SK++ NR+K I+ +++S+ Q+AF++
Subjt:  MVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIN

KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]1.8e-14133.1Show/hide
Query:  MEDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFD
        M+ L S W +  L EEE + +   E      +   E  +IGK++S ++V      S     WR+  +  +     N F + F +  +K RV S  PW FD
Subjt:  MEDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFD

Query:  RSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLY
          L V+     +  V +++F+  +FWVQ HN+P+    +E   +LG  IG VEEV   +     G  LR++++L L +PL RG  +    G +   PV Y
Subjt:  RSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLY

Query:  ERLPDFCYDCGCLGHSRRECGVIREGLGESGR------------------RTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASAS-------
        E++P FC+ CG + H    C V ++   + G                    T +G    A + +  +     +G       P K    V S+S       
Subjt:  ERLPDFCYDCGCLGHSRRECGVIREGLGESGR------------------RTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASAS-------

Query:  -GEGVGVQVSEQGGQG--EGGSGEGDSVVVVARGEGMADKGKGKQKV----------------GSVMVERRGMEVMGKGGGG------------------
         G+G+GV +     +   +    +G + VV    + ++D   G +++                G     + G+   G    G                  
Subjt:  -GEGVGVQVSEQGGQG--EGGSGEGDSVVVVARGEGMADKGKGKQKV----------------GSVMVERRGMEVMGKGGGG------------------

Query:  --RNWKRRARDVL--------SDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVL
           +WK++AR           S V  R     S++      S +A PG MSLI WN RG G+P   + L+ LV+ K P +LFL ETK++A  M   K  L
Subjt:  --RNWKRRARDVL--------SDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVL

Query:  GFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGCR---WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEG
        GF NC  V C G+SGG+AL W +     + +FSK HI   ++        W LTGFYG        +SW+LL  L   ++  WLI GDFN +L   EK G
Subjt:  GFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGCR---WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEG

Query:  GRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQR--IVR
        GRDK   ++ AF+ V+D C L DLGF G+ FTWCNRR     I ERLDR   N  W   YP   V H   + SDH P+ L LT      S  G R  + R
Subjt:  GRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQR--IVR

Query:  FEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVY
        FE  W+   + +Q++  +W    G   D++   +  K + C + +  W ++  G+  R ++ A  R+         S++R  L +A  ++++ L   E+ 
Subjt:  FEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVY

Query:  WKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVA----LRDLEPSVDDEMNQALLRPFT
        WKQRS+ LWL EGD+N+R+FH +A+ R+K N I  ++D  GVWQ  +    +++ DYF  LF ++       EV     L+ L   +  +M + L  PF+
Subjt:  WKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVA----LRDLEPSVDDEMNQALLRPFT

Query:  KEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYIL
        ++EV  AL + HP KAPGPDG+S  FY+ +W +VG D+ +S L  LN GC P A+N T + LIPK K    + D+RPISLCNV YKLISK L NR+K  L
Subjt:  KEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYIL

Query:  PQLISQNQSAFI
          +IS +Q+AF+
Subjt:  PQLISQNQSAFI

MCH80348.1 hypothetical protein [Trifolium medium]3.3e-12731.03Show/hide
Query:  MEDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFD
        ME  VS+    R  ++E E +   +        S    ++GK+++    N+ AF+  M  AWR      I+   +NL+  +F +K E   V   GPW+FD
Subjt:  MEDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFD

Query:  RSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLY
        R+L++L          ++  +  +FWV+++++P++ ++  +A++LG  +GT EE+   +  +  G  LR+RV + L +PL+RG +L + +G+E      Y
Subjt:  RSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLY

Query:  ERLPDFCYDCGCLGHSRRECGVIRE----GLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEV------ASASGEGVGVQVSEQGGQG
        ERLP+FC+ CG +GH  R+C  I +       E   + Q  G     +R   +PK   E  V ++S      K +      +     G   ++ E+  Q 
Subjt:  ERLPDFCYDCGCLGHSRRECGVIRE----GLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEV------ASASGEGVGVQVSEQGGQG

Query:  EGGSGEGDSVVVVARG------EGMADKGKGKQKVGSVMVERRGMEVMG------------KGGGGRNWKRR-----------------ARDVLSDVTNR
        +  S +  S   + +G      EG   K +  QK   V+ E  G   +             K G GR W R+                  +  L DV   
Subjt:  EGGSGEGDSVVVVARG------EGMADKGKGKQKVGSVMVERRGMEVMG------------KGGGGRNWKRR-----------------ARDVLSDVTNR

Query:  DVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHG----RSGGLALMWVS
        D ++ ++ G  K   G+       ++   V  +GSPR  R L +L + + PQ++FL ET++  + + + +  LGF NC  VDC+G    R+GGLALMW+ 
Subjt:  DVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHG----RSGGLALMWVS

Query:  SISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDL
         +S ++ SFS NHI G       G  W LTG YG+P      ++W L+  L    +  WL  GDFN +L   EK+GG  +  ++ +  +  V    L DL
Subjt:  SISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDL

Query:  GFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGES
        GF G  FTW N R  GE +  RLDR  GN  +++ +    VNHL    SDH  V + L  P    ++  +R+ RFEE+W ++ +  +L+  +W       
Subjt:  GFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGES

Query:  SDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASY
           + L+   K +        +     GN  + +     R++       + +      + E +  E+L+ +E  W+QRSR +WL++GDRNT++FH +AS 
Subjt:  SDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASY

Query:  RQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHH
        R K+N I  ++D+ GVW      V +V   YF  LFS+S PS  +       ++  + D         +T+EEV  A+ Q HP KAPGPDGL   F++ +
Subjt:  RQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHH

Query:  WDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM
        W IVG ++    L +LNL   P  +N T +VLIPK K      DFRPISLCNV  K+++KV+ NR+K  L  +I   QSAF+  ++
Subjt:  WDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM

XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]2.8e-12630.39Show/hide
Query:  VNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEK
        VN +AF+S M   WR  G  + +  G N   ++F+   +K +V++  PW+FDR+L+ L+E      + D++F    FWVQ HN+P+   ++E+   L   
Subjt:  VNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEK

Query:  IGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYERLPDFCYDCGCLGHSRRECGVIREGLGES------------------
        +  V  V    GG   G ILR +V + + +PL RG  L  +  +++  P  YERLP FC++CG + HS++  G  R    E+                  
Subjt:  IGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYERLPDFCYDCGCLGHSRRECGVIREGLGES------------------

Query:  ------------GRRTQEGGVDNAAI------RRPEIPKPPTEGQVTQDSVPEKAG-----------------------------KEVASASGEGVGVQV
                     + +QE G   A         R       T     +  V E+ G                             +E  S   E +G   
Subjt:  ------------GRRTQEGGVDNAAI------RRPEIPKPPTEGQVTQDSVPEKAG-----------------------------KEVASASGEGVGVQV

Query:  SEQGGQGEGGSGEGDSVVVVARGEGMADKGKGKQKVGSVMVERRGMEVMGKGGGGRNWKRRARDVLSDVT------------------------------
        S Q  +          + + + G  +A          +  ++      + K     +WK RAR     VT                              
Subjt:  SEQGGQGEGGSGEGDSVVVVARGEGMADKGKGKQKVGSVMVERRGMEVMGKGGGGRNWKRRARDVLSDVT------------------------------

Query:  ----NRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMW
            N D  +     K    + + PP  M  I WN RG G+PRT   L  L+++K P +LF  ETK S +R+    ++L FD+C  VD  G SGGLALMW
Subjt:  ----NRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMW

Query:  VSSISFSLLSFSKNHIDGWISWGGCR--WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLR
          ++  S+ ++S+ HI  ++  G  R  W LTGFYG P       SW LL  ++   +  WL  GDFN + C +EK GG  +P  ++  F+ V+  C L+
Subjt:  VSSISFSLLSFSKNHIDGWISWGGCR--WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLR

Query:  DLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPG
             G  FTW N R G +   E+LDR   N  W++ + +     L    SDH P+ L+    ++  +Q    + RFE  W  + E   ++ R+W     
Subjt:  DLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPG

Query:  ESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRA
         S     + +  + K+C  ++  W  S +   P+ I    +R++  +   +  +  + + Q + +LE  L E+++ WKQR+++ WL+ GD+NT++FH  A
Subjt:  ESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRA

Query:  SYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYR
        + R+K N+I  + D       EK  +  V T +F  LF+SS P   + E  L D+ P V   MN+ LL+ FT+EEV +A+ Q     +PGPDG    FY+
Subjt:  SYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYR

Query:  HHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI
         HWDI+G ++ +  L +LN G S   VN+T I LIPKVK  +R+ D+RPISLCNV YK+++K++ NRMK +LP +IS NQSAF+
Subjt:  HHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]7.1e-13032.12Show/hide
Query:  DLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRS
        D + R    +LTEEE E+V+        +     L ++G+  +++  N  A +S++R+ W++    RI   GE LF  +F+ + +   V+  GPW+FD +
Subjt:  DLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRS

Query:  LVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYER
        L+VL+      +   V F     WVQ+  +P      E   ++G+ +G V EV            +RIRV + L++P+RRG  + + EG + +    YER
Subjt:  LVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYER

Query:  LPDFCYDCGCLGHSRRECGV------IREGLGE---SGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGG
        L   CY CG LGH  ++C V        +  G+   +G R ++ G D A    P  P  P        +V   +  EVA + G     + ++ G +    
Subjt:  LPDFCYDCGCLGHSRRECGV------IREGLGE---SGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGG

Query:  SGEGDSVVVVARGEGMADKGKGKQKVGSVMVER-------------RGMEVMGKGGGGRNWKRRARDVLSDVTNRDVSLGSVNGKRKADSG---EAPPGI
             S V V++   ++++   +   G+   E              RGME+ G        +  A   L+  T  +V +     +  AD     + P   
Subjt:  SGEGDSVVVVARGEGMADKGKGKQKVGSVMVER-------------RGMEVMGKGGGGRNWKRRARDVLSDVTNRDVSLGSVNGKRKADSG---EAPPGI

Query:  MSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC-RW
         S++  +V           L+ LV+ K P++LFL ETK S   M   +  L +   F V    RSGGLAL+W+  I   + +F+ NHID  I       W
Subjt:  MSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC-RW

Query:  RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVF
        RLTGFYG+P      +SW LL  L      PWL  GDFN +L  +EK+GG  KPL+ +  F+  +  CGL DLG+ G+ FTW N R   + + ERLDR  
Subjt:  RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVF

Query:  GNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKS
            W D +    V HL+ S SDH P+ LV T  +   +   +   RFEE W   P+   ++  +W +     S +  L    K KRC  ++V W R   
Subjt:  GNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKS

Query:  GNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQV
        G    +++   Q++   +   + +++   +   +A++  ++ ++E++W+QRSR +WL  GD+NT++FH RAS R++ N I G+ D    W      + +V
Subjt:  GNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQV

Query:  MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVND
           YFQ LFS++ P  Q+ E  L+ ++  V   MN++L RP+T +EV LAL Q HP+K+PGPDG+S  F++ +W I+G D+ ++ L+ L        +N 
Subjt:  MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVND

Query:  TMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM
        T IVLIPK K  + + D+RPISL NV  ++ISKV+ NR+K ILP +IS +QSAF+  ++
Subjt:  TMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein2.9e-14534.1Show/hide
Query:  RLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRSLVVLKEPRS
        +L+ +EA  +  G  T   S    +  V+ K+ + K  N +A +S +R  W   G   I    +NLF   FR++ +  R+    PW FD+ L+ + +   
Subjt:  RLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRSLVVLKEPRS

Query:  AGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRG--FQLQDEEGREFRCP----VLYERLPD
              V+F   AFWV++ N+P++  TRE+   +G  +G   EV     G   G  LRIRV + + +PL RG   Q +DEE      P      YE LP 
Subjt:  AGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRG--FQLQDEEGREFRCP----VLYERLPD

Query:  FCYDCGCLGHSRREC--GVIREGLGESGRRTQEGGVDNAAIRRPEIPK----PPTEGQV--TQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSG--
        FCY CG LGHS  +C  G    G  E G       +     R P  P+    PP +G     + S       EV + S   V  + S    +    S   
Subjt:  FCYDCGCLGHSRREC--GVIREGLGESGRRTQEGGVDNAAIRRPEIPK----PPTEGQV--TQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSG--

Query:  ------EGDSVVVVARGEG-----MADKGKGKQKVG----SVMVERRGMEVMGKGGGGRNWKRRARDVLSDVTNR--------DVSLGSVNG--------
                 S V +AR E       AD     ++      ++ V+   + V+   G      R    ++ D TN+        +V +  +NG        
Subjt:  ------EGDSVVVVARGEG-----MADKGKGKQKVG----SVMVERRGMEVMGKGGGGRNWKRRARDVLSDVTNR--------DVSLGSVNG--------

Query:  ------KRKADSGEA------------PPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLA
              + + D  E             PPG M  I  N RG G+P T   L   V+++ PQ++FL ET++   ++   +  LG   CF V+  G  GGLA
Subjt:  ------KRKADSGEA------------PPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLA

Query:  LMWVSSISFSLLSFSKNHIDGWI-SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCG
        L+W  S+   + S+SK+HID W+ S  G  WR TGFYG P       SW LL RL+G ++ PWL+ GDFN ++  DEK+G   +  +++A F+  ++ C 
Subjt:  LMWVSSISFSLLSFSKNHIDGWI-SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCG

Query:  LRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV----ELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRS
        L DLGF G  FTW N R   E + ERLDR      WMDL+P   + H+ ++ SDH  +    E V+  PQ+    S +R   FE  WLR+    + + ++
Subjt:  LRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV----ELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRS

Query:  WAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTR
        W              ++ K K+C   ++ W ++     P+ I    +R+     G  +  +         +L  +LQ+EE+YW+QRSR  WLREGDRNT 
Subjt:  WAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTR

Query:  WFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGL
        +FH  AS R+K N I G+ D Q VWQ+E+T +  V+  YF  +++++ P   D    +R+++  V  +MNQ LL+PFT+EEV +AL Q  P+KAPGPDG+
Subjt:  WFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGL

Query:  SGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM
        +  F++  W IVG D+  + L  LN G    ++N T I LIPKVK+   M  FRPISLCNV YK+ISKVLVNRMK ILP ++S +QSAF+  +M
Subjt:  SGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM

A0A2N9GJ35 Uncharacterized protein1.6e-15134.81Show/hide
Query:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR
        E+L       +L+E+E   +   +   + S+   +  ++ K+ ++K  + EAF+  +RA W   G   I S   NLF   F  + +  R+    PW FD+
Subjt:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR

Query:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEE---GREFRCPV
         L+ +          +V F+  AFW+++ N+P++   RE+   +G++IG + EV   + G   G  LRIRV + + +PL RG  LQ +E   G  F    
Subjt:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEE---GREFRCPV

Query:  LYERLPDFCYDCGCLGHSRRECGVIREG-LGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSGEG
         YE LP FCY CG LGH   EC V R G + E     + G    A   RP  P+   EG    D                                  EG
Subjt:  LYERLPDFCYDCGCLGHSRRECGVIREG-LGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSGEG

Query:  DSVVVVARGEGMADKGKGKQKVGSVMVERRGMEVMGKGGGGRNWKRR-ARDVLSDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRR
        +S +         D+    +   S  V          GGG + W      ++L ++   ++    V     +   +APP  M  +  N RG G+P+T   
Subjt:  DSVVVVARGEGMADKGKGKQKVGSVMVERRGMEVMGKGGGGRNWKRR-ARDVLSDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRR

Query:  LTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWG-GCRWRLTGFYGFPSADMHPQSWS
        L  LV+++ P ++FL ET+++   +   +  LG   C  V+ HG+ GGLAL+W SS+  ++ S+S++HIDG +    G RWRLTGFYG+P A +  +SWS
Subjt:  LTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWG-GCRWRLTGFYGFPSADMHPQSWS

Query:  LLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDY
        LL  LR  ++ PW+I GDFN +   +EK G  D+  +++AAF+  +  C L+D+GF G  FTW N R  G+ +  RLDR   +  W+ L+P+  +NHL  
Subjt:  LLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDY

Query:  SRSDHRPVELVL----TPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQ
        + SDH  V L+L      P+    Q  +R+ RFE++WL++    +++  +W   P          +A K K+C   ++ W +S     P+ I    +++Q
Subjt:  SRSDHRPVELVL----TPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQ

Query:  S-AIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGP
           +      DSR + +  +  L  + ++ E+ W+QRSR +WL EGDRNT++FH  AS R+K+N I GL D Q  W+ E   V Q+  DYF  LF+SS P
Subjt:  S-AIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGP

Query:  SVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARR
           D    L ++E  V   MN  L+RPFT+EE+  AL Q HP+K+PGPDG+S  F++ +W IV  D+  + L  L  G   G++N T +VLIPKV A   
Subjt:  SVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARR

Query:  MVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM
        +  FRPISLCNV YK++SKVLVNRMK ILPQ+IS +QSAF+  +M
Subjt:  MVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM

A0A2N9IPS8 Reverse transcriptase domain-containing protein8.9e-14732.9Show/hide
Query:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR
        E+L+  W  + LTE+E+      +     SR     C++GK+ + +  N EA ++ M   W V     ++  G+NLF  QF++  E+ RV+S  PW FD 
Subjt:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR

Query:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQL---------------
         L+ L E   +     ++F+ C FWV  H VP+ + T+    ++G  +G V +V   + G   GP LR+R+ L   +P+ RG  +               
Subjt:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQL---------------

Query:  -QDEEGREFRCPVLYERLPDFCYDCGC--------LGHSRRECGVIRE-GLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASG
           + G   R P   +R      D            G +RR   +  E  L      +   G D    +  E+       Q  +  +         +A  
Subjt:  -QDEEGREFRCPVLYERLPDFCYDCGC--------LGHSRRECGVIRE-GLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASG

Query:  EGVGVQVSEQGGQGEGGSGEGDSVVVVARG------------------EGM---ADKGKGK-----------QKVGSVMVERRGMEVMGKGGGGRN----
              ++ +  Q E   G  D   V+  G                  EGM   +DK   K           Q V    V  +  E      G  N    
Subjt:  EGVGVQVSEQGGQGEGGSGEGDSVVVVARG------------------EGM---ADKGKGK-----------QKVGSVMVERRGMEVMGKGGGGRN----

Query:  -----WKRRARDVLSDVTNRDVSLGSVNGKRKADSGE-APPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCF
             WKR AR+       +   +     KR  +S E APP  M L+ WN +G G+  T R L  L++E+ P VLFLSET++    +   +  + FD  F
Subjt:  -----WKRRARDVLSDVTNRDVSLGSVNGKRKADSGE-APPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCF

Query:  CVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLS
        CV   G  GGLA++W++ +   L ++S+NHID  I     G  +RLTGFYG P      +SW+LL  L   + +PWL  GDFN +L  +E+ G   +P  
Subjt:  CVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLS

Query:  ELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQP
        ++  F+  V  CGL DLG+VG+ +TW  +R G   +  RLDR+  +  W+  Y   VV+HL    SDH P+  +L  P     +  +++ RFE  W++  
Subjt:  ELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQP

Query:  ELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAI----AGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRS
        + R+++  +W  G  E S +    +  K K C  S++GW R + G+    I+   +++Q  I    +G ST      +++ +  L  +L++EE++W+QRS
Subjt:  ELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAI----AGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRS

Query:  RELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALK
        R  W+ EGD+NT++FH + + R++ N I GL D  GVWQ EKT + ++  DYFQ +F+SS PS +     L+ +E  V + MN  L   FTK+EV LALK
Subjt:  RELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALK

Query:  QTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS
        Q +P KAPGPDG+S  FY+ +WDIVGP++ Q+ L++L+ G     +N T I LIPKVK    + DFRPISLCNV YK++SKVL NR+K +LP +IS+ QS
Subjt:  QTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS

Query:  AFI
        AF+
Subjt:  AFI

A0A7N2LIH6 Uncharacterized protein3.6e-14832.98Show/hide
Query:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR
        E+L   W    +TE E E ++ G  +   ++   + CV+ K+ + + V +EA +  MR  W+     +I   GE+LF ++F    +K +V+   PW++++
Subjt:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR

Query:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYE
         L++++E        +++     FWVQI N+P++  TRE   ++G KIG V EV   + G   G  LR+R+       L RG ++  E G        YE
Subjt:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYE

Query:  RLPDFCYDCGCLGHSRRECGVIREG---------------LGESGRRTQEG----GVDNAAIRRPEIPKPPTEGQV-TQDSVPEKAG------KEVASAS
        RLP+FCY CG L H  ++C   ++G                GE GR +       G +    RR +  +  TE Q   +  + E A        E A   
Subjt:  RLPDFCYDCGCLGHSRRECGVIREG---------------LGESGRRTQEG----GVDNAAIRRPEIPKPPTEGQV-TQDSVPEKAG------KEVASAS

Query:  GEGVGVQVSEQGGQGEGG----------------------------SGEGDSVVVVARGEGMADKGKGKQ---KVGSVMVERRGMEVMG-KGGGGRN---
         +G    + E GG G+ G                            SG+ DS+  +   + M DK + ++    V +V  E    +  G KG G  N   
Subjt:  GEGVGVQVSEQGGQGEGG----------------------------SGEGDSVVVVARGEGMADKGKGKQ---KVGSVMVERRGMEVMG-KGGGGRN---

Query:  --------------------------------WKRRARDVLSDVTNRDVSLGSVNGKRK--------ADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKL
                                        WKR AR       +   + GSV+G +K           G APP  M+++ WN RG G+    R LT  
Subjt:  --------------------------------WKRRARDVLSDVTNRDVSLGSVNGKRK--------ADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKL

Query:  VQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLS
        V++K P ++FL ETK S  +M   +  LGF     V   GRSGGLAL+W         S S +HID  +  +  G  WR TGFYG P       SW LL 
Subjt:  VQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLS

Query:  RLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRS
         L    E PWL+ GDFN ++  DEK G +D+  +++ AF+ V+  CGL DLGFVG  FTWCN R G +    RLDR+  N  W  ++P   V+H+  S S
Subjt:  RLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRS

Query:  DHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLS
        DH  + L L        + G++   FEE W R  E +++V  +W     +S+    + +  + +RC + +  W ++  GN  + I+    R+Q   +   
Subjt:  DHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLS

Query:  TSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVA
          ++ + +   + ++ E+   EEV WKQRSR  WL+ GD+N+++FH  AS R++ NRIGGL DD GVW +++    +++ DYF+ ++SS+ P+   F+V+
Subjt:  TSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVA

Query:  LRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPIS
        L  ++  V  EMN  L + F   EV  AL+Q HP KAPGPDG+S  FY+ +WDIVG  +    L  LN G  P  +N T I LIPK K  +++ +FRPIS
Subjt:  LRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPIS

Query:  LCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM
        LCNV YK+ISKVL NR+K +L  +I + QSAF+  +M
Subjt:  LCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQM

A0A7N2R0C3 Reverse transcriptase domain-containing protein2.1e-15132.72Show/hide
Query:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR
        E+L   W   ++TEEE E +  G+     +    + CV  KV S K + VEA R  +R  W+ + + ++   GE LF ++F  + +K RV+   PW +++
Subjt:  EDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDR

Query:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYE
         LV+ KE     S  D+      FWVQI+N+P++ +T+E  + +GE IG   EV   + G   G  LR+RV + + R L RG ++  E+G        YE
Subjt:  SLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYE

Query:  RLPDFCYDCGCLGHSRREC------------GVIREGLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVT------QDSVPEKAGKEVASASGEGVGVQV
        RLP+FCY CG L H  ++C              ++ G    G   ++GG D    ++  I +   +          +D V E   +E        +G   
Subjt:  RLPDFCYDCGCLGHSRREC------------GVIREGLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVT------QDSVPEKAGKEVASASGEGVGVQV

Query:  SEQGGQGEGGSGEGDSVVVVARGEGMADKGKGKQKVGSVMVERRGMEVMGKGGGGRN--WKRRARDVLSDVTNRDVSLGS--------------------
         E+ G   GG      V  V +GE    +  G+   G ++        +G G GG+    K RA ++   + N +  LG+                    
Subjt:  SEQGGQGEGGSGEGDSVVVVARGEGMADKGKGKQKVGSVMVERRGMEVMGKGGGGRN--WKRRARDVLSDVTNRDVSLGS--------------------

Query:  -------------------------------------------VNGKRKAD--------------SGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEK
                                                   V  KR  D                + PP  M+ + WN RG GS    R LT  V+  
Subjt:  -------------------------------------------VNGKRKAD--------------SGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEK

Query:  RPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRG
         P ++FL+ETK S  R+   +R LG      V   GRSGGLA++W   +  SL S S +HID  +  S G   WR TGFYG P A M P SW LL  L  
Subjt:  RPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRG

Query:  CAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRP
            PW++ GDFN +L  DEK G  ++   ++  F+  +  CGL DLGFVG  FTWCN R G +    RLDR+  N  WM+L+P   V H   + SDH  
Subjt:  CAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRP

Query:  VELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITP-LAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSD
        + L +   +    +  +R   FEE W R+   R+++ R+W     +     P L I N+ K C   +  W R   GN  + ++    R+Q         +
Subjt:  VELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITP-LAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSD

Query:  SRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRD
        S + + + + ++ EV+  EE+ W QRSR LW++ GDRNTR+FH  A+ R++ N+I G+ D +G W++    V +++ +YF+ ++SS+ P+  +F   L  
Subjt:  SRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRD

Query:  LEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCN
        +   V ++MN+ LLR F +EEV  AL Q HP K+PGPDG+S  F++ +WD+VGP ++QS +  L  G  P  VN+T I LIPKVK  +++ ++RPISLCN
Subjt:  LEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCN

Query:  VSYKLISKVLVNRMKYILPQLISQNQSAFI
        V YKL+SKVL NR+K +LP ++ + QSAF+
Subjt:  VSYKLISKVLVNRMKYILPQLISQNQSAFI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.6e-2320.52Show/hide
Query:  MSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC---
        ++++  NV G  SP    RL   ++ + P V  + ET ++  R +   ++ G+   +  +   +  G+A++      F      ++    +I   G    
Subjt:  MSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC---

Query:  -RWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGR--DKPLSELAAFQSVVDLCGL-RDLGFVGDCFTWCNRRPGGETIYE
            +   Y  P+         +LS L+   ++  LI GDFN  L   ++   +  +K   EL +     DL  + R L      +T+ +        Y 
Subjt:  -RWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGR--DKPLSELAAFQSVVDLCGL-RDLGFVGDCFTWCNRRPGGETIYE

Query:  RLDRVFGNTPWMD--LYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQ---RIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCM
        ++D + G+   +        + N+L    SDH  ++L L       S+S       +   + W+   E++  +   +     + +    L  A KA  C 
Subjt:  RLDRVFGNTPWMD--LYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQ---RIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKAKRCM

Query:  QSMVGWG-------RSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGG
           +          RSK      +++   ++ Q+     S +  R  + +  A+L+E+  ++ +     SR  +    ++  R        +++ N+I  
Subjt:  QSMVGWG-------RSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGG

Query:  LEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDI
        +++D+G    + T +   + +Y++HL+++   ++++ +  L     P ++ E  ++L RP T  E++  +      K+PGPDG +  FY+ + + + P +
Subjt:  LEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDI

Query:  IQSCLAVLNLGCSPGAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI
        ++   ++   G  P +  +  I+LIPK  +   +  +FRPISL N+  K+++K+L NR++  + +LI  +Q  FI
Subjt:  IQSCLAVLNLGCSPGAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI

P08548 LINE-1 reverse transcriptase homolog7.1e-1617.37Show/hide
Query:  MSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISW--GGCR
        +S+   NV G   P    RL   +Q+ +P +  + E+ ++       K V G+ + F  +   +  G+A+++  +I F      K+  DG   +  G  +
Subjt:  MSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISW--GGCR

Query:  W---RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFV----GDCFTWCNRRPGGETI
        +    +   Y  P+ +        L+ +     +  ++ GDFN  L   ++   + K   E+    S +    L D+          +T+ +   G    
Subjt:  W---RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFV----GDCFTWCNRRPGGETI

Query:  YERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQ---RIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKA--KR
        Y ++D + G+   +  +    +  +    SDH  +++ L   +   + +       +  ++TW+   E+++ + +       + ++   L    KA  + 
Subjt:  YERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQ---RIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANKA--KR

Query:  CMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQ
           ++  + +         + G  ++++           R  + +  A+L E+  +  +    +S+  +  + ++  +        ++  + I  + +  
Subjt:  CMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQ

Query:  GVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCL
             + + + +++ +Y++ L+S    ++++ +  L     P +  +  + L RP +  E+   ++     K+PGPDG +  FY+   + + P ++    
Subjt:  GVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCL

Query:  AVLNLGCSPGAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI
         +   G  P    +  I LIPK  K   R  ++RPISL N+  K+++K+L NR++  + ++I  +Q  FI
Subjt:  AVLNLGCSPGAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI

P11369 LINE-1 retrotransposable element ORF2 protein9.0e-1124.72Show/hide
Query:  IGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVAL-RDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVG
        I  + +++G    +   +   +  +++ L+S+   ++ + +  L R   P ++ +    L  P + +E+   +      K+PGPDG S  FY+   + + 
Subjt:  IGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSVQDFEVAL-RDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVG

Query:  PDIIQSCLAVLNLGCSPGAVNDTMIVLIPK-VKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI
        P + +    +   G  P +  +  I LIPK  K   ++ +FRPISL N+  K+++K+L NR++  +  +I  +Q  FI
Subjt:  PDIIQSCLAVLNLGCSPGAVNDTMIVLIPK-VKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFI

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-2125.69Show/hide
Query:  LIGGDFNAVL-CQD----EKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV
        +IGGDFN  L  +D    +K    +  L EL A  S+VD+   R+       FT+   R  G     R+DR++ ++  M    +  +    +  SDH  V
Subjt:  LIGGDFNAVL-CQD----EKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV

Query:  ELVLT-----PPQQCWSQSGQRIVRFEETWLRQPELRQLV---GRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIA
         L ++     P    W         F  + L      + V    R W A   E + +       K    +     + +S SG     I   N  V     
Subjt:  ELVLT-----PPQQCWSQSGQRIVRFEETWLRQPELRQLV---GRSWAAGPGESSDITPLAIANKAKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIA

Query:  GLSTSDSRDL---LVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSV
         LS S+ + L    ++ +  L  + Q +      RSR   L + DR +R+F+     +    +I  L  + G   ++  A+      ++Q+LFS   P  
Subjt:  GLSTSDSRDL---LVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLFSSSGPSV

Query:  QDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMV
         D    L D  P V +   + L  P T +E+  AL+    NK+PG DGL+  F++  WD +GPD  +        G  P +    ++ L+PK    R + 
Subjt:  QDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMV

Query:  DFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS
        ++RP+SL +  YK+++K +  R+K +L ++I  +QS
Subjt:  DFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.4e-2726.67Show/hide
Query:  PLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWL
        P+  L  FQ+ +    L D+   G  +TW N +     I  +LDR   N  W   +P+ +        SDH P  ++L    +   +  ++  R+     
Subjt:  PLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWL

Query:  RQPELRQLVGRSWAAGPGESSDITPLAIANK-AKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEE--------
          P     +  +W       S +  L    K AK+C + +    R   GN   + + A   ++S  + L T+ S  L      ++E V +++        
Subjt:  RQPELRQLVGRSWAAGPGESSDITPLAIANK-AKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEE--------

Query:  EVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLF-SSSGPSVQDFEVALRDLEP-SVDDEMNQALLRPF
        E +++Q+SR  WL++GD NTR+FH      Q  N I  L  D  V  +  T V +++  Y+ HL  S S     D    ++D+ P   +D +   L    
Subjt:  EVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVTQVMTDYFQHLF-SSSGPSVQDFEVALRDLEP-SVDDEMNQALLRPF

Query:  TKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLIS
        + +E+  A+     NKAPGPD  +  F+   W +V    I +       G      N T I LIPKV    ++  FRP+S C V YK+I+
Subjt:  TKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLIS

AT3G42140.1 zinc ion binding;nucleic acid binding3.3e-0827.34Show/hide
Query:  FRSKGEKGRVVSTGPWAFDRSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLR
        F+S+     ++  GPW+F+  + V++      S  D EF    FW+QI  +P+R+ T  +   +GE++G                       L L+  L 
Subjt:  FRSKGEKGRVVSTGPWAFDRSLVVLKEPRSAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLR

Query:  RGFQLQDEEGREFRCPVLYERLPDFCYDCGCLGHSRREC
        R     D    +F+    YE+L +FC  CG L H   EC
Subjt:  RGFQLQDEEGREFRCPVLYERLPDFCYDCGCLGHSRREC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATTTAGTTAGTCGATGGAACTCCTGGAGACTGACAGAGGAAGAGGCAGAGATGGTACGGCCAGGGGAGGGGACACCGGTTTTGTCAAGGGGCTCGATAGAACT
GTGTGTGATAGGTAAAGTCTTCTCCTCGAAGCGGGTGAATGTGGAGGCGTTTCGGAGTGTTATGAGGGCAGCATGGAGAGTCCATGGGGCGACGCGCATAGAATCTGCTG
GTGAAAATTTATTTGCTATTCAGTTTCGTTCCAAAGGGGAGAAGGGGCGGGTGGTGTCGACAGGTCCATGGGCTTTTGACCGGTCTCTGGTGGTGCTGAAGGAGCCGAGG
TCTGCGGGATCGGTGTTGGATGTGGAGTTTAATGATTGTGCTTTCTGGGTGCAGATTCATAATGTCCCTATGCGTTGGCAGACAAGGGAGATGGCTCGACAGTTGGGAGA
GAAGATCGGAACTGTGGAGGAAGTTGGTGGATCGCAGGGAGGTGACTGGACAGGGCCGATTCTTCGAATAAGGGTCCTTTTGAGGTTGGATCGCCCACTCAGGAGGGGAT
TCCAATTGCAGGATGAGGAGGGCAGGGAGTTTCGGTGTCCGGTCTTGTATGAGCGATTACCAGATTTTTGTTACGACTGTGGCTGCCTGGGGCACTCGAGGAGGGAGTGT
GGGGTGATTAGGGAAGGGCTTGGGGAGTCTGGAAGGCGCACTCAGGAGGGAGGGGTGGATAATGCAGCGATTCGGAGACCTGAGATCCCAAAGCCCCCTACTGAGGGACA
GGTGACTCAGGATTCTGTTCCTGAGAAAGCAGGGAAGGAGGTTGCTAGTGCGTCAGGGGAGGGTGTGGGAGTGCAGGTTAGTGAGCAGGGAGGTCAGGGGGAAGGGGGTA
GTGGAGAGGGGGATAGCGTGGTTGTGGTAGCAAGGGGGGAGGGGATGGCTGATAAGGGAAAGGGGAAGCAGAAGGTGGGGAGTGTCATGGTTGAGAGGAGGGGGATGGAA
GTGATGGGGAAAGGGGGTGGTGGCAGGAACTGGAAGAGAAGGGCGAGAGATGTTTTGTCTGATGTCACCAATCGAGATGTCTCTCTAGGGTCCGTTAATGGCAAGAGGAA
GGCAGATAGTGGGGAGGCCCCGCCAGGGATTATGAGTTTGATTTTCTGGAATGTTCGGGGGTCGGGGTCACCTCGAACATTCAGGCGCCTGACCAAGCTGGTTCAGGAGA
AACGACCTCAGGTGCTCTTCTTATCTGAAACAAAGGTGTCGGCTAGCAGGATGTCTTCGGCGAAGCGTGTGTTGGGGTTTGATAACTGCTTTTGTGTTGATTGTCATGGA
CGGAGTGGTGGGTTGGCCCTCATGTGGGTTTCCTCGATATCCTTCAGCCTTCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTCATGGGGTGGTTGTAGGTGGCG
GCTCACTGGTTTCTATGGTTTCCCTTCAGCTGATATGCACCCCCAGTCATGGTCTCTTCTGTCTAGACTGAGGGGCTGTGCTGAGACACCGTGGCTGATTGGTGGGGATT
TTAACGCCGTTTTATGTCAGGATGAGAAGGAGGGGGGCAGGGATAAGCCATTGTCTGAACTGGCTGCCTTTCAGAGTGTGGTTGATTTGTGTGGTCTACGGGACTTGGGT
TTTGTGGGGGACTGCTTTACTTGGTGCAACAGACGACCAGGAGGGGAGACGATTTATGAGCGGTTGGATCGGGTCTTTGGCAACACGCCTTGGATGGACCTCTATCCGAA
CTATGTGGTTAACCACCTGGATTACAGTCGGTCTGACCATAGGCCAGTGGAGCTCGTTCTTACCCCTCCGCAGCAGTGTTGGTCTCAGAGTGGCCAGCGGATTGTTCGGT
TTGAGGAAACTTGGCTTCGGCAACCGGAATTGCGACAGTTGGTTGGTAGGTCGTGGGCCGCCGGGCCTGGGGAGTCTAGTGATATAACGCCTCTGGCTATTGCCAACAAG
GCTAAGAGATGTATGCAGTCGATGGTCGGTTGGGGTCGATCAAAGTCCGGGAACTTCCCGAGGCGCATTCGTGGTGCGAACCAGAGGGTTCAGTCGGCCATCGCTGGTCT
TAGTACATCGGACTCTCGTGACTTGCTTGTTCAGGCCGAGGCTCAGTTGGAGGAGGTGTTGCAGGAGGAAGAGGTATATTGGAAACAGAGGTCAAGGGAGTTATGGCTTC
GAGAAGGGGATCGCAATACTCGGTGGTTCCACTGTCGAGCCTCTTACCGGCAGAAGCTTAATCGCATTGGGGGATTGGAGGATGATCAGGGAGTGTGGCAGCAGGAGAAG
ACTGCAGTTACTCAGGTGATGACTGATTATTTTCAGCACCTGTTTTCCTCATCAGGTCCGAGTGTTCAGGACTTTGAGGTGGCGTTGCGAGATTTGGAGCCTTCTGTGGA
TGATGAGATGAACCAAGCGCTATTGCGACCTTTTACCAAAGAGGAGGTTCTGTTGGCATTGAAGCAGACGCATCCTAACAAAGCCCCAGGTCCAGATGGGCTGTCGGGGA
GCTTTTACAGGCACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGGCAGTTCTGAATCTTGGGTGCTCCCCAGGTGCTGTTAATGATACTATGATCGTG
CTCATCCCGAAGGTTAAAGCAGCTCGGCGGATGGTTGATTTTCGACCCATCTCCCTGTGTAATGTGAGTTACAAGCTGATCTCGAAGGTTTTGGTCAACCGCATGAAGTA
TATACTGCCTCAGCTGATCTCGCAGAACCAGAGTGCTTTTATTAACAGGCAGATGTGTTGTGGACAATGCTATTCTGGGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATTTAGTTAGTCGATGGAACTCCTGGAGACTGACAGAGGAAGAGGCAGAGATGGTACGGCCAGGGGAGGGGACACCGGTTTTGTCAAGGGGCTCGATAGAACT
GTGTGTGATAGGTAAAGTCTTCTCCTCGAAGCGGGTGAATGTGGAGGCGTTTCGGAGTGTTATGAGGGCAGCATGGAGAGTCCATGGGGCGACGCGCATAGAATCTGCTG
GTGAAAATTTATTTGCTATTCAGTTTCGTTCCAAAGGGGAGAAGGGGCGGGTGGTGTCGACAGGTCCATGGGCTTTTGACCGGTCTCTGGTGGTGCTGAAGGAGCCGAGG
TCTGCGGGATCGGTGTTGGATGTGGAGTTTAATGATTGTGCTTTCTGGGTGCAGATTCATAATGTCCCTATGCGTTGGCAGACAAGGGAGATGGCTCGACAGTTGGGAGA
GAAGATCGGAACTGTGGAGGAAGTTGGTGGATCGCAGGGAGGTGACTGGACAGGGCCGATTCTTCGAATAAGGGTCCTTTTGAGGTTGGATCGCCCACTCAGGAGGGGAT
TCCAATTGCAGGATGAGGAGGGCAGGGAGTTTCGGTGTCCGGTCTTGTATGAGCGATTACCAGATTTTTGTTACGACTGTGGCTGCCTGGGGCACTCGAGGAGGGAGTGT
GGGGTGATTAGGGAAGGGCTTGGGGAGTCTGGAAGGCGCACTCAGGAGGGAGGGGTGGATAATGCAGCGATTCGGAGACCTGAGATCCCAAAGCCCCCTACTGAGGGACA
GGTGACTCAGGATTCTGTTCCTGAGAAAGCAGGGAAGGAGGTTGCTAGTGCGTCAGGGGAGGGTGTGGGAGTGCAGGTTAGTGAGCAGGGAGGTCAGGGGGAAGGGGGTA
GTGGAGAGGGGGATAGCGTGGTTGTGGTAGCAAGGGGGGAGGGGATGGCTGATAAGGGAAAGGGGAAGCAGAAGGTGGGGAGTGTCATGGTTGAGAGGAGGGGGATGGAA
GTGATGGGGAAAGGGGGTGGTGGCAGGAACTGGAAGAGAAGGGCGAGAGATGTTTTGTCTGATGTCACCAATCGAGATGTCTCTCTAGGGTCCGTTAATGGCAAGAGGAA
GGCAGATAGTGGGGAGGCCCCGCCAGGGATTATGAGTTTGATTTTCTGGAATGTTCGGGGGTCGGGGTCACCTCGAACATTCAGGCGCCTGACCAAGCTGGTTCAGGAGA
AACGACCTCAGGTGCTCTTCTTATCTGAAACAAAGGTGTCGGCTAGCAGGATGTCTTCGGCGAAGCGTGTGTTGGGGTTTGATAACTGCTTTTGTGTTGATTGTCATGGA
CGGAGTGGTGGGTTGGCCCTCATGTGGGTTTCCTCGATATCCTTCAGCCTTCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTCATGGGGTGGTTGTAGGTGGCG
GCTCACTGGTTTCTATGGTTTCCCTTCAGCTGATATGCACCCCCAGTCATGGTCTCTTCTGTCTAGACTGAGGGGCTGTGCTGAGACACCGTGGCTGATTGGTGGGGATT
TTAACGCCGTTTTATGTCAGGATGAGAAGGAGGGGGGCAGGGATAAGCCATTGTCTGAACTGGCTGCCTTTCAGAGTGTGGTTGATTTGTGTGGTCTACGGGACTTGGGT
TTTGTGGGGGACTGCTTTACTTGGTGCAACAGACGACCAGGAGGGGAGACGATTTATGAGCGGTTGGATCGGGTCTTTGGCAACACGCCTTGGATGGACCTCTATCCGAA
CTATGTGGTTAACCACCTGGATTACAGTCGGTCTGACCATAGGCCAGTGGAGCTCGTTCTTACCCCTCCGCAGCAGTGTTGGTCTCAGAGTGGCCAGCGGATTGTTCGGT
TTGAGGAAACTTGGCTTCGGCAACCGGAATTGCGACAGTTGGTTGGTAGGTCGTGGGCCGCCGGGCCTGGGGAGTCTAGTGATATAACGCCTCTGGCTATTGCCAACAAG
GCTAAGAGATGTATGCAGTCGATGGTCGGTTGGGGTCGATCAAAGTCCGGGAACTTCCCGAGGCGCATTCGTGGTGCGAACCAGAGGGTTCAGTCGGCCATCGCTGGTCT
TAGTACATCGGACTCTCGTGACTTGCTTGTTCAGGCCGAGGCTCAGTTGGAGGAGGTGTTGCAGGAGGAAGAGGTATATTGGAAACAGAGGTCAAGGGAGTTATGGCTTC
GAGAAGGGGATCGCAATACTCGGTGGTTCCACTGTCGAGCCTCTTACCGGCAGAAGCTTAATCGCATTGGGGGATTGGAGGATGATCAGGGAGTGTGGCAGCAGGAGAAG
ACTGCAGTTACTCAGGTGATGACTGATTATTTTCAGCACCTGTTTTCCTCATCAGGTCCGAGTGTTCAGGACTTTGAGGTGGCGTTGCGAGATTTGGAGCCTTCTGTGGA
TGATGAGATGAACCAAGCGCTATTGCGACCTTTTACCAAAGAGGAGGTTCTGTTGGCATTGAAGCAGACGCATCCTAACAAAGCCCCAGGTCCAGATGGGCTGTCGGGGA
GCTTTTACAGGCACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGGCAGTTCTGAATCTTGGGTGCTCCCCAGGTGCTGTTAATGATACTATGATCGTG
CTCATCCCGAAGGTTAAAGCAGCTCGGCGGATGGTTGATTTTCGACCCATCTCCCTGTGTAATGTGAGTTACAAGCTGATCTCGAAGGTTTTGGTCAACCGCATGAAGTA
TATACTGCCTCAGCTGATCTCGCAGAACCAGAGTGCTTTTATTAACAGGCAGATGTGTTGTGGACAATGCTATTCTGGGGTTTGA
Protein sequenceShow/hide protein sequence
MEDLVSRWNSWRLTEEEAEMVRPGEGTPVLSRGSIELCVIGKVFSSKRVNVEAFRSVMRAAWRVHGATRIESAGENLFAIQFRSKGEKGRVVSTGPWAFDRSLVVLKEPR
SAGSVLDVEFNDCAFWVQIHNVPMRWQTREMARQLGEKIGTVEEVGGSQGGDWTGPILRIRVLLRLDRPLRRGFQLQDEEGREFRCPVLYERLPDFCYDCGCLGHSRREC
GVIREGLGESGRRTQEGGVDNAAIRRPEIPKPPTEGQVTQDSVPEKAGKEVASASGEGVGVQVSEQGGQGEGGSGEGDSVVVVARGEGMADKGKGKQKVGSVMVERRGME
VMGKGGGGRNWKRRARDVLSDVTNRDVSLGSVNGKRKADSGEAPPGIMSLIFWNVRGSGSPRTFRRLTKLVQEKRPQVLFLSETKVSASRMSSAKRVLGFDNCFCVDCHG
RSGGLALMWVSSISFSLLSFSKNHIDGWISWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCGLRDLG
FVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPQQCWSQSGQRIVRFEETWLRQPELRQLVGRSWAAGPGESSDITPLAIANK
AKRCMQSMVGWGRSKSGNFPRRIRGANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEK
TAVTQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSGSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIV
LIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFINRQMCCGQCYSGV