; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005324 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005324
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionReverse transcriptase
Genome locationscaffold201:93033..95052
RNA-Seq ExpressionMS005324
SyntenyMS005324
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RYR74850.1 hypothetical protein Ahy_A02g009560 [Arachis hypogaea]2.0e-13640.15Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLY--GSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q+SR  WI FGDRNTKFFH   +I R+R R+ +L++  G+WVT++  L  LAF  F NLY   S          FP+L       L  EV   E++ ++F
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLY--GSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
         +G+ KAPG +G QAVFYQQ WE V P +   ++ +    K + + N T I LIPKV    ++ Q RPISLCNVSYK+VTK+++ R+R  +  +VSP Q 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
        SFVP R   DNII+AQE+IH M   K  + +M +K+DLEKAY+++ WSF+ +TL+ +G+PS     I+ C+ SA+MRVLWNG     FK  RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP +SHL FADD++LFAEAN  Q   +K  L  FC +SGQ V+  KT I+FS NV    +++I++  
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
        GF  T N+GKYLG+P+ H +  +S   DII+K+N RL +W A+SLSLAGR+TLVKSVL ++P+Y M     P   CN +D+ CRNFLWG  AQ  +IH  
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMG-WGIGNGCTAKFWLD
        +W  V + K   GLG+   +  N A + K+  GLI + + LWA+VL +KY G       +  KQ +S L   +   W  + +    W IG+G    FW  
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMG-WGIGNGCTAKFWLD

Query:  RWV-NGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
         WV N  S+ + +    +   + N  + D++  +G W+ +     +   +   I ++ PP
Subjt:  RWV-NGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

XP_016164673.1 uncharacterized protein LOC107607211 [Arachis ipaensis]6.8e-13738.54Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRG--VLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q++R +WI FGDRNTKFFH   ++ R++ ++ SL++ +G W+T++ TL  +A   F NLY  +    +      FP +       + + VS +E++ AMF
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRG--VLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        S+G++KAP  +G QA+FYQ  W  +   V    ++I +    + + N T I LIPK++  E + Q RPISLCNVSYK++TK+++ RLR  + N+V PNQ 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
        SFVP R   DNII+ QE+IH M   K  K +M +K+DLEKAY+R++W+F+ ETL  +G P    N  + C+ +A+MRV WNG     F   RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP ISHL FADDI+LFAEAN  Q   +   L  FC +SGQKV+  KT ++FS+NV +  + +I+ V 
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
         F  T ++ KYLG+PI+H +      + IINK+++RL +W A+SLSLAGR+TLVKSVL ++P Y MH    P A CN +D+ICRNF+WG   Q  ++HL 
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVK-QGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLD
        NW  + +PK   GLG+    + N A + K   GLI+R +DLWA++L +KY G   D    V K + +S L   +  +W+ + R   W +G+G   +FW  
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVK-QGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLD

Query:  RWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
         WV G   +             +  + D++  +G W+       + + +   I +I+PP
Subjt:  RWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

XP_025635616.1 uncharacterized protein LOC112729667 [Arachis hypogaea]2.1e-13839.34Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGS--SRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q+SRC+WI FGDRNTKFFH   ++ R++ ++ SL D  G  +T++ TL ++AF  + NLY           T  FP+L  +   ++  ++S  E++ A+F
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGS--SRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        ++G+FKAPG +G QAVFYQ  W+ V   +   +  I    + + + N T I LIPKV+   S+ Q RPISLCNVSYK+VTK+++ R+R  +  +V P Q 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
        SFVP RH  +NII AQE+IH M   K  K +M +K+DLEKAY+R+ WSF+ +TL  VG P+ +   IM C+ +A+MRVLWNG     F   RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP +SHL FADD++LFAEAN QQ++ +K  L  FC++SGQK++N KT +YFSKNV  + + +I+E  
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
         F  T N+GKYLG+P++H +       DIINK+N RL +W A+SLS AGR+TLVKSVL ++P Y M      V+ CN +D+ CR+FLWG   Q  + HL 
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDR
        NW+ V +PK   GLG+    + N   + K   GL+ + + LWA VL +KYK       ++  K   S +   +  +W ++ +   W IG+G   KFW   
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDR

Query:  WV-NGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPVW
        WV N  S+ + +        E +  + D++  +G W+     + + + +   IA+I+PP     +PD+  W
Subjt:  WV-NGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPVW

XP_025664883.1 uncharacterized protein LOC112763420 [Arachis hypogaea]6.8e-13738.54Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRG--VLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q++R +WI FGDRNTKFFH   ++ R++ ++ SL++ +G W+T++ TL  +A   F NLY  +    +      FP +       + + VS +E++ AMF
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRG--VLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        S+G++KAP  +G QA+FYQ  W  +   V    ++I +    + + N T I LIPK++  E + Q RPISLCNVSYK++TK+++ RLR  + N+V PNQ 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
        SFVP R   DNII+ QE+IH M   K  K +M +K+DLEKAY+R++W+F+ ETL  +G P    N  + C+ +A+MRV WNG     F   RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP ISHL FADDI+LFAEAN  Q   +   L  FC +SGQKV+  KT ++FS+NV +  + +I+ V 
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
         F  T ++ KYLG+PI+H +      + IINK+++RL +W A+SLSLAGR+TLVKSVL ++P Y MH    P A CN +D+ICRNF+WG   Q  ++HL 
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVK-QGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLD
        NW  + +PK   GLG+    + N A + K   GLI+R +DLWA++L +KY G   D    V K + +S L   +  +W+ + R   W +G+G   +FW  
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVK-QGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLD

Query:  RWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
         WV G   +             +  + D++  +G W+       + + +   I +I+PP
Subjt:  RWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

XP_028785542.1 uncharacterized protein LOC114741444 [Prosopis alba]4.4e-13637.99Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLY--GSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q+SR  WI  GDRNT+++H + +  R+R +I  LKD +G W+  +E L++LA E ++NL+    + G L    S+P +   + ++L + V+ +E+R AMF
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLY--GSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        S+GAFKAPG +GF AVF+Q+ W +V   V+  ++ +      + + N+T IVLIPKV  PE++ Q RPI+LCNV YK ++K++  RL+  LG +V+PNQ 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
        SFVP RHIQDNIIV  EL+H M  ++  K F  +KVDLEKAY+RV W F+ E LE +G+  K ++ +M C+ S   R+LWNG  T AF   RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP ++HLMFADD+LLF EA  QQI+ VK  L  F  ASGQKVN +K+ I FSKNV    ++ +A   
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
        GF VT ++G +LG+P  +GR +      ++ KV  RL  W    LS+AGR TL KSV+ A+P Y+M + K P  +   +++  R F+WGH  ++ ++HL 
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKIL-GLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDR
        NW  V KP+   GLG+   +  N A + K++  L S  + LWA VL  KY        ++ V++  S +   +   W ++N  + WG+ +G    FW D 
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKIL-GLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDR

Query:  WVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
        W     ++      ++   + N  + D V + G W+W I  + ++    + + ++ PP
Subjt:  WVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

TrEMBL top hitse value%identityAlignment
A0A2Z6LVG1 Reverse transcriptase domain-containing protein1.8e-13538.96Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSFPSLPVD---ANEKLVKEVSLDEVRTAM
        Q+SR +WI  GDRNTK++H++ I+ R++ +I SL+++ G WV E+E L  +    +  LY   + +     S+ + P++    ++ L   VS  E + A+
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSFPSLPVD---ANEKLVKEVSLDEVRTAM

Query:  FSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQ
        F +G  KAPG +G+ A+F+QQ W+ +  S+ ++V  +      I   NNT +V+IPKV  P  I QFRPI+LCNV+YK++TK+I  R++  L  I+SP Q
Subjt:  FSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQ

Query:  ASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG--
        +SF+P R I  NI+VAQE++H M K+K  K FM +K+DLEKAY+R++W+F+   L     P KL N I +C+ S   ++LWNG  T  F   RG+RQG  
Subjt:  ASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG--

Query:  ----------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEV
                                          GP ISHL+FADD+LLFAEA+ +Q+  V   L  FC ASGQK+NN KT IYFSKNV    +  I + 
Subjt:  ----------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEV

Query:  GGFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHL
         GF   +++GKYLG  I  GR      Q II K+  +L  W    LSLAGR TL+KSVL ++P Y M   K P  +C  +++I R FLWG   QK + HL
Subjt:  GGFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHL

Query:  FNWNLVTKPKFEAGLGLHRFKEFNIALIDKIL-GLISRPNDLWAQVLINKYKGRWGDGWSLVVKQG-DSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWL
         NW +   PK + GLG+    + NIA + K+L  +I+RP+DLW +VL NKY GR  D    V  Q  DS L   LAG W++  +++ W +G+G    FW+
Subjt:  FNWNLVTKPKFEAGLGLHRFKEFNIALIDKIL-GLISRPNDLWAQVLINKYKGRWGDGWSLVVKQG-DSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWL

Query:  DRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPVWIGS
        DRW    S +     + V D      +KD +   G W+     N +  ++   + ++  P +T + PD   W G+
Subjt:  DRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPVWIGS

A0A2Z6N471 Uncharacterized protein8.2e-13637.88Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSR--GVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q+SR +W+  GDRNT++FH    I R+  ++++++++ G WVT++  +  +A + +K L+ ++          +FP++       L + V+  E+  A+F
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSR--GVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        S+GA KAPGP+GFQ++FY   W  +  +    +++I E    +++ N T + LIPKV+  +++ QFRPISLCNVSYK+VTK+I+ RLR  L  +V P Q 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQ----
        SF+P+R+  DNII++QE+ H M   +  K +M +K+DLEKAY+R++W F+ ETLE +G+P ++ N I  C+ ++KMRVLWNG     F   RG+RQ    
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQ----

Query:  --------------------------------GGPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                        GGP ISHL +ADD+LLF EA   Q + +K +L  FC +SGQKV+  KT I+FSKNV +  +Q+++E  
Subjt:  --------------------------------GGPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
        GF  T N+GKYLG+PI+H +   +  Q I++KV  RL NW A +LS AGR TL KSV+QALP Y M     P ++C+++D+ CR+F+WG   +  +IHL 
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVV--KQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWL
        +W+ +  PK E GLG+      N A + +    L S+PN LW  V+ +KY  R G     ++  ++  S L N +   W +      W +G+G   KFWL
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVV--KQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWL

Query:  DRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
        D WV     +  +   DVP    N  +  +V + G W  ++F + +  SV   I  + PP
Subjt:  DRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

A0A445EHD6 Uncharacterized protein9.6e-13740.15Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLY--GSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF
        Q+SR  WI FGDRNTKFFH   +I R+R R+ +L++  G+WVT++  L  LAF  F NLY   S          FP+L       L  EV   E++ ++F
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLY--GSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
         +G+ KAPG +G QAVFYQQ WE V P +   ++ +    K + + N T I LIPKV    ++ Q RPISLCNVSYK+VTK+++ R+R  +  +VSP Q 
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
        SFVP R   DNII+AQE+IH M   K  + +M +K+DLEKAY+++ WSF+ +TL+ +G+PS     I+ C+ SA+MRVLWNG     FK  RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP +SHL FADD++LFAEAN  Q   +K  L  FC +SGQ V+  KT I+FS NV    +++I++  
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
        GF  T N+GKYLG+P+ H +  +S   DII+K+N RL +W A+SLSLAGR+TLVKSVL ++P+Y M     P   CN +D+ CRNFLWG  AQ  +IH  
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMG-WGIGNGCTAKFWLD
        +W  V + K   GLG+   +  N A + K+  GLI + + LWA+VL +KY G       +  KQ +S L   +   W  + +    W IG+G    FW  
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMG-WGIGNGCTAKFWLD

Query:  RWV-NGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
         WV N  S+ + +    +   + N  + D++  +G W+ +     +   +   I ++ PP
Subjt:  RWV-NGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

A0A5B6UTQ5 Reverse transcriptase8.2e-13639.46Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTS--FPSLPVDANEKLVKEVSLDEVRTAMF
        Q++RC+W+ FGDRNTKFFH R +  RK  RI +++ S G W +E+ETLR  A + F++LYG     +S   S  FPSL     + L K +  DE++ A+F
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTS--FPSLPVDANEKLVKEVSLDEVRTAMF

Query:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
         +   KAPG +G+ A+F+Q  W+LV  +V E V+ I    K  ++ NNT IVLIPK  HPE   QFRPISLC+V YKLV K+I+ RL+    N +SP QA
Subjt:  SLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---
         F+  R+I DN+I+AQE+IH M   K  K +M +K+DLEKAY+R+SW F++ +L   GIP  L+  IM+ + S+ M++LWNG P+  FK  RG+RQG   
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQG---

Query:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG
                                         GP +SHL FADD+++F +A   Q + +K +L  FCN SG ++++ K+ I+FSK V    + QI++  
Subjt:  ---------------------------------GPMISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVG

Query:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
        GF    N+GKYLG+P++H R   S L  +I+KV  +L NW A  LSLAGR TL +SVL  +P+Y M     P  VC+++++I R F+WG    KS+  L 
Subjt:  GFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGW-----SLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAK
         W  + +PK   GLG     + N + + KI   L++R +DLW +VL +KY      GW       + +   S L   L+ AW  ++  + W +GNG T +
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKYKGRWGDGW-----SLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAK

Query:  FWLDRWVNGESIIEPMDG---SDVPDFER---NRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP
         W D W+       P  G   S VP + R   +  +KD+VL+ G WN ++    + + +   I SI PP
Subjt:  FWLDRWVNGESIIEPMDG---SDVPDFER---NRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPP

A0A6N2NH64 Uncharacterized protein8.2e-13637.8Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSS----RGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTA
        Q+SRC+WI  GDRNT +FHT+ II R++ +I +LK+ S  W+ EE  L+ +A + F+ L+ +     +G       FP +     EKL+  V  +E++TA
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSS----RGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTA

Query:  MFSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPN
        +F +  +KAPGP+G+Q VFYQ  W +V  SV  +V+DI E    + + N++ +VLIPK+  PE + QFRPI LCNV +K+VTK+I  R++  + N++S  
Subjt:  MFSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPN

Query:  QASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGG
        Q+SFVP RH+ DNII+AQE+IH M +LK  +  M +KVDLEKAY+R+SW+F+ +TL     PS L   IM C+ +  + VLWNG  +  F   RG+RQG 
Subjt:  QASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGG

Query:  PM------------------------------------ISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAE
        P+                                    ISHL FADD+LLFAEA+  Q+ +++  L  FC ASGQ+V+ +KT I+FSKNV     + I+ 
Subjt:  PM------------------------------------ISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAE

Query:  VGGFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIH
          GF VT ++GKY+G+P++H R        ++ K+  +L  W A +LSLAGR TL K+VL  +P Y+M     P   C ++++ICR F+WG    + +IH
Subjt:  VGGFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIH

Query:  LFNWNLVTKPKFEAGLGLHRFKEFNIALIDKILGLISRPNDLWAQVLINKY-KGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWL
        L NW  + +PK E GLG+ + K  N A + K+   I++ N++W + L  KY K    D       + DS L   +   W+ + +   W +GNG    FW 
Subjt:  LFNWNLVTKPKFEAGLGLHRFKEFNIALIDKILGLISRPNDLWAQVLINKY-KGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWL

Query:  DRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPVW
        D W+     +      ++P    N  + D V E G W W  FA+ +   + + +A   PP + + + D  +W
Subjt:  DRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPVW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.8e-3723.25Show/hide
Query:  WFGDRNTKFFHTRVIIARK---RIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSF------PSLPVDANEKLVKEVSLDEVRTAMFS
        WF +R  K       + +K   + +ID++K+  G   T+   ++    E++K+LY +    L    +F      P L  +  E L + ++  E+   + S
Subjt:  WFGDRNTKFFHTRVIIARK---RIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSF------PSLPVDANEKLVKEVSLDEVRTAMFS

Query:  LGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQ-FRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        L   K+PGP+GF A FYQ+  E + P +L+  + I + G   +      I+LIPK     +  + FRPISL N+  K++ K+++ R+++++  ++  +Q 
Subjt:  LGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQ-FRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPM
         F+P      NI  +  +I  +++ K  K  +++ +D EKA++++   F+ +TL  +GI       I          ++ NG    AF +  G RQG P+
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPM

Query:  -------------------------------ISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNVT
                                       +   +FADD++++ E      + +  ++ NF   SG K+N  K+  +   N      Q + E+  F + 
Subjt:  -------------------------------ISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNVT

Query:  SNVGKYLGIPI---IHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHI--FKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF
        S   KYLGI +   +   F+ +  + ++ ++      W     S  GR  +VK  +     Y  +    K P+    +L++    F+W    +++RI   
Subjt:  SNVGKYLGIPI---IHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHI--FKFPVAVCNKLDQICRNFLWGHIAQKSRIHLF

Query:  NWNLVTKPKFEAGLGLHRFKEFNIALIDK
          +++++     G+ L  FK +  A + K
Subjt:  NWNLVTKPKFEAGLGLHRFKEFNIALIDK

P08548 LINE-1 reverse transcriptase homolog2.7e-3524.64Show/hide
Query:  WFGDRNTKFFHTRVIIAR-KRIR--IDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSF------PSLPVDANEKLVKEVSLDEVRTAMFS
        WF ++  K       + R KR++  I S+++ +    T+   ++ +  E++K LY      L     +      P L     E L + +S  E+ + + +
Subjt:  WFGDRNTKFFHTRVIIAR-KRIR--IDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSF------PSLPVDANEKLVKEVSLDEVRTAMFS

Query:  LGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKV-KHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA
        L   K+PGP+GF + FYQ   E + P +L   ++I + G   +      I LIPK  K P     +RPISL N+  K++ K+++ R+++++  I+  +Q 
Subjt:  LGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKV-KHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQA

Query:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPM
         F+P      NI  +  +I  ++KLK  K  M++ +D EKA++ +   F+  TL+ +GI       I          ++ NG    +F +  G RQG P+
Subjt:  SFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPM

Query:  -------------------------------ISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNVT
                                       I   +FADD++++ E       ++  V+  + N SG K+N  K+V +   N   +A++ + +   F V 
Subjt:  -------------------------------ISHLMFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNVT

Query:  SNVGKYLGIPI------IHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVK-SVL-QALPTYAMHIFKFPVAVCNKLDQICRNFLW
            KYLG+ +      ++     ++ ++I   VN     W     S  GR  +VK S+L +A+  +     K P++    L++I  +F+W
Subjt:  SNVGKYLGIPI------IHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVK-SVL-QALPTYAMHIFKFPVAVCNKLDQICRNFLW

P0C2F6 Putative ribonuclease H protein At1g657505.6e-3332.49Show/hide
Query:  IPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPKFEAG
        +P++  R       +I+ +V+ R+  W   +LS AGR TL K+VL ++P ++M     P ++ N+LDQ+ R FLWG  A+K + HL  W+ V  PK E G
Subjt:  IPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPKFEAG

Query:  LGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKY-------------KGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDR
        LG+   K  N ALI K+   L+   N LW  VL  KY             KG W   W  +             G  + ++  +GW  G+G   +FW DR
Subjt:  LGLHRFKEFNIALIDKI-LGLISRPNDLWAQVLINKY-------------KGRWGDGWSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDR

Query:  WVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNW
        WV+G+ ++E +D  + P        KD  +   GW++
Subjt:  WVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNW

P11369 LINE-1 retrotransposable element ORF2 protein3.1e-3926.61Show/hide
Query:  RKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSF------PSLPVDANEKLVKEVSLDEVRTAMFSLGAFKAPGPNGFQAVFYQQN
        R +I I+ +++  G   T+ E ++N     +K LY +    L     F      P L  D  + L   +S  E+   + SL   K+PGP+GF A FYQ  
Subjt:  RKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSF------PSLPVDANEKLVKEVSLDEVRTAMFSLGAFKAPGPNGFQAVFYQQN

Query:  WELVAPSVLEHVRDILERGKTIDKANNTFIVLIPK-VKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQASFVPSRHIQDNIIVAQELIH
         E + P + +    I   G   +      I LIPK  K P  I  FRPISL N+  K++ K+++ R++E++  I+ P+Q  F+P      NI  +  +IH
Subjt:  WELVAPSVLEHVRDILERGKTIDKANNTFIVLIPK-VKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQASFVPSRHIQDNIIVAQELIH

Query:  KMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPMISHL----------------
         ++KLK  K  M++ +D EKA++++   F+ + LE  GI     N I          +  NG    A  +  G RQG P+  +L                
Subjt:  KMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPMISHL----------------

Query:  ---------------MFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIY-FSKNVPYEAQQQIAEVGGFNVTSNVGKYLGIPI------IH
                       + ADD++++         E+ +++ +F    G K+N++K++ + ++KN   +A+++I E   F++ +N  KYLG+ +      ++
Subjt:  ---------------MFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIY-FSKNVPYEAQQQIAEVGGFNVTSNVGKYLGIPI------IH

Query:  GRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHI--FKFPVAVCNKLDQICRNFLWGHIAQKSRI
         +   S+ ++I  K +LR   W     S  GR  +VK  +     Y  +    K P    N+L+     F+W +  +K RI
Subjt:  GRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHI--FKFPVAVCNKLDQICRNFLWGHIAQKSRI

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-2923.61Show/hide
Query:  RSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYG----SSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAM
        RSR + +   DR ++FF+        R +I  L    G+ + + E +R+ A   ++NL+     S           P +     E+L   ++LDE+  A+
Subjt:  RSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYG----SSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAM

Query:  FSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQ
          +   K+PG +G    F+Q  W+ + P     + +  ++G+         + L+PK      I  +RP+SL +  YK+V K ISLRL+  L  ++ P+Q
Subjt:  FSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQ

Query:  ASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGP
        +  VP R I DN+ + ++L+H   +   +  F+   +D EKA++RV   +L  TL+      +    +     SA+  V  N   T      RGVRQG P
Subjt:  ASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGP

Query:  MISHL-------------------------------MFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNV
        +   L                                +ADD++L A+ +   +E  +     +  AS  ++N SK+      ++  +           + 
Subjt:  MISHL-------------------------------MFADDILLFAEANFQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNV

Query:  TSNVGKYLGIPIIHGRFRASV-LQDIINKVNLRLCNWS--AASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLW
         S + KYLG+ +    +  S    ++   V  RL  W   A  LS+ GR+ ++  ++ +   Y +           K+ +   +FLW
Subjt:  TSNVGKYLGIPIIHGRFRASV-LQDIINKVNLRLCNWS--AASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLW

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.7e-2130.65Show/hide
Query:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSFPSL----PVDANEKLVKEVSL----DE
        Q+SR +W+  GD NT+FFH  ++  + +  I  L+      V     ++ +   ++ +L GS   +L T  S   +    P   N+ L   +S      E
Subjt:  QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSFPSL----PVDANEKLVKEVSL----DE

Query:  VRTAMFSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVT
        +  A+F++   KAPGP+ F A F+ ++W +V  S +  V++    G  + + N T I LIPKV   + +  FRP+S C V YK++T
Subjt:  VRTAMFSLGAFKAPGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVT

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-1731.62Show/hide
Query:  KYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPK
        +YLG+P++  +   S    ++ K+ +R+  W+A  LS AGR  L+ SV+ +L  + M  F+ P A   ++D IC +FLW      ++     W+ V  PK
Subjt:  KYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVLQALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPK

Query:  FEAGLGLHRFKEFNIALIDKILGLISRPNDLWAQVL
         E GLG+   KE N      I G  +  + +W ++L
Subjt:  FEAGLGLHRFKEFNIALIDKILGLISRPNDLWAQVL

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.0e-0634.31Show/hide
Query:  VTKPKFEAGLGLHRFKEFNIALIDKIL-GLISRPNDLWAQVLINKYKGRWGDG-----WSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLD
        V  PK E GLGL  F E+N  L  K++  L S    LW       +    GD      W+      DSW    L        R++   IGNG TA+FW D
Subjt:  VTKPKFEAGLGLHRFKEFNIALIDKIL-GLISRPNDLWAQVLINKYKGRWGDG-----WSLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLD

Query:  RW
         W
Subjt:  RW

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.7e-1446.67Show/hide
Query:  RLREYLGNIVSPNQASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIP
        RL+  + N++ P QASF+P R   DNI+  QE +H M + K  K +ML+K+DLEKAY+R+ W +L +TL   G P
Subjt:  RLREYLGNIVSPNQASFVPSRHIQDNIIVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-1028.57Show/hide
Query:  ALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPK-FEAGLGLHRFKEFNIALIDK-ILGLISRPNDLWAQVLINKYKGRWGDGW
        ALP YAM  F+    +C KL      F W     K +I    W  + K K  + GLG      FN AL+ K    +I +P+ L +++L ++Y        
Subjt:  ALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPK-FEAGLGLHRFKEFNIALIDK-ILGLISRPNDLWAQVLINKYKGRWGDGW

Query:  SLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDRWVNGESIIEPMD
           V    S+    +      ++R +   IG+G   K WLDRW+  E+ + P++
Subjt:  SLVVKQGDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDRWVNGESIIEPMD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAGAGATCTAGATGTGAGTGGATCTGGTTTGGTGATAGAAATACAAAATTTTTCCATACAAGAGTCATCATTGCTAGGAAAAGGATTCGTATAGATTCCTTGAAG
GATAGTAGTGGAAGTTGGGTAACAGAGGAAGAAACACTGAGAAATTTAGCCTTTGAACATTTTAAAAATCTTTATGGCAGCTCTAGGGGGGTGTTATCCACGACA
ACTTCATTCCCTTCCCTCCCTGTGGATGCAAATGAGAAACTAGTCAAAGAAGTTTCCTTGGATGAGGTCAGAACAGCAATGTTCTCCTTAGGTGCCTTTAAAGCG
CCTGGTCCAAACGGATTCCAAGCCGTCTTTTATCAGCAAAATTGGGAGCTTGTTGCTCCTTCAGTACTGGAACATGTTAGAGATATTTTGGAAAGGGGAAAAACG
ATTGACAAGGCCAATAACACTTTTATAGTTCTAATACCAAAAGTGAAACACCCTGAATCGATTCTTCAGTTCCGACCAATTAGTCTTTGTAACGTTTCTTACAAA
CTTGTTACTAAGCTGATTTCACTACGATTGAGAGAATATTTAGGAAATATTGTCTCCCCTAACCAAGCTAGTTTTGTCCCGAGTAGGCACATCCAAGATAATATT
ATTGTAGCCCAAGAGTTGATTCATAAAATGGACAAATTAAAAAGAACTAAGAAGTTTATGTTAGTTAAGGTGGACTTAGAAAAGGCTTATGAAAGAGTTTCTTGG
AGCTTTCTGAACGAAACTCTTGAGCTAGTTGGCATTCCATCCAAGTTAAAGAATGCTATTATGGACTGTGTATGCTCAGCAAAAATGCGGGTTCTATGGAATGGG
GGGCCTACGGGCGCTTTTAAAATGCATAGAGGGGTTAGGCAAGGGGGACCGATGATCTCTCACCTAATGTTTGCGGATGATATATTATTGTTTGCAGAGGCCAAC
TTTCAACAAATTGAGGAAGTGAAGTCAGTTCTAGGAAATTTTTGTAATGCTTCAGGACAAAAAGTTAATAATTCTAAAACAGTCATTTATTTTTCAAAAAATGTT
CCATATGAGGCTCAACAACAGATAGCAGAAGTAGGAGGGTTTAATGTTACATCTAATGTTGGGAAGTACTTAGGAATTCCAATTATACATGGAAGATTTAGAGCA
TCGGTCCTTCAGGACATAATAAACAAAGTTAATCTTCGTCTTTGTAACTGGTCTGCGGCTTCTCTATCTTTAGCAGGGCGCTCGACATTAGTTAAATCGGTCCTT
CAGGCTTTGCCAACATATGCTATGCATATTTTTAAGTTTCCAGTAGCAGTTTGTAACAAACTTGACCAGATTTGTAGGAACTTCTTATGGGGGCATATCGCTCAA
AAATCTAGAATACACCTTTTTAATTGGAATTTGGTCACAAAGCCAAAGTTTGAGGCGGGACTTGGGTTACACCGATTTAAAGAATTTAACATAGCTCTGATTGAC
AAAATTCTGGGGTTAATCAGCCGACCAAACGATCTTTGGGCTCAAGTGCTCATAAATAAGTATAAAGGAAGGTGGGGGGATGGGTGGAGCTTGGTTGTAAAGCAA
GGAGACTCTTGGCTCTTGAATTATTTAGCTGGAGCATGGAATGAAATTAATCGTTGGATGGGTTGGGGAATTGGGAATGGTTGTACAGCTAAATTTTGGCTAGAT
AGATGGGTCAATGGGGAAAGCATTATAGAACCTATGGATGGAAGTGATGTTCCAGATTTTGAAAGAAATAGACCTATTAAGGATTATGTTTTAGAAACGGGAGGC
TGGAATTGGGAAATCTTTGCAAATAGAGTTAATCAATCAGTATCGTTAGCTATTGCAAGTATAAATCCGCCAGATGAGACAATCAATCTACCTGATTACCCAGTT
TGGATTGGTTCTAGG
mRNA sequenceShow/hide mRNA sequence
CAGAGATCTAGATGTGAGTGGATCTGGTTTGGTGATAGAAATACAAAATTTTTCCATACAAGAGTCATCATTGCTAGGAAAAGGATTCGTATAGATTCCTTGAAG
GATAGTAGTGGAAGTTGGGTAACAGAGGAAGAAACACTGAGAAATTTAGCCTTTGAACATTTTAAAAATCTTTATGGCAGCTCTAGGGGGGTGTTATCCACGACA
ACTTCATTCCCTTCCCTCCCTGTGGATGCAAATGAGAAACTAGTCAAAGAAGTTTCCTTGGATGAGGTCAGAACAGCAATGTTCTCCTTAGGTGCCTTTAAAGCG
CCTGGTCCAAACGGATTCCAAGCCGTCTTTTATCAGCAAAATTGGGAGCTTGTTGCTCCTTCAGTACTGGAACATGTTAGAGATATTTTGGAAAGGGGAAAAACG
ATTGACAAGGCCAATAACACTTTTATAGTTCTAATACCAAAAGTGAAACACCCTGAATCGATTCTTCAGTTCCGACCAATTAGTCTTTGTAACGTTTCTTACAAA
CTTGTTACTAAGCTGATTTCACTACGATTGAGAGAATATTTAGGAAATATTGTCTCCCCTAACCAAGCTAGTTTTGTCCCGAGTAGGCACATCCAAGATAATATT
ATTGTAGCCCAAGAGTTGATTCATAAAATGGACAAATTAAAAAGAACTAAGAAGTTTATGTTAGTTAAGGTGGACTTAGAAAAGGCTTATGAAAGAGTTTCTTGG
AGCTTTCTGAACGAAACTCTTGAGCTAGTTGGCATTCCATCCAAGTTAAAGAATGCTATTATGGACTGTGTATGCTCAGCAAAAATGCGGGTTCTATGGAATGGG
GGGCCTACGGGCGCTTTTAAAATGCATAGAGGGGTTAGGCAAGGGGGACCGATGATCTCTCACCTAATGTTTGCGGATGATATATTATTGTTTGCAGAGGCCAAC
TTTCAACAAATTGAGGAAGTGAAGTCAGTTCTAGGAAATTTTTGTAATGCTTCAGGACAAAAAGTTAATAATTCTAAAACAGTCATTTATTTTTCAAAAAATGTT
CCATATGAGGCTCAACAACAGATAGCAGAAGTAGGAGGGTTTAATGTTACATCTAATGTTGGGAAGTACTTAGGAATTCCAATTATACATGGAAGATTTAGAGCA
TCGGTCCTTCAGGACATAATAAACAAAGTTAATCTTCGTCTTTGTAACTGGTCTGCGGCTTCTCTATCTTTAGCAGGGCGCTCGACATTAGTTAAATCGGTCCTT
CAGGCTTTGCCAACATATGCTATGCATATTTTTAAGTTTCCAGTAGCAGTTTGTAACAAACTTGACCAGATTTGTAGGAACTTCTTATGGGGGCATATCGCTCAA
AAATCTAGAATACACCTTTTTAATTGGAATTTGGTCACAAAGCCAAAGTTTGAGGCGGGACTTGGGTTACACCGATTTAAAGAATTTAACATAGCTCTGATTGAC
AAAATTCTGGGGTTAATCAGCCGACCAAACGATCTTTGGGCTCAAGTGCTCATAAATAAGTATAAAGGAAGGTGGGGGGATGGGTGGAGCTTGGTTGTAAAGCAA
GGAGACTCTTGGCTCTTGAATTATTTAGCTGGAGCATGGAATGAAATTAATCGTTGGATGGGTTGGGGAATTGGGAATGGTTGTACAGCTAAATTTTGGCTAGAT
AGATGGGTCAATGGGGAAAGCATTATAGAACCTATGGATGGAAGTGATGTTCCAGATTTTGAAAGAAATAGACCTATTAAGGATTATGTTTTAGAAACGGGAGGC
TGGAATTGGGAAATCTTTGCAAATAGAGTTAATCAATCAGTATCGTTAGCTATTGCAAGTATAAATCCGCCAGATGAGACAATCAATCTACCTGATTACCCAGTT
TGGATTGGTTCTAGG
Protein sequenceShow/hide protein sequence
QRSRCEWIWFGDRNTKFFHTRVIIARKRIRIDSLKDSSGSWVTEEETLRNLAFEHFKNLYGSSRGVLSTTTSFPSLPVDANEKLVKEVSLDEVRTAMFSLGAFKA
PGPNGFQAVFYQQNWELVAPSVLEHVRDILERGKTIDKANNTFIVLIPKVKHPESILQFRPISLCNVSYKLVTKLISLRLREYLGNIVSPNQASFVPSRHIQDNI
IVAQELIHKMDKLKRTKKFMLVKVDLEKAYERVSWSFLNETLELVGIPSKLKNAIMDCVCSAKMRVLWNGGPTGAFKMHRGVRQGGPMISHLMFADDILLFAEAN
FQQIEEVKSVLGNFCNASGQKVNNSKTVIYFSKNVPYEAQQQIAEVGGFNVTSNVGKYLGIPIIHGRFRASVLQDIINKVNLRLCNWSAASLSLAGRSTLVKSVL
QALPTYAMHIFKFPVAVCNKLDQICRNFLWGHIAQKSRIHLFNWNLVTKPKFEAGLGLHRFKEFNIALIDKILGLISRPNDLWAQVLINKYKGRWGDGWSLVVKQ
GDSWLLNYLAGAWNEINRWMGWGIGNGCTAKFWLDRWVNGESIIEPMDGSDVPDFERNRPIKDYVLETGGWNWEIFANRVNQSVSLAIASINPPDETINLPDYPV
WIGSR