; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022213 (gene) of Snake gourd v1 genome

Gene IDTan0022213
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG08:17928909..17934451
RNA-Seq ExpressionTan0022213
SyntenyTan0022213
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU30026.1 hypothetical protein TSUD_161120 [Trifolium subterraneum]8.7e-5426.12Show/hide
Query:  WNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNEAG
        W L+  VE Q +GKNL+++ F   +D   +   GPW   + +L+ + + G  + S++E                     MA  L + +G+FE+ +T +  
Subjt:  WNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNEAG

Query:  KCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGE--------TKRYGTWLRALPSERKNLRTDQR
        +  G  LR KV +D+ KPLKR T ++     ++ K+   YE+LP  CF CG +GH  +DC+  G N GE           +G WLRA P  R    T + 
Subjt:  KCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGE--------TKRYGTWLRALPSERKNLRTDQR

Query:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQC---KANTKESSTPEKLETGSEI-EILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDK
         G    S+    G   S+ E     ++E E+E   +    K +    S P    T  EI E+  TL+S             DL P              K
Subjt:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQC---KANTKESSTPEKLETGSEI-EILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDK

Query:  SQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD-----GGIHLRPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKR
        + K   KGK   +S    ++K  +VK GR   P  E        G   L  +    T +  +  +L  E+ G G+   ++        E        W  
Subjt:  SQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD-----GGIHLRPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKR

Query:  KARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDP
            N      N  +     Q +  +W   G YG     ++ D+WEL+  LS  +   W+  GDFN+I    EK GG  +    +Q  RE +D C L+D 
Subjt:  KARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDP

Query:  WYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGD
         +NG  FTW   R+   + + RLDR L     +++   I ++H     SDH   +A  + +       +K  L +FEE W+    C+ LV+Q W  S G+
Subjt:  WYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGD

Query:  DLEGLIYRNTISLK--------------KLND-------------------------------------------W---------------------NRI
          E LI   ++ ++              K+ D                                           W                     N I
Subjt:  DLEGLIYRNTISLK--------------KLND-------------------------------------------W---------------------NRI

Query:  -RLKG-----------------SLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGAD
         +LKG                      LFT+SNP    + +  + + RK+  +Q+ +    F+  EV+EA++ M P KAPGPDGL A++YQ++W IVGAD
Subjt:  -RLKG-----------------SLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGAD

OMO61345.1 reverse transcriptase [Corchorus capsularis]1.2e-5027.16Show/hide
Query:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFE-----DLIGRVKLSDLEFRKMAISL-------------GNSIGQFEKVET
        M  +W L   ++++ +G+NLFI+ F  + +K R+    PW  +K LL+ +     D +  +KL    F K A  L             G S G  E+++T
Subjt:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFE-----DLIGRVKLSDLEFRKMAISL-------------GNSIGQFEKVET

Query:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEET---KILIT--YEKLPDFCFGCGLVGHVEKDCETPGTNTGE----TKRYGTWLRALPSERKNL
              WG  LR + ++++ KPL+R      G+++      KILI+  YEKLPDFC+ CG + HVE +CE       +     K YG WLRA     K++
Subjt:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEET---KILIT--YEKLPDFCFGCGLVGHVEKDCETPGTNTGE----TKRYGTWLRALPSERKNL

Query:  RTDQRGGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQ---CKANTKESSTPEKLETGSEIEILTTLQSD----HAKQKRNLDSK---DDLFPRPTE
        + D            G G  ++RT RE  E K++ +   ++   CK    +S    KL+   + E  +  Q D     A  K  +  K    D   +  E
Subjt:  RTDQRGGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQ---CKANTKESSTPEKLETGSEIEILTTLQSD----HAKQKRNLDSK---DDLFPRPTE

Query:  SPEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD--GGIHLRPQPLRITKSIQNV----MDLTEEIIGKGKEDDNNQITFTLK-
           GS S     +K   KG       ++ +  + K         +NE ++  GG  ++ +  R   ++  +     D   ++ G+ +E     +    K 
Subjt:  SPEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD--GGIHLRPQPLRITKSIQNV----MDLTEEIIGKGKEDDNNQITFTLK-

Query:  -SEEQRISSRK--------WKRKARGNLVVAEENQKEASSIIQNDLGS--WRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLG
          E+  + S +        WK +   +++    +  +A  I+ +  GS  WRFTGFYGNP   +R +SW+L+  L   SS PW+IGGDFNEI    EK+G
Subjt:  -SEEQRISSRK--------WKRKARGNLVVAEENQKEASSIIQNDLGS--WRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLG

Query:  GGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRK-NKLLK
        G  +    +Q FR ++  C+L      G + TWR+ R   N+  ERLDRFL++              GS+                     G+K N +LK
Subjt:  GGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRK-NKLLK

Query:  FEESWAHSKECKNLVKQVWDNSPGDDLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEAL
         ++  A  K C             DD+E +                          LFTSS+  S       D +  ++  D    L   +   EV+ A+
Subjt:  FEESWAHSKECKNLVKQVWDNSPGDDLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEAL

Query:  KNMGPTKAPGPDGLHAIYYQRWWDIVGAD
          M P+KA GPDG+ A++YQ++WD++G +
Subjt:  KNMGPTKAPGPDGLHAIYYQRWWDIVGAD

XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]2.4e-5124.32Show/hide
Query:  RIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEFRKMAI------------------SLGNSIGQFEKVETNE
        +IW++      +++   LF+  F   +DK ++  G PW  D+ L++F ++ G  + S++                          +G+ +G   +V+ + 
Subjt:  RIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEFRKMAI------------------SLGNSIGQFEKVETNE

Query:  AGKCWGHTLRVKVKVDIKKPLKRYTKIKV---GLMMEETKILITYEKLPDFCFGCGLVGHVEKDC-ETPGTNTGETKRYGTWLRALPSERKNLRTDQRGG
         G  W  + RVKV VD+ KPL+R  +I+     + + E K    YE+LP+FC+ CG++GH+E+DC   P  +  E + +G+WLRA P  R+         
Subjt:  AGKCWGHTLRVKVKVDIKKPLKRYTKIKV---GLMMEETKILITYEKLPDFCFGCGLVGHVEKDC-ETPGTNTGETKRYGTWLRALPSERKNLRTDQRGG

Query:  RQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDD--------LFPRPTESPEGSKSKQ
        ++ +S  R      S    E   +    I   Q      +E +   +  T  E +I    Q +HA  +      +         LF + + SP+  +  +
Subjt:  RQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDD--------LFPRPTESPEGSKSKQ

Query:  DKSQKETRKGK-----EALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQNVMDL-TEEIIGKGKEDDNNQITFTLKSEEQRISSR-
         + +K+    K     E     +   + S   ++      +++  D  + L  +  +I   + N   L T E  G   E+  +++ +        +    
Subjt:  DKSQKETRKGK-----EALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQNVMDL-TEEIIGKGKEDDNNQITFTLKSEEQRISSR-

Query:  ----KWKRKARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIV
             WK       +V+  N      ++  +   WRF G YG P    +  +W+LL  L      P L GGDFNE+    E  GG   D   M  FRE+V
Subjt:  ----KWKRKARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIV

Query:  DKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQ
        D+  L D  ++G  +TW + +      +ERLDRFL +PQ      ++ ++H   + SDH PI+  +   C      RK K  +F  +W     C++LV+ 
Subjt:  DKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQ

Query:  VWDNSPG-------------------------------------------------------DDLEGLI--------------------------YRNTI
         WD+S G                                                         L+GL+                          +    
Subjt:  VWDNSPG-------------------------------------------------------DDLEGLI--------------------------YRNTI

Query:  SLKKLNDWNRI---------------RLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQR
          K+ N  N +               R+  +   +LFTSS P  ++++ VLD +   I E+ N  LC      EV EAL+ M P+KAPGPDG+HA++YQR
Subjt:  SLKKLNDWNRI---------------RLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQR

Query:  WWDIVGADTT
        +W IVG D T
Subjt:  WWDIVGADTT

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]9.0e-5124.18Show/hide
Query:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVET
        M +IWN E  +  + +  N+++  F    DK ++  G PW  D+ L+  ++  G + LS++ F                  ++M I +G+ IG+  +VET
Subjt:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVET

Query:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQ
        N  G  WG  LR+K +V++ K L R   +K G   +++ +   YE+LP FCF CG   H +  C+  G +     +YG WLRA     K     + GG +
Subjt:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQ

Query:  PQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKE--SSTPEKLETGSEIEIL--------------TTLQSDHAKQK------------------
                  ++  +    S S+EE  +    C  +T +  S  P  +E+   +E+               T++  + A  K                  
Subjt:  PQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKE--SSTPEKLETGSEIEIL--------------TTLQSDHAKQK------------------

Query:  --------------RNLDSKDDLFPRPTESP----------------------EGSKSKQDKSQKETRKGKE-----ALASGLIPESKSHKVKEGRGQHP
                       +L S  D+    TE+P                      E S +   K +   RK +      +  + +I +  S+   +   +  
Subjt:  --------------RNLDSKDDLFPRPTESP----------------------EGSKSKQDKSQKETRKGKE-----ALASGLIPESKSHKVKEGRGQHP

Query:  INEGADG---------------GIHL-----RPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKRKARGNLVVAEENQKEASS
         +  ADG                +HL     +P  + +T++  N + L    +  G E   N  +   K +   + +  WK   +  ++        A  
Subjt:  INEGADG---------------GIHL-----RPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKRKARGNLVVAEENQKEASS

Query:  IIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNS
            D   W+ TGFYG+PN AKR +SW LL  L    + PWL  GDFNEI    EK+G   +    M  FR  +  CKL D  ++G+KFTW  NR     
Subjt:  IIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNS

Query:  TKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGD------------------
        TKERLDR   N          T+ H     SDH+   A +V   D  ++G+K ++ +FE +W    EC+ ++K+VW  S G                   
Subjt:  TKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGD------------------

Query:  -----------------------------------------------DLEGL--------------------IYRNTISLKKLNDWNRIRLKGSLQ----
                                                       D E L                     ++ +   K+ N   RI+ K  L     
Subjt:  -----------------------------------------------DLEGL--------------------IYRNTISLKKLNDWNRIRLKGSLQ----

Query:  -----------GSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGADTTK
                     LFTSS+P    +   L  L + I +D  +FL   FT  EV+EA+ +M P  +PGPDG  A+++Q++WD VG++ TK
Subjt:  -----------GSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGADTTK

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]2.2e-0425.86Show/hide
Query:  TGGWDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLNSSSLPSSSSNNSSTSSWKALWNQETSLKVKICAWKVFKDIIP
        T  W+   ++    + DAE +  +      + D ++W     GVFTVKSAY L + L       SS      + W ++W     +  K   W+   + IP
Subjt:  TGGWDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLNSSSLPSSSSNNSSTSSWKALWNQETSLKVKICAWKVFKDIIP

Query:  SKANIIGKGIDTNDRC
        ++A ++ K +   D C
Subjt:  SKANIIGKGIDTNDRC

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]1.2e-5024.27Show/hide
Query:  RIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNE
        +IW L   V ++ V  N FI  F  + DK R+ +G PW  D  L +     G + +S+++F                  ++    LG++IG+ E+VE +E
Subjt:  RIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNE

Query:  AGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKIL--ITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQ
            WG +LRVK+ +D++KPL R   I    ++   K+   + YEK+P FCF CG + H    C+        + ++G+WLRA    +K         ++
Subjt:  AGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKIL--ITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQ

Query:  PQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESS------------------------TPEKLETGSEIEILTT--LQSDHAKQKRNLDSKDD
          S  +G   +    +   ++    +     +C  N   SS                        T    E G    +  T  L SD     + + + + 
Subjt:  PQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESS------------------------TPEKLETGSEIEILTT--LQSDHAKQKRNLDSKDD

Query:  LFPRPTESPEGSKSKQDKSQKETRKGKEALASGLIPESKS-----HKV---KEGRGQHPINEGADGGIHLRPQPLR---ITKSIQNVMDLTEEII-----
         F  P   P G     D SQ    +      +GL+PE        H++   ++GR      +    G  L  +PL    + K   + + L + +I     
Subjt:  LFPRPTESPEGSKSKQDKSQKETRKGKEALASGLIPESKS-----HKV---KEGRGQHPINEGADGGIHLRPQPLR---ITKSIQNVMDLTEEII-----

Query:  --------------GKGKEDDNNQITFTLKS--------EEQRISSR-----KWKRKARGNLVVAEENQKEASSIIQN----------------------
                      G G       ++  ++S        +E ++++R     K++   +  L V+ E +    ++  N                      
Subjt:  --------------GKGKEDDNNQITFTLKS--------EEQRISSR-----KWKRKARGNLVVAEENQKEASSIIQN----------------------

Query:  --DLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTK
          ++  W  TGFYGN + +KR +SW LL  L   S   WLI GDFNEI  + EK GG  K    M+AFR+++D+C L D  +NGN FTW   R   +   
Subjt:  --DLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTK

Query:  ERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPG-DDLEGLIYRNTISLKKLNDW
        ERLDRFL N + H+     ++ HG    SDH PI   ++ L   +  G + KL +FE  W  + +CK +++  W    G  DL  ++ +     +KL  W
Subjt:  ERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPG-DDLEGLIYRNTISLKKLNDW

Query:  NRIRLKGSLQGSL---------------------------------------------------------------------------------------
        N+    G +Q +L                                                                                       
Subjt:  NRIRLKGSLQGSL---------------------------------------------------------------------------------------

Query:  ---------FTSSNPDSDSVARV--LDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGADTTK
                 F S    +D V  +  L GL  +I  D    L +PF+  EV+ AL  M PTKAPGPDG+  ++YQ++W +VG D T+
Subjt:  ---------FTSSNPDSDSVARV--LDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGADTTK

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase5.7e-5127.16Show/hide
Query:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFE-----DLIGRVKLSDLEFRKMAISL-------------GNSIGQFEKVET
        M  +W L   ++++ +G+NLFI+ F  + +K R+    PW  +K LL+ +     D +  +KL    F K A  L             G S G  E+++T
Subjt:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFE-----DLIGRVKLSDLEFRKMAISL-------------GNSIGQFEKVET

Query:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEET---KILIT--YEKLPDFCFGCGLVGHVEKDCETPGTNTGE----TKRYGTWLRALPSERKNL
              WG  LR + ++++ KPL+R      G+++      KILI+  YEKLPDFC+ CG + HVE +CE       +     K YG WLRA     K++
Subjt:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEET---KILIT--YEKLPDFCFGCGLVGHVEKDCETPGTNTGE----TKRYGTWLRALPSERKNL

Query:  RTDQRGGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQ---CKANTKESSTPEKLETGSEIEILTTLQSD----HAKQKRNLDSK---DDLFPRPTE
        + D            G G  ++RT RE  E K++ +   ++   CK    +S    KL+   + E  +  Q D     A  K  +  K    D   +  E
Subjt:  RTDQRGGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQ---CKANTKESSTPEKLETGSEIEILTTLQSD----HAKQKRNLDSK---DDLFPRPTE

Query:  SPEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD--GGIHLRPQPLRITKSIQNV----MDLTEEIIGKGKEDDNNQITFTLK-
           GS S     +K   KG       ++ +  + K         +NE ++  GG  ++ +  R   ++  +     D   ++ G+ +E     +    K 
Subjt:  SPEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD--GGIHLRPQPLRITKSIQNV----MDLTEEIIGKGKEDDNNQITFTLK-

Query:  -SEEQRISSRK--------WKRKARGNLVVAEENQKEASSIIQNDLGS--WRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLG
          E+  + S +        WK +   +++    +  +A  I+ +  GS  WRFTGFYGNP   +R +SW+L+  L   SS PW+IGGDFNEI    EK+G
Subjt:  -SEEQRISSRK--------WKRKARGNLVVAEENQKEASSIIQNDLGS--WRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLG

Query:  GGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRK-NKLLK
        G  +    +Q FR ++  C+L      G + TWR+ R   N+  ERLDRFL++              GS+                     G+K N +LK
Subjt:  GGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRK-NKLLK

Query:  FEESWAHSKECKNLVKQVWDNSPGDDLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEAL
         ++  A  K C             DD+E +                          LFTSS+  S       D +  ++  D    L   +   EV+ A+
Subjt:  FEESWAHSKECKNLVKQVWDNSPGDDLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEAL

Query:  KNMGPTKAPGPDGLHAIYYQRWWDIVGAD
          M P+KA GPDG+ A++YQ++WD++G +
Subjt:  KNMGPTKAPGPDGLHAIYYQRWWDIVGAD

A0A2N9F7A6 Uncharacterized protein3.3e-5124.51Show/hide
Query:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEFRKMAI------------------SLGNSIGQFEKVET
        M R+W +   + I+ +G+NLF++ F    ++ R+ NG PW+ +  +L   +  G    + ++F +                      +G ++G   +V+ 
Subjt:  MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEFRKMAI------------------SLGNSIGQFEKVET

Query:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCET---PGTNTGE-TKRYGTWLRALPSERKNLRTDQR
         E G  WG  LR ++ +DI KP+ R   I   L + +  +   YE+LP  CF CG++GH E+DC T    G   GE  ++YG WLRA  +E    R DQ 
Subjt:  NEAGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCET---PGTNTGE-TKRYGTWLRALPSERKNLRTDQR

Query:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANT-----KESSTPEKLETGSEI----EILTTLQSDHAK----QKRNLD-SKDDLFPRPTES
         GR  +    GR    S+     + +++E+        A++     +ES+ P   +   ++    E +T  + + +K    +++ +D S     P   ++
Subjt:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANT-----KESSTPEKLETGSEI----EILTTLQSDHAK----QKRNLD-SKDDLFPRPTES

Query:  PEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITK--SIQNV-MDLTEEIIGKGKEDDNNQITFTLKSEEQR
         E      +     T    ++L       S +  +      H I   A G  H   Q   ++K   I+   M+L    +      +N  +   L    + 
Subjt:  PEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITK--SIQNV-MDLTEEIIGKGKEDDNNQITFTLKSEEQR

Query:  IS----SRKWKRKARGNLVVAEENQKEASSIIQNDLG-SWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQA
        +     +  W  K    L    +N  +A +++  +LG S+R TGFYGNP   +R +SW LL  LS  +++PWL  GDFNEI  + E++G G +    ++ 
Subjt:  IS----SRKWKRKARGNLVVAEENQKEASSIIQNDLG-SWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQA

Query:  FREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECK
        FRE +   +L D  ++G  FTWR  R        RLDR L +    A+     + H     SDH P+   ++++   + + ++ K+ +FE  W   ++C+
Subjt:  FREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECK

Query:  NLVKQVWDNSPGDDLEG-LIYRNTISLKK----LNDWNR--------------------------------IRLKGSLQG--------------------
         ++ + W     D  EG  +++ T  LKK    L  W++                                + L+  L G                    
Subjt:  NLVKQVWDNSPGDDLEG-LIYRNTISLKK----LNDWNR--------------------------------IRLKGSLQG--------------------

Query:  -----------------------------------------------SLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKA
                                                       ++FTSS P  D++   L+G+   +  D N+ L   FT  EV  AL+ M PTKA
Subjt:  -----------------------------------------------SLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKA

Query:  PGPDGLHAIYYQRWWDIVGADTTK
        PGPDG+ AI+YQ +W++VG + T+
Subjt:  PGPDGLHAIYYQRWWDIVGADTTK

A0A2N9G7B6 Uncharacterized protein5.7e-5126.37Show/hide
Query:  IWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLI--------------FEDLIGRVKLSDLEFRKM----AISLGNSIGQFEKVETNEA
        +W  +    I+ +  N  ++ F +  D+ R+  G PW+ DK L+I              F +    V+L  +  R+M    AI LG+S+G+   V   E 
Subjt:  IWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLI--------------FEDLIGRVKLSDLEFRKM----AISLGNSIGQFEKVETNEA

Query:  GKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMME---ETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTG----ETKRYGTWLRALPSERKNLRTDQR
            G  +R++V +DI KPL R  K     M+E   E  I   YE+LP+FC+ CGLV H +KDC     N      E +++G WLRA         +++R
Subjt:  GKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMME---ETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTG----ETKRYGTWLRALPSERKNLRTDQR

Query:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDKSQKE
          R+ + +  G   + S +++ T         T  Q ++N+        + T   I      Q       + + S     P P+ +   +    ++    
Subjt:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDKSQKE

Query:  TRKGKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQNVMDLTEE--------IIGKGKEDDNNQITFTLKSEEQRISSRK----
        T    E           + ++  G G + +   A   +    + L   +++Q +  L           I     E    ++   L+ E + I+  +    
Subjt:  TRKGKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQNVMDLTEE--------IIGKGKEDDNNQITFTLKSEEQRISSRK----

Query:  -----WKRKARGNLVVAEENQKEASSII-QNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREI
             WK++   NL V   +     +++ +N   +WRFTGFYG P    R +SW LL RL+     PW   GDFNE+   +EK G   +    MQ FR++
Subjt:  -----WKRKARGNLVVAEENQKEASSII-QNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREI

Query:  VDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVK
        +D+C   D  + G +FTW  NR   + T ERLDR +  P    +     + H     SDH+P+  T              K  +FEE W   + C+ +V+
Subjt:  VDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVK

Query:  QVWDNSPGD---DLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFT--------------SSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEA
          W         ++E L+ +      +  D +R+ L      SL                ++NP  + +  VLD +   + E+ N+ L   FT  EV+ A
Subjt:  QVWDNSPGD---DLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFT--------------SSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEA

Query:  LKNMGPTKAPGPDGLHAIYYQRWWDIVGADTT
        LK M P KAPGPDGL  I+YQ +W ++G D T
Subjt:  LKNMGPTKAPGPDGLHAIYYQRWWDIVGADTT

A0A2N9G7B6 Uncharacterized protein5.9e-0823.42Show/hide
Query:  IYYQRWWDIVGADTTKGFNKPLFVKEPFKT--LYVQHLF-HPTGGWDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLN
        I+  +W  ++G++  +  + P     P  T   +V+HL       W    +K  F   +A  IL +       +D++IWG   +G++TV+S YHL ++  
Subjt:  IYYQRWWDIVGADTTKGFNKPLFVKEPFKT--LYVQHLF-HPTGGWDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLN

Query:  SSSLPSSSSNNSSTSSWKALWNQETSLKVKICAWKVFKDIIPSKANIIGKGIDTNDRC
        + + P +S     +  WK +W+ +   K++   W+     +P+++N+  + +  + RC
Subjt:  SSSLPSSSSNNSSTSSWKALWNQETSLKVKICAWKVFKDIIPSKANIIGKGIDTNDRC

A0A2N9G7B6 Uncharacterized protein3.7e-5027.08Show/hide
Query:  RIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNE
        +IW     V  + +G+N F++ F +   KRR    GPW+ +K L++  DL     + D+ F                  ++  +++G  +G+F  ++  E
Subjt:  RIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNE

Query:  AGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQPQ
         G   G  LR+K+++DI+KPL R   + VG         + YE LPDFC+ CG+VGH EK CE      GE   +   LR +P      R    GG   +
Subjt:  AGKCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQPQ

Query:  SRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGS---KSKQDKSQKETRK
        + G      + + +R++  S  +  +     K  T  S   EK  T  E E+ + L+     ++ +   K  LF    +S +G+    +K+ ++ K+   
Subjt:  SRGRGRGHQQSRTERETSESKEEEIETDQQCKANTKESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGS---KSKQDKSQKETRK

Query:  GKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKRKARGNLVVAEEN
          +A+    +P+       EG     + +G   G+ ++    RI +         EE   +G ++ + ++T   K   +     K +R A G     E+ 
Subjt:  GKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKRKARGNLVVAEEN

Query:  QKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKN
         K+A   IQ  L        YG+ +   +  +W  +  L    + PWL+ GDFNEI +  EK  G  K    M  FR  +  C L D  + G+ FTWR +
Subjt:  QKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKN

Query:  RLNRNS-TKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGDDLEGLIYRNTI
          ++    +E LDR + NP+  A      + +G    SDHRP+I  +            +   +FE +W   ++ K +VK+ WD S G  L+GL+   ++
Subjt:  RLNRNS-TKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGDDLEGLIYRNTI

Query:  S--LKKLNDWN-------RIRLKGSLQGSLFTSSNPDS-DSVAR---------------VLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGP
        +     L+ W+         RLK   +        P S D V R               +LD + RK+    N  L   FT  EV+EAL  +G  KAPGP
Subjt:  S--LKKLNDWN-------RIRLKGSLQGSLFTSSNPDS-DSVAR---------------VLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGP

Query:  DGLHAIYYQRWWDIVGADTT
        DG+ A +Y+  WD+VG   T
Subjt:  DGLHAIYYQRWWDIVGADTT

A0A2Z6MBL0 Uncharacterized protein4.2e-5426.12Show/hide
Query:  WNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNEAG
        W L+  VE Q +GKNL+++ F   +D   +   GPW   + +L+ + + G  + S++E                     MA  L + +G+FE+ +T +  
Subjt:  WNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEF------------------RKMAISLGNSIGQFEKVETNEAG

Query:  KCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGE--------TKRYGTWLRALPSERKNLRTDQR
        +  G  LR KV +D+ KPLKR T ++     ++ K+   YE+LP  CF CG +GH  +DC+  G N GE           +G WLRA P  R    T + 
Subjt:  KCWGHTLRVKVKVDIKKPLKRYTKIKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGE--------TKRYGTWLRALPSERKNLRTDQR

Query:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQC---KANTKESSTPEKLETGSEI-EILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDK
         G    S+    G   S+ E     ++E E+E   +    K +    S P    T  EI E+  TL+S             DL P              K
Subjt:  GGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQC---KANTKESSTPEKLETGSEI-EILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDK

Query:  SQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD-----GGIHLRPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKR
        + K   KGK   +S    ++K  +VK GR   P  E        G   L  +    T +  +  +L  E+ G G+   ++        E        W  
Subjt:  SQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGAD-----GGIHLRPQPLRITKSIQNVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKR

Query:  KARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDP
            N      N  +     Q +  +W   G YG     ++ D+WEL+  LS  +   W+  GDFN+I    EK GG  +    +Q  RE +D C L+D 
Subjt:  KARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWDDEKLGGGKKDPLLMQAFREIVDKCKLSDP

Query:  WYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGD
         +NG  FTW   R+   + + RLDR L     +++   I ++H     SDH   +A  + +       +K  L +FEE W+    C+ LV+Q W  S G+
Subjt:  WYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWAHSKECKNLVKQVWDNSPGD

Query:  DLEGLIYRNTISLK--------------KLND-------------------------------------------W---------------------NRI
          E LI   ++ ++              K+ D                                           W                     N I
Subjt:  DLEGLIYRNTISLK--------------KLND-------------------------------------------W---------------------NRI

Query:  -RLKG-----------------SLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGAD
         +LKG                      LFT+SNP    + +  + + RK+  +Q+ +    F+  EV+EA++ M P KAPGPDGL A++YQ++W IVGAD
Subjt:  -RLKG-----------------SLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGAD

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-0433.71Show/hide
Query:  SLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGAD----TTKGFNK
        S   +LF+      D+   + DGLP  + E +   L  P T  E+ +AL+ M   K+PG DGL   ++Q +WD +G D     T+ F K
Subjt:  SLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHAIYYQRWWDIVGAD----TTKGFNK

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.1e-0924.46Show/hide
Query:  KPLFVKEPFKTLYVQHLFHPTGG---WDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLNSSSLPSSSSNNSSTSSWKA
        +PL  +E +K + + +LF   G    WD +KI Q    +D   I  +   +    D IIW  +  G +TV+S Y L  +  S+++P+ +  + S      
Subjt:  KPLFVKEPFKTLYVQHLFHPTGG---WDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLNSSSLPSSSSNNSSTSSWKA

Query:  LWNQETSLKVKICAWKVFKDIIPSKANIIGKGIDTNDRC
        +WN     K+K   W+     + +   +  +G+  +  C
Subjt:  LWNQETSLKVKICAWKVFKDIIPSKANIIGKGIDTNDRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAGGATCTGGAACCTAGAGGCAATAGTTGAAATACAATCTGTAGGAAAAAACTTGTTCATCTATAACTTTCTAGAAAACAAAGATAAAAGAAGAATAAGAAATGG
AGGCCCATGGATTATCGACAAGGGGCTTCTCATTTTTGAAGACCTGATTGGAAGGGTGAAGCTTTCGGATCTCGAGTTCAGGAAGATGGCGATATCCCTAGGAAACTCAA
TTGGGCAGTTTGAAAAGGTTGAAACGAATGAAGCAGGAAAATGTTGGGGGCATACTCTGAGGGTCAAAGTGAAAGTCGACATCAAAAAACCCTTAAAAAGATACACAAAA
ATCAAAGTGGGGTTAATGATGGAAGAAACAAAGATCCTAATAACCTATGAAAAGCTCCCAGATTTTTGCTTTGGTTGTGGCTTGGTGGGGCACGTAGAAAAGGACTGTGA
AACTCCAGGAACTAATACGGGTGAGACAAAAAGGTATGGGACTTGGCTAAGGGCCCTACCATCGGAAAGAAAGAATCTACGAACTGACCAAAGAGGAGGTAGACAACCCC
AAAGCAGAGGAAGAGGTCGAGGACATCAACAGAGTCGGACAGAAAGAGAAACAAGCGAATCAAAAGAGGAGGAAATAGAGACGGACCAACAATGTAAGGCTAATACAAAA
GAAAGTTCAACTCCTGAAAAGCTGGAGACTGGGTCTGAGATTGAGATTCTGACAACCCTCCAATCTGATCATGCTAAGCAAAAGAGGAATCTTGATTCAAAGGATGATTT
ATTCCCCAGACCAACTGAGTCTCCAGAAGGGTCAAAGAGCAAGCAAGATAAATCTCAGAAGGAAACAAGAAAGGGAAAAGAGGCATTAGCCTCAGGGCTGATACCGGAAT
CGAAATCTCACAAAGTAAAAGAGGGAAGAGGACAACATCCTATCAATGAGGGGGCAGATGGAGGAATTCACCTCAGGCCCCAACCCCTCAGAATTACTAAAAGCATCCAA
AATGTGATGGATTTGACAGAGGAAATCATAGGAAAAGGAAAGGAGGACGATAATAACCAGATCACATTCACTTTGAAGAGTGAGGAACAAAGAATAAGCTCTAGGAAATG
GAAGAGAAAGGCAAGAGGCAATCTTGTAGTGGCAGAGGAAAACCAAAAGGAAGCAAGCTCAATCATCCAAAACGATTTGGGATCTTGGAGATTTACCGGCTTCTATGGCA
ACCCTAACCCTGCTAAAAGGTCGGATTCGTGGGAGTTACTGGATAGGCTCAGTCAAGTCTCCTCTGCTCCTTGGCTTATTGGAGGAGACTTCAATGAGATTTTCTGGGAT
GATGAAAAACTTGGAGGGGGCAAAAAGGATCCTCTCCTTATGCAAGCTTTTCGGGAGATAGTTGACAAATGCAAGCTTAGTGACCCTTGGTACAATGGAAACAAGTTCAC
GTGGAGGAAAAATAGATTAAACAGAAACTCTACCAAAGAAAGGCTTGACCGTTTTCTGATTAACCCCCAGCTGCATGCCAAGCAGGTGAGAATTACGATTGACCATGGGA
GTTTTCACATGTCGGATCATAGGCCCATCATTGCTACAATTGTTAACCTCTGTGACCATGCCGCCCTGGGCAGAAAGAATAAGCTGCTGAAATTTGAGGAAAGTTGGGCT
CACTCCAAAGAATGCAAAAATTTGGTTAAGCAAGTCTGGGATAATTCCCCCGGAGATGATCTGGAAGGCCTTATTTACCGCAACACGATCAGCCTGAAGAAGCTCAATGA
CTGGAACAGAATCAGGCTTAAGGGGTCTCTCCAAGGGTCACTTTTCACTTCTTCTAATCCCGATTCGGACAGTGTCGCTAGGGTCCTGGATGGTCTTCCGAGGAAAATTG
AGGAGGATCAAAACAGTTTCCTCTGCAATCCCTTTACAAATCATGAGGTTGAGGAAGCGCTGAAAAATATGGGCCCTACTAAGGCCCCTGGCCCGGATGGGTTACATGCT
ATCTATTACCAACGGTGGTGGGACATTGTTGGGGCTGATACCACAAAAGGCTTCAACAAACCTCTTTTCGTGAAAGAGCCTTTCAAAACTCTTTATGTACAACACCTTTT
CCATCCAACCGGGGGTTGGGACCCTACAAAAATTAAACAAGCTTTCTCAAACGCCGATGCCGAGGATATTCTTAATATGCAAGCTGGCCGCCCAGGCTCTTCAGACACTA
TTATTTGGGGCGTCGATCCTAAGGGGGTTTTCACTGTTAAATCGGCCTACCACCTAGAAATCAATTTAAACTCGAGTTCCTTACCTTCTAGCTCGAGCAATAATTCTTCC
ACCTCTAGCTGGAAAGCTCTTTGGAACCAAGAAACTAGCCTTAAAGTGAAGATTTGTGCTTGGAAGGTCTTCAAGGATATTATTCCCTCAAAAGCTAATATTATTGGCAA
AGGAATTGACACTAATGACCGATGTCTTTTGGAATGGTGGAAGCCTATCGACTTTTGGCATTGGATCGTGAAGAATGGAAAGGGAACTGATATGAAGAAGGCAATCCTGC
TGATTTGGTGCATTTGGACGTACAGAAACCGTATTAACAAAGATAAAGCTACTCCAGACATATCTCATTTTATCAACTGGGAAGGCATGCAACATTTAATTTCCTTTTCT
GATCAAATGCCTCCCCCCCTAGTTATTCAATCTGACTCCTCTGAAGCTGTAAGAGTGATTATCTCCAGAAATGAAGACCTCTCGGAGATCCGACTGATTGTGGAAGATAT
TGACAACCTTCGTTCGAAGCTTAACTCGGTGGTTTTTGAAAAATGCACAAGGTCTGGTAACAGGGTGGCTCATGCCCTCGCCAGAATAGCCATGGAAGCTTCTCCCTCGT
CGACTTGCATGGAGGAAGCGAAAGTTTCTGGACCTGAGGAAAGCGTCTTCTTGTCTTCTGTTTTTCCTTCTTGGTTTGTTAATCTCATTAATGAGGACGCTGGCAGTTGT
GCTTTTGGTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAGGATCTGGAACCTAGAGGCAATAGTTGAAATACAATCTGTAGGAAAAAACTTGTTCATCTATAACTTTCTAGAAAACAAAGATAAAAGAAGAATAAGAAATGG
AGGCCCATGGATTATCGACAAGGGGCTTCTCATTTTTGAAGACCTGATTGGAAGGGTGAAGCTTTCGGATCTCGAGTTCAGGAAGATGGCGATATCCCTAGGAAACTCAA
TTGGGCAGTTTGAAAAGGTTGAAACGAATGAAGCAGGAAAATGTTGGGGGCATACTCTGAGGGTCAAAGTGAAAGTCGACATCAAAAAACCCTTAAAAAGATACACAAAA
ATCAAAGTGGGGTTAATGATGGAAGAAACAAAGATCCTAATAACCTATGAAAAGCTCCCAGATTTTTGCTTTGGTTGTGGCTTGGTGGGGCACGTAGAAAAGGACTGTGA
AACTCCAGGAACTAATACGGGTGAGACAAAAAGGTATGGGACTTGGCTAAGGGCCCTACCATCGGAAAGAAAGAATCTACGAACTGACCAAAGAGGAGGTAGACAACCCC
AAAGCAGAGGAAGAGGTCGAGGACATCAACAGAGTCGGACAGAAAGAGAAACAAGCGAATCAAAAGAGGAGGAAATAGAGACGGACCAACAATGTAAGGCTAATACAAAA
GAAAGTTCAACTCCTGAAAAGCTGGAGACTGGGTCTGAGATTGAGATTCTGACAACCCTCCAATCTGATCATGCTAAGCAAAAGAGGAATCTTGATTCAAAGGATGATTT
ATTCCCCAGACCAACTGAGTCTCCAGAAGGGTCAAAGAGCAAGCAAGATAAATCTCAGAAGGAAACAAGAAAGGGAAAAGAGGCATTAGCCTCAGGGCTGATACCGGAAT
CGAAATCTCACAAAGTAAAAGAGGGAAGAGGACAACATCCTATCAATGAGGGGGCAGATGGAGGAATTCACCTCAGGCCCCAACCCCTCAGAATTACTAAAAGCATCCAA
AATGTGATGGATTTGACAGAGGAAATCATAGGAAAAGGAAAGGAGGACGATAATAACCAGATCACATTCACTTTGAAGAGTGAGGAACAAAGAATAAGCTCTAGGAAATG
GAAGAGAAAGGCAAGAGGCAATCTTGTAGTGGCAGAGGAAAACCAAAAGGAAGCAAGCTCAATCATCCAAAACGATTTGGGATCTTGGAGATTTACCGGCTTCTATGGCA
ACCCTAACCCTGCTAAAAGGTCGGATTCGTGGGAGTTACTGGATAGGCTCAGTCAAGTCTCCTCTGCTCCTTGGCTTATTGGAGGAGACTTCAATGAGATTTTCTGGGAT
GATGAAAAACTTGGAGGGGGCAAAAAGGATCCTCTCCTTATGCAAGCTTTTCGGGAGATAGTTGACAAATGCAAGCTTAGTGACCCTTGGTACAATGGAAACAAGTTCAC
GTGGAGGAAAAATAGATTAAACAGAAACTCTACCAAAGAAAGGCTTGACCGTTTTCTGATTAACCCCCAGCTGCATGCCAAGCAGGTGAGAATTACGATTGACCATGGGA
GTTTTCACATGTCGGATCATAGGCCCATCATTGCTACAATTGTTAACCTCTGTGACCATGCCGCCCTGGGCAGAAAGAATAAGCTGCTGAAATTTGAGGAAAGTTGGGCT
CACTCCAAAGAATGCAAAAATTTGGTTAAGCAAGTCTGGGATAATTCCCCCGGAGATGATCTGGAAGGCCTTATTTACCGCAACACGATCAGCCTGAAGAAGCTCAATGA
CTGGAACAGAATCAGGCTTAAGGGGTCTCTCCAAGGGTCACTTTTCACTTCTTCTAATCCCGATTCGGACAGTGTCGCTAGGGTCCTGGATGGTCTTCCGAGGAAAATTG
AGGAGGATCAAAACAGTTTCCTCTGCAATCCCTTTACAAATCATGAGGTTGAGGAAGCGCTGAAAAATATGGGCCCTACTAAGGCCCCTGGCCCGGATGGGTTACATGCT
ATCTATTACCAACGGTGGTGGGACATTGTTGGGGCTGATACCACAAAAGGCTTCAACAAACCTCTTTTCGTGAAAGAGCCTTTCAAAACTCTTTATGTACAACACCTTTT
CCATCCAACCGGGGGTTGGGACCCTACAAAAATTAAACAAGCTTTCTCAAACGCCGATGCCGAGGATATTCTTAATATGCAAGCTGGCCGCCCAGGCTCTTCAGACACTA
TTATTTGGGGCGTCGATCCTAAGGGGGTTTTCACTGTTAAATCGGCCTACCACCTAGAAATCAATTTAAACTCGAGTTCCTTACCTTCTAGCTCGAGCAATAATTCTTCC
ACCTCTAGCTGGAAAGCTCTTTGGAACCAAGAAACTAGCCTTAAAGTGAAGATTTGTGCTTGGAAGGTCTTCAAGGATATTATTCCCTCAAAAGCTAATATTATTGGCAA
AGGAATTGACACTAATGACCGATGTCTTTTGGAATGGTGGAAGCCTATCGACTTTTGGCATTGGATCGTGAAGAATGGAAAGGGAACTGATATGAAGAAGGCAATCCTGC
TGATTTGGTGCATTTGGACGTACAGAAACCGTATTAACAAAGATAAAGCTACTCCAGACATATCTCATTTTATCAACTGGGAAGGCATGCAACATTTAATTTCCTTTTCT
GATCAAATGCCTCCCCCCCTAGTTATTCAATCTGACTCCTCTGAAGCTGTAAGAGTGATTATCTCCAGAAATGAAGACCTCTCGGAGATCCGACTGATTGTGGAAGATAT
TGACAACCTTCGTTCGAAGCTTAACTCGGTGGTTTTTGAAAAATGCACAAGGTCTGGTAACAGGGTGGCTCATGCCCTCGCCAGAATAGCCATGGAAGCTTCTCCCTCGT
CGACTTGCATGGAGGAAGCGAAAGTTTCTGGACCTGAGGAAAGCGTCTTCTTGTCTTCTGTTTTTCCTTCTTGGTTTGTTAATCTCATTAATGAGGACGCTGGCAGTTGT
GCTTTTGGTAATTAA
Protein sequenceShow/hide protein sequence
MPRIWNLEAIVEIQSVGKNLFIYNFLENKDKRRIRNGGPWIIDKGLLIFEDLIGRVKLSDLEFRKMAISLGNSIGQFEKVETNEAGKCWGHTLRVKVKVDIKKPLKRYTK
IKVGLMMEETKILITYEKLPDFCFGCGLVGHVEKDCETPGTNTGETKRYGTWLRALPSERKNLRTDQRGGRQPQSRGRGRGHQQSRTERETSESKEEEIETDQQCKANTK
ESSTPEKLETGSEIEILTTLQSDHAKQKRNLDSKDDLFPRPTESPEGSKSKQDKSQKETRKGKEALASGLIPESKSHKVKEGRGQHPINEGADGGIHLRPQPLRITKSIQ
NVMDLTEEIIGKGKEDDNNQITFTLKSEEQRISSRKWKRKARGNLVVAEENQKEASSIIQNDLGSWRFTGFYGNPNPAKRSDSWELLDRLSQVSSAPWLIGGDFNEIFWD
DEKLGGGKKDPLLMQAFREIVDKCKLSDPWYNGNKFTWRKNRLNRNSTKERLDRFLINPQLHAKQVRITIDHGSFHMSDHRPIIATIVNLCDHAALGRKNKLLKFEESWA
HSKECKNLVKQVWDNSPGDDLEGLIYRNTISLKKLNDWNRIRLKGSLQGSLFTSSNPDSDSVARVLDGLPRKIEEDQNSFLCNPFTNHEVEEALKNMGPTKAPGPDGLHA
IYYQRWWDIVGADTTKGFNKPLFVKEPFKTLYVQHLFHPTGGWDPTKIKQAFSNADAEDILNMQAGRPGSSDTIIWGVDPKGVFTVKSAYHLEINLNSSSLPSSSSNNSS
TSSWKALWNQETSLKVKICAWKVFKDIIPSKANIIGKGIDTNDRCLLEWWKPIDFWHWIVKNGKGTDMKKAILLIWCIWTYRNRINKDKATPDISHFINWEGMQHLISFS
DQMPPPLVIQSDSSEAVRVIISRNEDLSEIRLIVEDIDNLRSKLNSVVFEKCTRSGNRVAHALARIAMEASPSSTCMEEAKVSGPEESVFLSSVFPSWFVNLINEDAGSC
AFGN