; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000286 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000286
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:3000418..3001663
RNA-Seq ExpressionLag0000286
SyntenyLag0000286
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030478973.1 uncharacterized protein LOC115696051 [Cannabis sativa]2.8e-8241.9Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        M C+ +   S L+NGS + ++KP RG+ QGDPLSPYLFL+ AE LSSLL   ++     G+ I+   PSISHL FADD LIFC A  + C S+ +IL +Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N  KS  + S N   D          ++    +  YLG+P   AR K   F   K+KV   L  W    FS+ GKE L+KA++QAIP Y M+
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LP   C  I  A ARFWW SS  N KIHW +W +LC+SK++GG+ F  +  FNQAMLAKQ+WKI K P+ LL ++L  RYF +   L A  GHNPS 
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFLA----------------------HSVRAYEQQN----------------EIIWAENSKGVFFVKSAYHLAVSVAEANEASCSVNEMF
         WRSILWGRDL                         +    Y+  +                E+IW+    G+F VKSAYHLA+S  +    S   N   
Subjt:  TWRSILWGRDLFLA----------------------HSVRAYEQQN----------------EIIWAENSKGVFFVKSAYHLAVSVAEANEASCSVNEMF

Query:  F
        F
Subjt:  F

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]1.6e-8045.91Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        M C+ +  +S  +NG  S ++ P+RGL QGDPLSPYLFLI +E LS LL  E+++    G+ ++   P+ISHL FADD L+FC+A+++ C +I++ L +Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N +KS+   S N  +        +LG+   +    YLG+P+ + R K+ LF+  KEK+ K +  W + +FS GGKE L+KA+VQ+IPTY MS
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+L K FC+++    ARFWW S+  N+KIHW  W  LCKSK  GGM F     FNQA+LAKQ+W+I + PN LLS++L GRY+ H D++ A+     SL
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFLAHSVR
        TW+ I+WGR+L LA  +R
Subjt:  TWRSILWGRDLFLAHSVR

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]7.0e-8148.24Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        + C++SV YS L+NG     +KP RG+ QGDPLSPYLFLI AE LS LL  +++     G+ ++   PS+SHLFFADD ++FCRA+ +  RSI+++L +Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N +K +   S N           LL +        YLG+PS + R K  LF+   +K+ K L  WKE LFS GGKE L+KA+VQAIPTY MS
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LP   C  I    A FWW S+     IHW NW+ LCK+K  GG+ F +   FNQA+LAKQ+W++L++PN LLS+IL  RYF HG  L+A +G+ PSL
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFL
        TWRSI+WG++L +
Subjt:  TWRSILWGRDLFL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]8.8e-8440.61Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        + C++SV YS L+NG+    + P RG+ QGDPLSPYLFLI AE LS LL  E+      G+ I+   PS+SHLFFADD ++FCRA+++  R+I + L  Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N EK +   S+N    +     DLLG+        YLG+PS + + K  LF    +K+ K L  WKE LFS GGKE L+KA+VQAIPTY MS
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LP   C  I    ARFWW S+   + IHW NW+ LCK+K  GG+ F +   FNQA+LAKQ+W+IL+ PN LLS +L  RYF +G+YL A +G NPSL
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFL----------------------AHS-----------------------------------------------VRAYEQQNEIIWAEN
        TWRS++WG++L L                       H+                                               +  Y   + +IW ++
Subjt:  TWRSILWGRDLFL----------------------AHS-----------------------------------------------VRAYEQQNEIIWAEN

Query:  SKGVFFVKSAYHLAVSVAEANEASCS
          GV+ VKS YH AVS+AE ++++CS
Subjt:  SKGVFFVKSAYHLAVSVAEANEASCS

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]1.7e-8243.9Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        + C+ +V YS L+NGS    + P RG+ QGDPLSPYLFLI AE LS LL  E+S     G+ I+   PS+SHLFFADD ++FCRA+ +  RSI + LQ Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N +K +   S N    +    + LLG+        YLG+PS   R K  LF    +K+ K L  WKE LFS GGKE L+KA+VQAIPTY MS
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LP   C  I    A FWW SS   + IHW NW+ LCK+K  GG+ F +   FNQA+LAKQ+W++++ PN LLSK+L  RYF +G +L++ +G NPSL
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFLAH-----------SVRA-----------------YEQQNEIIWAENSKGVFFVKSAYHLAVSVAEANEASCS
        TWRS+    +L +A            S+RA                 +   + +IW+ +  G++ VKS Y LAVS AE ++ + S
Subjt:  TWRSILWGRDLFLAH-----------SVRA-----------------YEQQNEIIWAENSKGVFFVKSAYHLAVSVAEANEASCS

TrEMBL top hitse value%identityAlignment
A0A2N9GD63 Reverse transcriptase domain-containing protein6.2e-8342.47Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        M C+ +V+YS+L++G P   I P RGL QGDPLSPYLFL+  E LS+L+ +  ++    G   + + P +SHLFFADD L+F +AS  + +   +ILQ+Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
        EA+S Q  N EK+    S N   +  + + DL G   T++   YLG+P+   R K S+FN  KE++ + LQGWKE   S+ G+E LIKA+ Q+IPTYTM+
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LPK +CD++N   A++WW  S+  RKIHW+ W +LC SK  GG+ F ++ +FN A+LAKQ W++L N   L  ++   +YF    +++A +G  PS 
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFLAHSVRAYE--QQNEIIWAENSKGVFFVKSAYHLAVSVAEA-NEASCSVNEMF
         WRS L GRDL L      Y      +++W+E   G+F V+SAY L        N   CS +E +
Subjt:  TWRSILWGRDLFLAHSVRAYE--QQNEIIWAENSKGVFFVKSAYHLAVSVAEA-NEASCSVNEMF

A0A2N9GRT8 Reverse transcriptase domain-containing protein1.2e-8142.57Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        M C+ +V+YS+L++G P   I P RG+ QGDPLSPY+FL+ AE LS++L +     H  G+ +    P +SHLFFADD L+F +A+ ++C  + +IL +Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
        E +S Q  N++K+    S N   D    +    G +   +   YLG+P+   R K S+FN  KE++ + LQGWKE   S+ G+E LIKA+ QAIPTY M+
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LPK +CD++N   AR+WW   +  RK+HW+ W +LC +KA GG+ F ++S FN A+LAKQ W+IL  P  L  ++   RYF +  +++A +G NPS 
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFLAHSVRAYEQQNEI------IWAENSKGVFFVKSAYHL
         WRS LWGRD      ++  E Q++        W   S G+F VKSAY +
Subjt:  TWRSILWGRDLFLAHSVRAYEQQNEI------IWAENSKGVFFVKSAYHL

A0A2N9J4E8 Reverse transcriptase domain-containing protein2.8e-8342.74Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        M C+ +V+YS+L++G P   I P RGL QGDPLSPYLFL+ AE LS+L+ +  ++    G   + + P +SHLFFADD L+F +AS  + +   +ILQ+Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
        EA+S Q  N EK+    S N   +  + + DL G   T++   YLG+P+   R K S+FN  KE++ + LQGWKE   S+ G+E LIKA+ Q+IPTYTM+
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CF+LPK +CD++N   A++WW  S+  RKIHW+ W +LC SK  GG+ F ++ +FN A+LAKQ W++L N   L  ++   +YF    +++A +G  PS 
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDLFLAHSVRAYE--QQNEIIWAENSKGVFFVKSAYHLAVSVAEA-NEASCSVNEMF
         WRS L GRDL L      Y      +++W+E   G+F V+SAY L        N   CS +E +
Subjt:  TWRSILWGRDLFLAHSVRAYE--QQNEIIWAENSKGVFFVKSAYHLAVSVAEA-NEASCSVNEMF

A0A803NM27 Uncharacterized protein9.9e-8145.66Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        +NC+++ + S ++NG+ S  +KP+RGL QGDPLSPYLFLI +E LS LL  E+S+    G+ ++   PS+SHL FADD L+FC A ++ C +I+++L  Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N +KS+   S N      +   ++LG+   +    YLG+P+ + R K  LFNQ KE++ K L  W + +FS GGKE L+KA++Q+IPTY MS
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CFKLP  FC +I    + FWW S+   +KIHW  W  LCKSK  GG+ F +   FNQA+LAKQ+W++ +NP  LL ++L GRYF   D+L+A      SL
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDL
        TW+   WGR+L
Subjt:  TWRSILWGRDL

A0A803NTN0 Uncharacterized protein3.4e-8139.29Show/hide
Query:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY
        MNC+ +  +S  +NG     ++P RGL QGDPLSPYLFLI +E  S LL  E+S  +  G+ +    PS+SHL FADD L+FCRA+++   +I++IL  Y
Subjt:  MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMY

Query:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS
           S Q  N  KS+   S N         ++ L +  T+    YLG+PS + R K  LF+  KEKV K L  W E +FS GGKE L+KA+VQ+IPTY MS
Subjt:  EATSEQTTNLEKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMS

Query:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL
        CFKL K FC  +    A FWW ++Q   KIHW  W  LCKSK  GGM F     FNQA+LAKQ+W+I   PN LLS++L  RYF +  +L+A IGH+PS 
Subjt:  CFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSL

Query:  TWRSILWGRDL--------------------------------------------------------------------FLAHSVRAYEQQNEIIWAENS
        TW+SI WGRDL                                                                     L   +  +  Q+ +IW  +S
Subjt:  TWRSILWGRDL--------------------------------------------------------------------FLAHSVRAYEQQNEIIWAENS

Query:  KGVFFVKSAYHLAVSVAEANEASCS
         G++ VKS +HLA ++ E N++S S
Subjt:  KGVFFVKSAYHLAVSVAEANEASCS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.5e-1728.57Show/hide
Query:  MPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGG
        MP    R     F +  E+V   + GW+E   S  G+ TL KA++ ++P ++MS   LP+   + +++    F W S+   +K H + W ++C  K  GG
Subjt:  MPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGG

Query:  MDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNP----SLTWRSILWGRDLFLAHSV
        +        N+A+++K  W++L+  N L + +L  +Y  H   +  +    P    S TWRSI  G    ++H V
Subjt:  MDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNP----SLTWRSILWGRDLFLAHSV

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-1226.82Show/hide
Query:  SILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMYEATSEQTTN
        +I VNG   E I  + G  QG PLSPYLF I  EVL+  + ++K +    GI I      IS L  ADD +++    +   R +  ++  +        N
Subjt:  SILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMYEATSEQTTN

Query:  LEKSI-FMTSKNIGVDK-IKGLSDL-LGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIV--QAIPTYTMSCFKL
          KS+ F+ +KN   +K I+  +   +   + K +G  L    ++   KN  F   K+++ + L+ WK+   S  G+  ++K  +  +AI  +     K+
Subjt:  LEKSI-FMTSKNIGVDK-IKGLSDL-LGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIV--QAIPTYTMSCFKL

Query:  PKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSW
        P  F +++  A  +F W     N K   +    L   +  GG+   D+ L+ +A++ K +W
Subjt:  PKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSW

P92555 Uncharacterized mitochondrial protein AtMg012504.2e-1250.75Show/hide
Query:  LVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADD
        ++NG+P   + P RGL QGDPLSPYLF++  EVLS L  R +      GI ++N  P I+HL FADD
Subjt:  LVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADD

P93295 Uncharacterized mitochondrial protein AtMg003101.5e-3048.33Show/hide
Query:  AIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKA-LGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLN
        A+P Y MSCF+L K  C  +  A   FWW S +  RKI W+ W +LCKSK   GG+ F D+  FNQA+LAKQS++I+  P+ LLS++L  RYF H   + 
Subjt:  AIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKA-LGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLN

Query:  ATIGHNPSLTWRSILWGRDL
         ++G  PS  WRSI+ GR+L
Subjt:  ATIGHNPSLTWRSILWGRDL

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-2843.48Show/hide
Query:  AIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNA
        A+PTYTM+CF LPK  C  I    A FWW + Q  + +HW  W  L   KA GG+ F DI  FN A+L KQ W++L  P  L++K+   RYF   D LNA
Subjt:  AIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNA

Query:  TIGHNPSLTWRSILWGRDLFLAHSVRAYEQQNE--IIW
         +G  PS  W+SI   +++ L    RA     E  IIW
Subjt:  TIGHNPSLTWRSILWGRDLFLAHSVRAYEQQNE--IIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-3148.33Show/hide
Query:  AIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKA-LGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLN
        A+P Y MSCF+L K  C  +  A   FWW S +  RKI W+ W +LCKSK   GG+ F D+  FNQA+LAKQS++I+  P+ LLS++L  RYF H   + 
Subjt:  AIPTYTMSCFKLPKFFCDDINRACARFWWVSSQGNRKIHWMNWHRLCKSKA-LGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLN

Query:  ATIGHNPSLTWRSILWGRDL
         ++G  PS  WRSI+ GR+L
Subjt:  ATIGHNPSLTWRSILWGRDL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.0e-1350.75Show/hide
Query:  LVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADD
        ++NG+P   + P RGL QGDPLSPYLF++  EVLS L  R +      GI ++N  P I+HL FADD
Subjt:  LVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGCATGGAATCTGTCGAGTACTCAATCCTAGTGAATGGATCCCCAAGCGAGACTATCAAACCTGAAAGAGGGCTGGGGCAGGGCGACCCATTGTCCCCCTACCT
TTTCCTTATACGCGCAGAAGTCCTCTCAAGCTTATTACTCAGGGAAAAATCTCTCTCTCATTTTAACGGTATTTGTATTAATAATCTCTACCCCTCTATCTCTCACTTAT
TTTTCGCTGATGACAGATTGATTTTTTGTAGGGCATCTGAGAAAGATTGTAGGAGTATTAGAAAGATCCTCCAGATGTACGAAGCGACTTCGGAGCAAACCACCAACCTT
GAGAAGTCAATCTTCATGACAAGCAAAAACATTGGTGTTGATAAGATCAAAGGGCTTTCGGATCTTTTGGGAATAAAGCATACCAAGTCTATAGGTCACTACCTTGGTAT
GCCTTCCCAAAACGCTAGACGCAAGAACTCCCTGTTCAATCAGCGTAAGGAGAAAGTGGGGAAAACGCTTCAGGGTTGGAAGGAGTCTCTTTTCTCTCAAGGAGGCAAAG
AGACCCTGATCAAGGCCATCGTTCAAGCGATTCCTACTTATACTATGTCATGTTTTAAACTTCCAAAATTTTTTTGTGATGATATTAACAGGGCTTGTGCTCGTTTCTGG
TGGGTTTCCTCTCAAGGGAATAGAAAGATTCACTGGATGAACTGGCATCGTCTTTGCAAAAGCAAAGCCTTGGGAGGGATGGACTTCTGGGATATTAGCCTTTTCAATCA
AGCGATGTTGGCCAAGCAAAGCTGGAAGATTTTGAAAAATCCAAATTGCCTCCTTTCCAAGATCTTGATGGGCAGGTATTTCAAGCATGGAGACTATTTGAATGCCACTA
TAGGCCACAATCCTTCGCTAACCTGGAGAAGCATTCTATGGGGTCGAGATCTCTTCTTGGCTCACTCGGTTAGAGCTTATGAACAGCAAAATGAGATTATTTGGGCAGAA
AACTCTAAAGGTGTATTCTTCGTGAAATCCGCCTATCATCTTGCTGTTTCTGTGGCCGAAGCCAATGAGGCATCTTGTTCTGTTAATGAAATGTTCTTCACTTCTTCTCA
GAATATTGAATTTCAGTTCATTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTGCATGGAATCTGTCGAGTACTCAATCCTAGTGAATGGATCCCCAAGCGAGACTATCAAACCTGAAAGAGGGCTGGGGCAGGGCGACCCATTGTCCCCCTACCT
TTTCCTTATACGCGCAGAAGTCCTCTCAAGCTTATTACTCAGGGAAAAATCTCTCTCTCATTTTAACGGTATTTGTATTAATAATCTCTACCCCTCTATCTCTCACTTAT
TTTTCGCTGATGACAGATTGATTTTTTGTAGGGCATCTGAGAAAGATTGTAGGAGTATTAGAAAGATCCTCCAGATGTACGAAGCGACTTCGGAGCAAACCACCAACCTT
GAGAAGTCAATCTTCATGACAAGCAAAAACATTGGTGTTGATAAGATCAAAGGGCTTTCGGATCTTTTGGGAATAAAGCATACCAAGTCTATAGGTCACTACCTTGGTAT
GCCTTCCCAAAACGCTAGACGCAAGAACTCCCTGTTCAATCAGCGTAAGGAGAAAGTGGGGAAAACGCTTCAGGGTTGGAAGGAGTCTCTTTTCTCTCAAGGAGGCAAAG
AGACCCTGATCAAGGCCATCGTTCAAGCGATTCCTACTTATACTATGTCATGTTTTAAACTTCCAAAATTTTTTTGTGATGATATTAACAGGGCTTGTGCTCGTTTCTGG
TGGGTTTCCTCTCAAGGGAATAGAAAGATTCACTGGATGAACTGGCATCGTCTTTGCAAAAGCAAAGCCTTGGGAGGGATGGACTTCTGGGATATTAGCCTTTTCAATCA
AGCGATGTTGGCCAAGCAAAGCTGGAAGATTTTGAAAAATCCAAATTGCCTCCTTTCCAAGATCTTGATGGGCAGGTATTTCAAGCATGGAGACTATTTGAATGCCACTA
TAGGCCACAATCCTTCGCTAACCTGGAGAAGCATTCTATGGGGTCGAGATCTCTTCTTGGCTCACTCGGTTAGAGCTTATGAACAGCAAAATGAGATTATTTGGGCAGAA
AACTCTAAAGGTGTATTCTTCGTGAAATCCGCCTATCATCTTGCTGTTTCTGTGGCCGAAGCCAATGAGGCATCTTGTTCTGTTAATGAAATGTTCTTCACTTCTTCTCA
GAATATTGAATTTCAGTTCATTCCATAA
Protein sequenceShow/hide protein sequence
MNCMESVEYSILVNGSPSETIKPERGLGQGDPLSPYLFLIRAEVLSSLLLREKSLSHFNGICINNLYPSISHLFFADDRLIFCRASEKDCRSIRKILQMYEATSEQTTNL
EKSIFMTSKNIGVDKIKGLSDLLGIKHTKSIGHYLGMPSQNARRKNSLFNQRKEKVGKTLQGWKESLFSQGGKETLIKAIVQAIPTYTMSCFKLPKFFCDDINRACARFW
WVSSQGNRKIHWMNWHRLCKSKALGGMDFWDISLFNQAMLAKQSWKILKNPNCLLSKILMGRYFKHGDYLNATIGHNPSLTWRSILWGRDLFLAHSVRAYEQQNEIIWAE
NSKGVFFVKSAYHLAVSVAEANEASCSVNEMFFTSSQNIEFQFIP