; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025592 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025592
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:15787806..15789865
RNA-Seq ExpressionLag0025592
SyntenyLag0025592
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]4.8e-5642.78Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA
        LDR N+LLWK LALPIL+ YKLEGHL+G +PCP  ++  +   S  N+T TEEGA                                   +    VA   
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA

Query:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA
         GF  ++ L       FGVQSRAEED+LRQ+ Q +RKGN KM +YL VMK ++DNLGQ GSPV  R+LI QVLLGLDE YNLVI ++QG+  I+W  +Q+
Subjt:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA

Query:  ELLVFEKRLEMQNTHKSSL---SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCY
        +LL+FEK L+ QNT K      + +Q+ ++N+A    +  QRN  N      N  +   QRGN                 NN P CQ+CGK GH+AL+CY
Subjt:  ELLVFEKRLEMQNTHKSSL---SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCY

Query:  QRFNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY
         RFNKE+S P    + ++  +G+VS PN  V       FV+ QN   F A+PDTV+D N+
Subjt:  QRFNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.4e-5441.69Show/hide
Query:  TSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGI
        T +     G +    L +    LFGVQSRAEED+LRQ+FQ +RK      DYLR+MK + D LGQAGSPV  R+ I Q LLGLDE YN VIA++QG+  I
Subjt:  TSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGI

Query:  TWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIGNQRNQPN-SFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHT
        +W  +Q+ELL FEKRLE Q+T K++ +  QN  VN+A ++   + R   N  F+G   N ++G QRG  N  RGRG+GR       NKP CQ+C K GH+
Subjt:  TWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIGNQRNQPN-SFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHT

Query:  ALMCYQRFNKEYSGP-SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY-------------------------------------
        AL+CY RFNKE+  P  Q +     N S+ +N       T  V  Q+VN F A+ DTVI+LN+                                     
Subjt:  ALMCYQRFNKEYSGP-SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY-------------------------------------

Query:  ------------------LENVLCVPSIAKNLVSISKLSRD-NVFVEFHDTFCLVKAKDTGKAVWHR
                          L+NVLCVP I KNLVS+SKL++D NV++EFH  +C +K KDTG+ + +R
Subjt:  ------------------LENVLCVPSIAKNLVSISKLSRD-NVFVEFHDTFCLVKAKDTGKAVWHR

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]5.3e-4734.14Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA
        LDR N+LLWK LALPIL+ YKLEGHL+  +PCP  ++  +   S  N+T TEEGA                                   +    VA   
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA

Query:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA
         GF  ++ L       FGVQSRAEED+LRQ+ Q +RK                                     GLDE YNLVI ++QG+  I+W  +Q+
Subjt:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA

Query:  ELLVFEKRLEMQNTHKSSL-SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQR
        +LL+FEKRL+ QNT K +  + +Q+ ++N+A    +  QRNQ N      N  +   QRGN                 NN P CQ+CGK GH+AL+CY R
Subjt:  ELLVFEKRLEMQNTHKSSL-SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQR

Query:  FNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY------------------------------------------
        FNKE+S P   ++ ++  +G+VS PN  V       FV+ QN   F A+PDTV+D N+                                          
Subjt:  FNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY------------------------------------------

Query:  -------------LENVLCVPSIAKNLVSISKLSRDN-VFVEFHDTFCLVKAKDTGK
                     L+N+LCVP IAKNL+S+SKL++DN +++EFH   C +K K TGK
Subjt:  -------------LENVLCVPSIAKNLVSISKLSRDN-VFVEFHDTFCLVKAKDTGK

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]9.6e-4944.14Show/hide
Query:  PQYIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQ
        PQY    AV+ +            V     G      L   I  LFGVQSR EEDYLR VFQ +RKGN+KM +YL+ MK + DNL QAGSP+  R+L+ Q
Subjt:  PQYIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQ

Query:  VLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQ--NASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGR
        VLLGLDEEYN ++AM+QGR+ ++W  +Q+ELL++E+RLE Q+  K+++ F+Q  NASVN+ N++ + NQ N+ NS     N    G QRG G   RGRGR
Subjt:  VLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQ--NASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGR

Query:  GRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQGQNKGDGNVSRPNNQVPSTQ--TTAFVANQNVNSFVASPDTVIDLNYLEN
        GR     NN KP+CQ+CGK+GH A  C+ R+++++  P+  QNK +     PNNQ  +TQ   TA       N F+   + + D N+ ++
Subjt:  GRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQGQNKGDGNVSRPNNQVPSTQ--TTAFVANQNVNSFVASPDTVIDLNYLEN

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]9.6e-4944.14Show/hide
Query:  PQYIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQ
        PQY    AV+ +            V     G      L   I  LFGVQSR EEDYLR VFQ +RKGN+KM +YL+ MK + DNL QAGSP+  R+L+ Q
Subjt:  PQYIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQ

Query:  VLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQ--NASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGR
        VLLGLDEEYN ++AM+QGR+ ++W  +Q+ELL++E+RLE Q+  K+++ F+Q  NASVN+ N++ + NQ N+ NS     N    G QRG G   RGRGR
Subjt:  VLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQ--NASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGR

Query:  GRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQGQNKGDGNVSRPNNQVPSTQ--TTAFVANQNVNSFVASPDTVIDLNYLEN
        GR     NN KP+CQ+CGK+GH A  C+ R+++++  P+  QNK +     PNNQ  +TQ   TA       N F+   + + D N+ ++
Subjt:  GRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQGQNKGDGNVSRPNNQVPSTQ--TTAFVANQNVNSFVASPDTVIDLNYLEN

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X12.6e-4734.14Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA
        LDR N+LLWK LALPIL+ YKLEGHL+  +PCP  ++  +   S  N+T TEEGA                                   +    VA   
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA

Query:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA
         GF  ++ L       FGVQSRAEED+LRQ+ Q +RK                                     GLDE YNLVI ++QG+  I+W  +Q+
Subjt:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA

Query:  ELLVFEKRLEMQNTHKSSL-SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQR
        +LL+FEKRL+ QNT K +  + +Q+ ++N+A    +  QRNQ N      N  +   QRGN                 NN P CQ+CGK GH+AL+CY R
Subjt:  ELLVFEKRLEMQNTHKSSL-SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQR

Query:  FNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY------------------------------------------
        FNKE+S P   ++ ++  +G+VS PN  V       FV+ QN   F A+PDTV+D N+                                          
Subjt:  FNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY------------------------------------------

Query:  -------------LENVLCVPSIAKNLVSISKLSRDN-VFVEFHDTFCLVKAKDTGK
                     L+N+LCVP IAKNL+S+SKL++DN +++EFH   C +K K TGK
Subjt:  -------------LENVLCVPSIAKNLVSISKLSRDN-VFVEFHDTFCLVKAKDTGK

A0A2K3LRE0 Glutamate receptor2.9e-4334.73Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGATSVQTVA-----SGASGFHQLQKLIHY------------LFGVQS
        LDR NF LWK+L LP+++  KL+G+L G   CP QYI  +   S  ++   E+     Q +      S   G     +L+H             L G  +
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGATSVQTVA-----SGASGFHQLQKLIHY------------LFGVQS

Query:  RAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSF
        R+   YL+  F  SRKG +KM +YL  MK   D L  AGS ++   L++Q L GLD +YN V+  +  +I ++W ++QA+LL FE RLE  N      + 
Subjt:  RAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSF

Query:  SQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGR--GRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGP----SQGQNKGD
        + NA+ N+AN             F+G     NR N RGN  GS  RG   GR  G+++N+KP+CQ+C    HTA+ C+ RF+K Y+GP    S+ + +G 
Subjt:  SQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGR--GRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGP----SQGQNKGD

Query:  GNV----------------SRPNNQVP------------STQTTAFVANQNVNSFVASPDTVIDLNYLENVLCVPSIAKNLVSISKLSRD-NVFVEFHDT
         N                 S  +N V             + + +  V N      VAS  + I+   L NVL VP I KNL+S+SKL+ D N  VEF   
Subjt:  GNV----------------SRPNNQVP------------STQTTAFVANQNVNSFVASPDTVIDLNYLENVLCVPSIAKNLVSISKLSRD-NVFVEFHDT

Query:  FCLVKAKDTGKAVWHRHLGHLSDPQYSTS
        +C VK K TGK +     G L D  Y  S
Subjt:  FCLVKAKDTGKAVWHRHLGHLSDPQYSTS

A0A5A7SIT7 Uncharacterized protein2.3e-5642.78Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA
        LDR N+LLWK LALPIL+ YKLEGHL+G +PCP  ++  +   S  N+T TEEGA                                   +    VA   
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGA-----------------------------------TSVQTVASGA

Query:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA
         GF  ++ L       FGVQSRAEED+LRQ+ Q +RKGN KM +YL VMK ++DNLGQ GSPV  R+LI QVLLGLDE YNLVI ++QG+  I+W  +Q+
Subjt:  SGFHQLQKL---IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQA

Query:  ELLVFEKRLEMQNTHKSSL---SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCY
        +LL+FEK L+ QNT K      + +Q+ ++N+A    +  QRN  N      N  +   QRGN                 NN P CQ+CGK GH+AL+CY
Subjt:  ELLVFEKRLEMQNTHKSSL---SFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCY

Query:  QRFNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY
         RFNKE+S P    + ++  +G+VS PN  V       FV+ QN   F A+PDTV+D N+
Subjt:  QRFNKEYSGP---SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-5441.69Show/hide
Query:  TSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGI
        T +     G +    L +    LFGVQSRAEED+LRQ+FQ +RK      DYLR+MK + D LGQAGSPV  R+ I Q LLGLDE YN VIA++QG+  I
Subjt:  TSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGI

Query:  TWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIGNQRNQPN-SFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHT
        +W  +Q+ELL FEKRLE Q+T K++ +  QN  VN+A ++   + R   N  F+G   N ++G QRG  N  RGRG+GR       NKP CQ+C K GH+
Subjt:  TWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIGNQRNQPN-SFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHT

Query:  ALMCYQRFNKEYSGP-SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY-------------------------------------
        AL+CY RFNKE+  P  Q +     N S+ +N       T  V  Q+VN F A+ DTVI+LN+                                     
Subjt:  ALMCYQRFNKEYSGP-SQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNY-------------------------------------

Query:  ------------------LENVLCVPSIAKNLVSISKLSRD-NVFVEFHDTFCLVKAKDTGKAVWHR
                          L+NVLCVP I KNLVS+SKL++D NV++EFH  +C +K KDTG+ + +R
Subjt:  ------------------LENVLCVPSIAKNLVSISKLSRD-NVFVEFHDTFCLVKAKDTGKAVWHR

A0A6J1DCW4 uncharacterized protein LOC1110195981.6e-4137.84Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQP----------SAVESVPNSTRTEEG----------------ATSVQTVASGASGFHQLQKL
        +DR NFLLW+NLALPILRSYKL  +L+G+ PCPP ++ P          ++ +S P    T E                 A  V     G S   +L   
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQP----------SAVESVPNSTRTEEG----------------ATSVQTVASGASGFHQLQKL

Query:  IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQ
        +  LFGVQSRAE DYL+QVFQQ+ KG+++M +YL++MK+H DNL  AGS V+ R L+ QVL GLDEEYN ++  VQG++ ++WS++ AELL +EKRLE Q
Subjt:  IHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQ

Query:  NTHKSSLSF--SQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGN--QRGNGNGSRGRGRGRRYGQY-----NNNKPICQICGKIGHTALMCYQRFNKE
        N+ KS +    +Q  SVN  + +     +   N  N   +N +RG   QRG+  G R RGRG +  Q+     +N+ P          T        +  
Subjt:  NTHKSSLSF--SQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGN--QRGNGNGSRGRGRGRRYGQY-----NNNKPICQICGKIGHTALMCYQRFNKE

Query:  YSGPSQGQNKGDGNVSRPNNQVPSTQT-TAFVANQNVNSFVASPDTVIDLN----YLENVLCVPSIAKNL
        +   S   +    N +    +V  + T    VAN N  S      T I  +     L++VL VP IAKNL
Subjt:  YSGPSQGQNKGDGNVSRPNNQVPSTQT-TAFVANQNVNSFVASPDTVIDLN----YLENVLCVPSIAKNL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-1025.6Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESV-PNSTRTEEG------------ATSVQTVASGASGFHQLQKLIHYLFGVQSRAEE
        L   N+L+W      +   Y+L G L G++  PP  I   A   V P+ TR +              + SVQ   S A+   Q+ + +  ++   S    
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESV-PNSTRTEEG------------ATSVQTVASGASGFHQLQKLIHYLFGVQSRAEE

Query:  DYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGR-IGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQN
          LR   +Q  KG   + DY++ +    D L   G P+     + +VL  L EEY  VI  +  +    T ++I   LL           H+S +    +
Subjt:  DYLRQVFQQSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGR-IGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQN

Query:  ASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNN--KPI---CQICGKIGHTALMCYQRFNKEYSGPSQ----------
        A+V    +  + ++     + N   N  NR + R N N S+   +       NNN  KP    CQICG  GH+A  C Q  +   S  SQ          
Subjt:  ASVNLANSKEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNN--KPI---CQICGKIGHTALMCYQRFNKEYSGPSQ----------

Query:  -GQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDT-----------VIDLNY--------------LENVLCVPSIAKNLVSISKLSRDN-VFVE
           N   G+    NN +  +  T  + +   N  +  P T            I +++              L N+L VP+I KNL+S+ +L   N V VE
Subjt:  -GQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDT-----------VIDLNY--------------LENVLCVPSIAKNLVSISKLSRDN-VFVE

Query:  FHDTFCLVKAKDTG
        F      VK  +TG
Subjt:  FHDTFCLVKAKDTG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.5e-0424.32Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESV-PNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRK
        L   N+L+W      +   Y+L G L G++P PP  I   AV  V P+ TR       + +   GA        +       Q     + LR+++     
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESV-PNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRK

Query:  GNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGR-IGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIG
        G++    ++       D L   G P+     + +VL  L ++Y  VI  +  +    + ++I   L+  E +L   N+ +         + N+   +   
Subjt:  GNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGR-IGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIG

Query:  NQRNQPNSFNGRQNNFNRGNQRGNG---NGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQ--------GQNKGDGNVSRPNNQ-
          RNQ N   G   N+N  N R N    + S  R   R+   Y      CQIC   GH+A  C Q    + +   Q         Q + +  V+ P N  
Subjt:  NQRNQPNSFNGRQNNFNRGNQRGNG---NGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQ--------GQNKGDGNVSRPNNQ-

Query:  ---VPSTQTTAFVANQNVNSF-----------VASPDTV----------------IDLNYLENVLCVPSIAKNLVSISKLSRDN-VFVEFHDTFCLVKAK
           + S  T    ++ N  SF           +A   T+                +DLN    VL VP+I KNL+S+ +L   N V VEF      VK  
Subjt:  ---VPSTQTTAFVANQNVNSF-----------VASPDTV----------------IDLNYLENVLCVPSIAKNLVSISKLSRDN-VFVEFHDTFCLVKAK

Query:  DTG
        +TG
Subjt:  DTG

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.1e-0522.67Show/hide
Query:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQ-----YIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQ
        L++ N+ +W+ L   +  S+ + GH+ G+S   P        +   V+     T T+   + + T+         L   +  LF     A         +
Subjt:  LDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQ-----YIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQ

Query:  QSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGI-TWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANS
         +   ++ + +Y + +K+  D L    SP++ R L++ +L GL E+Y+ ++ +++ +    ++++ ++ LL+ E RL   N  KSSLS + + S++    
Subjt:  QSRKGNIKMADYLRVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGI-TWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANS

Query:  KEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNN
             Q   P  ++   +N  RG  +      + RG G   G+YNNN
Subjt:  KEIGNQRNQPNSFNGRQNNFNRGNQRGNGNGSRGRGRGRRYGQYNNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTGGATCGCAACAATTTTCTTCTGTGGAAGAATTTGGCCCTTCCCATCTTGCGAAGCTACAAATTGGAAGGACATCTATCAGGTAATAGTCCTTGCCCTCCTCA
GTACATTCAACCCTCAGCTGTGGAATCAGTTCCGAATTCCACAAGAACAGAGGAAGGAGCAACCAGTGTTCAGACGGTTGCTAGTGGTGCATCTGGGTTTCACCAGCTGC
AGAAGTTAATCCATTATTTATTCGGAGTGCAATCGCGAGCTGAGGAGGACTATCTCCGTCAAGTTTTCCAACAGTCCAGGAAAGGGAACATTAAGATGGCTGATTATTTG
CGGGTTATGAAGAATCACATGGATAATTTGGGACAAGCAGGAAGTCCGGTCACTACACGGTCCTTAATCTTACAAGTCCTTCTTGGTTTAGACGAGGAGTATAATCTTGT
CATTGCAATGGTACAAGGGAGAATTGGAATAACCTGGTCAAAAATCCAAGCAGAGTTGTTAGTTTTCGAGAAGAGGTTGGAAATGCAGAATACCCATAAGAGCTCGTTAT
CTTTTAGTCAGAATGCGTCAGTGAACTTAGCAAATAGTAAGGAAATTGGAAATCAGAGAAATCAACCAAACTCCTTCAACGGGCGTCAGAACAATTTTAACAGAGGAAAT
CAACGGGGAAATGGAAATGGGAGTCGAGGTAGAGGACGAGGTCGAAGGTATGGACAATACAATAATAACAAGCCGATATGTCAGATATGTGGGAAGATTGGTCACACTGC
TCTAATGTGTTACCAACGGTTCAATAAAGAGTATTCTGGACCTTCCCAAGGTCAAAACAAAGGAGATGGAAATGTAAGTCGCCCAAACAATCAGGTACCTTCGACCCAAA
CTACTGCTTTTGTTGCAAATCAGAATGTCAATTCTTTTGTAGCTTCCCCAGACACTGTTATCGACCTCAACTATTTGGAAAACGTTCTGTGTGTGCCTTCAATTGCAAAG
AATCTAGTCAGTATTTCCAAGTTATCGCGTGATAATGTATTTGTGGAGTTTCATGATACGTTTTGTCTTGTAAAGGCCAAGGATACAGGCAAGGCTGTATGGCATAGGCA
TTTAGGCCATCTGTCAGACCCTCAATATTCGACTAGCCCGGGACAACATGATGACCAAGGGCGAGGCCTCGACCAGGGAGCACACAAGGGGAAACAAACCAGGATGACAC
AAGGAGCTCATAGGTCTGGGCCGTGGGAGGTCTCACGAGATGGCACTAGACGGGCGGACCAAGACGCTAGAAAGAACCATCAGCAGTGGGTAAAGAAAACTCACCCACCG
GGAGACTACAGCCGGTATGTGCGAAAGCGAGGAGCGCTTTGTCCTGACACACACACCCAACCCACAAGGAGGACCAAGGCTTGCAGGCGGCAAGTCACCCACATGGACGA
CATCAAAACAATGGCCACCACTTCAAGGCGCCACACGGAGAGGAGTCATTCCACTATATCCAGGAACCATGAAGTGGGTTTCCCAAAGAACTCGATGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGTTGGATCGCAACAATTTTCTTCTGTGGAAGAATTTGGCCCTTCCCATCTTGCGAAGCTACAAATTGGAAGGACATCTATCAGGTAATAGTCCTTGCCCTCCTCA
GTACATTCAACCCTCAGCTGTGGAATCAGTTCCGAATTCCACAAGAACAGAGGAAGGAGCAACCAGTGTTCAGACGGTTGCTAGTGGTGCATCTGGGTTTCACCAGCTGC
AGAAGTTAATCCATTATTTATTCGGAGTGCAATCGCGAGCTGAGGAGGACTATCTCCGTCAAGTTTTCCAACAGTCCAGGAAAGGGAACATTAAGATGGCTGATTATTTG
CGGGTTATGAAGAATCACATGGATAATTTGGGACAAGCAGGAAGTCCGGTCACTACACGGTCCTTAATCTTACAAGTCCTTCTTGGTTTAGACGAGGAGTATAATCTTGT
CATTGCAATGGTACAAGGGAGAATTGGAATAACCTGGTCAAAAATCCAAGCAGAGTTGTTAGTTTTCGAGAAGAGGTTGGAAATGCAGAATACCCATAAGAGCTCGTTAT
CTTTTAGTCAGAATGCGTCAGTGAACTTAGCAAATAGTAAGGAAATTGGAAATCAGAGAAATCAACCAAACTCCTTCAACGGGCGTCAGAACAATTTTAACAGAGGAAAT
CAACGGGGAAATGGAAATGGGAGTCGAGGTAGAGGACGAGGTCGAAGGTATGGACAATACAATAATAACAAGCCGATATGTCAGATATGTGGGAAGATTGGTCACACTGC
TCTAATGTGTTACCAACGGTTCAATAAAGAGTATTCTGGACCTTCCCAAGGTCAAAACAAAGGAGATGGAAATGTAAGTCGCCCAAACAATCAGGTACCTTCGACCCAAA
CTACTGCTTTTGTTGCAAATCAGAATGTCAATTCTTTTGTAGCTTCCCCAGACACTGTTATCGACCTCAACTATTTGGAAAACGTTCTGTGTGTGCCTTCAATTGCAAAG
AATCTAGTCAGTATTTCCAAGTTATCGCGTGATAATGTATTTGTGGAGTTTCATGATACGTTTTGTCTTGTAAAGGCCAAGGATACAGGCAAGGCTGTATGGCATAGGCA
TTTAGGCCATCTGTCAGACCCTCAATATTCGACTAGCCCGGGACAACATGATGACCAAGGGCGAGGCCTCGACCAGGGAGCACACAAGGGGAAACAAACCAGGATGACAC
AAGGAGCTCATAGGTCTGGGCCGTGGGAGGTCTCACGAGATGGCACTAGACGGGCGGACCAAGACGCTAGAAAGAACCATCAGCAGTGGGTAAAGAAAACTCACCCACCG
GGAGACTACAGCCGGTATGTGCGAAAGCGAGGAGCGCTTTGTCCTGACACACACACCCAACCCACAAGGAGGACCAAGGCTTGCAGGCGGCAAGTCACCCACATGGACGA
CATCAAAACAATGGCCACCACTTCAAGGCGCCACACGGAGAGGAGTCATTCCACTATATCCAGGAACCATGAAGTGGGTTTCCCAAAGAACTCGATGATGTAG
Protein sequenceShow/hide protein sequence
MMLDRNNFLLWKNLALPILRSYKLEGHLSGNSPCPPQYIQPSAVESVPNSTRTEEGATSVQTVASGASGFHQLQKLIHYLFGVQSRAEEDYLRQVFQQSRKGNIKMADYL
RVMKNHMDNLGQAGSPVTTRSLILQVLLGLDEEYNLVIAMVQGRIGITWSKIQAELLVFEKRLEMQNTHKSSLSFSQNASVNLANSKEIGNQRNQPNSFNGRQNNFNRGN
QRGNGNGSRGRGRGRRYGQYNNNKPICQICGKIGHTALMCYQRFNKEYSGPSQGQNKGDGNVSRPNNQVPSTQTTAFVANQNVNSFVASPDTVIDLNYLENVLCVPSIAK
NLVSISKLSRDNVFVEFHDTFCLVKAKDTGKAVWHRHLGHLSDPQYSTSPGQHDDQGRGLDQGAHKGKQTRMTQGAHRSGPWEVSRDGTRRADQDARKNHQQWVKKTHPP
GDYSRYVRKRGALCPDTHTQPTRRTKACRRQVTHMDDIKTMATTSRRHTERSHSTISRNHEVGFPKNSMM