; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036910 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036910
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:2047243..2050671
RNA-Seq ExpressionLag0036910
SyntenyLag0036910
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74695.1 hypothetical protein VITISV_024648 [Vitis vinifera]1.1e-5940.98Show/hide
Query:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG
        RA++ + K++L +T+KG LS+++Y  +I+  VD LA VG  +  +DHI  I  GL  +YE+ + ++ +  D  +V+++ A LL  E+RIE  +K +    
Subjt:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG

Query:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN
        TP  A+L+T +     H    + ++  N   P++ GN       N+  +GRGR  RG  SW   NKPQ Q+C + GH  ++CY    RFDQ   G SQ  
Subjt:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN

Query:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH
           P     + H Q     FP   S      T +E+ QDN WYP+ GAT+HLT N +NL   +++  ++++ VGNG GL I H G+TSFSSS   +    
Subjt:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH

Query:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP
        LK LLHVP+ITKNL+SVS+FA DN V+FEFHP  C VKD  T  +L+ G L  GLY FD   L LP
Subjt:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.7e-7750.29Show/hide
Query:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT
        AQ M+ K+KL + +KGS+ L EYF +I +CVDALA++ K V ++DHI+YIL GLGS+Y+SM+S ++  TD  SVQ+VM+ LLT E++ ESKL    S+  
Subjt:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT

Query:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG
         PS N++TQ +    ES  +    +Y  N +YN    RG GRSNRG R   NRNKPQ QIC K G++A +C      F +    S S+  +PN+ +  + 
Subjt:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG

Query:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS
          +  +N   M+AM+ A +LN D+ WYP+ GATNHLT++ SNL++G+EY G NQ+   NG+GL I+HYG  SF+SS      F L NLL VP ITKNLIS
Subjt:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS

Query:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL
        VSQFAKDN V+FEFHP  C VKD  TGQ+LL+G L+DGLY+F +
Subjt:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]2.9e-6341.56Show/hide
Query:  SFVFYNNSDYLSIDGFLNHGKSNCRRSWKFG-KSDRQSWNR----------AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMY
        +FV +N  D L +  FL    S   +S   G ++  Q W R          A++M+ K +LQ+ +KG+LS+ +Y  ++K  +D LAA G ++  +D I++
Subjt:  SFVFYNNSDYLSIDGFLNHGKSNCRRSWKFG-KSDRQSWNR----------AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMY

Query:  ILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT-PPSANLMTQSHHTVESDSQKPNPSYQGNQNYNSRGRGRSNRGGRS-WN
        IL G+G EYES+V  +T+  +  S+ +V A LL HE RIE+   N+    T  PS N+ T        ++ +  P Y+G      RGRGR+ RGGR  W+
Subjt:  ILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT-PPSANLMTQSHHTVESDSQKPNPSYQGNQNYNSRGRGRSNRGGRS-WN

Query:  NRNKPQYQICFKFGHTAIKCYSLGGRFD-----QGRGVSQSNNIAPNNFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVG
        N  +P  QIC   GH A  CY    RFD     +  GVS+++    N   P +   +F           T SE   +  WYP+ GA++H+TN+  NL+V 
Subjt:  NRNKPQYQICFKFGHTAIKCYSLGGRFD-----QGRGVSQSNNIAPNNFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVG

Query:  AEYSGANQMQVGNGTGLSISHYGYTSFS--SSNHIFHLKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL
        +EY+G +++QVGNG GLSIS+ G ++ +   S+  F LKNLLHVP ITKNLISVS+FA DN VYFEFHP FC VKD  T  +LLRG LH+GLYRF+L
Subjt:  AEYSGANQMQVGNGTGLSISHYGYTSFS--SSNHIFHLKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL

RVW80632.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-5940.98Show/hide
Query:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG
        RA++ + K++L +T+KG LS+++Y  +I+  VD LA VG  +  +DHI  I  GL  +YE+ + ++ +  D  +V+++   LL  E+RIE  +K +    
Subjt:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG

Query:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN
        TP  A+L+T +     H    + ++  N   P++ GN       N+  +GRGR  RG  SW   NKPQ Q+C + GH  ++CY    RFDQ   G SQ  
Subjt:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN

Query:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH
           P     + H Q     FP  SS      T +E+ QDN WYP+ GAT+HLT N +NL   +++  ++++ VGNG GL I H G+TSFSSS   +    
Subjt:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH

Query:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP
        LK LLHVP+ITKNL+SVS+FA DN V+FEFHP  C VKD  T  +L+ G L  GLY FD   L LP
Subjt:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.7e-7750.29Show/hide
Query:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT
        AQ M+ K+KL + +KGS+ L EYF +I +CVDALA++ K V ++DHI+YIL GLGS+Y+SM+S ++  TD  SVQ+VM+ LLT E++ ESKL    S+  
Subjt:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT

Query:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG
         PS N++TQ +    ES  +    +Y  N +YN    RG GRSNRG R   NRNKPQ QIC K G++A +C      F +    S S+  +PN+ +  + 
Subjt:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG

Query:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS
          +  +N   M+AM+ A +LN D+ WYP+ GATNHLT++ SNL++G+EY G NQ+   NG+GL I+HYG  SF+SS      F L NLL VP ITKNLIS
Subjt:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS

Query:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL
        VSQFAKDN V+FEFHP  C VKD  TGQ+LL+G L+DGLY+F +
Subjt:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein1.4e-6341.56Show/hide
Query:  SFVFYNNSDYLSIDGFLNHGKSNCRRSWKFG-KSDRQSWNR----------AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMY
        +FV +N  D L +  FL    S   +S   G ++  Q W R          A++M+ K +LQ+ +KG+LS+ +Y  ++K  +D LAA G ++  +D I++
Subjt:  SFVFYNNSDYLSIDGFLNHGKSNCRRSWKFG-KSDRQSWNR----------AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMY

Query:  ILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT-PPSANLMTQSHHTVESDSQKPNPSYQGNQNYNSRGRGRSNRGGRS-WN
        IL G+G EYES+V  +T+  +  S+ +V A LL HE RIE+   N+    T  PS N+ T        ++ +  P Y+G      RGRGR+ RGGR  W+
Subjt:  ILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT-PPSANLMTQSHHTVESDSQKPNPSYQGNQNYNSRGRGRSNRGGRS-WN

Query:  NRNKPQYQICFKFGHTAIKCYSLGGRFD-----QGRGVSQSNNIAPNNFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVG
        N  +P  QIC   GH A  CY    RFD     +  GVS+++    N   P +   +F           T SE   +  WYP+ GA++H+TN+  NL+V 
Subjt:  NRNKPQYQICFKFGHTAIKCYSLGGRFD-----QGRGVSQSNNIAPNNFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVG

Query:  AEYSGANQMQVGNGTGLSISHYGYTSFS--SSNHIFHLKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL
        +EY+G +++QVGNG GLSIS+ G ++ +   S+  F LKNLLHVP ITKNLISVS+FA DN VYFEFHP FC VKD  T  +LLRG LH+GLYRF+L
Subjt:  AEYSGANQMQVGNGTGLSISHYGYTSFS--SSNHIFHLKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE15.5e-6040.98Show/hide
Query:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG
        RA++ + K++L +T+KG LS+++Y  +I+  VD LA VG  +  +DHI  I  GL  +YE+ + ++ +  D  +V+++   LL  E+RIE  +K +    
Subjt:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG

Query:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN
        TP  A+L+T +     H    + ++  N   P++ GN       N+  +GRGR  RG  SW   NKPQ Q+C + GH  ++CY    RFDQ   G SQ  
Subjt:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN

Query:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH
           P     + H Q     FP  SS      T +E+ QDN WYP+ GAT+HLT N +NL   +++  ++++ VGNG GL I H G+TSFSSS   +    
Subjt:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH

Query:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP
        LK LLHVP+ITKNL+SVS+FA DN V+FEFHP  C VKD  T  +L+ G L  GLY FD   L LP
Subjt:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-7750.29Show/hide
Query:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT
        AQ M+ K+KL + +KGS+ L EYF +I +CVDALA++ K V ++DHI+YIL GLGS+Y+SM+S ++  TD  SVQ+VM+ LLT E++ ESKL    S+  
Subjt:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT

Query:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG
         PS N++TQ +    ES  +    +Y  N +YN    RG GRSNRG R   NRNKPQ QIC K G++A +C      F +    S S+  +PN+ +  + 
Subjt:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG

Query:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS
          +  +N   M+AM+ A +LN D+ WYP+ GATNHLT++ SNL++G+EY G NQ+   NG+GL I+HYG  SF+SS      F L NLL VP ITKNLIS
Subjt:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS

Query:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL
        VSQFAKDN V+FEFHP  C VKD  TGQ+LL+G L+DGLY+F +
Subjt:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-7750.29Show/hide
Query:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT
        AQ M+ K+KL + +KGS+ L EYF +I +CVDALA++ K V ++DHI+YIL GLGS+Y+SM+S ++  TD  SVQ+VM+ LLT E++ ESKL    S+  
Subjt:  AQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGT

Query:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG
         PS N++TQ +    ES  +    +Y  N +YN    RG GRSNRG R   NRNKPQ QIC K G++A +C      F +    S S+  +PN+ +  + 
Subjt:  PPSANLMTQ-SHHTVESDSQKPNPSYQGNQNYN---SRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG

Query:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS
          +  +N   M+AM+ A +LN D+ WYP+ GATNHLT++ SNL++G+EY G NQ+   NG+GL I+HYG  SF+SS      F L NLL VP ITKNLIS
Subjt:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSN---HIFHLKNLLHVPQITKNLIS

Query:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL
        VSQFAKDN V+FEFHP  C VKD  TGQ+LL+G L+DGLY+F +
Subjt:  VSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDL

A5BK17 Integrase catalytic domain-containing protein5.5e-6040.98Show/hide
Query:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG
        RA++ + K++L +T+KG LS+++Y  +I+  VD LA VG  +  +DHI  I  GL  +YE+ + ++ +  D  +V+++ A LL  E+RIE  +K +    
Subjt:  RAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDG

Query:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN
        TP  A+L+T +     H    + ++  N   P++ GN       N+  +GRGR  RG  SW   NKPQ Q+C + GH  ++CY    RFDQ   G SQ  
Subjt:  TPPSANLMTQS-----HHTVESDSQKPN---PSYQGN------QNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQG-RGVSQSN

Query:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH
           P     + H Q     FP   S      T +E+ QDN WYP+ GAT+HLT N +NL   +++  ++++ VGNG GL I H G+TSFSSS   +    
Subjt:  NIAPN----NFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSS---NHIFH

Query:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP
        LK LLHVP+ITKNL+SVS+FA DN V+FEFHP  C VKD  T  +L+ G L  GLY FD   L LP
Subjt:  LKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFD---LSLP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-2828.36Show/hide
Query:  IMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGTPP
        + +L+++L+   KG+ ++++Y   +    D LA +GK +D ++ +  +L  L  EY+ ++  +       ++ ++   LL HE++I   L   ++   P 
Subjt:  IMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNSDGTPP

Query:  SANLMTQSHHTVESDSQKPNPSYQ---GNQNYNSRGRGRSNRGGRSWNNRNKP---QYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG
        +AN ++  + T  +++   N + +    N N NS+   +S+      NN++KP   + QIC   GH+A +C  L             +   P        
Subjt:  SANLMTQSHHTVESDSQKPNPSYQ---GNQNYNSRGRGRSNRGGRSWNNRNKP---QYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFG

Query:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSNHIFHLKNLLHVPQITKNLISVSQ
         GS P++S               N W  + GAT+H+T++F+NL++   Y+G + + V +G+ + ISH G TS S+ +   +L N+L+VP I KNLISV +
Subjt:  AGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSNHIFHLKNLLHVPQITKNLISVSQ

Query:  FAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDLS
            N V  EF P    VKD  TG  LL+G   D LY + ++
Subjt:  FAKDNLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-2429.81Show/hide
Query:  DALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNS-DGTPPSANLMT-QSHHTVESDSQKPNPSYQGNQ
        D LA +GK +D ++ +  +L  L  +Y+ ++  +       S+ ++   L+   NR ESKL  +NS +  P +AN++T ++ +T  + + + +     N 
Subjt:  DALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYLLTHENRIESKLKNVNS-DGTPPSANLMT-QSHHTVESDSQKPNPSYQGNQ

Query:  NYNSRGRGRSNRGGRSWNNRNKP---QYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNL
        N  S     S+ G RS N + KP   + QIC   GH+A +C  L  +F       QS +               P       A L  +     N W  + 
Subjt:  NYNSRGRGRSNRGGRSWNNRNKP---QYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNNFHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNL

Query:  GATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSNHIFHLKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRG
        GAT+H+T++F+NL+    Y+G + + + +G+ + I+H G  S  +S+    L  +L+VP I KNLISV +    N V  EF P    VKD  TG  LL+G
Subjt:  GATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSNHIFHLKNLLHVPQITKNLISVSQFAKDNLVYFEFHPEFCCVKDSRTGQILLRG

Query:  ALHDGLYRFDLS
           D LY + ++
Subjt:  ALHDGLYRFDLS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.0e-0726.6Show/hide
Query:  SNCRRSWKFGKSDRQSWNRAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYL
        S  R  W   K+  ++   A+ +RL S+L++   G + + +Y+ ++KK  D+L  V   V   + +MY+LNGL  +++++++ +       S  D    L
Subjt:  SNCRRSWKFGKSDRQSWNRAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAYL

Query:  LTHENRIESKLKNVNSDGTPPSANLMTQSHHTVESDSQKP---NPSYQGNQNYNSRGRGRSN-----RGGR-------SWNNRNKPQY
           E+R++  +K       P   ++   S  TV + S+ P   N    G      RGRGR N     RGGR       ++N+ N+P +
Subjt:  LTHENRIESKLKNVNSDGTPPSANLMTQSHHTVESDSQKP---NPSYQGNQNYNSRGRGRSN-----RGGR-------SWNNRNKPQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGCCAGCGCCTCGTTTCGGCCCGTGGTTCCCCAGATCACCCCGGTTCCGCCTGGTTCGTCCCGAAACACCTCCGAATTCCTAAAAACTTTAGGAGAAAAACAGG
CGTCGGAGGCGGTGTGGCCTACACCACACCGGTGTCCAACGATTTTTACTGGTTTTGTAGGTCACTGTCTTCCCCAGCTTCTACAAATTCACTGTCGGTGTCACATGAAG
GTCAGAGCTCTCAAGTCCAACGACTGAACTCGATTCCTGAAATTTGTGCGGTGTTCTTTTTTTTTTTTGTTTCTTTTGCATCATTCGTCTTCTACAACAACTCTGATTAT
CTGAGCATCGATGGATTCCTGAACCACGGAAAATCAAACTGCCGAAGATCTTGGAAGTTCGGCAAGTCAGATCGTCAATCCTGGAACAGAGCTCAGATAATGAGATTGAA
ATCGAAGCTACAATCTACTCAAAAAGGATCGTTGTCATTGAATGAGTATTTTGCACAAATCAAAAAGTGCGTAGATGCACTTGCTGCTGTAGGCAAGACTGTTGACACAG
AGGATCACATTATGTACATTCTTAATGGCTTAGGTTCAGAGTATGAATCTATGGTCTCTGCACTAACGACTTCCACAGATGATCAAAGTGTGCAAGATGTCATGGCTTAC
CTTCTCACTCATGAGAATCGAATAGAGAGCAAGCTGAAGAATGTGAACTCAGATGGAACTCCTCCTTCAGCCAATCTAATGACTCAAAGTCATCACACTGTGGAATCTGA
CTCTCAAAAGCCGAATCCTTCGTATCAGGGAAATCAGAATTATAACTCTCGAGGTCGAGGCCGTTCGAATCGGGGAGGAAGATCTTGGAATAACAGAAACAAGCCCCAGT
ATCAAATTTGTTTCAAATTTGGTCATACGGCGATAAAGTGCTACTCTCTCGGTGGTCGTTTTGATCAAGGTCGAGGTGTTTCTCAGTCGAATAATATTGCTCCTAATAAC
TTTCACCCTCAGTTTGGTGCAGGTTCATTCCCACACAACTCCTCACCGATGACTGCTATGCTGACTGCAAGCGAACTCAACCAAGATAATGGTTGGTATCCAAATTTGGG
AGCCACCAATCATCTCACTAACAACTTCAGCAATCTTGCAGTTGGTGCTGAATATTCAGGGGCTAATCAGATGCAAGTAGGAAATGGTACGGGTCTCTCTATCTCCCACT
ATGGTTATACATCCTTTTCCTCATCCAATCACATATTTCATTTAAAGAACTTACTTCATGTGCCTCAAATAACCAAGAACTTGATAAGCGTCAGTCAGTTCGCTAAAGAT
AACTTGGTATACTTTGAATTTCACCCTGAATTTTGTTGTGTGAAGGACTCCCGTACTGGCCAAATCCTCCTTCGAGGAGCACTCCATGATGGGTTGTACCGGTTTGATCT
CTCACTACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGCCAGCGCCTCGTTTCGGCCCGTGGTTCCCCAGATCACCCCGGTTCCGCCTGGTTCGTCCCGAAACACCTCCGAATTCCTAAAAACTTTAGGAGAAAAACAGG
CGTCGGAGGCGGTGTGGCCTACACCACACCGGTGTCCAACGATTTTTACTGGTTTTGTAGGTCACTGTCTTCCCCAGCTTCTACAAATTCACTGTCGGTGTCACATGAAG
GTCAGAGCTCTCAAGTCCAACGACTGAACTCGATTCCTGAAATTTGTGCGGTGTTCTTTTTTTTTTTTGTTTCTTTTGCATCATTCGTCTTCTACAACAACTCTGATTAT
CTGAGCATCGATGGATTCCTGAACCACGGAAAATCAAACTGCCGAAGATCTTGGAAGTTCGGCAAGTCAGATCGTCAATCCTGGAACAGAGCTCAGATAATGAGATTGAA
ATCGAAGCTACAATCTACTCAAAAAGGATCGTTGTCATTGAATGAGTATTTTGCACAAATCAAAAAGTGCGTAGATGCACTTGCTGCTGTAGGCAAGACTGTTGACACAG
AGGATCACATTATGTACATTCTTAATGGCTTAGGTTCAGAGTATGAATCTATGGTCTCTGCACTAACGACTTCCACAGATGATCAAAGTGTGCAAGATGTCATGGCTTAC
CTTCTCACTCATGAGAATCGAATAGAGAGCAAGCTGAAGAATGTGAACTCAGATGGAACTCCTCCTTCAGCCAATCTAATGACTCAAAGTCATCACACTGTGGAATCTGA
CTCTCAAAAGCCGAATCCTTCGTATCAGGGAAATCAGAATTATAACTCTCGAGGTCGAGGCCGTTCGAATCGGGGAGGAAGATCTTGGAATAACAGAAACAAGCCCCAGT
ATCAAATTTGTTTCAAATTTGGTCATACGGCGATAAAGTGCTACTCTCTCGGTGGTCGTTTTGATCAAGGTCGAGGTGTTTCTCAGTCGAATAATATTGCTCCTAATAAC
TTTCACCCTCAGTTTGGTGCAGGTTCATTCCCACACAACTCCTCACCGATGACTGCTATGCTGACTGCAAGCGAACTCAACCAAGATAATGGTTGGTATCCAAATTTGGG
AGCCACCAATCATCTCACTAACAACTTCAGCAATCTTGCAGTTGGTGCTGAATATTCAGGGGCTAATCAGATGCAAGTAGGAAATGGTACGGGTCTCTCTATCTCCCACT
ATGGTTATACATCCTTTTCCTCATCCAATCACATATTTCATTTAAAGAACTTACTTCATGTGCCTCAAATAACCAAGAACTTGATAAGCGTCAGTCAGTTCGCTAAAGAT
AACTTGGTATACTTTGAATTTCACCCTGAATTTTGTTGTGTGAAGGACTCCCGTACTGGCCAAATCCTCCTTCGAGGAGCACTCCATGATGGGTTGTACCGGTTTGATCT
CTCACTACCTTAG
Protein sequenceShow/hide protein sequence
MVGQRLVSARGSPDHPGSAWFVPKHLRIPKNFRRKTGVGGGVAYTTPVSNDFYWFCRSLSSPASTNSLSVSHEGQSSQVQRLNSIPEICAVFFFFFVSFASFVFYNNSDY
LSIDGFLNHGKSNCRRSWKFGKSDRQSWNRAQIMRLKSKLQSTQKGSLSLNEYFAQIKKCVDALAAVGKTVDTEDHIMYILNGLGSEYESMVSALTTSTDDQSVQDVMAY
LLTHENRIESKLKNVNSDGTPPSANLMTQSHHTVESDSQKPNPSYQGNQNYNSRGRGRSNRGGRSWNNRNKPQYQICFKFGHTAIKCYSLGGRFDQGRGVSQSNNIAPNN
FHPQFGAGSFPHNSSPMTAMLTASELNQDNGWYPNLGATNHLTNNFSNLAVGAEYSGANQMQVGNGTGLSISHYGYTSFSSSNHIFHLKNLLHVPQITKNLISVSQFAKD
NLVYFEFHPEFCCVKDSRTGQILLRGALHDGLYRFDLSLP