; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008191 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008191
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:14352657..14355458
RNA-Seq ExpressionLag0008191
SyntenyLag0008191
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.1e-6932.79Show/hide
Query:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L
        M H   AKEIW  L  IF++  LAQ M+ K KL  I+KG M LKEYF KI Q VDALA+++KP +                 +++++I           +
Subjt:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L

Query:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP
        + L+L    Q     I E                                    RGGRG G SNRG RG     NRN+ QCQ+C K G++  +CFFRY P
Subjt:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP

Query:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------
        R   +   P S +++Y   N  P    M  MVA  +LN D+NWYPDSGATNHLTHS +NLS+GSEYG GNQ++  NG+                      
Subjt:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------

Query:  -------------------------------------------------------------------------------------------DVWHRRLGH
                                                                                                   D+WHRRLGH
Subjt:  -------------------------------------------------------------------------------------------DVWHRRLGH

Query:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------
        P L  VK VL  +   + +IN  N  FC AC++GKHH LPFS+S+T+Y+ PLQL+  DLWGP+   S NG+RYYISFVD                     
Subjt:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------

Query:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH
                                                         QN I E KHR+I++ GL L+SQ ++PL +WDEAF+T+V+LINRLPT V ++
Subjt:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH

Query:  VSPLEKLFQTKPD
        +SPLEKLF  KP+
Subjt:  VSPLEKLFQTKPD

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]4.0e-4926.67Show/hide
Query:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDK--PCTLNLLNLI-----------------LLRLILINFPQVLVM
        + ++WT +TQ+F T + A++M+ K +LQT++KG +S+K+Y  K++ Y+D LAA     P    +L+++                 +  L L     +L+ 
Subjt:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDK--PCTLNLLNLI-----------------LLRLILINFPQVLVM

Query:  EIGEVD----RGG----------------------------RGGPSNRGGRGGHS-WNNRNRVQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQ
          G ++     GG                            RG    R GRGG   W+N  R  CQ+C   GH    C++R+   F P   G   ++  Q
Subjt:  EIGEVD----RGG----------------------------RGGPSNRGGRGGHS-WNNRNRVQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQ

Query:  FNR-FPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT---------------------------------------
        FNR  P++P             +  WYPDSGA++H+T+   NLSV SEY  G++V V NG                                        
Subjt:  FNR-FPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT---------------------------------------

Query:  -------------------------------------------------------------------------------DVWHRRLGHPTLSTVKTVLRL
                                                                                       D WH RLGHP+++TVK VL  
Subjt:  -------------------------------------------------------------------------------DVWHRRLGHPTLSTVKTVLRL

Query:  YKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD----------------------------------
            +S  N N  FC++C +GK+H LPF  S T +SAP ++V SDLWGP++I S+NG RYYISFVD                                  
Subjt:  YKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD----------------------------------

Query:  ------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD
                                            QNG+ E KHRH+VDTGL+L++  S+P E+W++AF +AV+LINRLP+      SP   L+  +PD
Subjt:  ------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD

RVW41150.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-5233.2Show/hide
Query:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVD---------------------------------------------
        A +IW  L   F +    +I + KT LQ  +K  +S+ EY  KI+ +VD LA V                                              
Subjt:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVD---------------------------------------------

Query:  KPCTLNLLNLILLRLILI---------NFPQVLVMEIGEVDRGGRGGPSNRGGRGGHSWNNR-----NRVQCQLCTKFGHTTAKCFFRYAPRF-APMDPG
        +   +   N  LL+L+L+         NF +      G     GRGG +N G R  ++WNN       +  CQ+C K GH+  +C++RY P F  P  PG
Subjt:  KPCTLNLLNLILLRLILI---------NFPQVLVMEIGEVDRGGRGGPSNRGGRGGHSWNNR-----NRVQCQLCTKFGHTTAKCFFRYAPRF-APMDPG

Query:  ---SFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHV---------------------------------
           SF  +     +    P M V++ATP    D NWYPDSGA+NH+T + NNL   + Y    QV V                                 
Subjt:  ---SFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHV---------------------------------

Query:  --------------------------------ENGTD---VWHRRLGHPTLSTVKTVLRLYKPTLSINNS---NFHFCNACSMGKHHNLPFSYSVTVYSA
                                        +NG+    +WH RLGHP+   V+TV+ L K  LS  N    NF  C AC +GK H LPF  S++ Y  
Subjt:  --------------------------------ENGTD---VWHRRLGHPTLSTVKTVLRLYKPTLSINNS---NFHFCNACSMGKHHNLPFSYSVTVYSA

Query:  PLQLVVSDLWGPSYISSKNGYRYYISFVD------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVS
        PLQLV SDLWGPS + S NGY+YY+ FVD            QNG+AE KHRHIV+ GL L+++ SMP +YWDE+F T VFL NRLP+ V +H S
Subjt:  PLQLVVSDLWGPSYISSKNGYRYYISFVD------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVS

RVX14937.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.4e-4632.28Show/hide
Query:  GEVDRG---GRGGPSNRGGRGGHS----WNNRNRVQ---CQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTF-PLMFVMVATPNLNQDNN
        G  +RG   GRGG   RG RG       WN+ N+ +   CQLC K GH  A+C++R+   F    P + S        + +F P +  ++ T  +  D+N
Subjt:  GEVDRG---GRGGPSNRGGRGGHS----WNNRNRVQ---CQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTF-PLMFVMVATPNLNQDNN

Query:  WYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT-------------------------------------------------------DVWHRRLGH
        WYPDSGA+NH+T +  NL    E+   NQVHV NGT                                                       D+WH+RLG 
Subjt:  WYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT-------------------------------------------------------DVWHRRLGH

Query:  PTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD----------------------
        P+ +T+K VL        IN  + +FC++C +GK H  PFS S T Y+ PL+L+ SDLWGP+ + S +GYRYYI FVD                      
Subjt:  PTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD----------------------

Query:  ------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHV
                                                        QNG+AE KHR IV+ GL L+   S+PL++WDE+F T V+L NRLPT V +H 
Subjt:  ------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHV

Query:  SPLEKLFQTKPD
         P+E LF++ PD
Subjt:  SPLEKLFQTKPD

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.1e-6932.79Show/hide
Query:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L
        M H   AKEIW  L  IF++  LAQ M+ K KL  I+KG M LKEYF KI Q VDALA+++KP +                 +++++I           +
Subjt:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L

Query:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP
        + L+L    Q     I E                                    RGGRG G SNRG RG     NRN+ QCQ+C K G++  +CFFRY P
Subjt:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP

Query:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------
        R   +   P S +++Y   N  P    M  MVA  +LN D+NWYPDSGATNHLTHS +NLS+GSEYG GNQ++  NG+                      
Subjt:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------

Query:  -------------------------------------------------------------------------------------------DVWHRRLGH
                                                                                                   D+WHRRLGH
Subjt:  -------------------------------------------------------------------------------------------DVWHRRLGH

Query:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------
        P L  VK VL  +   + +IN  N  FC AC++GKHH LPFS+S+T+Y+ PLQL+  DLWGP+   S NG+RYYISFVD                     
Subjt:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------

Query:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH
                                                         QN I E KHR+I++ GL L+SQ ++PL +WDEAF+T+V+LINRLPT V ++
Subjt:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH

Query:  VSPLEKLFQTKPD
        +SPLEKLF  KP+
Subjt:  VSPLEKLFQTKPD

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein2.0e-4926.67Show/hide
Query:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDK--PCTLNLLNLI-----------------LLRLILINFPQVLVM
        + ++WT +TQ+F T + A++M+ K +LQT++KG +S+K+Y  K++ Y+D LAA     P    +L+++                 +  L L     +L+ 
Subjt:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDK--PCTLNLLNLI-----------------LLRLILINFPQVLVM

Query:  EIGEVD----RGG----------------------------RGGPSNRGGRGGHS-WNNRNRVQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQ
          G ++     GG                            RG    R GRGG   W+N  R  CQ+C   GH    C++R+   F P   G   ++  Q
Subjt:  EIGEVD----RGG----------------------------RGGPSNRGGRGGHS-WNNRNRVQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQ

Query:  FNR-FPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT---------------------------------------
        FNR  P++P             +  WYPDSGA++H+T+   NLSV SEY  G++V V NG                                        
Subjt:  FNR-FPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT---------------------------------------

Query:  -------------------------------------------------------------------------------DVWHRRLGHPTLSTVKTVLRL
                                                                                       D WH RLGHP+++TVK VL  
Subjt:  -------------------------------------------------------------------------------DVWHRRLGHPTLSTVKTVLRL

Query:  YKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD----------------------------------
            +S  N N  FC++C +GK+H LPF  S T +SAP ++V SDLWGP++I S+NG RYYISFVD                                  
Subjt:  YKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD----------------------------------

Query:  ------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD
                                            QNG+ E KHRH+VDTGL+L++  S+P E+W++AF +AV+LINRLP+      SP   L+  +PD
Subjt:  ------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD

A0A438E0F0 Retrovirus-related Pol polyprotein from transposon RE11.4e-5233.2Show/hide
Query:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVD---------------------------------------------
        A +IW  L   F +    +I + KT LQ  +K  +S+ EY  KI+ +VD LA V                                              
Subjt:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVD---------------------------------------------

Query:  KPCTLNLLNLILLRLILI---------NFPQVLVMEIGEVDRGGRGGPSNRGGRGGHSWNNR-----NRVQCQLCTKFGHTTAKCFFRYAPRF-APMDPG
        +   +   N  LL+L+L+         NF +      G     GRGG +N G R  ++WNN       +  CQ+C K GH+  +C++RY P F  P  PG
Subjt:  KPCTLNLLNLILLRLILI---------NFPQVLVMEIGEVDRGGRGGPSNRGGRGGHSWNNR-----NRVQCQLCTKFGHTTAKCFFRYAPRF-APMDPG

Query:  ---SFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHV---------------------------------
           SF  +     +    P M V++ATP    D NWYPDSGA+NH+T + NNL   + Y    QV V                                 
Subjt:  ---SFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHV---------------------------------

Query:  --------------------------------ENGTD---VWHRRLGHPTLSTVKTVLRLYKPTLSINNS---NFHFCNACSMGKHHNLPFSYSVTVYSA
                                        +NG+    +WH RLGHP+   V+TV+ L K  LS  N    NF  C AC +GK H LPF  S++ Y  
Subjt:  --------------------------------ENGTD---VWHRRLGHPTLSTVKTVLRLYKPTLSINNS---NFHFCNACSMGKHHNLPFSYSVTVYSA

Query:  PLQLVVSDLWGPSYISSKNGYRYYISFVD------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVS
        PLQLV SDLWGPS + S NGY+YY+ FVD            QNG+AE KHRHIV+ GL L+++ SMP +YWDE+F T VFL NRLP+ V +H S
Subjt:  PLQLVVSDLWGPSYISSKNGYRYYISFVD------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVS

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-6932.79Show/hide
Query:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L
        M H   AKEIW  L  IF++  LAQ M+ K KL  I+KG M LKEYF KI Q VDALA+++KP +                 +++++I           +
Subjt:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L

Query:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP
        + L+L    Q     I E                                    RGGRG G SNRG RG     NRN+ QCQ+C K G++  +CFFRY P
Subjt:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP

Query:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------
        R   +   P S +++Y   N  P    M  MVA  +LN D+NWYPDSGATNHLTHS +NLS+GSEYG GNQ++  NG+                      
Subjt:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------

Query:  -------------------------------------------------------------------------------------------DVWHRRLGH
                                                                                                   D+WHRRLGH
Subjt:  -------------------------------------------------------------------------------------------DVWHRRLGH

Query:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------
        P L  VK VL  +   + +IN  N  FC AC++GKHH LPFS+S+T+Y+ PLQL+  DLWGP+   S NG+RYYISFVD                     
Subjt:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------

Query:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH
                                                         QN I E KHR+I++ GL L+SQ ++PL +WDEAF+T+V+LINRLPT V ++
Subjt:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH

Query:  VSPLEKLFQTKPD
        +SPLEKLF  KP+
Subjt:  VSPLEKLFQTKPD

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-6932.79Show/hide
Query:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L
        M H   AKEIW  L  IF++  LAQ M+ K KL  I+KG M LKEYF KI Q VDALA+++KP +                 +++++I           +
Subjt:  MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCT----------------LNLLNLI-----------L

Query:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP
        + L+L    Q     I E                                    RGGRG G SNRG RG     NRN+ QCQ+C K G++  +CFFRY P
Subjt:  LRLILINFPQVLVMEIGEV----------------------------------DRGGRG-GPSNRGGRGGHSWNNRNRVQCQLCTKFGHTTAKCFFRYAP

Query:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------
        R   +   P S +++Y   N  P    M  MVA  +LN D+NWYPDSGATNHLTHS +NLS+GSEYG GNQ++  NG+                      
Subjt:  R--FAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGT----------------------

Query:  -------------------------------------------------------------------------------------------DVWHRRLGH
                                                                                                   D+WHRRLGH
Subjt:  -------------------------------------------------------------------------------------------DVWHRRLGH

Query:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------
        P L  VK VL  +   + +IN  N  FC AC++GKHH LPFS+S+T+Y+ PLQL+  DLWGP+   S NG+RYYISFVD                     
Subjt:  PTLSTVKTVL-RLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------------

Query:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH
                                                         QN I E KHR+I++ GL L+SQ ++PL +WDEAF+T+V+LINRLPT V ++
Subjt:  -------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNH

Query:  VSPLEKLFQTKPD
        +SPLEKLF  KP+
Subjt:  VSPLEKLFQTKPD

A0A803Q9W1 Uncharacterized protein1.5e-4931.65Show/hide
Query:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAV-------------------------------DKPCTLNLLNLILLR
        A++IW+ L + FT    ++I++ +TKLQ ++KG +SL +Y  K++Q VD LA+V                                +  T+  +  +LL 
Subjt:  AKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAV-------------------------------DKPCTLNLLNLILLR

Query:  L----------------ILINFPQVLVMEIGEVDRG-----GRG---------------------GPSNRGGRGG------------HSWNNRNRVQCQL
                         ++ N P  L ME     R      GRG                     G +N GGRG                N  N+VQCQL
Subjt:  L----------------ILINFPQVLVMEIGEVDRG-----GRG---------------------GPSNRGGRGG------------HSWNNRNRVQCQL

Query:  CTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGTDVWHRRL
        C + GHT   CF+R+   F+ +  GS S +        +       + T + + D++WYPDSGATNH T    NL+   +Y   +Q+            L
Subjt:  CTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGTDVWHRRL

Query:  GHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPF-SYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD-------------------
        GHP+   V+TVL+         +  F  C AC +GK H  PF   S TV S PLQLVVSDLWGPS+  S NGY+YYI FVD                   
Subjt:  GHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPF-SYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD-------------------

Query:  ---------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVF
                                                           QNG+AE KHRHIV+ GLAL++Q S+PL++WDEAF TAV+L NRLPT + 
Subjt:  ---------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVF

Query:  NHVSPLEKLFQTKPD
           SPLE LF TKPD
Subjt:  NHVSPLEKLFQTKPD

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-0422.01Show/hide
Query:  LTHSFNNLSVGSEYGSGNQVHV------------ENGTDVWHRRLGHPTLSTVKTVLR--LYKPTLSINNSNF--HFCNACSMGKHHNLPFSY--SVTVY
        +T S N L V    G  N V V            +N   +WH R GH +   +  + R  ++     +NN       C  C  GK   LPF      T  
Subjt:  LTHSFNNLSVGSEYGSGNQVHV------------ENGTDVWHRRLGHPTLSTVKTVLR--LYKPTLSINNSNF--HFCNACSMGKHHNLPFSY--SVTVY

Query:  SAPLQLVVSDLWGPSYISSKNGYRYYISFVDQ--------------------------------------------------------------------
          PL +V SD+ GP    + +   Y++ FVDQ                                                                    
Subjt:  SAPLQLVVSDLWGPSYISSKNGYRYYISFVDQ--------------------------------------------------------------------

Query:  ----NGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPT--IVFNHVSPLEKLFQTKP
            NG++E   R I +    ++S   +   +W EA  TA +LINR+P+  +V +  +P E     KP
Subjt:  ----NGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPT--IVFNHVSPLEKLFQTKP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-0631Show/hide
Query:  GSGNQVHVENGTDVWHRRLGHPTLSTVKTVLRLYKPTLS-INNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD
        G  N    E   D+WH+R+GH +   ++ + +  K  +S    +    C+ C  GK H + F  S       L LV SD+ GP  I S  G +Y+++F+D
Subjt:  GSGNQVHVENGTDVWHRRLGHPTLSTVKTVLRLYKPTLS-INNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-0339.13Show/hide
Query:  NGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTI
        NG+AE  +R IV+   +++    +P  +W EA  TA +LINR P++
Subjt:  NGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-2331.96Show/hide
Query:  WHRRLGHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------
        WH RLGHP  S + +V+  Y  ++   +  F  C+ C + K + +PFS S    + PL+ + SD+W  S I S + YRYY+ FVD               
Subjt:  WHRRLGHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD---------------

Query:  -------------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLP
                                                                NG++E KHRHIV+TGL L+S  S+P  YW  AFA AV+LINRLP
Subjt:  -------------------------------------------------------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLP

Query:  TIVFNHVSPLEKLFQTKPD
        T +    SP +KLF T P+
Subjt:  TIVFNHVSPLEKLFQTKPD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-2422.34Show/hide
Query:  TLAKEIWTCLTQIFTTHNLAQIMKIK--TKLQTIQKGG--MSLKEYFSKI--------QQYVDALAAVDKPCTLNLLNLILL----RLILINFPQVLVME
        T A +IW  L +I+   +   + +++  T+   +   G  M   E   ++        +  +D +AA D P +L  ++  L+    +L+ +N  +V+ + 
Subjt:  TLAKEIWTCLTQIFTTHNLAQIMKIK--TKLQTIQKGG--MSLKEYFSKI--------QQYVDALAAVDKPCTLNLLNLILL----RLILINFPQVLVME

Query:  IGEVDRGGRGGPSNRGGRG-GHSWNNRNR-----------------------VQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTF---
           V         N+  RG   ++NN N                         +CQ+C+  GH+  +C     P+        F ST NQ      F   
Subjt:  IGEVDRGGRGGPSNRGGRG-GHSWNNRNR-----------------------VQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTF---

Query:  -PLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGTDV--------------------------------------------
         P   + V +P     NNW  DSGAT+H+T  FNNLS    Y  G+ V + +G+ +                                            
Subjt:  -PLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGTDV--------------------------------------------

Query:  ------------------------------------------------------WHRRLGHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLP
                                                              WH RLGHP+L+ + +V+  +   +   +     C+ C + K H +P
Subjt:  ------------------------------------------------------WHRRLGHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLP

Query:  FSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD-------------------------------------------------------------
        FS S    S PL+ + SD+W    +S  N YRYY+ FVD                                                             
Subjt:  FSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVD-------------------------------------------------------------

Query:  ---------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD
                  NG++E KHRHIV+ GL L+S  S+P  YW  AF+ AV+LINRLPT +    SP +KLF   P+
Subjt:  ---------QNGIAEHKHRHIVDTGLALMSQTSMPLEYWDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein9.4e-0434.12Show/hide
Query:  EYGSGNQVH-VENGTDVWHRRLGHPTLSTVKTVLRLYKPTL-SINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWG
        E G  N     ++ T +WH RL H +   ++ +++  K  L S   S+  FC  C  GK H + FS        PL  V SDLWG
Subjt:  EYGSGNQVH-VENGTDVWHRRLGHPTLSTVKTVLRLYKPTL-SINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCACTACACCTTGGCTAAGGAGATTTGGACATGTTTAACTCAAATATTTACCACTCACAACTTGGCGCAAATAATGAAAATCAAGACCAAGTTACAGACGATCCA
AAAAGGAGGTATGTCCCTCAAAGAATACTTTTCTAAAATCCAACAATATGTGGATGCCCTGGCTGCGGTTGATAAACCTTGCACTCTAAACCTGTTGAATCTGATACTTC
TAAGGTTAATACTAATCAATTTTCCTCAAGTCCTGGTAATGGAAATAGGGGAGGTCGATCGTGGCGGTCGTGGTGGTCCATCCAATCGGGGTGGTCGTGGAGGCCATTCA
TGGAATAATCGGAACAGAGTTCAGTGCCAACTCTGTACTAAATTTGGCCATACTACTGCAAAATGTTTTTTCCGATATGCTCCTCGATTTGCTCCAATGGATCCAGGTTC
GTTCTCTTCTACTTACAACCAATTTAACCGCTTTCCAACTTTCCCACTGATGTTTGTCATGGTTGCTACTCCAAACCTTAATCAGGACAACAATTGGTATCCTGATTCTG
GTGCCACAAACCACTTGACCCATAGCTTCAATAACCTTTCTGTCGGGTCTGAATATGGCAGTGGAAATCAAGTTCATGTCGAGAATGGAACAGATGTTTGGCATAGACGT
CTAGGTCATCCCACTCTATCTACTGTGAAAACTGTTCTTCGGTTGTACAAGCCTACTCTGTCTATAAATAATAGTAATTTTCATTTTTGCAATGCATGTTCTATGGGAAA
GCATCACAATCTTCCCTTCTCTTATTCTGTTACTGTGTACTCTGCCCCTTTACAACTCGTTGTTTCTGATTTATGGGGCCCATCTTATATATCTTCAAAGAATGGTTATC
GGTATTATATTAGTTTTGTTGATCAAAACGGCATAGCAGAGCACAAGCATCGACACATAGTTGATACTGGGCTTGCTCTCATGTCTCAAACTTCCATGCCTTTAGAATAC
TGGGATGAGGCGTTTGCAACGGCTGTATTCCTCATTAATAGATTGCCAACGATTGTTTTTAATCATGTTAGTCCCTTGGAGAAATTGTTTCAAACTAAACCTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCACTACACCTTGGCTAAGGAGATTTGGACATGTTTAACTCAAATATTTACCACTCACAACTTGGCGCAAATAATGAAAATCAAGACCAAGTTACAGACGATCCA
AAAAGGAGGTATGTCCCTCAAAGAATACTTTTCTAAAATCCAACAATATGTGGATGCCCTGGCTGCGGTTGATAAACCTTGCACTCTAAACCTGTTGAATCTGATACTTC
TAAGGTTAATACTAATCAATTTTCCTCAAGTCCTGGTAATGGAAATAGGGGAGGTCGATCGTGGCGGTCGTGGTGGTCCATCCAATCGGGGTGGTCGTGGAGGCCATTCA
TGGAATAATCGGAACAGAGTTCAGTGCCAACTCTGTACTAAATTTGGCCATACTACTGCAAAATGTTTTTTCCGATATGCTCCTCGATTTGCTCCAATGGATCCAGGTTC
GTTCTCTTCTACTTACAACCAATTTAACCGCTTTCCAACTTTCCCACTGATGTTTGTCATGGTTGCTACTCCAAACCTTAATCAGGACAACAATTGGTATCCTGATTCTG
GTGCCACAAACCACTTGACCCATAGCTTCAATAACCTTTCTGTCGGGTCTGAATATGGCAGTGGAAATCAAGTTCATGTCGAGAATGGAACAGATGTTTGGCATAGACGT
CTAGGTCATCCCACTCTATCTACTGTGAAAACTGTTCTTCGGTTGTACAAGCCTACTCTGTCTATAAATAATAGTAATTTTCATTTTTGCAATGCATGTTCTATGGGAAA
GCATCACAATCTTCCCTTCTCTTATTCTGTTACTGTGTACTCTGCCCCTTTACAACTCGTTGTTTCTGATTTATGGGGCCCATCTTATATATCTTCAAAGAATGGTTATC
GGTATTATATTAGTTTTGTTGATCAAAACGGCATAGCAGAGCACAAGCATCGACACATAGTTGATACTGGGCTTGCTCTCATGTCTCAAACTTCCATGCCTTTAGAATAC
TGGGATGAGGCGTTTGCAACGGCTGTATTCCTCATTAATAGATTGCCAACGATTGTTTTTAATCATGTTAGTCCCTTGGAGAAATTGTTTCAAACTAAACCTGACTAG
Protein sequenceShow/hide protein sequence
MFHYTLAKEIWTCLTQIFTTHNLAQIMKIKTKLQTIQKGGMSLKEYFSKIQQYVDALAAVDKPCTLNLLNLILLRLILINFPQVLVMEIGEVDRGGRGGPSNRGGRGGHS
WNNRNRVQCQLCTKFGHTTAKCFFRYAPRFAPMDPGSFSSTYNQFNRFPTFPLMFVMVATPNLNQDNNWYPDSGATNHLTHSFNNLSVGSEYGSGNQVHVENGTDVWHRR
LGHPTLSTVKTVLRLYKPTLSINNSNFHFCNACSMGKHHNLPFSYSVTVYSAPLQLVVSDLWGPSYISSKNGYRYYISFVDQNGIAEHKHRHIVDTGLALMSQTSMPLEY
WDEAFATAVFLINRLPTIVFNHVSPLEKLFQTKPD