; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001595 (gene) of Snake gourd v1 genome

Gene IDTan0001595
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG09:61003036..61004591
RNA-Seq ExpressionTan0001595
SyntenyTan0001595
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN80930.1 hypothetical protein VITISV_005279 [Vitis vinifera]5.3e-10544.44Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K  +NVLY+P L Q LLS+AQ+L N + + FK+  C I D  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

RVW14603.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.0e-10444.05Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L++ IF +I++ +T KQ WDKL  EFEG+ RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E +EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +      ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C I D  G +IA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

RVW33963.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.5e-10444.44Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+ L GE F DQ+VV+NI+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C IYD  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+G++N K L+ +    MV+D    +   Q CESC+     R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

RVW63791.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-10544.64Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C I D  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

RVW70519.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.0e-10544.44Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+ 
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C I D  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

TrEMBL top hitse value%identityAlignment
A0A438BUF2 Retrovirus-related Pol polyprotein from transposon RE19.7e-10544.05Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L++ IF +I++ +T KQ WDKL  EFEG+ RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E +EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +      ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C I D  G +IA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

A0A438DEP9 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-10544.44Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+ L GE F DQ+VV+NI+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C IYD  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+G++N K L+ +    MV+D    +   Q CESC+     R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

A0A438FV11 Retrovirus-related Pol polyprotein from transposon RE16.7e-10644.64Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C I D  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

A0A438GE89 Retrovirus-related Pol polyprotein from transposon RE14.3e-10544.44Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+ 
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K ++NVLY+P L Q LLS+AQ+L N + V FK+  C I D  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

A5B9M8 Integrase catalytic domain-containing protein2.5e-10544.44Show/hide
Query:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD
        LW  V +  +P PLG N T+ Q++ +EEEKLKK KA++ +H+ L+D IF +I++ +T KQ WDKL  EFEGS RVK V+LLTLKREFE++KM+D   V+D
Subjt:  LWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRD

Query:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED
        Y+ ++M +VNQ+RL GE F DQ+VV+ I+VSV  KFE+KISAIEES DL TL+I EL SKL AQEQR  MR +E  EGAF A  KGK       K   ++
Subjt:  YTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGK-------KPVAED

Query:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS
         R + +    KGK            A K  W                   + C AKK Q+Q   EQ  +    + ++   LFMAS   + ++  +W+IDS
Subjt:  DRRETKDQGSKGK----------GGASKKGW----------------SYREVCYAKKTQTQHAQEQ--ANCAHNNHETNFLFMASHT-NDNKSTSWIIDS

Query:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV
        GCT+HM K + +F+ ID+S+Q KV LG+GE V A+GKGT  + TK+G K  +NVLY+P L Q LLS+AQ+L N + + FK+  C I D  G EIA + M 
Subjt:  GCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMV

Query:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL
        GN+F+L  D +  H  + K++++   H R+GH+N K L+ +    MV+D    +   Q CESC+ G   R PFP+  S RA  KLEL+HSD+  P  +  
Subjt:  GNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLPFPKGGSFRAKDKLELVHSDVWVPCKIHL

Query:  LGIINILFFLLMI
          + N ++F L I
Subjt:  LGIINILFFLLMI

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.6e-0921.51Show/hide
Query:  ALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRDYTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNK
        A S I   LSD          T +Q  + L   +E   R  +   L L++    LK+     +  +      +++++  +G    +   + ++++++ + 
Subjt:  ALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRDYTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNK

Query:  FESKISAIEESSDLTTLSIAELISKLQAQE------QRDTMR----------NEEHVEGAFHAK-SKGKKPVAEDDRRETKDQGSKGKGGASKKGWSYRE
        ++  I+AIE  S+   L++A + ++L  QE        DT +          N  +    F  + +K KK    + + + K      +G   K  + Y+ 
Subjt:  FESKISAIEESSDLTTLSIAELISKLQAQE------QRDTMR----------NEEHVEGAFHAK-SKGKKPVAEDDRRETKDQGSKGKGGASKKGWSYRE

Query:  VCYAKKTQTQHAQEQANCAHNNHETNFLFMASHTNDNK---STSWIIDSGCTTHMAKDIDLFS-QIDKSIQYKV-VLGHGETVLAEGKGTAIMHTKQGEK
        +   K  + +  Q Q   +H        FM    N+     +  +++DSG + H+  D  L++  ++     K+ V   GE + A  +G   +     E 
Subjt:  VCYAKKTQTQHAQEQANCAHNNHETNFLFMASHTNDNK---STSWIIDSGCTTHMAKDIDLFS-QIDKSIQYKV-VLGHGETVLAEGKGTAIMHTKQGEK

Query:  KISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMVGNAFFLNGDSLCHHALNVKVEDN-KNGHHRFGHYNNKPLKILHSTNMVD
         + +VL+    +  L+S+ +L      + F      I     + +    M+ N   +N  +   +++N K ++N +  H RFGH ++  L  +   NM  
Subjt:  KISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMVGNAFFLNGDSLCHHALNVKVEDN-KNGHHRFGHYNNKPLKILHSTNMVD

Query:  DFSNFTSFD---QICESCQEGNMHRLPFPK-GGSFRAKDKLELVHSDVWVP
        D S   + +   +ICE C  G   RLPF +       K  L +VHSDV  P
Subjt:  DFSNFTSFD---QICESCQEGNMHRLPFPK-GGSFRAKDKLELVHSDVWVP

P25601 Putative transposon Ty5-1 protein YCL075W1.3e-0535.71Show/hide
Query:  MASHTNDNKSTSWIIDSGCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSI
        ++S     KS+ WI D+GCT+HM  D  +FS   +S +   V G G ++   G GT  + T Q    + +V YVP L   L+S+
Subjt:  MASHTNDNKSTSWIIDSGCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQKLLSI

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein3.2e-1521.66Show/hide
Query:  LWKSVSTNINPQP-----LGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSA--RVKVVKLLTLKREFEMLKMR
        LW  V   +   P     L   +   ++    +  +K  KAL ++ ++L+D +F + +   + K  WD L +  E +   R++ V +  L+++ E LKM 
Subjt:  LWKSVSTNINPQP-----LGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSA--RVKVVKLLTLKREFEMLKMR

Query:  DSNFVRDYTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGKKPVAED
        D      Y  K + I+ ++  +     D  + KN+  ++S  F+   S +EE  D+  ++   L+     +    +   EE + G           + +D
Subjt:  DSNFVRDYTTKVMTIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGKKPVAED

Query:  DRRETKDQGSKGKGGASKKGWSYREVCYAKKTQTQHAQEQANCAHNNHETNFLFMASHTNDNKSTSWIIDSGCTTHMAKDIDLFSQIDKSIQYKVVLGHG
         R ++K +  K  G   K   +  +  +   T  +  +++    +       L   ++ +D     WII      +M   +  F+ +D++ +  V    G
Subjt:  DRRETKDQGSKGKGGASKKGWSYREVCYAKKTQTQHAQEQANCAHNNHETNFLFMASHTNDNKSTSWIIDSGCTTHMAKDIDLFSQIDKSIQYKVVLGHG

Query:  ETVLAEGKGTAIMHTKQGEKK-ISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQ-VCVIYDPQGVEIATVNMVGNAFFLNGDSLCHHALNVKVEDNK
          +L EGKG   +  K+G+KK I NV++VP L++ +LS  +++  ++ +    Q  C++ D         N +G+A ++  ++    AL +KV + K
Subjt:  ETVLAEGKGTAIMHTKQGEKK-ISNVLYVPRLSQKLLSIAQLLHNKFFVVFKDQ-VCVIYDPQGVEIATVNMVGNAFFLNGDSLCHHALNVKVEDNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTTGACCTTTGGAAATCTGTCTCAACTAATATTAATCCTCAACCATTAGGAGAAAATCTGACGTTGAATCAGATAAGACTACACGAAGAGGAAAAATTGAAAAA
GCCGAAGGCTTTATCCGTTATTCATGCCGCTTTATCAGATCCTATTTTTGCTAGGATCATTGATTGTAAAACAACAAAACAAGCTTGGGATAAATTGCATGAGGAATTTG
AAGGAAGTGCGAGGGTGAAGGTTGTCAAATTATTGACTCTCAAGAGAGAGTTTGAGATGTTGAAAATGAGAGATTCAAATTTTGTGAGGGACTACACAACAAAAGTGATG
ACCATCGTAAATCAGATAAGACTATCTGGTGAAAATTTTCCAGATCAAAGAGTTGTGAAAAACATAGTGGTTAGTGTTTCCAATAAATTTGAATCGAAGATCTCAGCCAT
CGAGGAGTCTTCTGATTTGACTACTCTGTCTATAGCTGAGTTAATTAGCAAATTACAAGCTCAAGAACAAAGGGATACAATGCGCAATGAAGAGCATGTTGAGGGTGCAT
TTCATGCCAAGTCTAAAGGCAAGAAACCTGTTGCAGAAGATGATAGACGAGAGACCAAAGATCAGGGGAGCAAGGGAAAAGGAGGAGCATCAAAGAAAGGATGGTCATAC
AGAGAAGTTTGCTATGCCAAGAAAACCCAAACCCAACATGCTCAAGAACAAGCGAATTGTGCCCACAATAATCATGAAACAAATTTTTTGTTTATGGCATCTCATACCAA
TGACAACAAGTCAACCTCATGGATTATTGATAGTGGATGCACTACTCACATGGCTAAAGATATCGATCTTTTTAGCCAAATTGACAAATCCATACAGTATAAGGTGGTCC
TTGGACATGGCGAGACGGTACTAGCTGAAGGTAAAGGTACTGCCATTATGCATACTAAGCAAGGTGAAAAGAAAATATCCAATGTCTTATATGTTCCAAGATTATCTCAA
AAGTTGCTCAGTATTGCTCAATTGTTGCATAACAAATTTTTCGTGGTTTTCAAGGACCAAGTTTGTGTTATTTATGACCCACAAGGAGTAGAGATTGCAACGGTCAACAT
GGTTGGAAATGCTTTCTTTTTGAATGGTGACTCTTTATGTCATCATGCACTAAATGTCAAAGTAGAAGATAACAAGAATGGGCATCATCGCTTTGGTCACTATAACAACA
AACCTTTGAAAATTTTGCATTCTACTAATATGGTTGATGATTTTTCTAATTTTACCTCATTTGATCAGATTTGTGAAAGTTGCCAAGAAGGGAACATGCACAGATTACCT
TTTCCCAAAGGTGGCAGCTTTAGAGCCAAAGACAAACTTGAATTAGTGCACAGTGATGTTTGGGTCCCATGCAAAATTCATCTATTGGGAATAATAAATATTTTATTCTT
TTTATTGATGATCTTACCAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTTGACCTTTGGAAATCTGTCTCAACTAATATTAATCCTCAACCATTAGGAGAAAATCTGACGTTGAATCAGATAAGACTACACGAAGAGGAAAAATTGAAAAA
GCCGAAGGCTTTATCCGTTATTCATGCCGCTTTATCAGATCCTATTTTTGCTAGGATCATTGATTGTAAAACAACAAAACAAGCTTGGGATAAATTGCATGAGGAATTTG
AAGGAAGTGCGAGGGTGAAGGTTGTCAAATTATTGACTCTCAAGAGAGAGTTTGAGATGTTGAAAATGAGAGATTCAAATTTTGTGAGGGACTACACAACAAAAGTGATG
ACCATCGTAAATCAGATAAGACTATCTGGTGAAAATTTTCCAGATCAAAGAGTTGTGAAAAACATAGTGGTTAGTGTTTCCAATAAATTTGAATCGAAGATCTCAGCCAT
CGAGGAGTCTTCTGATTTGACTACTCTGTCTATAGCTGAGTTAATTAGCAAATTACAAGCTCAAGAACAAAGGGATACAATGCGCAATGAAGAGCATGTTGAGGGTGCAT
TTCATGCCAAGTCTAAAGGCAAGAAACCTGTTGCAGAAGATGATAGACGAGAGACCAAAGATCAGGGGAGCAAGGGAAAAGGAGGAGCATCAAAGAAAGGATGGTCATAC
AGAGAAGTTTGCTATGCCAAGAAAACCCAAACCCAACATGCTCAAGAACAAGCGAATTGTGCCCACAATAATCATGAAACAAATTTTTTGTTTATGGCATCTCATACCAA
TGACAACAAGTCAACCTCATGGATTATTGATAGTGGATGCACTACTCACATGGCTAAAGATATCGATCTTTTTAGCCAAATTGACAAATCCATACAGTATAAGGTGGTCC
TTGGACATGGCGAGACGGTACTAGCTGAAGGTAAAGGTACTGCCATTATGCATACTAAGCAAGGTGAAAAGAAAATATCCAATGTCTTATATGTTCCAAGATTATCTCAA
AAGTTGCTCAGTATTGCTCAATTGTTGCATAACAAATTTTTCGTGGTTTTCAAGGACCAAGTTTGTGTTATTTATGACCCACAAGGAGTAGAGATTGCAACGGTCAACAT
GGTTGGAAATGCTTTCTTTTTGAATGGTGACTCTTTATGTCATCATGCACTAAATGTCAAAGTAGAAGATAACAAGAATGGGCATCATCGCTTTGGTCACTATAACAACA
AACCTTTGAAAATTTTGCATTCTACTAATATGGTTGATGATTTTTCTAATTTTACCTCATTTGATCAGATTTGTGAAAGTTGCCAAGAAGGGAACATGCACAGATTACCT
TTTCCCAAAGGTGGCAGCTTTAGAGCCAAAGACAAACTTGAATTAGTGCACAGTGATGTTTGGGTCCCATGCAAAATTCATCTATTGGGAATAATAAATATTTTATTCTT
TTTATTGATGATCTTACCAGGATGA
Protein sequenceShow/hide protein sequence
MALDLWKSVSTNINPQPLGENLTLNQIRLHEEEKLKKPKALSVIHAALSDPIFARIIDCKTTKQAWDKLHEEFEGSARVKVVKLLTLKREFEMLKMRDSNFVRDYTTKVM
TIVNQIRLSGENFPDQRVVKNIVVSVSNKFESKISAIEESSDLTTLSIAELISKLQAQEQRDTMRNEEHVEGAFHAKSKGKKPVAEDDRRETKDQGSKGKGGASKKGWSY
REVCYAKKTQTQHAQEQANCAHNNHETNFLFMASHTNDNKSTSWIIDSGCTTHMAKDIDLFSQIDKSIQYKVVLGHGETVLAEGKGTAIMHTKQGEKKISNVLYVPRLSQ
KLLSIAQLLHNKFFVVFKDQVCVIYDPQGVEIATVNMVGNAFFLNGDSLCHHALNVKVEDNKNGHHRFGHYNNKPLKILHSTNMVDDFSNFTSFDQICESCQEGNMHRLP
FPKGGSFRAKDKLELVHSDVWVPCKIHLLGIINILFFLLMILPG