; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021666 (gene) of Snake gourd v1 genome

Gene IDTan0021666
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG04:77198382..77206583
RNA-Seq ExpressionTan0021666
SyntenyTan0021666
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]3.5e-5138.31Show/hide
Query:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTP--PSANLMIQGNR----NVKADTQKSGNQYQG----------------
        +L GL ++YES ++ +  + +   V+E+ ALL+ HE+R+E     ++S  +    S+N + +GNR       A++Q S + Y G                
Subjt:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTP--PSANLMIQGNR----NVKADTQKSGNQYQG----------------

Query:  ----HHNYNSRG-----RGRFNRGG----RSWNNRN---KLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMI
            + NYN R      RGR N+G       WN+ N   K  CQLC K  H   +CY    RFD  H+    Q+ +S    P+    ++ + + Q++ +I
Subjt:  ----HHNYNSRG-----RGRFNRGG----RSWNNRN---KLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMI

Query:  AAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPSP--SKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFH
           ++  D++WYPDSGA+NH+T +  NL     + G NQV VGNG GL I H G S F SP  SK L  LN+LLHVP  TKNL+SVS+FAKDN VFFEFH
Subjt:  AAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPSP--SKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFH

Query:  SDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSSTSGSMSKTCLPS-HSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVL
        SD C VKD++T  VL+ G + +GLY F+ SSH     + S+SK+  PS  +SSF S   V TT      D WH+RLGHP+   +K +   CN    + + 
Subjt:  SDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSSTSGSMSKTCLPS-HSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVL

Query:  FNFCNAVLLESLMPF
         NFC++  L  +  F
Subjt:  FNFCNAVLLESLMPF

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-7046.17Show/hide
Query:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG
        +Y+L GLGS+Y+SMISVI+A+T++  VQEVM+LLLT E++ E+K+    S+   PS N++ Q   +  ++  + + N Y  +H+YN    RG GR NRG 
Subjt:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG

Query:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV
        R   NRNK QCQ+C+K  ++A +C+                 SNS G+ P     S+    +  QMSAM+AA DLN D++WYPDSGATNHLT+  +NL++
Subjt:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV

Query:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL
        G+ Y GGNQ+   NG+GLPI+H+G  SF S   P K  F LNNLL VP  TKNLISVSQFAKDN VFFEFH   C VKD  TG+VLLQG L++GLY+F +
Subjt:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL

Query:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL
           H         K    S+S++ P  + V        LD WHRRLGHP LP+VK +    ++   +    NFC A  L
Subjt:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL

KAF7832320.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]1.1e-5241.65Show/hide
Query:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANL-----------MIQGNRNVKADTQKSGNQYQGHHNYNSRGRGR
        +L GL  EYES ++ I  +TE   V E+  LL+  E R+E  +K   ++   PSAN+               N   ++  ++  NQ  G      +GRGR
Subjt:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANL-----------MIQGNRNVKADTQKSGNQYQGHHNYNSRGRGR

Query:  FN---------RGGRSWN-NRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGA
         N         RGG S N NR  + CQ+CSK  H A  CY    RFD+++  ++QQ S   G   QF   + P     MSA IA P++  D++W+PDSGA
Subjt:  FN---------RGGRSWN-NRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGA

Query:  TNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS--PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQ
        TNH+T+D NNL  G+ Y G  Q+ +GNG GL IS  G S   S  P+ HL  LN+LLHVP  TKNLISVS+FAKDN VFFEFHS++C VK ++T +VLL+
Subjt:  TNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS--PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQ

Query:  GPL-HEGLYRF------NLSSHHPSSTSG-SMSKTC-----LPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCN
        G +  +GLY+F      +LS+ + SSTS  S S  C       S S S PST + + T       TWH RLGH    VV  + KLCN  +S+     FC+
Subjt:  GPL-HEGLYRF------NLSSHHPSSTSG-SMSKTC-----LPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCN

Query:  A
        A
Subjt:  A

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]9.6e-5741.54Show/hide
Query:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRS-W
        +++L G+G EYES++  +T++ E+  + EV ALLL HE RIET           PS N+            +K+ N  Q    Y  RGRGR  RGGR  W
Subjt:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRS-W

Query:  NNRNKLQCQLCSKFRHTALKCYSLAGRFD---------TSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFN
        +N  +  CQ+C    H A  CY    RFD          S +S QQ + +S    P +   +F +  S+ ++         +  WYPDSGA++H+TND  
Subjt:  NNRNKLQCQLCSKFRHTALKCYSLAGRFD---------TSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFN

Query:  NLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS-PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRF
        NL+V + Y GG++VQVGNGAGL IS+ G S+    PS   F L NLLHVP  TKNLISVS+FA DN V+FEFH  FC VKD  T  VLL+G LH GLYRF
Subjt:  NLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS-PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRF

Query:  NLSSHHPSSTSGSM-SKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL--ESLMPF
        NL S      SG + S  CL S  S                LD WH RLGHP++  VKQ+   CN  +S +   +FC++  L    L+PF
Subjt:  NLSSHHPSSTSGSM-SKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL--ESLMPF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-7046.17Show/hide
Query:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG
        +Y+L GLGS+Y+SMISVI+A+T++  VQEVM+LLLT E++ E+K+    S+   PS N++ Q   +  ++  + + N Y  +H+YN    RG GR NRG 
Subjt:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG

Query:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV
        R   NRNK QCQ+C+K  ++A +C+                 SNS G+ P     S+    +  QMSAM+AA DLN D++WYPDSGATNHLT+  +NL++
Subjt:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV

Query:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL
        G+ Y GGNQ+   NG+GLPI+H+G  SF S   P K  F LNNLL VP  TKNLISVSQFAKDN VFFEFH   C VKD  TG+VLLQG L++GLY+F +
Subjt:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL

Query:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL
           H         K    S+S++ P  + V        LD WHRRLGHP LP+VK +    ++   +    NFC A  L
Subjt:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein4.6e-5741.54Show/hide
Query:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRS-W
        +++L G+G EYES++  +T++ E+  + EV ALLL HE RIET           PS N+            +K+ N  Q    Y  RGRGR  RGGR  W
Subjt:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRS-W

Query:  NNRNKLQCQLCSKFRHTALKCYSLAGRFD---------TSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFN
        +N  +  CQ+C    H A  CY    RFD          S +S QQ + +S    P +   +F +  S+ ++         +  WYPDSGA++H+TND  
Subjt:  NNRNKLQCQLCSKFRHTALKCYSLAGRFD---------TSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFN

Query:  NLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS-PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRF
        NL+V + Y GG++VQVGNGAGL IS+ G S+    PS   F L NLLHVP  TKNLISVS+FA DN V+FEFH  FC VKD  T  VLL+G LH GLYRF
Subjt:  NLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS-PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRF

Query:  NLSSHHPSSTSGSM-SKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL--ESLMPF
        NL S      SG + S  CL S  S                LD WH RLGHP++  VKQ+   CN  +S +   +FC++  L    L+PF
Subjt:  NLSSHHPSSTSGSM-SKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL--ESLMPF

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-7146.17Show/hide
Query:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG
        +Y+L GLGS+Y+SMISVI+A+T++  VQEVM+LLLT E++ E+K+    S+   PS N++ Q   +  ++  + + N Y  +H+YN    RG GR NRG 
Subjt:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG

Query:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV
        R   NRNK QCQ+C+K  ++A +C+                 SNS G+ P     S+    +  QMSAM+AA DLN D++WYPDSGATNHLT+  +NL++
Subjt:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV

Query:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL
        G+ Y GGNQ+   NG+GLPI+H+G  SF S   P K  F LNNLL VP  TKNLISVSQFAKDN VFFEFH   C VKD  TG+VLLQG L++GLY+F +
Subjt:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL

Query:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL
           H         K    S+S++ P  + V        LD WHRRLGHP LP+VK +    ++   +    NFC A  L
Subjt:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-7146.17Show/hide
Query:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG
        +Y+L GLGS+Y+SMISVI+A+T++  VQEVM+LLLT E++ E+K+    S+   PS N++ Q   +  ++  + + N Y  +H+YN    RG GR NRG 
Subjt:  MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQ-GNRNVKADTQKSGNQYQGHHNYN---SRGRGRFNRGG

Query:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV
        R   NRNK QCQ+C+K  ++A +C+                 SNS G+ P     S+    +  QMSAM+AA DLN D++WYPDSGATNHLT+  +NL++
Subjt:  RSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKAS--QMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAV

Query:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL
        G+ Y GGNQ+   NG+GLPI+H+G  SF S   P K  F LNNLL VP  TKNLISVSQFAKDN VFFEFH   C VKD  TG+VLLQG L++GLY+F +
Subjt:  GTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNL

Query:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL
           H         K    S+S++ P  + V        LD WHRRLGHP LP+VK +    ++   +    NFC A  L
Subjt:  SSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLL

A5BFT3 Integrase catalytic domain-containing protein1.7e-5138.31Show/hide
Query:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTP--PSANLMIQGNR----NVKADTQKSGNQYQG----------------
        +L GL ++YES ++ +  + +   V+E+ ALL+ HE+R+E     ++S  +    S+N + +GNR       A++Q S + Y G                
Subjt:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTP--PSANLMIQGNR----NVKADTQKSGNQYQG----------------

Query:  ----HHNYNSRG-----RGRFNRGG----RSWNNRN---KLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMI
            + NYN R      RGR N+G       WN+ N   K  CQLC K  H   +CY    RFD  H+    Q+ +S    P+    ++ + + Q++ +I
Subjt:  ----HHNYNSRG-----RGRFNRGG----RSWNNRN---KLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMI

Query:  AAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPSP--SKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFH
           ++  D++WYPDSGA+NH+T +  NL     + G NQV VGNG GL I H G S F SP  SK L  LN+LLHVP  TKNL+SVS+FAKDN VFFEFH
Subjt:  AAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPSP--SKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFH

Query:  SDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSSTSGSMSKTCLPS-HSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVL
        SD C VKD++T  VL+ G + +GLY F+ SSH     + S+SK+  PS  +SSF S   V TT      D WH+RLGHP+   +K +   CN    + + 
Subjt:  SDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSSTSGSMSKTCLPS-HSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVL

Query:  FNFCNAVLLESLMPF
         NFC++  L  +  F
Subjt:  FNFCNAVLLESLMPF

A5BK17 Integrase catalytic domain-containing protein5.5e-5039.89Show/hide
Query:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNR------NVKADTQK--------SGNQYQGHH-NYNSR
        +  GL  +YE+ I  + ++ +   V+E+ ALLL  E+RIE  +K   +D + PS   +I  NR      N +A T+         SGN  Q    N+  +
Subjt:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNR------NVKADTQK--------SGNQYQGHH-NYNSR

Query:  GRGRFNRGGRSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQM------SAMIAAPDLNQDNSWYPDSGA
        GRGR  RG  SW   NK QCQLC +  H  ++CY    RFD S +   Q   N     PQ  +     + S+       S      ++ QDN+WYPDSGA
Subjt:  GRGRFNRGGRSWNNRNKLQCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQM------SAMIAAPDLNQDNSWYPDSGA

Query:  TNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLL
        T+HLT + NNL   + +   ++V VGNG GLPI H G++SF S   PSK L  L  LLHVP+ TKNL+SVS+FA DN VFFEFH   C VKD  T  VL+
Subjt:  TNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPS---PSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLL

Query:  QGPLHEGLYRF-NLSSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCN
         G L  GLY F N     P   S   + T LPS   + P     ++   PF L  WH RLGHP+  +V  +   CN
Subjt:  QGPLHEGLYRF-NLSSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-2831.68Show/hide
Query:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRSW---
        +L  L  EY+ +I  I AK     + E+   LL HE++I   +   ++   P +AN +   N     +   +GN+   + N N+      N   + W   
Subjt:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRSW---

Query:  -------NNRNKL---QCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDF
               NN++K    +CQ+C    H+A +C  L   F +S +S Q  S               P    Q  A +A       N+W  DSGAT+H+T+DF
Subjt:  -------NNRNKL---QCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDF

Query:  NNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPSPSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRF
        NNL++   Y GG+ V V +G+ +PISH G +S  + S+ L +L+N+L+VP   KNLISV +    N V  EF      VKD  TG  LLQG   + LY +
Subjt:  NNLAVGTGYFGGNQVQVGNGAGLPISHFGYSSFPSPSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRF

Query:  NLSSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQ-IAKLCNSVLSSSVLFNFCNAVLL
         ++S  P S   S S     +HSS                   WH RLGHPA  ++   I+    SVL+ S  F  C+  L+
Subjt:  NLSSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQ-IAKLCNSVLSSSVLFNFCNAVLL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-2831.13Show/hide
Query:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSG-NQYQGHHNYNSRGRGRFNRGGRSWNN
        +L  L  +Y+ +I  I AK     + E+   L+  E+++   +   +++  P +AN++   N N   +    G N+   ++N  S      + G RS N 
Subjt:  LLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSG-NQYQGHHNYNSRGRGRFNRGGRSWNN

Query:  RNKL---QCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGY
        + K    +CQ+CS   H+A +C  L        ++NQQQS++             P    Q  A +A       N+W  DSGAT+H+T+DFNNL+    Y
Subjt:  RNKL---QCQLCSKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGY

Query:  FGGNQVQVGNGAGLPISHFGYSSFPSPSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSS
         GG+ V + +G+ +PI+H G +S P+ S+ L  LN +L+VP   KNLISV +    N V  EF      VKD  TG  LLQG   + LY +      P +
Subjt:  FGGNQVQVGNGAGLPISHFGYSSFPSPSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSS

Query:  TSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVL
        +S ++S    P       +TH            +WH RLGHP+L ++       NSV+S+  L
Subjt:  TSGSMSKTCLPSHSSSFPSTHIVSTTFKPFALDTWHRRLGHPALPVVKQIAKLCNSVLSSSVL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCTTTTGACTGGCCTAGGGTCTGAGTATGAATCAATGATCTCAGTAATTACAGCAAAAACAGAAACACAAGATGTTCAAGAGGTTATGGCATTACTTCTTACTCA
TGAGAATAGAATTGAAACTAAAATGAAGAAAGTTAACTCTGATGGAACTCCTCCATCTGCTAACTTGATGATTCAAGGGAACAGAAATGTAAAAGCTGATACTCAGAAGT
CAGGAAATCAGTATCAAGGCCACCATAATTATAACTCTCGAGGACGAGGGCGTTTTAATCGTGGCGGTCGATCATGGAACAACCGGAACAAACTTCAATGTCAACTTTGT
TCAAAATTTAGACATACAGCACTTAAATGTTACTCACTTGCTGGGCGTTTTGATACTTCTCATAGTTCTAATCAGCAGCAATCCTCAAATTCAGGAGGTTTTTGTCCTCA
GTTTGGTGTTGGATCCTTTCCTACCAAGGCTTCTCAGATGTCTGCTATGATTGCAGCTCCTGATCTTAATCAAGACAATAGTTGGTATCCTGATTCTGGTGCTACAAATC
ATCTCACTAACGATTTCAACAATTTGGCTGTAGGTACAGGGTATTTTGGTGGTAATCAGGTGCAAGTTGGAAATGGTGCAGGTTTGCCCATATCTCACTTTGGTTATTCC
TCTTTTCCATCTCCTTCTAAACATCTTTTTCATCTTAATAATCTTCTTCATGTTCCTAAAAAAACCAAAAACTTGATCAGTGTGAGTCAATTTGCCAAAGATAATTCAGT
CTTTTTTGAATTTCACTCTGATTTTTGTTGTGTGAAGGATCGTCTTACTGGCAAAGTTCTACTCCAAGGACCACTCCATGAGGGACTGTACCGGTTCAACTTATCCTCGC
ATCATCCTTCGTCGACTAGTGGCTCTATGTCTAAAACTTGTTTGCCATCTCATTCTTCTAGTTTTCCCTCTACACACATTGTTTCTACTACTTTTAAGCCTTTTGCTCTA
GATACTTGGCATCGACGACTTGGTCACCCTGCCCTTCCAGTTGTTAAACAGATTGCTAAACTTTGTAATTCAGTTTTGTCTTCTTCTGTACTTTTCAACTTTTGTAATGC
TGTGCTGTTGGAAAGTCTCATGCCCTTCGTTTTACTCCCTCTGTTACTACTTACACTGCTCCTTTGCAATTAA
mRNA sequenceShow/hide mRNA sequence
TGAAAAGTAAATGGAAAGGGTTTCTCCAGTAACCTTCTGAGTTCTTTCTTTATTCATTCATGGTATTAGAACCCGGTCTTGCGACTGAGTTCGATCCGTCATTTGAGATT
CTTCGTTTTGTGACCGTTGTTTTTAATAAATCTCTTCGGTTAATTAGAGTTGAAAATTTCTTCAATTCTATAGAGAAATTCTATTTTAGTCACGGATTCCTCAATTAATA
CAGGAGTCGAATCTACGGAAGAAAGAGGAAATCAGGTGAATCAGGTAATTAATCCTGGTAATAAGATCTCTACTGTGAAACTAACTAACGATAATTTTCTTATGTGGAAA
GTTCAAATTGAATTTGCCCTAGAAGGTCACGATTTAGAGAATTTCATCAATGATGATACAGAACCACCACCTAAGAGAATTCCTATGACTGAAGGTTCAAATATTACTAA
ATTGAATCCTGCTTTTATAAAATGGAAACGACAAGATAGGTTGATCTCTTCTTGGTTGCTTGGATCTATGACTGAAGGAATTTCAGAACAAGTAATACATTGTAAATCTG
CTGGAGAAATTTGGAAATGTTTGCTCCAGATTTTTAATTCCAGAAATATGGCTCGAATAATGAGGATGAAGACTAAACTTCAGACTATACAAAAAGGCGGTATAAAGAAT
ATTTTGCTCAAATCAAGAAATGTGTTGATGCACTTGCTGCTATAGGAAAGGAGGTTCCAATTGAGGACCATAAATGTATCTTTTGACTGGCCTAGGGTCTGAGTATGAAT
CAATGATCTCAGTAATTACAGCAAAAACAGAAACACAAGATGTTCAAGAGGTTATGGCATTACTTCTTACTCATGAGAATAGAATTGAAACTAAAATGAAGAAAGTTAAC
TCTGATGGAACTCCTCCATCTGCTAACTTGATGATTCAAGGGAACAGAAATGTAAAAGCTGATACTCAGAAGTCAGGAAATCAGTATCAAGGCCACCATAATTATAACTC
TCGAGGACGAGGGCGTTTTAATCGTGGCGGTCGATCATGGAACAACCGGAACAAACTTCAATGTCAACTTTGTTCAAAATTTAGACATACAGCACTTAAATGTTACTCAC
TTGCTGGGCGTTTTGATACTTCTCATAGTTCTAATCAGCAGCAATCCTCAAATTCAGGAGGTTTTTGTCCTCAGTTTGGTGTTGGATCCTTTCCTACCAAGGCTTCTCAG
ATGTCTGCTATGATTGCAGCTCCTGATCTTAATCAAGACAATAGTTGGTATCCTGATTCTGGTGCTACAAATCATCTCACTAACGATTTCAACAATTTGGCTGTAGGTAC
AGGGTATTTTGGTGGTAATCAGGTGCAAGTTGGAAATGGTGCAGGTTTGCCCATATCTCACTTTGGTTATTCCTCTTTTCCATCTCCTTCTAAACATCTTTTTCATCTTA
ATAATCTTCTTCATGTTCCTAAAAAAACCAAAAACTTGATCAGTGTGAGTCAATTTGCCAAAGATAATTCAGTCTTTTTTGAATTTCACTCTGATTTTTGTTGTGTGAAG
GATCGTCTTACTGGCAAAGTTCTACTCCAAGGACCACTCCATGAGGGACTGTACCGGTTCAACTTATCCTCGCATCATCCTTCGTCGACTAGTGGCTCTATGTCTAAAAC
TTGTTTGCCATCTCATTCTTCTAGTTTTCCCTCTACACACATTGTTTCTACTACTTTTAAGCCTTTTGCTCTAGATACTTGGCATCGACGACTTGGTCACCCTGCCCTTC
CAGTTGTTAAACAGATTGCTAAACTTTGTAATTCAGTTTTGTCTTCTTCTGTACTTTTCAACTTTTGTAATGCTGTGCTGTTGGAAAGTCTCATGCCCTTCGTTTTACTC
CCTCTGTTACTACTTACACTGCTCCTTTGCAATTAATTGTAGCTGATTTGTGGGGCCCTCTTATAAAGCTTCTAGAAATGGTTTTCGATATTAAATTAGTTTTGTTGATG
TTTTTTCCTGTTATACTTGGATTTATTTTCTGAATACTAAATCAGATGCTTTCAAAGCATTTCTTTCATTTAAAGCCTCTGTTGAGAAACTTCTTGGTCTTTCTATTCTT
CGTTTTCAATCTGATGGGGGAGAGGAATTTAAAATTTTTACCTCATTTCTTCAAACACGTGGCATTGATCATAGAATTTCTTGCCCTTATACCTCTCAACAAAATGAAAT
TGTTGAGCGTAAATGTTGAGGATCCCACATTGGAAAAGTGGAGAGAAACCTCACAATATATATGATATATGGGTTACTCCTCTCATTGCCAATTGGTTTTGAGATAGAAC
CCCATATAATCTAATATGGTATCAGAGCCCATTAAACTCAAACTGGTATTCGGTCCAAAAATTGAAATTGGGATCCGATCCAAGAATGGTGAACCCAAAAAGGCACCATC
TTGAGGGGGCATGTTGAGGATCCCACATTGGAAAAGTTGAGAGAAACCTCACAATATATATGATATATGGGCTACTCCTCTCATTGCCAATTGGTTTTGAGATGGAACCC
CATATAACCTAATAGTAAACACCGACAAATTGTTAACATTGGTCTTACACTCTTGTCACAAGCTTCCATGCCACTTGGTTTTTGGGATGATGCATTCTCCTCTGTTGTTT
ACCTCATGAACCGATTGCCTTCTACATCCCTGAATGGTATTTGTCCTATGGTTAAATTGTTTAATACCCAACCCAATTATTCCTTAAAAGTTTTTGGTCGTCGTTGTTAT
CCTTCTCTTCGCTCCTATAATTCTCACAAACTTAATTTTCGTTCTAAATTGTGTACTTTCATTGGCTATAGTAATCAGTATAAGGGTTACAAATGTCTTTCTTCTAATGG
TCGAGTTTTTGTTTCTCAACATGTTAATTTTGACGAACACACCTTTTCATTTGTTGTTTCACCTACTGTTTCCTCACCTTCTGAGAGATTCGTTAACCAATGTTTACCTG
TGCTTTCACCTACTTCTTCTTCCCGAACTCTTGAGTCATCTATATCACCTGCATCTGTTTCCACTCCACCTGAATTGTCATCGAATTTTCCAATACCTAATGTTTTGCCA
ACAACAACGTTGTCATCTCCTATGTCTAATCATGATACTGATGTGCCTACTTCTGGTGAATCTATTTTACCCTTGATGTCTCCTATTGCTAATGATCATCCTTAAAACAG
TTGAAGTGGTTTCATCTGCTAAAGTTACAACATATGAATAATATCATGGATGTTACAAAATTTTTATAATATCATTCTTAAGACAGTTGAAGAAAAGAAACCAACCCCAC
AATGAGGATCTGTTTGTGCAACCGTTGTAAAAGGAAGTTGAAACAAATTTCTGTTAAAAAAAAGGCATACTTACCTCGACTGTTTTCTCAACCTCTTAACCGTGAACCCT
CTTCATCTAACAAAATTCTCTTCTATGGTTTTTTTAGAAGATGGATGATGTACCTCGAAAAATAGATAGAGATAGAGAGAGAGAGAGAGAGAGAGAGAGAAAGTTGTGTG
AGAGAGTTGTGCTTTGTAAGTTCTTTTCATCTTTGAACAACAATGAAAACAGAATACTGATTACCTTACAGTGGATGTAGGCCACCATGGCTGAACCACTTAAATCTTGT
GTGTATCCGACTCATCTCTTCTCCTTTTCTTTTTGGATTTTCTTCTTCGATCTTTAGCTGCTTCTTCTGGATTGTTTTTGTTCTTTGTGAAGCCTTCGGGAGAAAGGATT
TGTTCTTTGCATGTTTACAGAGCAATACTTGAAAAAGGAAAAAGCAATTGAAAGAAGACTGATAAAGGGCATCGGTACTTGCATTCACGTGGAGTGTTTTCTGGCCACAG
AGGCATACCTGGAGGGAGCCATTATCAACTGATGCTTACCTATTATGTCAATAATCATGCGGCAGCTTTTGGCCCCATGACCATCTTATGGGCCATAGTGATTCAGGGGT
CGTGTAGAAGGGTTCTCGCAACGGATCAAATATATGTCAGGGATAACAAGCGGATGACTCCCAAGAGCTCTTATCAACAGAGTCATTTGGCACCTCAATGTCGACTCATC
ACATCCTAAGATTGAAGAAGGTTCCAAAGGTTCAGTTGTTCATTGATTCAAGTGGTATGTTATTACCTCGATGTCCACATAGGGGCACATAAATGTAGGAAACATTTCAC
CCTCACTGGAAAAATGTTCACCAAGCCAAATTGCAGCAACTTTCCAAATGAAAAGAAACTATCTTTGTGAGTTGCAGCTTGCATGACCTCAAAGAGCCAATTGAGTTGCG
AGTTCGAGTGAAGATTAACGGAAGTGCTGCAGGAATAATCTCTTAGAATCAAGTTGTCTAAGTATCATGAATCCAAATCAGCCACCAATAATCCTTACACAAACAAAGTC
CCTGCTCAAAGTGATGGAAACTATATCGATCTCGCACCACAATTTGACGACAAAAAGCCTTGGAGATTAATTTTGCAGCAGTGTGACAGTCATTACATATTCTCAAGTTT
TTCATTATCCTGATTGGTGTTGGTGGACTAATATAATGTTAAGGCCAAATGCAATGACTA
Protein sequenceShow/hide protein sequence
MYLLTGLGSEYESMISVITAKTETQDVQEVMALLLTHENRIETKMKKVNSDGTPPSANLMIQGNRNVKADTQKSGNQYQGHHNYNSRGRGRFNRGGRSWNNRNKLQCQLC
SKFRHTALKCYSLAGRFDTSHSSNQQQSSNSGGFCPQFGVGSFPTKASQMSAMIAAPDLNQDNSWYPDSGATNHLTNDFNNLAVGTGYFGGNQVQVGNGAGLPISHFGYS
SFPSPSKHLFHLNNLLHVPKKTKNLISVSQFAKDNSVFFEFHSDFCCVKDRLTGKVLLQGPLHEGLYRFNLSSHHPSSTSGSMSKTCLPSHSSSFPSTHIVSTTFKPFAL
DTWHRRLGHPALPVVKQIAKLCNSVLSSSVLFNFCNAVLLESLMPFVLLPLLLLTLLLCN