; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022255 (gene) of Snake gourd v1 genome

Gene IDTan0022255
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:29881278..29883352
RNA-Seq ExpressionTan0022255
SyntenyTan0022255
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-11143.22Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFISF+DDYSR+G++YL++HKSEALEKFKE+KTEVEN L K IK LRSD GGEYMDL+FQ+YMIEHGI   LS P               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  ----------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKV
                                                      GYPKETR GLF+DP+E++VFVSTNATFLEEDH+R+HK +SK+VLSE      +V
Subjt:  ----------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKV

Query:  ANKTSTSTRVVDT---------------------------------------------------------------------------------NKPDGV
         ++   S+RV +T                                                                                 + P+GV
Subjt:  ANKTSTSTRVVDT---------------------------------------------------------------------------------NKPDGV

Query:  KPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------------------------------
        KPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPVAM+KSIRILL IA  YDYE                                   
Subjt:  KPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------------------------------

Query:  ------------------------------------------------------------------------------------CPTTPQGVEDMRRIPY
                                                                                            CP TPQ VEDMRRIPY
Subjt:  ------------------------------------------------------------------------------------CPTTPQGVEDMRRIPY

Query:  ASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
        AS VGS MYAMLCTR     +A+G+VSRYQSNPGL+HWT VK ILKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKST  S+FTLNGGAVVW
Subjt:  ASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

KAA0045330.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.3e-11551.03Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI+F DDYSR+G++YL+QHKSEALEKFKE+K EVEN L KTIKT RSD GGEYMDLKFQNY++E GI   LSAP               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  --------------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGT
                                                          GYPK TR G FYDPK++KVFVSTNATFLEEDHIR+HK             
Subjt:  --------------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGT

Query:  IAKVANKTSTSTRVVD-TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------
                    R+ D  ++PDGVKPIG KWIYKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSI+ILL IAA +DYE           
Subjt:  IAKVANKTSTSTRVVD-TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------

Query:  -----------------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLC
                                                                               CP TPQ V++MR IPYAS VGS MYAMLC
Subjt:  -----------------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLC

Query:  TRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
         R     +A+G+VSRYQSNPGL HWT VKTILKYLRRTR+YMLVYG+K+L LTGYTD DFQTD DSRKSTS S+FTLNGGAVVW
Subjt:  TRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

KAA0055498.1 gag-pol fusion protein [Cucumis melo var. makuwa]5.1e-11546.49Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI+F DDYSR+G++YL+QHKSEALEKFKE+K EVEN L KTIK  RSD GGEYMDLKFQNY++E GI   LSAP               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  -------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------
                     GYPK TR G FYDPK++K+FVS NATFLEEDHIR+HK +SKIVL+EL         +V    S  TRVV                  
Subjt:  -------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------

Query:  ----------------------------------------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAK
                                                                         ++PDGVKPIGCKWIYKRKRG DGKVQTFKARLVAK
Subjt:  ----------------------------------------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAK

Query:  GYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE----------------------------------------------------------------
        GYTQVEGVDYEETFSPVAM+KSIRILL IAA +DYE                                                                
Subjt:  GYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE----------------------------------------------------------------

Query:  ------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVK
                                                  CP TPQ VE+MR IPYAS VGS MY MLCTR     +A+G+VSRYQSNP L HWT VK
Subjt:  ------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVK

Query:  TILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
        TILKYLRRTR+YMLVYG KDL LTGYTDSDFQTD+DSRKST  S+FTLNGGAVVW
Subjt:  TILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

KAA0059546.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]2.4e-10941.76Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI+F DDYSR+G++YL+QHKSEALEKFKE+K EVEN L KTIKT RSD GGEYMDLKFQNY++E GI   LSAP               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED
                                                                                           GYPK TR G  YDPK++
Subjt:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------------------------------------
        KVFVSTNATFLEEDHIR+HK +SKIVL+EL         +V  + S  TRVV                                                
Subjt:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------------------------------------

Query:  ----------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA
                                           ++PDGVKPIGCKWIYKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL IA
Subjt:  ----------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA

Query:  ACYDYE----------------------------------------------------------------------------------------------
        A +DYE                                                                                              
Subjt:  ACYDYE----------------------------------------------------------------------------------------------

Query:  ------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSD
                    CP TPQ VE+MR IPYAS VGS +YAMLCTR     + +G+VSRYQSNPGL HWTTVK ILKYLRRTR+YMLVYG+KDL LTGYTDSD
Subjt:  ------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSD

Query:  FQTDKDSRKSTSRSIFTLNGGAVVW
        FQTD+DSRKSTS S+FTLNGGAVVW
Subjt:  FQTDKDSRKSTSRSIFTLNGGAVVW

KAA0059678.1 gag/pol protein [Cucumis melo var. makuwa]6.4e-11043.43Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI F+DDYSR+G++YL++HK EALEKFKE+K EVEN L K IK LRSD GGEYMDL+FQ+YMIEHGI   LS P               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED
                                                                                           GYPKETRDGLF+DP+E+
Subjt:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRV--------------------------VDTNK----------------------
        +VFVSTNATFLEEDH+RDHK +SK+VL+E      +V ++  +   +                          VD N+                      
Subjt:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRV--------------------------VDTNK----------------------

Query:  -PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE------------------------------
         P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG DYEETFSPVA++KSIRI L IA  YDYE                              
Subjt:  -PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE------------------------------

Query:  -----------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTS
                                                                         CP TPQ VEDMR IPY S VGS MY MLCTR    
Subjt:  -----------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTS

Query:  LFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
         +A+G+VSRYQSNPGL+HWT VK ILKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKSTS S+FTLNGGAVVW
Subjt:  LFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein1.3e-11143.22Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFISF+DDYSR+G++YL++HKSEALEKFKE+KTEVEN L K IK LRSD GGEYMDL+FQ+YMIEHGI   LS P               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  ----------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKV
                                                      GYPKETR GLF+DP+E++VFVSTNATFLEEDH+R+HK +SK+VLSE      +V
Subjt:  ----------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKV

Query:  ANKTSTSTRVVDT---------------------------------------------------------------------------------NKPDGV
         ++   S+RV +T                                                                                 + P+GV
Subjt:  ANKTSTSTRVVDT---------------------------------------------------------------------------------NKPDGV

Query:  KPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------------------------------
        KPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPVAM+KSIRILL IA  YDYE                                   
Subjt:  KPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------------------------------

Query:  ------------------------------------------------------------------------------------CPTTPQGVEDMRRIPY
                                                                                            CP TPQ VEDMRRIPY
Subjt:  ------------------------------------------------------------------------------------CPTTPQGVEDMRRIPY

Query:  ASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
        AS VGS MYAMLCTR     +A+G+VSRYQSNPGL+HWT VK ILKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKST  S+FTLNGGAVVW
Subjt:  ASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

A0A5A7TQ86 Retrotransposon protein, putative, Ty1-copia subclass6.5e-11651.03Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI+F DDYSR+G++YL+QHKSEALEKFKE+K EVEN L KTIKT RSD GGEYMDLKFQNY++E GI   LSAP               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  --------------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGT
                                                          GYPK TR G FYDPK++KVFVSTNATFLEEDHIR+HK             
Subjt:  --------------------------------------------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGT

Query:  IAKVANKTSTSTRVVD-TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------
                    R+ D  ++PDGVKPIG KWIYKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSI+ILL IAA +DYE           
Subjt:  IAKVANKTSTSTRVVD-TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE-----------

Query:  -----------------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLC
                                                                               CP TPQ V++MR IPYAS VGS MYAMLC
Subjt:  -----------------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLC

Query:  TRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
         R     +A+G+VSRYQSNPGL HWT VKTILKYLRRTR+YMLVYG+K+L LTGYTD DFQTD DSRKSTS S+FTLNGGAVVW
Subjt:  TRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

A0A5A7UKD1 Gag-pol fusion protein2.5e-11546.49Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI+F DDYSR+G++YL+QHKSEALEKFKE+K EVEN L KTIK  RSD GGEYMDLKFQNY++E GI   LSAP               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  -------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------
                     GYPK TR G FYDPK++K+FVS NATFLEEDHIR+HK +SKIVL+EL         +V    S  TRVV                  
Subjt:  -------------GYPKETRDGLFYDPKEDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------

Query:  ----------------------------------------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAK
                                                                         ++PDGVKPIGCKWIYKRKRG DGKVQTFKARLVAK
Subjt:  ----------------------------------------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAK

Query:  GYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE----------------------------------------------------------------
        GYTQVEGVDYEETFSPVAM+KSIRILL IAA +DYE                                                                
Subjt:  GYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE----------------------------------------------------------------

Query:  ------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVK
                                                  CP TPQ VE+MR IPYAS VGS MY MLCTR     +A+G+VSRYQSNP L HWT VK
Subjt:  ------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVK

Query:  TILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
        TILKYLRRTR+YMLVYG KDL LTGYTDSDFQTD+DSRKST  S+FTLNGGAVVW
Subjt:  TILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

A0A5A7UWW4 Gag/pol protein3.1e-11043.43Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI F+DDYSR+G++YL++HK EALEKFKE+K EVEN L K IK LRSD GGEYMDL+FQ+YMIEHGI   LS P               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED
                                                                                           GYPKETRDGLF+DP+E+
Subjt:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRV--------------------------VDTNK----------------------
        +VFVSTNATFLEEDH+RDHK +SK+VL+E      +V ++  +   +                          VD N+                      
Subjt:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRV--------------------------VDTNK----------------------

Query:  -PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE------------------------------
         P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EG DYEETFSPVA++KSIRI L IA  YDYE                              
Subjt:  -PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDYE------------------------------

Query:  -----------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTS
                                                                         CP TPQ VEDMR IPY S VGS MY MLCTR    
Subjt:  -----------------------------------------------------------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTS

Query:  LFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW
         +A+G+VSRYQSNPGL+HWT VK ILKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKSTS S+FTLNGGAVVW
Subjt:  LFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGGAVVW

A0A5A7UZE3 Retrotransposon protein, putative, Ty1-copia subclass1.2e-10941.76Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------
        M+VKARGG+EYFI+F DDYSR+G++YL+QHKSEALEKFKE+K EVEN L KTIKT RSD GGEYMDLKFQNY++E GI   LSAP               
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAP---------------

Query:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED
                                                                                           GYPK TR G  YDPK++
Subjt:  -----------------------------------------------------------------------------------GYPKETRDGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------------------------------------
        KVFVSTNATFLEEDHIR+HK +SKIVL+EL         +V  + S  TRVV                                                
Subjt:  KVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTI----AKVANKTSTSTRVVD-----------------------------------------------

Query:  ----------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA
                                           ++PDGVKPIGCKWIYKRKRG DGKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL IA
Subjt:  ----------------------------------TNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA

Query:  ACYDYE----------------------------------------------------------------------------------------------
        A +DYE                                                                                              
Subjt:  ACYDYE----------------------------------------------------------------------------------------------

Query:  ------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSD
                    CP TPQ VE+MR IPYAS VGS +YAMLCTR     + +G+VSRYQSNPGL HWTTVK ILKYLRRTR+YMLVYG+KDL LTGYTDSD
Subjt:  ------------CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSD

Query:  FQTDKDSRKSTSRSIFTLNGGAVVW
        FQTD+DSRKSTS S+FTLNGGAVVW
Subjt:  FQTDKDSRKSTSRSIFTLNGGAVVW

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1617.3e-1642.31Show/hide
Query:  MRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVY-GAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGG
        M+ +PY S VG+ MY M+ TR   +  A+G++S++ S+P   HW  +K +L+YL+ T+ Y L +  A    L GY+D+D+  D +SR+STS  +F LNGG
Subjt:  MRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVY-GAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLNGG

Query:  AVVW
         V W
Subjt:  AVVW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-2550.44Show/hide
Query:  CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTS
        CPTT +   +M ++PY+S VGS MYAM+CTR   +  A+G+VSR+  NPG EHW  VK IL+YLR T    L +G  D  L GYTD+D   D D+RKS++
Subjt:  CPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTS

Query:  RSIFTLNGGAVVW
          +FT +GGA+ W
Subjt:  RSIFTLNGGAVVW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-1438.2Show/hide
Query:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPK
        M +++ GG +YF++F+DD SR   +Y+++ K +  + F++F   VE + G+ +K LRSD GGEY   +F+ Y   HGI    + PG P+
Subjt:  MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.9e-1333.07Show/hide
Query:  EDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRVVDTNK----PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVD
        E + + ST    + +D  R+ +   +++       + K   +   S +   T K    P G +P+ CKW++K K+  D K+  +KARLV KG+ Q +G+D
Subjt:  EDKVFVSTNATFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRVVDTNK----PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVD

Query:  YEETFSPVAMVKSIRILLVIAACYDYE
        ++E FSPV  + SIR +L +AA  D E
Subjt:  YEETFSPVAMVKSIRILLVIAACYDYE

P92520 Uncharacterized mitochondrial protein AtMg008205.1e-0949.12Show/hide
Query:  IGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA
        +GCKW++K K   DG +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L +A
Subjt:  IGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-1253.97Show/hide
Query:  PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA
        P  V  +GC+WI+ +K   DG +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L +A
Subjt:  PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-0832.76Show/hide
Query:  VKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPKETRDGLFYDPKED
        + +   Y Y++ FVD ++R+  +Y ++ KS+  E F  FK  +EN+    I T  SD GGE++ L    Y  +HGI+ HL++P +  E  +GL    ++ 
Subjt:  VKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPKETRDGLFYDPKED

Query:  KVFVSTNATFLEEDHI
        +  V T  T L    I
Subjt:  KVFVSTNATFLEEDHI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-1253.97Show/hide
Query:  PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA
        P  V  +GC+WI+ +K   DG +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L +A
Subjt:  PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-0838.27Show/hide
Query:  YEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPK
        Y Y++ FVD ++R+  +Y ++ KS+  + F  FK+ VEN+    I TL SD GGE++ L+  +Y+ +HGI+   S P  P+
Subjt:  YEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.6e-1855.88Show/hide
Query:  PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDY
        P   KPIGCKW+YK K   DG ++ +KARLVAKGYTQ EG+D+ ETFSPV  + S++++L I+A Y++
Subjt:  PDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACYDY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.6e-1049.12Show/hide
Query:  IGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA
        +GCKW++K K   DG +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L +A
Subjt:  IGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGTTAAAGCCAGAGGAGGTTATGAATATTTCATCTCCTTTGTAGATGATTATTCCAGATTTGGGCATATTTACCTAATACAACATAAGTCTGAAGCACTTGAAAA
GTTCAAGGAATTCAAGACTGAGGTTGAAAATCAATTGGGTAAAACAATTAAAACGCTTCGATCAGATCTAGGTGGAGAGTACATGGATTTAAAATTCCAGAACTATATGA
TAGAACATGGAATTACATTCCACCTCTCGGCTCCCGGCTACCCAAAAGAAACAAGAGATGGTTTATTCTATGATCCTAAGGAAGATAAGGTTTTTGTGTCGACAAATGCC
ACTTTCTTAGAGGAGGACCACATAAGGGACCACAAACAAAAAAGTAAAATTGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGGTTGCTAATAAAACTAGTACGTCAAC
AAGAGTTGTTGATACTAATAAGCCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAGCGTGGTGTAGATGGGAAGGTGCAAACCTTTAAAGCTAGAC
TAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTATGGTAAAGTCTATCCGTATCCTACTTGTCATTGCCGCATGTTAT
GACTATGAGTGTCCTACGACACCTCAAGGAGTTGAGGATATGAGACGGATTCCTTATGCATCAGTTGTTGGGAGCCCGATGTACGCCATGTTGTGTACTAGGGCGACGAC
ATCTCTTTTTGCGATTGGGATGGTCAGTAGGTATCAATCCAATCCAGGACTTGAACACTGGACAACGGTTAAAACAATCCTTAAGTATCTACGAAGAACAAGGAACTACA
TGCTTGTGTATGGGGCTAAGGATTTGACCCTTACAGGATACACAGATTCTGACTTTCAGACTGATAAAGATTCTCGAAAATCTACATCAAGATCAATATTTACTCTTAAC
GGAGGAGCTGTAGTATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGTTAAAGCCAGAGGAGGTTATGAATATTTCATCTCCTTTGTAGATGATTATTCCAGATTTGGGCATATTTACCTAATACAACATAAGTCTGAAGCACTTGAAAA
GTTCAAGGAATTCAAGACTGAGGTTGAAAATCAATTGGGTAAAACAATTAAAACGCTTCGATCAGATCTAGGTGGAGAGTACATGGATTTAAAATTCCAGAACTATATGA
TAGAACATGGAATTACATTCCACCTCTCGGCTCCCGGCTACCCAAAAGAAACAAGAGATGGTTTATTCTATGATCCTAAGGAAGATAAGGTTTTTGTGTCGACAAATGCC
ACTTTCTTAGAGGAGGACCACATAAGGGACCACAAACAAAAAAGTAAAATTGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGGTTGCTAATAAAACTAGTACGTCAAC
AAGAGTTGTTGATACTAATAAGCCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAGCGTGGTGTAGATGGGAAGGTGCAAACCTTTAAAGCTAGAC
TAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTATGGTAAAGTCTATCCGTATCCTACTTGTCATTGCCGCATGTTAT
GACTATGAGTGTCCTACGACACCTCAAGGAGTTGAGGATATGAGACGGATTCCTTATGCATCAGTTGTTGGGAGCCCGATGTACGCCATGTTGTGTACTAGGGCGACGAC
ATCTCTTTTTGCGATTGGGATGGTCAGTAGGTATCAATCCAATCCAGGACTTGAACACTGGACAACGGTTAAAACAATCCTTAAGTATCTACGAAGAACAAGGAACTACA
TGCTTGTGTATGGGGCTAAGGATTTGACCCTTACAGGATACACAGATTCTGACTTTCAGACTGATAAAGATTCTCGAAAATCTACATCAAGATCAATATTTACTCTTAAC
GGAGGAGCTGTAGTATGGTGA
Protein sequenceShow/hide protein sequence
MSVKARGGYEYFISFVDDYSRFGHIYLIQHKSEALEKFKEFKTEVENQLGKTIKTLRSDLGGEYMDLKFQNYMIEHGITFHLSAPGYPKETRDGLFYDPKEDKVFVSTNA
TFLEEDHIRDHKQKSKIVLSELDGTIAKVANKTSTSTRVVDTNKPDGVKPIGCKWIYKRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLVIAACY
DYECPTTPQGVEDMRRIPYASVVGSPMYAMLCTRATTSLFAIGMVSRYQSNPGLEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSRSIFTLN
GGAVVW