; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000743 (gene) of Snake gourd v1 genome

Gene IDTan0000743
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:62260735..62262723
RNA-Seq ExpressionTan0000743
SyntenyTan0000743
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.1e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

A0A5A7TWB9 Gag/pol protein1.1e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

A0A5A7TZD7 Gag/pol protein1.1e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

A0A5A7UGV2 Gag/pol protein1.1e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

A0A5A7V4M1 Gag/pol protein1.1e-17648.46Show/hide
Query:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG
        MN AVIDE SQVSFILESLP+SFLQF SNAVMNKI Y LTTLLNELQ+FESL+K KGQ  GEAN+   +++F + S+SGTK    +      KK+KGG+G
Subjt:  MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTR-----KKRKGGKG

Query:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-
                    KA A   I                    + K K+GK DLLVLETCLVEN+D  WI+DSGATNHVCSSFQ  SS+++LE GEMT+RVG 
Subjt:  ------------KASATADI--------------------ELKEKKGKLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVG-

Query:  -----------------------------------------------------------------------------RETSSQ-----------------
                                                                                     R  +S+                 
Subjt:  -----------------------------------------------------------------------------RETSSQ-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA
            L+Q +   LEKFKEYKA+VENAL KTIKT RSDRGGEYMDL+FQ+Y++E GI SQLSAP TPQ++GVSERRNRTLLDMVRSMMSYA LP SFWGYA
Subjt:  ----LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYA

Query:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR
        ++TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGC AHVL   PKKLEPRS+LC FVGYPK TRGG FYDP++NKV VSTNATFLEEDH+R HKPR
Subjt:  IKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPR

Query:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL
        SK+VLN    E T   TRVV +    +RV     +S+R+   QSL   RR     +  I                         + E  DKD+WIKAM+L
Subjt:  SKLVLN----EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLI------------------------SSNEWRDKDQWIKAMDL

Query:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL
        E+ESM FNSVW+LVDQ +GV+PIGCKWIYKRKR A GKVQTFKARLVAKG+TQ EGVDYEE F PVAMLKSIRILLSIAA++DYEIWQMDVKTAFLNGNL
Subjt:  EMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNL

Query:  DESIYMSQPEG
        +E+IYM QPEG
Subjt:  DESIYMSQPEG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-4426.13Show/hide
Query:  LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTA
        L++++      F+++ AK E      +  L  D G EY+    + + ++ GI   L+ P+TPQ +GVSER  RT+ +  R+M+S A+L  SFWG A+ TA
Subjt:  LVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTA

Query:  VQILNSVPSKSV---SETPFELWKGRKPSLQHFRIWGCLAHVLVTIPK-KLEPRSRLCQFVGYPKETRGGLFYD--------------------------
          ++N +PS+++   S+TP+E+W  +KP L+H R++G   +V +   + K + +S    FVGY  E  G   +D                          
Subjt:  VQILNSVPSKSV---SETPFELWKGRKPSLQHFRIWGCLAHVLVTIPK-KLEPRSRLCQFVGYPKETRGGLFYD--------------------------

Query:  -----------------PQENKVIVST----------NATFLEE-------------------------------DHMRNHKPRSKLVLNEA--------
                         P +++ I+ T          N  FL++                                 +++ K  +K  LNE+        
Subjt:  -----------------PQENKVIVST----------NATFLEE-------------------------------DHMRNHKPRSKLVLNEA--------

Query:  -----------------THEPTRVVNQAGPSS----RVDGRASTSSRSRPSQSLGMSRRRWEKI----HCLIS-------SNEWR-DKDQWIKAMDLEME
                         T E  + +    P+      +  R S   +++P  S         K+    H + +         ++R DK  W +A++ E+ 
Subjt:  -----------------THEPTRVVNQAGPSS----RVDGRASTSSRSRPSQSLGMSRRRWEKI----HCLIS-------SNEWR-DKDQWIKAMDLEME

Query:  SMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNLDES
        +   N+ W +  + E    +  +W++  K +  G    +KARLVA+GFTQ+  +DYEE F PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L E 
Subjt:  SMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNLDES

Query:  IYMSQPEGL
        IYM  P+G+
Subjt:  IYMSQPEGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-6937.86Show/hide
Query:  FKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPSKS
        F+++ A VE   G+ +K LRSD GGEY    F++Y   HGI+ + + P TPQ +GV+ER NRT+++ VRSM+  A+LP SFWG A++TA  ++N  PS  
Subjt:  FKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPSKS

Query:  VS-ETPFELWKGRKPSLQHFRIWGC--LAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLN------
        ++ E P  +W  ++ S  H +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + KVI S +  F  E  +R     S+ V N      
Subjt:  VS-ETPFELWKGRKPSLQHFRIWGC--LAHVLVTIPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLN------

Query:  -----------------------------------------EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQS--LGMSRRRWEKIHCLISSNEWRDK
                                                 E    PT+   Q  P  R + R    SR  PS    L    R  E +  ++S  E   K
Subjt:  -----------------------------------------EATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQS--LGMSRRRWEKIHCLISSNEWRDK

Query:  DQWIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDV
        +Q +KAM  EMES+  N  ++LV+  +G RP+ CKW++K K+D   K+  +KARLV KGF Q++G+D++EIF PV  + SIR +LS+AA  D E+ Q+DV
Subjt:  DQWIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDV

Query:  KTAFLNGNLDESIYMSQPEG
        KTAFL+G+L+E IYM QPEG
Subjt:  KTAFLNGNLDESIYMSQPEG

P92520 Uncharacterized mitochondrial protein AtMg008202.8e-1239.53Show/hide
Query:  WIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV        +GCKW++K K  + G +   KARLVAKGF Q EG+ + E + PV    +IR +L++A
Subjt:  WIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-3625.19Show/hide
Query:  EKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPS
        E F  +K  +EN     I T  SD GGE++ L   +Y  +HGI    S P+TP+ +G+SER++R +++   +++S+A +P ++W YA   AV ++N +P+
Subjt:  EKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPS

Query:  KSVS-ETPFELWKGRKPSLQHFRIWGCLAHVLVT--IPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEE-----------DHMRNHKP
          +  E+PF+   G  P+    R++GC  +  +      KL+ +SR C F+GY       L    Q +++ +S +  F E              ++  + 
Subjt:  KSVS-ETPFELWKGRKPSLQHFRIWGCLAHVLVT--IPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLEE-----------DHMRNHKP

Query:  RSKLVLNEATHEPTRVV-------------------------NQAGPSSRVDGRASTS------------------------------------------
         S  V +  T  PTR                           N    SS +D   S+S                                          
Subjt:  RSKLVLNEATHEPTRVV-------------------------NQAGPSSRVDGRASTS------------------------------------------

Query:  -------SRSRPSQSLGMS-------------------------------------------------------RRRWEKIHCLISSNEWR------DKD
               S S P+QS   S                                                         ++     L + +E R        +
Subjt:  -------SRSRPSQSLGMS-------------------------------------------------------RRRWEKIHCLISSNEWR------DKD

Query:  QWIKAMDLEMESMDFNSVWELVDQSEG-VRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDV
        +W  AM  E+ +   N  W+LV      V  +GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY E F PV    SIRI+L +A    + I Q+DV
Subjt:  QWIKAMDLEMESMDFNSVWELVDQSEG-VRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDV

Query:  KTAFLNGNLDESIYMSQPEG
          AFL G L + +YMSQP G
Subjt:  KTAFLNGNLDESIYMSQPEG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-3724.25Show/hide
Query:  WQMPTLEK------FKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAI
        W  P  +K      F  +K+ VEN     I TL SD GGE++ LR  DY+ +HGI    S P+TP+ +G+SER++R +++M  +++S+A +P ++W YA 
Subjt:  WQMPTLEK------FKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAI

Query:  KTAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCLAHVLVT--IPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLE--------
          AV ++N +P+  +  ++PF+   G+ P+ +  +++GC  +  +      KLE +S+ C F+GY       L       ++  S +  F E        
Subjt:  KTAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCLAHVLVT--IPKKLEPRSRLCQFVGYPKETRGGLFYDPQENKVIVSTNATFLE--------

Query:  -----------------------------------------------------------------------------------------EDHMRNHKPRS
                                                                                                 + H   +   +
Subjt:  -----------------------------------------------------------------------------------------EDHMRNHKPRS

Query:  KLVLN-----------------------EATHEPTRVVNQAGPSSRVDGRAST------------------------SSRSRPSQSLGMSRRRWEKIHCL
          +LN                        + H PT   + + P+S      ST                        S  +R    +    +++     L
Subjt:  KLVLN-----------------------EATHEPTRVVNQAGPSSRVDGRAST------------------------SSRSRPSQSLGMSRRRWEKIHCL

Query:  ISSNEWR------DKDQWIKAMDLEMESMDFNSVWELV-DQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRIL
         +++E R        D+W +AM  E+ +   N  W+LV      V  +GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY E F PV    SIRI+
Subjt:  ISSNEWR------DKDQWIKAMDLEMESMDFNSVWELV-DQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRIL

Query:  LSIAAFYDYEIWQMDVKTAFLNGNLDESIYMSQPEG
        L +A    + I Q+DV  AFL G L + +YMSQP G
Subjt:  LSIAAFYDYEIWQMDVKTAFLNGNLDESIYMSQPEG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.9e-3046.03Show/hide
Query:  NEWRDKDQWIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYE
        NE ++   W  AMD E+ +M+    WE+       +PIGCKW+YK K ++ G ++ +KARLVAKG+TQ+EG+D+ E F PV  L S++++L+I+A Y++ 
Subjt:  NEWRDKDQWIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYE

Query:  IWQMDVKTAFLNGNLDESIYMSQPEG
        + Q+D+  AFLNG+LDE IYM  P G
Subjt:  IWQMDVKTAFLNGNLDESIYMSQPEG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0836.59Show/hide
Query:  NRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAV I+N  PS +++   P E+W    P+  + R +GC+A++      KL+PR++
Subjt:  NRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.0e-1339.53Show/hide
Query:  WIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV        +GCKW++K K  + G +   KARLVAKGF Q EG+ + E + PV    +IR +L++A
Subjt:  WIKAMDLEMESMDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGAAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTTATCCTAGAATCTCTTCCGAAGAGTTTCCTGCAATTCCACAGCAATGCGGTGATGAACAAGATAGAGTA
TAACCTGACTACTCTCCTAAATGAACTACAATCTTTCGAGTCTCTTATTAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGTTTGCCCATTCCAAAAGATTCCAGA
AGAGTTCATCCTCTGGGACTAAGCCCTGTGGTTTGACTCGGAAAAAGAGGAAAGGAGGCAAAGGGAAAGCTTCTGCCACTGCAGACATTGAGCTCAAGGAGAAGAAAGGT
AAATTAGATTTGCTTGTTCTTGAAACATGTTTAGTGGAAAATAATGATTTTACCTGGATACTTGATTCAGGAGCCACTAATCATGTTTGCTCTTCATTTCAGAAAACTAG
TTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCAGGGTCGGACGGGAGACGTCGTCTCAGCTCGTGCAGTGGCAGATGCCAACTCTTGAAAAGTTCAAAGAGTATA
AGGCAAAAGTAGAGAATGCATTAGGAAAAACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATC
AAATCTCAACTCTCAGCACCAAATACACCACAAAAACATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCTATGATGAGCTATGCTCAATTGCC
TGCCTCATTTTGGGGATACGCAATAAAGACTGCAGTTCAAATCTTGAACAGTGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAACTTTGGAAGGGGCGTAAACCTA
GTTTACAACACTTCAGGATTTGGGGTTGTCTGGCACACGTGCTCGTGACAATCCCAAAGAAACTGGAACCTCGTTCAAGATTGTGCCAATTTGTTGGCTATCCCAAAGAA
ACGAGAGGTGGTCTTTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACAAACGCCACGTTCTTGGAGGAAGATCACATGAGGAACCACAAACCGCGTAGTAAATT
AGTACTAAATGAAGCTACACATGAACCAACAAGAGTTGTTAATCAAGCTGGACCTTCATCAAGAGTTGATGGAAGAGCCAGTACCTCAAGTCGATCTCGTCCTTCTCAAT
CGTTGGGAATGTCTCGACGCAGATGGGAGAAGATCCATTGTCTTATAAGCAGCAATGAATGGCGTGACAAGGACCAATGGATCAAAGCCATGGACCTTGAAATGGAGTCA
ATGGACTTCAATTCAGTATGGGAACTTGTAGACCAATCTGAAGGAGTTAGACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAACCGGAAAGGTACAGAC
CTTTAAGGCTAGGCTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAATTTTTTTCCCTGTTGCTATGCTGAAGTCTATAAGGATACTCTTGTCCA
TAGCCGCGTTTTATGATTATGAAATTTGGCAGATGGACGTCAAGACTGCCTTTTTGAATGGTAATCTTGACGAGAGCATCTATATGTCTCAGCCCGAAGGGTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGAAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTTATCCTAGAATCTCTTCCGAAGAGTTTCCTGCAATTCCACAGCAATGCGGTGATGAACAAGATAGAGTA
TAACCTGACTACTCTCCTAAATGAACTACAATCTTTCGAGTCTCTTATTAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGTTTGCCCATTCCAAAAGATTCCAGA
AGAGTTCATCCTCTGGGACTAAGCCCTGTGGTTTGACTCGGAAAAAGAGGAAAGGAGGCAAAGGGAAAGCTTCTGCCACTGCAGACATTGAGCTCAAGGAGAAGAAAGGT
AAATTAGATTTGCTTGTTCTTGAAACATGTTTAGTGGAAAATAATGATTTTACCTGGATACTTGATTCAGGAGCCACTAATCATGTTTGCTCTTCATTTCAGAAAACTAG
TTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCAGGGTCGGACGGGAGACGTCGTCTCAGCTCGTGCAGTGGCAGATGCCAACTCTTGAAAAGTTCAAAGAGTATA
AGGCAAAAGTAGAGAATGCATTAGGAAAAACCATTAAAACACTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATC
AAATCTCAACTCTCAGCACCAAATACACCACAAAAACATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCTATGATGAGCTATGCTCAATTGCC
TGCCTCATTTTGGGGATACGCAATAAAGACTGCAGTTCAAATCTTGAACAGTGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAACTTTGGAAGGGGCGTAAACCTA
GTTTACAACACTTCAGGATTTGGGGTTGTCTGGCACACGTGCTCGTGACAATCCCAAAGAAACTGGAACCTCGTTCAAGATTGTGCCAATTTGTTGGCTATCCCAAAGAA
ACGAGAGGTGGTCTTTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACAAACGCCACGTTCTTGGAGGAAGATCACATGAGGAACCACAAACCGCGTAGTAAATT
AGTACTAAATGAAGCTACACATGAACCAACAAGAGTTGTTAATCAAGCTGGACCTTCATCAAGAGTTGATGGAAGAGCCAGTACCTCAAGTCGATCTCGTCCTTCTCAAT
CGTTGGGAATGTCTCGACGCAGATGGGAGAAGATCCATTGTCTTATAAGCAGCAATGAATGGCGTGACAAGGACCAATGGATCAAAGCCATGGACCTTGAAATGGAGTCA
ATGGACTTCAATTCAGTATGGGAACTTGTAGACCAATCTGAAGGAGTTAGACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAACCGGAAAGGTACAGAC
CTTTAAGGCTAGGCTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAATTTTTTTCCCTGTTGCTATGCTGAAGTCTATAAGGATACTCTTGTCCA
TAGCCGCGTTTTATGATTATGAAATTTGGCAGATGGACGTCAAGACTGCCTTTTTGAATGGTAATCTTGACGAGAGCATCTATATGTCTCAGCCCGAAGGGTTATAG
Protein sequenceShow/hide protein sequence
MNEAVIDEQSQVSFILESLPKSFLQFHSNAVMNKIEYNLTTLLNELQSFESLIKNKGQADGEANLFAHSKRFQKSSSSGTKPCGLTRKKRKGGKGKASATADIELKEKKG
KLDLLVLETCLVENNDFTWILDSGATNHVCSSFQKTSSFKELEEGEMTLRVGRETSSQLVQWQMPTLEKFKEYKAKVENALGKTIKTLRSDRGGEYMDLRFQDYMIEHGI
KSQLSAPNTPQKHGVSERRNRTLLDMVRSMMSYAQLPASFWGYAIKTAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCLAHVLVTIPKKLEPRSRLCQFVGYPKE
TRGGLFYDPQENKVIVSTNATFLEEDHMRNHKPRSKLVLNEATHEPTRVVNQAGPSSRVDGRASTSSRSRPSQSLGMSRRRWEKIHCLISSNEWRDKDQWIKAMDLEMES
MDFNSVWELVDQSEGVRPIGCKWIYKRKRDATGKVQTFKARLVAKGFTQREGVDYEEIFFPVAMLKSIRILLSIAAFYDYEIWQMDVKTAFLNGNLDESIYMSQPEGL