; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002938 (gene) of Snake gourd v1 genome

Gene IDTan0002938
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:65068439..65070862
RNA-Seq ExpressionTan0002938
SyntenyTan0002938
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-21760.25Show/hide
Query:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK
        +S++  Y+         R H+L  M         +      ++D   +   +   Q  ++AN +AHS+R         +   + SG +K QK+K  GKGK
Subjt:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK

Query:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKK------------------------GATNHVCSSFQETSSFKELEEGEMTLRVGTGD
         P  A + KGK KVA K  CFHCNVD HWK NCPKYLV+ KEK+                        GATNHVCSS QETSSFK+LE+ EMTL+VGTGD
Subjt:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKK------------------------GATNHVCSSFQETSSFKELEEGEMTLRVGTGD

Query:  VVSARAVGDAKL----------------------------------------------------------------------------------------
        V+SARAVGDAKL                                                                                        
Subjt:  VVSARAVGDAKL----------------------------------------------------------------------------------------

Query:  -------------LGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGY
                     LGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPF GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFISFIDDY+RYGY
Subjt:  -------------LGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGY

Query:  LYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVE
        LYLM HKSEALEKFKEYK EVEN L K IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS P TPQQNGVSERRNRTLLDMVRSMMSYAQLP+ FWGYAVE
Subjt:  LYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVE

Query:  TAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSK
        TAV ILN VPSKSV ETPFELW+GRKPSL HFRIWG PAH+LVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEE+HMRNHK RSK
Subjt:  TAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSK

Query:  LVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMD
        LVL+EAT++STRVVD+ GPSSRVD E +TS  + PSQSL +PRRS RVV QP+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM 
Subjt:  LVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMD

Query:  FNSVWEL
        FNSVWEL
Subjt:  FNSVWEL

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-23273.02Show/hide
Query:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK
        +S++  Y+         R H+L  M         +      ++D   +   +   Q  ++AN +AHS+R         +   + SG +K QK+K  GKGK
Subjt:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK

Query:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL
         P  A + KGK KVA K  CFHCNVD HWK NCPKYLV+ KEK+GATNHVCSS QETSSFK+LE+ EMTL+VGTGDV+SARAVGDAK LGHINL+RI RL
Subjt:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL

Query:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL
         KNGLLNKL+D SLPPCESCLEGKMTKRPF GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFISFIDDY+RYGYLYLM HKSEALEKFKEYK EVEN L
Subjt:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL

Query:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR
         K IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS P TPQQNGVSERRNRTLLDMVRSMMSYAQLP+ FWGYAVETAV ILN VPSKSV ETPFELW+GR
Subjt:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR

Query:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG
        KPSL HFRIWG PAH+LVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEE+HMRNHK RSKLVL+EAT++STRVVD+ GPSSRVD 
Subjt:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG

Query:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
        E +TS  + PSQSL +PRRS RVV QP+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

KAA0065386.1 gag/pol protein [Cucumis melo var. makuwa]9.0e-21970.19Show/hide
Query:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK
        ++D   +       Q  +QAN  AHSRRF            +SS  +K QK+K  GKGK P  A + K K KV  K  CFHCNVD HWK NCPKYLV+ K
Subjt:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK

Query:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAK------------------------------------LLGHINLNRIERLSKNGL
        +K+GATN VCSS QET+SFK+LE+ +MTL+VGTGDV+SARAVGDAK                                     LGHINL++I RL KNGL
Subjt:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAK------------------------------------LLGHINLNRIERLSKNGL

Query:  LNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIK
        LNKLEDDSLPPCES LEGKMTKRPFIGKGYRA EPLEL+HSDL GP+NVKAR G+EYFISFIDDY+RYGYLYLM HKSEALEK KEY+ EVEN L + IK
Subjt:  LNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIK

Query:  TFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQ
          RSDRGGEYMDLRFQDY+IEHGI+SQLS   TPQQNGVSERRNRTLLDMVRSMMSYAQ P+ FWGYAVETAV ILN VPSKSV E PFELW+GRKPSL 
Subjt:  TFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQ

Query:  HFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTS
        HFRIWG P HMLVTNPKKLEPRS+LCQFVGYPK+TRGG F+DPQEN+V VSTNATFLEE+HMR+HK RSKLVLNEATN+STRVVD+ GPSSRVD E +TS
Subjt:  HFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTS

Query:  SPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
          + PSQ L +PR S R+V +P+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  SPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-23273.02Show/hide
Query:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK
        +S++  Y+         R H+L  M         +      ++D   +   +   Q  ++AN +AHS+R         +   + SG +K QK+K  GKGK
Subjt:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK

Query:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL
         P  A + KGK KVA K  CFHCNVD HWK NCPKYLV+ KEK+GATNHVCSS QETSSFK+LE+ EMTL+VGTGDV+SARAVGDAK LGHINL+RI RL
Subjt:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL

Query:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL
         KNGLLNKL+D SLPPCESCLEGKMTKRPF GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFISFIDDY+RYGYLYLM HKSEALEKFKEYK EVEN L
Subjt:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL

Query:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR
         K IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS P TPQQNGVSERRNRTLLDMVRSMMSYAQLP+ FWGYAVETAV ILN VPSKSV ETPFELW+GR
Subjt:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR

Query:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG
        KPSL HFRIWG PAH+LVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEE+HMRNHK RSKLVL+EAT++STRVVD+ GPSSRVD 
Subjt:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG

Query:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
        E +TS  + PSQSL +PRRS RVV QP+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

TYK28868.1 gag/pol protein [Cucumis melo var. makuwa]4.6e-22374.86Show/hide
Query:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK
        ++D   +       Q  +QAN  AHSRRF            +SS  +K QK+K  GKGK P  A + K K KV  K  CFHCNVD HWK NCPKYLV+ K
Subjt:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK

Query:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPL
        +K+GATN VCSS QET+SFK+LE+ +MTL+VGTGDV+SARAVGDAK LGHINL++I RL KNGLLNKLEDDSLPPCES LEGKMTKRPFIGKGYRA EPL
Subjt:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPL

Query:  ELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQ
        EL+HSDL GP+NVKAR G+EYFISFIDDY+RYGYLYLM HKSEALEK KEY+ EVEN L + IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS   TPQQ
Subjt:  ELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQ

Query:  NGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETR
        NGVSERRNRTLLDMVRSMMSYAQ P+ FWGYAVETAV ILN VPSKSV E PFELW+GRKPSL HFRIWG P HMLVTNPKKLEPRS+LCQFVGYPK+TR
Subjt:  NGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETR

Query:  GGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIP
        GG F+DPQEN+V VSTNATFLEE+HMR+HK RSKLVLNEATN+STRVVD+ GPSSRVD E +TS  + PSQ L +PR S R+V +P+RYLGL ETQVVIP
Subjt:  GGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIP

Query:  DDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
        DDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  DDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein8.2e-21860.25Show/hide
Query:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK
        +S++  Y+         R H+L  M         +      ++D   +   +   Q  ++AN +AHS+R         +   + SG +K QK+K  GKGK
Subjt:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK

Query:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKK------------------------GATNHVCSSFQETSSFKELEEGEMTLRVGTGD
         P  A + KGK KVA K  CFHCNVD HWK NCPKYLV+ KEK+                        GATNHVCSS QETSSFK+LE+ EMTL+VGTGD
Subjt:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKK------------------------GATNHVCSSFQETSSFKELEEGEMTLRVGTGD

Query:  VVSARAVGDAKL----------------------------------------------------------------------------------------
        V+SARAVGDAKL                                                                                        
Subjt:  VVSARAVGDAKL----------------------------------------------------------------------------------------

Query:  -------------LGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGY
                     LGHINL+RI RL KNGLLNKL+D SLPPCESCLEGKMTKRPF GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFISFIDDY+RYGY
Subjt:  -------------LGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGY

Query:  LYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVE
        LYLM HKSEALEKFKEYK EVEN L K IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS P TPQQNGVSERRNRTLLDMVRSMMSYAQLP+ FWGYAVE
Subjt:  LYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVE

Query:  TAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSK
        TAV ILN VPSKSV ETPFELW+GRKPSL HFRIWG PAH+LVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEE+HMRNHK RSK
Subjt:  TAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSK

Query:  LVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMD
        LVL+EAT++STRVVD+ GPSSRVD E +TS  + PSQSL +PRRS RVV QP+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM 
Subjt:  LVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMD

Query:  FNSVWEL
        FNSVWEL
Subjt:  FNSVWEL

A0A5A7UYE8 Gag/pol protein5.3e-23373.02Show/hide
Query:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK
        +S++  Y+         R H+L  M         +      ++D   +   +   Q  ++AN +AHS+R         +   + SG +K QK+K  GKGK
Subjt:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK

Query:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL
         P  A + KGK KVA K  CFHCNVD HWK NCPKYLV+ KEK+GATNHVCSS QETSSFK+LE+ EMTL+VGTGDV+SARAVGDAK LGHINL+RI RL
Subjt:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL

Query:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL
         KNGLLNKL+D SLPPCESCLEGKMTKRPF GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFISFIDDY+RYGYLYLM HKSEALEKFKEYK EVEN L
Subjt:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL

Query:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR
         K IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS P TPQQNGVSERRNRTLLDMVRSMMSYAQLP+ FWGYAVETAV ILN VPSKSV ETPFELW+GR
Subjt:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR

Query:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG
        KPSL HFRIWG PAH+LVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEE+HMRNHK RSKLVL+EAT++STRVVD+ GPSSRVD 
Subjt:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG

Query:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
        E +TS  + PSQSL +PRRS RVV QP+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

A0A5A7VGC7 Gag/pol protein4.4e-21970.19Show/hide
Query:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK
        ++D   +       Q  +QAN  AHSRRF            +SS  +K QK+K  GKGK P  A + K K KV  K  CFHCNVD HWK NCPKYLV+ K
Subjt:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK

Query:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAK------------------------------------LLGHINLNRIERLSKNGL
        +K+GATN VCSS QET+SFK+LE+ +MTL+VGTGDV+SARAVGDAK                                     LGHINL++I RL KNGL
Subjt:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAK------------------------------------LLGHINLNRIERLSKNGL

Query:  LNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIK
        LNKLEDDSLPPCES LEGKMTKRPFIGKGYRA EPLEL+HSDL GP+NVKAR G+EYFISFIDDY+RYGYLYLM HKSEALEK KEY+ EVEN L + IK
Subjt:  LNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIK

Query:  TFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQ
          RSDRGGEYMDLRFQDY+IEHGI+SQLS   TPQQNGVSERRNRTLLDMVRSMMSYAQ P+ FWGYAVETAV ILN VPSKSV E PFELW+GRKPSL 
Subjt:  TFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQ

Query:  HFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTS
        HFRIWG P HMLVTNPKKLEPRS+LCQFVGYPK+TRGG F+DPQEN+V VSTNATFLEE+HMR+HK RSKLVLNEATN+STRVVD+ GPSSRVD E +TS
Subjt:  HFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTS

Query:  SPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
          + PSQ L +PR S R+V +P+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  SPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

A0A5D3BUN8 Gag/pol protein5.3e-23373.02Show/hide
Query:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK
        +S++  Y+         R H+L  M         +      ++D   +   +   Q  ++AN +AHS+R         +   + SG +K QK+K  GKGK
Subjt:  ESLKYVYNSRMNEGSSVREHVLDLMVHF-----NVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGK

Query:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL
         P  A + KGK KVA K  CFHCNVD HWK NCPKYLV+ KEK+GATNHVCSS QETSSFK+LE+ EMTL+VGTGDV+SARAVGDAK LGHINL+RI RL
Subjt:  APA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERL

Query:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL
         KNGLLNKL+D SLPPCESCLEGKMTKRPF GKGYRA EPLEL+HSDLCGP+NVKARGG+EYFISFIDDY+RYGYLYLM HKSEALEKFKEYK EVEN L
Subjt:  SKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENAL

Query:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR
         K IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS P TPQQNGVSERRNRTLLDMVRSMMSYAQLP+ FWGYAVETAV ILN VPSKSV ETPFELW+GR
Subjt:  GKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGR

Query:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG
        KPSL HFRIWG PAH+LVTNPKKLEPRS+LCQFVGYPKETRGG F+DPQEN+V VSTNATFLEE+HMRNHK RSKLVL+EAT++STRVVD+ GPSSRVD 
Subjt:  KPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDG

Query:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
        E +TS  + PSQSL +PRRS RVV QP+RYLGL ETQVVIPDDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  EASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

A0A5D3DZH9 Gag/pol protein2.2e-22374.86Show/hide
Query:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK
        ++D   +       Q  +QAN  AHSRRF            +SS  +K QK+K  GKGK P  A + K K KV  K  CFHCNVD HWK NCPKYLV+ K
Subjt:  VIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSSGTKSYGTSSGLKKTQKKKIGGKGKAPA-ADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELK

Query:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPL
        +K+GATN VCSS QET+SFK+LE+ +MTL+VGTGDV+SARAVGDAK LGHINL++I RL KNGLLNKLEDDSLPPCES LEGKMTKRPFIGKGYRA EPL
Subjt:  EKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAKLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPL

Query:  ELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQ
        EL+HSDL GP+NVKAR G+EYFISFIDDY+RYGYLYLM HKSEALEK KEY+ EVEN L + IK  RSDRGGEYMDLRFQDY+IEHGI+SQLS   TPQQ
Subjt:  ELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQ

Query:  NGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETR
        NGVSERRNRTLLDMVRSMMSYAQ P+ FWGYAVETAV ILN VPSKSV E PFELW+GRKPSL HFRIWG P HMLVTNPKKLEPRS+LCQFVGYPK+TR
Subjt:  NGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETR

Query:  GGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIP
        GG F+DPQEN+V VSTNATFLEE+HMR+HK RSKLVLNEATN+STRVVD+ GPSSRVD E +TS  + PSQ L +PR S R+V +P+RYLGL ETQVVIP
Subjt:  GGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIP

Query:  DDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL
        DDGVEDPLSYKQAMNDVDKDQW+K MDLEMESM FNSVWEL
Subjt:  DDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.9e-4331.01Show/hide
Query:  VGDAKLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPF--IGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMH
        + D KLL    + R    S   LLN LE  S   CE CL GK  + PF  +        PL +VHSD+CGP+         YF+ F+D +T Y   YL+ 
Subjt:  VGDAKLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPF--IGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMH

Query:  HKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQI
        +KS+    F+++ A+ E      +     D G EY+    + + ++ GI   L+ P+TPQ NGVSER  RT+ +  R+M+S A+L   FWG AV TA  +
Subjt:  HKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQI

Query:  LNVVPSKSVLE---TPFELWKGRKPSLQHFRIWGYPAHMLVTNPK-KLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKL
        +N +PS+++++   TP+E+W  +KP L+H R++G   ++ + N + K + +S    FVGY  E  G   +D    K IV+ +    E N + +  ++ + 
Subjt:  LNVVPSKSVLE---TPFELWKGRKPSLQHFRIWGYPAHMLVTNPK-KLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKL

Query:  VL----NEATNK-----STRVVDQAGPS-SRVDGEASTSSPARPSQSLGIPRRSERVV
        V      E+ NK     S +++    P+ S+          ++ S++   P  S +++
Subjt:  VL----NEATNK-----STRVVDQAGPS-SRVDGEASTSSPARPSQSLGIPRRSERVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-5833.02Show/hide
Query:  KLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEAL
        K +GH++   ++ L+K  L++  +  ++ PC+ CL GK  +  F     R +  L+LV+SD+CGP+ +++ GG +YF++FIDD +R  ++Y++  K +  
Subjt:  KLLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEAL

Query:  EKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPS
        + F+++ A VE   G+ +K  RSD GGEY    F++Y   HGI+ + + P TPQ NGV+ER NRT+++ VRSM+  A+LP  FWG AV+TA  ++N  PS
Subjt:  EKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPS

Query:  KSV-LETPFELWKGRKPSLQHFRIWGYP--AHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMR-----NHKLRSKLVL
          +  E P  +W  ++ S  H +++G    AH+      KL+ +S  C F+GY  E  G   +DP + KVI S +  F  E+ +R     + K+++ ++ 
Subjt:  KSV-LETPFELWKGRKPSLQHFRIWGYP--AHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMR-----NHKLRSKLVL

Query:  NEATNKST------------RVVDQAGPSSRV--------DGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDV
        N  T  ST             V +Q      V        +G      P +  +     RRSER   +  RY   +   V+I DD   +P S K+ ++  
Subjt:  NEATNKST------------RVVDQAGPSSRV--------DGEASTSSPARPSQSLGIPRRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDV

Query:  DKDQWIKVMDLEMESMDFNSVWEL
        +K+Q +K M  EMES+  N  ++L
Subjt:  DKDQWIKVMDLEMESMDFNSVWEL

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein5.4e-2529.55Show/hide
Query:  KLLGHINLNRIER-LSKNGLLNKLEDD------SLPPCESCLEGKMTKRPFIGKGYR-----AIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYG
        ++LGH N   I++ L KN +    E D      S   C  CL GK TK   I KG R     + EP + +H+D+ GPV+   +    YFISF D+ TR+ 
Subjt:  KLLGHINLNRIER-LSKNGLLNKLEDD------SLPPCESCLEGKMTKRPFIGKGYR-----AIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYG

Query:  YLYLMHHKSE--ALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGY
        ++Y +H + E   L  F    A ++N     +   + DRG EY +     +    GI +  +T    + +GV+ER NRTLL+  R+++  + LP   W  
Subjt:  YLYLMHHKSE--ALEKFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGY

Query:  AVETAVQILNVVPSKSVLETPFELWKGRKPSLQHFRI----------WGYPAHMLVTNP-KKLEPRSKLCQFVGYPKETRGGH-FYDPQENKVIVSTNAT
        AVE +  I N + S           K  K + QH  +          +G P  +   NP  K+ PR  +  +  +P     G+  Y P   K + +TN  
Subjt:  AVETAVQILNVVPSKSVLETPFELWKGRKPSLQHFRI----------WGYPAHMLVTNP-KKLEPRSKLCQFVGYPKETRGGH-FYDPQENKVIVSTNAT

Query:  FLEENHMR
         L++N  +
Subjt:  FLEENHMR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-3430.87Show/hide
Query:  CESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYM
        C  CL  K  K PF      +  PLE ++SD+     + +   Y Y++ F+D +TRY +LY +  KS+  E F  +K  +EN     I TF SD GGE++
Subjt:  CESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVENALGKTIKTFRSDRGGEYM

Query:  DLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSV-LETPFELWKGRKPSLQHFRIWG---Y
         L   +Y  +HGI    S P+TP+ NG+SER++R +++   +++S+A +P  +W YA   AV ++N +P+  + LE+PF+   G  P+    R++G   Y
Subjt:  DLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSV-LETPFELWKGRKPSLQHFRIWG---Y

Query:  PAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEE-----------NHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGE
        P  +   N  KL+ +S+ C F+GY            Q +++ +S +  F E            + ++  +  S  V +  T   TR      PS      
Subjt:  PAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEE-----------NHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGE

Query:  AST--SSPARP
        A+T  SSP+ P
Subjt:  AST--SSPARP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-3634.78Show/hide
Query:  LGHINLNRIERLSKNGLLNKLE-DDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALE
        LGH +L  +  +  N  L  L     L  C  C   K  K PF      + +PLE ++SD+     + +   Y Y++ F+D +TRY +LY +  KS+  +
Subjt:  LGHINLNRIERLSKNGLLNKLE-DDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALE

Query:  KFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSK
         F  +K+ VEN     I T  SD GGE++ LR  DY+ +HGI    S P+TP+ NG+SER++R +++M  +++S+A +P  +W YA   AV ++N +P+ 
Subjt:  KFKEYKAEVENALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSK

Query:  SV-LETPFELWKGRKPSLQHFRIWG---YPAHMLVTNPKKLEPRSKLCQFVGY
         + L++PF+   G+ P+ +  +++G   YP  +   N  KLE +SK C F+GY
Subjt:  SV-LETPFELWKGRKPSLQHFRIWG---YPAHMLVTNPKKLEPRSKLCQFVGY

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0636.59Show/hide
Query:  NRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSV-LETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSK
        NRT+++ VRSM+    LP  F   A  TAV I+N  PS ++    P E+W    P+  + R +G  A+ +  +  KL+PR+K
Subjt:  NRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSV-LETPFELWKGRKPSLQHFRIWGYPAHMLVTNPKKLEPRSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTCAGGTCCCTACTCAAACGCCTTCAATCATTGGGAGGCGTCGCATCACCGGATCAAGGCCAATGATAAGGCCAAGGATATGTTTGGACAAACGTCTGGACAGCT
TCGACACGAATCCCTCAAATACGTTTATAACTCCCGTATGAATGAGGGTTCATCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAGATGAATG
GAGCGGTCATTGACGAGCAAAGTCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGAACAGGCAAATCTGTTGGCCCATTCTAGAAGGTTCCAGAAGGGTTCATCCTCT
GGGACTAAGTCCTATGGTACATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCTGCTGATAAATGCAAGGGAAAAACCAAGGT
TGCAGACAAAGAAAATTGTTTCCACTGCAACGTGGATGGGCACTGGAAGCGAAACTGCCCTAAATACCTTGTTGAGCTCAAAGAGAAGAAAGGAGCCACTAATCATGTTT
GCTCTTCGTTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCAGGGTCGGAACGGGGGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAG
CTACTTGGTCATATAAATCTCAACCGGATTGAGAGACTCTCTAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCTTTACCTCCTTGCGAATCATGCTTGGAAGGTAA
AATGACTAAGCGACCTTTTATTGGAAAAGGTTACAGAGCCATAGAGCCCTTAGAACTTGTACATTCGGATCTTTGTGGTCCGGTGAATGTTAAAGCTCGAGGAGGGTACG
AATATTTCATCTCTTTCATAGATGATTATACCAGGTATGGTTATCTATACCTAATGCATCACAAGTCTGAGGCTCTTGAAAAGTTCAAAGAGTATAAGGCTGAAGTAGAG
AATGCATTAGGGAAAACAATTAAAACATTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAAGACTATATAATAGAACATGGAATTAAATCTCAACTCTC
AACACCTAATACACCACAGCAAAATGGTGTGTCGGAAAGGAGAAATAGAACCTTGTTGGACATGGTTCGTTCTATGATGAGCTATGCTCAATTGCCTGCCTTGTTTTGGG
GATATGCAGTAGAGACTGCAGTTCAAATCTTGAACGTTGTTCCATCAAAGAGTGTTTTAGAAACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTC
AGGATTTGGGGTTATCCAGCACACATGCTGGTGACAAACCCAAAAAAACTGGAACCTCGTTCAAAATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCA
TTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACGAATGCCACTTTCTTGGAGGAAAATCACATGAGAAACCATAAACTGCGTAGTAAATTAGTGTTAAATGAAG
CTACAAATAAATCAACAAGAGTTGTTGATCAAGCTGGACCTTCATCAAGAGTTGATGGAGAAGCCAGCACCTCAAGTCCGGCTCGTCCTTCTCAATCGTTGGGAATACCT
CGACGCAGTGAGAGGGTTGTTCCCCAACCTGATCGCTACTTGGGTTTAGCTGAAACTCAAGTTGTCATACCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGC
GATGAATGACGTAGACAAGGACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAATGGACTTCAATTCAGTATGGGAATTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTCAGGTCCCTACTCAAACGCCTTCAATCATTGGGAGGCGTCGCATCACCGGATCAAGGCCAATGATAAGGCCAAGGATATGTTTGGACAAACGTCTGGACAGCT
TCGACACGAATCCCTCAAATACGTTTATAACTCCCGTATGAATGAGGGTTCATCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAGATGAATG
GAGCGGTCATTGACGAGCAAAGTCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGAACAGGCAAATCTGTTGGCCCATTCTAGAAGGTTCCAGAAGGGTTCATCCTCT
GGGACTAAGTCCTATGGTACATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAGATAGGAGGGAAAGGGAAGGCACCTGCTGCTGATAAATGCAAGGGAAAAACCAAGGT
TGCAGACAAAGAAAATTGTTTCCACTGCAACGTGGATGGGCACTGGAAGCGAAACTGCCCTAAATACCTTGTTGAGCTCAAAGAGAAGAAAGGAGCCACTAATCATGTTT
GCTCTTCGTTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGGTGAGATGACGCTCAGGGTCGGAACGGGGGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAG
CTACTTGGTCATATAAATCTCAACCGGATTGAGAGACTCTCTAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCTTTACCTCCTTGCGAATCATGCTTGGAAGGTAA
AATGACTAAGCGACCTTTTATTGGAAAAGGTTACAGAGCCATAGAGCCCTTAGAACTTGTACATTCGGATCTTTGTGGTCCGGTGAATGTTAAAGCTCGAGGAGGGTACG
AATATTTCATCTCTTTCATAGATGATTATACCAGGTATGGTTATCTATACCTAATGCATCACAAGTCTGAGGCTCTTGAAAAGTTCAAAGAGTATAAGGCTGAAGTAGAG
AATGCATTAGGGAAAACAATTAAAACATTTCGATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAAGACTATATAATAGAACATGGAATTAAATCTCAACTCTC
AACACCTAATACACCACAGCAAAATGGTGTGTCGGAAAGGAGAAATAGAACCTTGTTGGACATGGTTCGTTCTATGATGAGCTATGCTCAATTGCCTGCCTTGTTTTGGG
GATATGCAGTAGAGACTGCAGTTCAAATCTTGAACGTTGTTCCATCAAAGAGTGTTTTAGAAACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTC
AGGATTTGGGGTTATCCAGCACACATGCTGGTGACAAACCCAAAAAAACTGGAACCTCGTTCAAAATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCA
TTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACGAATGCCACTTTCTTGGAGGAAAATCACATGAGAAACCATAAACTGCGTAGTAAATTAGTGTTAAATGAAG
CTACAAATAAATCAACAAGAGTTGTTGATCAAGCTGGACCTTCATCAAGAGTTGATGGAGAAGCCAGCACCTCAAGTCCGGCTCGTCCTTCTCAATCGTTGGGAATACCT
CGACGCAGTGAGAGGGTTGTTCCCCAACCTGATCGCTACTTGGGTTTAGCTGAAACTCAAGTTGTCATACCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGC
GATGAATGACGTAGACAAGGACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAATGGACTTCAATTCAGTATGGGAATTGTAG
Protein sequenceShow/hide protein sequence
MSSGPYSNAFNHWEASHHRIKANDKAKDMFGQTSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQSLMKNKGQADEQANLLAHSRRFQKGSSS
GTKSYGTSSGLKKTQKKKIGGKGKAPAADKCKGKTKVADKENCFHCNVDGHWKRNCPKYLVELKEKKGATNHVCSSFQETSSFKELEEGEMTLRVGTGDVVSARAVGDAK
LLGHINLNRIERLSKNGLLNKLEDDSLPPCESCLEGKMTKRPFIGKGYRAIEPLELVHSDLCGPVNVKARGGYEYFISFIDDYTRYGYLYLMHHKSEALEKFKEYKAEVE
NALGKTIKTFRSDRGGEYMDLRFQDYIIEHGIKSQLSTPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPALFWGYAVETAVQILNVVPSKSVLETPFELWKGRKPSLQHF
RIWGYPAHMLVTNPKKLEPRSKLCQFVGYPKETRGGHFYDPQENKVIVSTNATFLEENHMRNHKLRSKLVLNEATNKSTRVVDQAGPSSRVDGEASTSSPARPSQSLGIP
RRSERVVPQPDRYLGLAETQVVIPDDGVEDPLSYKQAMNDVDKDQWIKVMDLEMESMDFNSVWEL