; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008369 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008369
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr9:19257008..19265336
RNA-Seq ExpressionLag0008369
SyntenyLag0008369
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.3e-20955.3Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E
        M++SI+ LL +E+L G+N+  WKSNLNTILVV+DLRFVLTEECP  P   A + V++AY+RW KAN+K                               E
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS--------------------------------------------
        MFG PS+ L H+A+K+++  +M EG SVREHVLDM+  FNIAE NGG + E +Q S                                            
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS--------------------------------------------

Query:  ----SSKQIQKRKGDKGKAPAQAV---------QGKGKA------KVVADKGRCFHCNADGHWKRNCPRYLAEKRRGK--EGKFDLLVLETCLVEHDEFA
            ++  + KRK  +G +    V         +GKGKA      K  ADKG+CFHCN DGHWKRNCP+YLAEK+  K  +GK+DLLV+ETCLVE D   
Subjt:  ----SSKQIQKRKGDKGKAPAQAV---------QGKGKA------KVVADKGRCFHCNADGHWKRNCPRYLAEKRRGK--EGKFDLLVLETCLVEHDEFA

Query:  WILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------
        WILDSGATNH+C SFQ  + +++L +GE+TLKVGTG+VVSA AVG   LFF++                                               
Subjt:  WILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------

Query:  ------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRA
                    +PT A  +L+ EMF+T ETQNK+QKV   S+N YLWHLRLGHIN++RI+RLVK+G+L  +ED SLPPCESCLEGKMTKR FTGKG RA
Subjt:  ------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRA

Query:  KEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPG
        K PLEL+HSDLCGPMNVKAR GYEYFISFIDD+SRYG++YL+ HKSE+ EKF+E+KAEVEN +GKTIKTLRSDRGGEY+D +FQDY+IE GIQSQLSAP 
Subjt:  KEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPG

Query:  TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYP
        TPQQNGVSERRNRTLLDMVRSMMSYAQLP SFWGYA+ TA+HILNNVPSKSV ETP+ELWKGR+ SLRYFRIWGCPAHVLV NPKKLEPR+++C FV YP
Subjt:  TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYP

Query:  KETRGGL
        KE+RGGL
Subjt:  KETRGGL

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-22466.24Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV
        MSSSII+LLK +QLTGEN+  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++ + K   M+  + +
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV

Query:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV
         + + +M  Q +I    EAN      R     S S++IQKRK  KGK P  AV+ KGKAK VA K +CFHCN D HWK NCP+YL +K+  KEGK+DLLV
Subjt:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV

Query:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------
        LETCLVE+D+ AWILDSGATNHVCSS Q  + F+QL D EMTLKVGTGDV+SARAVG AKLFF NK                                  
Subjt:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------

Query:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM
                                 P +AKA+L+HEMF+TA TQNKRQ++SP +NNTYLWHLRLGHIN+DRI RLVKNGLL  ++D SLPPCESCLEGKM
Subjt:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI
        TKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR G+EYFISFIDDYSRYGYLYLM HKSEALEKF+E+K EVENLL K IK LRSDRGGEY+D RFQDYMI
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE
        EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAV TAVHILNNVPSKSVSETPFELW+GR+PSL +FRIWGCPAHVLVTNPKKLE
Subjt:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVVYPKETRGGL
        PR+R+CQFV YPKETRGGL
Subjt:  PRTRICQFVVYPKETRGGL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-22065.43Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV
        MSSSII+LLK +QLTGEN+  WKS LNTILV+ DL FVL EECP  P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++ + K   M+  + +
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV

Query:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV
         + + +M  Q +I    EAN      R     S S++IQKRK  KGK P  AV+ KGKAK VA K +CFHCN D HWK NCP+YL +K   KE K+DLLV
Subjt:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV

Query:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------
        LETCLVE+D+ AWILDSGATNHVCSS Q  + F+QL D EMTLKVGTGDV+SARAVG AKLFF NK                                  
Subjt:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------

Query:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM
                                 P +AKA+L+HEMF+TA TQNKRQ++SP +NNTYLWHLRLGHIN+DRI RLVK+GLL  ++D SLPPCESCLEGKM
Subjt:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI
        TKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR  +EYFISFIDDYSRYGYLYLM HKSEALEKF+E+K EVENLL K IK  RSDRGGEY+D  FQDYMI
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE
        EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAV TAVHILNNVPSKSVSETPFELW+GR+PSL +FRIWGCPAHVLVTNPKKLE
Subjt:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVVYPKETRGGL
        PR+R+CQFV YPKETRGGL
Subjt:  PRTRICQFVVYPKETRGGL

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-20463.56Show/hide
Query:  ILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSVREHVLDMINQ--FNIAEANGGVVCERSQ
        ILV+ DLRFVL EECPP P + A+Q+V+DAY+RWTKANEK    + +      ++ ++ + K   M+  + + + + +M  Q    I +       +R  
Subjt:  ILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSVREHVLDMINQ--FNIAEANGGVVCERSQ

Query:  ASS--SKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCLVEHDEFAWILDSGATNHVCSSFQ-
         SS  S++IQKRK  KGK P  A++GKGK KVV  K + FHCN + HWK NCP+YL +K+  KEGK+DLLVLETCLVE+D+ AWILDSGATNHVCSS Q 
Subjt:  ASS--SKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCLVEHDEFAWILDSGATNHVCSSFQ-

Query:  GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------------------KPTDA
         + F+QL + EM LKVGTGDV+SARAVG AKLFF N                                                           KP + 
Subjt:  GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------------------KPTDA

Query:  KAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        KA+L+HEMF+TA TQNKRQ++S  +NNTYLWHLRLGHIN+DRI RLVKNGLL  +ED SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
Subjt:  KAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

Query:  KARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD
        KA  G+EYFISFIDDYS YGYLYL+ HKSEALEKF+E+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD
Subjt:  KARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD

Query:  MVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYPKETRGGL
        MV SMMSY QLPSSFWGYAV TAVHILNNVPSK+V ETPFELW+GR+PSL +FRIW CP HVLVTNPKKLEPR+R+CQFV YPKETRGGL
Subjt:  MVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYPKETRGGL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-20253.3Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK                               E
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS----------------------------------SSKQIQKRKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ S                                  + + + K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS----------------------------------SSKQIQKRKG

Query:  DKGKA--------------------------------------PAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCL
         KG+A                                       A     K   K  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETCL
Subjt:  DKGKA--------------------------------------PAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCL

Query:  VEHDEFAWILDSGATNHVCSSFQG-NDFQQLADGEMTLKVGTGDVVSARAVGAAKL-------------------------------------------F
        VE+D+ AWI+DSGATNHVCSSFQG + ++QL  GEMT++VGTG VVSA AVG  +L                                            
Subjt:  VEHDEFAWILDSGATNHVCSSFQG-NDFQQLADGEMTLKVGTGDVVSARAVGAAKL-------------------------------------------F

Query:  FRN----------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPF
        ++N                +   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++RI+RLVKNGLL+ +E+ SLP CESCLEGKMTKRPF
Subjt:  FRN----------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPF

Query:  TGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQ
        TGKG+RAKEPLEL+HSDLCGPMNVKAR G+EYFI+F DDYSRYGY+YLM HKSEALEKF+E+KAEVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI 
Subjt:  TGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRI
        SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV TAV+ILN VPSKSVSETP +LW GR+ SLR+FRIWGCPAHVL  NPKKLEPR+++
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRI

Query:  CQFVVYPKETRGG
        C FV YPK TRGG
Subjt:  CQFVVYPKETRGG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.3e-20253.3Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK                               E
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS----------------------------------SSKQIQKRKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+AE NG V+ E SQ S                                  + + + K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS----------------------------------SSKQIQKRKG

Query:  DKGKA--------------------------------------PAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCL
         KG+A                                       A     K   K  A KG CFHCN +GHWKRNCP+YLAEK++ K+GK+DLLVLETCL
Subjt:  DKGKA--------------------------------------PAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCL

Query:  VEHDEFAWILDSGATNHVCSSFQG-NDFQQLADGEMTLKVGTGDVVSARAVGAAKL-------------------------------------------F
        VE+D+ AWI+DSGATNHVCSSFQG + ++QL  GEMT++VGTG VVSA AVG  +L                                            
Subjt:  VEHDEFAWILDSGATNHVCSSFQG-NDFQQLADGEMTLKVGTGDVVSARAVGAAKL-------------------------------------------F

Query:  FRN----------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPF
        ++N                +   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++RI+RLVKNGLL+ +E+ SLP CESCLEGKMTKRPF
Subjt:  FRN----------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPF

Query:  TGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQ
        TGKG+RAKEPLEL+HSDLCGPMNVKAR G+EYFI+F DDYSRYGY+YLM HKSEALEKF+E+KAEVEN L KTIKT RSDRGGEY+D +FQ+Y++E GI 
Subjt:  TGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQ

Query:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRI
        SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV TAV+ILN VPSKSVSETP +LW GR+ SLR+FRIWGCPAHVL  NPKKLEPR+++
Subjt:  SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRI

Query:  CQFVVYPKETRGG
        C FV YPK TRGG
Subjt:  CQFVVYPKETRGG

A0A5A7T2V9 Gag/pol protein1.9e-22065.43Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV
        MSSSII+LLK +QLTGEN+  WKS LNTILV+ DL FVL EECP  P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++ + K   M+  + +
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV

Query:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV
         + + +M  Q +I    EAN      R     S S++IQKRK  KGK P  AV+ KGKAK VA K +CFHCN D HWK NCP+YL +K   KE K+DLLV
Subjt:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV

Query:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------
        LETCLVE+D+ AWILDSGATNHVCSS Q  + F+QL D EMTLKVGTGDV+SARAVG AKLFF NK                                  
Subjt:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------

Query:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM
                                 P +AKA+L+HEMF+TA TQNKRQ++SP +NNTYLWHLRLGHIN+DRI RLVK+GLL  ++D SLPPCESCLEGKM
Subjt:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI
        TKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR  +EYFISFIDDYSRYGYLYLM HKSEALEKF+E+K EVENLL K IK  RSDRGGEY+D  FQDYMI
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE
        EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAV TAVHILNNVPSKSVSETPFELW+GR+PSL +FRIWGCPAHVLVTNPKKLE
Subjt:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVVYPKETRGGL
        PR+R+CQFV YPKETRGGL
Subjt:  PRTRICQFVVYPKETRGGL

A0A5A7TZD0 Gag/pol protein2.8e-22466.24Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV
        MSSSII+LLK +QLTGEN+  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++ + K   M+  + +
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSV

Query:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV
         + + +M  Q +I    EAN      R     S S++IQKRK  KGK P  AV+ KGKAK VA K +CFHCN D HWK NCP+YL +K+  KEGK+DLLV
Subjt:  REHVLDMINQFNI---AEANGGVVCER--SQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLV

Query:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------
        LETCLVE+D+ AWILDSGATNHVCSS Q  + F+QL D EMTLKVGTGDV+SARAVG AKLFF NK                                  
Subjt:  LETCLVEHDEFAWILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNK----------------------------------

Query:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM
                                 P +AKA+L+HEMF+TA TQNKRQ++SP +NNTYLWHLRLGHIN+DRI RLVKNGLL  ++D SLPPCESCLEGKM
Subjt:  -------------------------PTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI
        TKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR G+EYFISFIDDYSRYGYLYLM HKSEALEKF+E+K EVENLL K IK LRSDRGGEY+D RFQDYMI
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE
        EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAV TAVHILNNVPSKSVSETPFELW+GR+PSL +FRIWGCPAHVLVTNPKKLE
Subjt:  EHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVVYPKETRGGL
        PR+R+CQFV YPKETRGGL
Subjt:  PRTRICQFVVYPKETRGGL

A0A5D3BNE1 Gag/pol protein4.2e-20463.56Show/hide
Query:  ILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSVREHVLDMINQ--FNIAEANGGVVCERSQ
        ILV+ DLRFVL EECPP P + A+Q+V+DAY+RWTKANEK    + +      ++ ++ + K   M+  + + + + +M  Q    I +       +R  
Subjt:  ILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAK---MLEGQSVREHVLDMINQ--FNIAEANGGVVCERSQ

Query:  ASS--SKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCLVEHDEFAWILDSGATNHVCSSFQ-
         SS  S++IQKRK  KGK P  A++GKGK KVV  K + FHCN + HWK NCP+YL +K+  KEGK+DLLVLETCLVE+D+ AWILDSGATNHVCSS Q 
Subjt:  ASS--SKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCLVEHDEFAWILDSGATNHVCSSFQ-

Query:  GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------------------KPTDA
         + F+QL + EM LKVGTGDV+SARAVG AKLFF N                                                           KP + 
Subjt:  GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------------------KPTDA

Query:  KAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        KA+L+HEMF+TA TQNKRQ++S  +NNTYLWHLRLGHIN+DRI RLVKNGLL  +ED SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
Subjt:  KAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

Query:  KARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD
        KA  G+EYFISFIDDYS YGYLYL+ HKSEALEKF+E+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD
Subjt:  KARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD

Query:  MVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYPKETRGGL
        MV SMMSY QLPSSFWGYAV TAVHILNNVPSK+V ETPFELW+GR+PSL +FRIW CP HVLVTNPKKLEPR+R+CQFV YPKETRGGL
Subjt:  MVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYPKETRGGL

E2GK51 Gag/pol protein (Fragment)1.1e-20955.3Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E
        M++SI+ LL +E+L G+N+  WKSNLNTILVV+DLRFVLTEECP  P   A + V++AY+RW KAN+K                               E
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS--------------------------------------------
        MFG PS+ L H+A+K+++  +M EG SVREHVLDM+  FNIAE NGG + E +Q S                                            
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIAEANGGVVCERSQAS--------------------------------------------

Query:  ----SSKQIQKRKGDKGKAPAQAV---------QGKGKA------KVVADKGRCFHCNADGHWKRNCPRYLAEKRRGK--EGKFDLLVLETCLVEHDEFA
            ++  + KRK  +G +    V         +GKGKA      K  ADKG+CFHCN DGHWKRNCP+YLAEK+  K  +GK+DLLV+ETCLVE D   
Subjt:  ----SSKQIQKRKGDKGKAPAQAV---------QGKGKA------KVVADKGRCFHCNADGHWKRNCPRYLAEKRRGK--EGKFDLLVLETCLVEHDEFA

Query:  WILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------
        WILDSGATNH+C SFQ  + +++L +GE+TLKVGTG+VVSA AVG   LFF++                                               
Subjt:  WILDSGATNHVCSSFQ-GNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRN-----------------------------------------------

Query:  ------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRA
                    +PT A  +L+ EMF+T ETQNK+QKV   S+N YLWHLRLGHIN++RI+RLVK+G+L  +ED SLPPCESCLEGKMTKR FTGKG RA
Subjt:  ------------KPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRA

Query:  KEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPG
        K PLEL+HSDLCGPMNVKAR GYEYFISFIDD+SRYG++YL+ HKSE+ EKF+E+KAEVEN +GKTIKTLRSDRGGEY+D +FQDY+IE GIQSQLSAP 
Subjt:  KEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPG

Query:  TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYP
        TPQQNGVSERRNRTLLDMVRSMMSYAQLP SFWGYA+ TA+HILNNVPSKSV ETP+ELWKGR+ SLRYFRIWGCPAHVLV NPKKLEPR+++C FV YP
Subjt:  TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVVYP

Query:  KETRGGL
        KE+RGGL
Subjt:  KETRGGL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-4629Show/hide
Query:  PAQAVQGKGKAKVVADKGRCFHCNADGHWKRNC---PRYLAEKRRGKEGKFD--------LLVLETCLVE-HDEFAWILDSGATNH--------------
        P +  +G  K KV     +C HC  +GH K++C    R L  K +  E +           +V E       D   ++LDSGA++H              
Subjt:  PAQAVQGKGKAKVVADKGRCFHCNADGHWKRNC---PRYLAEKRRGKEGKFD--------LLVLETCLVE-HDEFAWILDSGATNH--------------

Query:  -----VCSSFQG--------------NDFQQLADGEMTLKVGTGDVVSARAVGAAKL---FFRNKPTDAK----AILSHEMFKTAETQNKRQKV--SPLS
             +  + QG              ND +   +  +  K   G+++S + +  A +   F ++  T +K     + +  M       N +     +   
Subjt:  -----VCSSFQG--------------NDFQQLADGEMTLKVGTGDVVSARAVGAAKL---FFRNKPTDAK----AILSHEMFKTAETQNKRQKV--SPLS

Query:  NNTYLWHLRLGHIN------IDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARCGYEYFISFIDDYS
        NN  LWH R GHI+      I R +      LL  +E  S   CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++
Subjt:  NNTYLWHLRLGHIN------IDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARCGYEYFISFIDDYS

Query:  RYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWG
         Y   YL+ +KS+    FQ+F A+ E      +  L  D G EYL    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG
Subjt:  RYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWG

Query:  YAVATAVHILNNVPSKSV---SETPFELWKGRQPSLRYFRIWGCPAHVLVTNPK-KLEPRTRICQFVVY
         AV TA +++N +PS+++   S+TP+E+W  ++P L++ R++G   +V + N + K + ++    FV Y
Subjt:  YAVATAVHILNNVPSKSV---SETPFELWKGRQPSLRYFRIWGCPAHVLVTNPK-KLEPRTRICQFVVY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5830.12Show/hide
Query:  KQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEG-------------KFDLLVL-----ETCL-VEHDEFAWILD
        +  Q+   + G++ A   +GK K +  +    C++CN  GH+KR+CP     K +G+                 D +VL     E C+ +   E  W++D
Subjt:  KQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEG-------------KFDLLVL-----ETCL-VEHDEFAWILD

Query:  SGATNH------VCSSFQGNDFQQLADGEMTLK--VGTGDVVSARAVGAA---------------------------KLFFRNKP--------TDAKAIL
        + A++H      +   +   DF  +  G  +     G GD+     VG                             + +F N+           AK + 
Subjt:  SGATNH------VCSSFQGNDFQQLADGEMTLK--VGTGDVVSARAVGAA---------------------------KLFFRNKP--------TDAKAIL

Query:  SHEMFKT-AETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR
           +++T AE        +    +  LWH R+GH++   +  L K  L++  + T++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ 
Subjt:  SHEMFKT-AETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKAR

Query:  CGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVR
         G +YF++FIDD SR  ++Y++  K +  + FQ+F A VE   G+ +K LRSD GGEY  + F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VR
Subjt:  CGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVR

Query:  SMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCP--AHVLVTNPKKLEPRTRICQFVVYPKETRG
        SM+  A+LP SFWG AV TA +++N  PS  ++ E P  +W  ++ S  + +++GC   AHV      KL+ ++  C F+ Y  E  G
Subjt:  SMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCP--AHVLVTNPKKLEPRTRICQFVVYPKETRG

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.9e-2129.55Show/hide
Query:  ILSHEMFKTAETQNKRQKVSPLSNNTY---LWHLRLGHINIDRIDRLVKNGLLTGIEDTSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPL
        I SH    T    NK + V     N Y   L H  LGH N   I + +K   +T ++++ +         C  CL GK TK     KG R K     EP 
Subjt:  ILSHEMFKTAETQNKRQKVSPLSNNTY---LWHLRLGHINIDRIDRLVKNGLLTGIEDTSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPL

Query:  ELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSE--ALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTP
        + +H+D+ GP++   +    YFISF D+ +R+ ++Y +  + E   L  F    A ++N     +  ++ DRG EY ++    +    GI +  +     
Subjt:  ELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSE--ALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTP

Query:  QQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPS
        + +GV+ER NRTLL+  R+++  + LP+  W  AV  +  I N++ S
Subjt:  QQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-4129.08Show/hide
Query:  WILDSGATNHVCSSFQGNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNKPTDAKAIL---------------------SHEMF-------------
        W+LDSGAT+H+ S F      Q   G   + V  G  +     G+  L  +++P +   IL                     S E F             
Subjt:  WILDSGATNHVCSSFQGNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNKPTDAKAIL---------------------SHEMF-------------

Query:  -----KT---------AETQNKRQKVSPLSNNTY-LWHLRLGHINIDRIDRLVKNGLLTGIEDT-SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD
             KT         A +Q      SP S  T+  WH RLGH     ++ ++ N  L+ +  +     C  CL  K  K PF+     +  PLE I+SD
Subjt:  -----KT---------AETQNKRQKVSPLSNNTY-LWHLRLGHINIDRIDRLVKNGLLTGIEDT-SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD

Query:  LCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSER
        +     + +   Y Y++ F+D ++RY +LY +  KS+  E F  FK  +EN     I T  SD GGE++     +Y  +HGI    S P TP+ NG+SER
Subjt:  LCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSER

Query:  RNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCPAHVLVT--NPKKLEPRTRICQFVVY
        ++R +++   +++S+A +P ++W YA A AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ ++R C F+ Y
Subjt:  RNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCPAHVLVT--NPKKLEPRTRICQFVVY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.4e-4426.31Show/hide
Query:  REHVLDMINQFNIAEANGGVVCERSQASSSKQIQKRKGDKGK-----------APAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPR----YLAEKRR
        RE  L  +N   +      VV  R+  +++ + Q  +GD               P+ +       +     GRC  C+  GH  + CP+         ++
Subjt:  REHVLDMINQFNIAEANGGVVCERSQASSSKQIQKRKGDKGK-----------APAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPR----YLAEKRR

Query:  GKEGKFDLLVLETCLVEHDEF---AWILDSGATNHVCSSFQGNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNKPTDAKAIL--------------
             F        L  +  +    W+LDSGAT+H+ S F    F Q   G   + +  G  +     G+A L   ++  D   +L              
Subjt:  GKEGKFDLLVLETCLVEHDEF---AWILDSGATNHVCSSFQGNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNKPTDAKAIL--------------

Query:  -------SHEMF------------------KT---------AETQNKRQKVSPLSNNTY-LWHLRLGHINIDRIDRLVKNGLLTGIEDT-SLPPCESCLE
               S E F                  KT         A +Q      SP S  T+  WH RLGH ++  ++ ++ N  L  +  +  L  C  C  
Subjt:  -------SHEMF------------------KT---------AETQNKRQKVSPLSNNTY-LWHLRLGHINIDRIDRLVKNGLLTGIEDT-SLPPCESCLE

Query:  GKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQD
         K  K PF+     + +PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  FK+ VEN     I TL SD GGE++  R  D
Subjt:  GKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGGEYLDQRFQD

Query:  YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCPAHVLVT--
        Y+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +P ++W YA + AV+++N +P+  +  ++PF+   G+ P+    +++GC  +  +   
Subjt:  YMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCPAHVLVT--

Query:  NPKKLEPRTRICQFVVY
        N  KLE +++ C F+ Y
Subjt:  NPKKLEPRTRICQFVVY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein5.9e-0938.67Show/hide
Query:  NNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        + T LWH RL H++   ++ LVK G L   + +SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINIDRIDRLVKNGLLTGIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.5e-0939.02Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTR
        NRT+++ VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+  Y R +GC A++   +  KL+PR +
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVS-ETPFELWKGRQPSLRYFRIWGCPAHVLVTNPKKLEPRTR

ATMG00750.1 GAG/POL/ENV polyprotein4.1e-1057.78Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTNILKL
        +L  GF+WPT FKDAH F   CDACQR+GN   R+EMP   IL++
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDACQRRGNLGPRDEMPLTNILKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCGATTATCTCCTTGCTTAAAAACGAACAACTAACAGGCGAAAACTTTCCACAATGGAAATCTAATCTCAACACTATACTCGTGGTAGAAGATCTT
AGGTTCGTCTTAACGGAGGAATGTCCTCCCGTTCCCCCTCGTACTGCCGCTCAGGCAGTAAAAGACGCCTACGAACGCTGGACTAAGGCCAATGAAAAGGAGATG
TTTGGACTTCCGTCCTACCAGCTCCACCACGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGGTCAATCTGTTCGGGAACACGTCCTAGACATGATT
AACCAATTCAACATAGCTGAGGCAAATGGCGGGGTTGTCTGCGAGCGCAGTCAGGCTTCTTCATCGAAGCAGATCCAAAAGAGGAAGGGAGACAAGGGGAAGGCT
CCTGCACAGGCTGTGCAAGGGAAGGGAAAGGCCAAGGTCGTGGCCGACAAAGGCAGGTGCTTCCACTGCAATGCAGATGGTCACTGGAAGCGGAACTGTCCCCGT
TACCTTGCTGAGAAGAGAAGAGGAAAAGAAGGTAAATTCGATTTACTTGTTTTAGAGACTTGTCTCGTAGAACATGATGAGTTTGCCTGGATACTGGATTCGGGA
GCCACTAATCATGTTTGTTCTTCTTTTCAGGGAAATGATTTCCAGCAGCTGGCAGACGGTGAAATGACGCTCAAGGTTGGAACAGGAGATGTCGTTTCAGCTCGT
GCAGTGGGAGCTGCAAAATTATTTTTTAGAAATAAACCAACTGATGCTAAGGCTATTTTAAGTCATGAAATGTTTAAAACGGCCGAAACGCAAAACAAAAGGCAA
AAAGTTTCTCCTTTAAGTAACAACACGTATCTTTGGCACCTTCGTCTTGGTCATATTAACATCGATCGGATCGATCGTTTGGTTAAAAATGGACTTCTAACTGGT
ATAGAAGATACATCTTTACCACCCTGTGAATCGTGTCTCGAGGGTAAAATGACTAAGAGGCCTTTTACTGGAAAAGGTTATAGGGCCAAAGAACCACTTGAACTA
ATACATTCGGATCTTTGTGGTCCAATGAATGTAAAGGCTCGATGTGGTTATGAATATTTCATCTCTTTCATAGATGATTATTCTCGATACGGTTACTTGTACCTG
ATGGGCCATAAGTCGGAGGCTCTTGAAAAGTTCCAAGAGTTTAAGGCTGAGGTAGAAAACCTGTTAGGTAAAACGATTAAAACACTTCGATCAGATCGAGGTGGA
GAGTATTTAGATCAGCGATTCCAGGACTATATGATAGAACATGGAATCCAATCACAACTCTCAGCACCTGGTACACCTCAACAGAATGGTGTATCAGAAAGGAGA
AATAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCTAGCTCGTTTTGGGGGTATGCAGTAGCGACTGCAGTTCATATTCTTAACAAC
GTTCCCTCAAAAAGTGTTTCTGAAACTCCTTTTGAGCTATGGAAGGGGCGTCAACCTAGTTTGCGTTACTTTCGTATCTGGGGTTGTCCTGCACATGTGTTAGTG
ACAAACCCTAAGAAATTGGAACCACGTACGAGAATATGCCAATTTGTTGTGTACCCGAAAGAAACGAGAGGTGGTCTTTCTGCTATGAAGCGGTGTATGTTGGCA
ATTTTTTCTGCTATGATTGAGTCCACTGTTGAGAGAGGGGAGTTCAACTACTGGGCTATAGTGGAGTGTCCCGATTGTAGGAAGGCTTTTGAGACTTTAAAGGTT
GCTTTAATCTCAGCACCCATTCTTTGTGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGTGGTAGGTGCTATGCTGGGGCAAAAG
CAAGGCAAATTTATCCATCCTATATATTATGCAAGCAAGGTTTTAAATGAAGCACAAGTCAACTACACAACTACTGAAAAGGAGTTGTTAGCTATGGTGTTTGCC
TTTGAAAAATTCCGACCATATTTGGTTGGATCCAAAGTCACAGTGTTCACAGATCATGCAGCAATAAGGTGTGTTTCAGGTGATGAAGCAAAGGAAATCCTGGAG
CAATGTCATTCTTCGTCGTATGGAGGTCATTTTAGCGGTCAGAGGACAGCTATGAGGATTTTACATTGTGGATTCTTTTGGCCTACCTTATTCAAGGATGCCCAT
TGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTGGGGCCTAGAGATGAAATGCCTCTTACGAATATTTTGAAATTGAATTATTCGATGTATGGG
TGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTAGCAAGGTTTCTTCAATTGCACATCTTTGTGCGGTTTGGGACACCTAGGGCTCTAGGGATT
AAGCATAGGATAGCTACCCGTTATCACCCACAAGTAAATGGTCAAGCTGAGATTAGTGATAGGGAAATAAAATTGATTTTAGAGAAAATAGTCCATCCATCTAGA
AAGGATTGGTCTTTTAGGTTGGATGAGGCTCTTTGGGCCTATAGGACAACCTATAAGACTCCTCTAGATACGGACGAAGTTCAAAATCCTGAAGTAGAACCGATA
GTCACAGATACGGTTCAAGAGGAAAATGCTGAGGAGAATCAAGAACAGAAGGTTGACATGGTGAGAGACGAGCAGACAGACGTGGTGCCTGAAAGAGGGAACGAG
CAGGAGCAAGAAGCTCGTGTTGAGGTAATCATGCCCGAACCACCAAAATGTCGCCGCATTAAGCGAAAGACCGGCCGCGTTCAGGTGATTCGGACTGATACCCCA
TCACCACCATCATCAGATTCTGAGAGAGAGAAGACAGAACGAGAGGAAAGAGAGAAAAAAGAAGCTGAGGACAAATTGCGGGAAAAAGCAAAGAAGAAGATTGAG
GAAGAGCGGTTGCTCAAGCGCAGGGCGGAAAAGGGCAAAAATGTTGCTGAAGCATCAGAGGATCACGATGAAATAGAAGAAAACAACCATGGTTGGGAGTTGTTT
TGTGCGAAGCCTGAGTCTGTAAACGCGCAGGTGGTACATGAATTTTATGCTAACATTGACAAAGAAGATGGTTTCCAGGTATTGAAGGAGCACAGTGGCAGTTGT
CCAAGACTCAAAGAGGACATTCCAGGCGGCTTATTTGAAAAGGGAAGTAAACATGTGGATGGGATTTATAAGACAGAGGATGCTTCCAACAACACACGACTCGAC
AATCTCCAAGGAACGAGTTCTTCTAGCTTTTGCAATCTTGCGGTCACTCAACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCGATTATCTCCTTGCTTAAAAACGAACAACTAACAGGCGAAAACTTTCCACAATGGAAATCTAATCTCAACACTATACTCGTGGTAGAAGATCTT
AGGTTCGTCTTAACGGAGGAATGTCCTCCCGTTCCCCCTCGTACTGCCGCTCAGGCAGTAAAAGACGCCTACGAACGCTGGACTAAGGCCAATGAAAAGGAGATG
TTTGGACTTCCGTCCTACCAGCTCCACCACGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGGTCAATCTGTTCGGGAACACGTCCTAGACATGATT
AACCAATTCAACATAGCTGAGGCAAATGGCGGGGTTGTCTGCGAGCGCAGTCAGGCTTCTTCATCGAAGCAGATCCAAAAGAGGAAGGGAGACAAGGGGAAGGCT
CCTGCACAGGCTGTGCAAGGGAAGGGAAAGGCCAAGGTCGTGGCCGACAAAGGCAGGTGCTTCCACTGCAATGCAGATGGTCACTGGAAGCGGAACTGTCCCCGT
TACCTTGCTGAGAAGAGAAGAGGAAAAGAAGGTAAATTCGATTTACTTGTTTTAGAGACTTGTCTCGTAGAACATGATGAGTTTGCCTGGATACTGGATTCGGGA
GCCACTAATCATGTTTGTTCTTCTTTTCAGGGAAATGATTTCCAGCAGCTGGCAGACGGTGAAATGACGCTCAAGGTTGGAACAGGAGATGTCGTTTCAGCTCGT
GCAGTGGGAGCTGCAAAATTATTTTTTAGAAATAAACCAACTGATGCTAAGGCTATTTTAAGTCATGAAATGTTTAAAACGGCCGAAACGCAAAACAAAAGGCAA
AAAGTTTCTCCTTTAAGTAACAACACGTATCTTTGGCACCTTCGTCTTGGTCATATTAACATCGATCGGATCGATCGTTTGGTTAAAAATGGACTTCTAACTGGT
ATAGAAGATACATCTTTACCACCCTGTGAATCGTGTCTCGAGGGTAAAATGACTAAGAGGCCTTTTACTGGAAAAGGTTATAGGGCCAAAGAACCACTTGAACTA
ATACATTCGGATCTTTGTGGTCCAATGAATGTAAAGGCTCGATGTGGTTATGAATATTTCATCTCTTTCATAGATGATTATTCTCGATACGGTTACTTGTACCTG
ATGGGCCATAAGTCGGAGGCTCTTGAAAAGTTCCAAGAGTTTAAGGCTGAGGTAGAAAACCTGTTAGGTAAAACGATTAAAACACTTCGATCAGATCGAGGTGGA
GAGTATTTAGATCAGCGATTCCAGGACTATATGATAGAACATGGAATCCAATCACAACTCTCAGCACCTGGTACACCTCAACAGAATGGTGTATCAGAAAGGAGA
AATAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCTAGCTCGTTTTGGGGGTATGCAGTAGCGACTGCAGTTCATATTCTTAACAAC
GTTCCCTCAAAAAGTGTTTCTGAAACTCCTTTTGAGCTATGGAAGGGGCGTCAACCTAGTTTGCGTTACTTTCGTATCTGGGGTTGTCCTGCACATGTGTTAGTG
ACAAACCCTAAGAAATTGGAACCACGTACGAGAATATGCCAATTTGTTGTGTACCCGAAAGAAACGAGAGGTGGTCTTTCTGCTATGAAGCGGTGTATGTTGGCA
ATTTTTTCTGCTATGATTGAGTCCACTGTTGAGAGAGGGGAGTTCAACTACTGGGCTATAGTGGAGTGTCCCGATTGTAGGAAGGCTTTTGAGACTTTAAAGGTT
GCTTTAATCTCAGCACCCATTCTTTGTGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGTGGTAGGTGCTATGCTGGGGCAAAAG
CAAGGCAAATTTATCCATCCTATATATTATGCAAGCAAGGTTTTAAATGAAGCACAAGTCAACTACACAACTACTGAAAAGGAGTTGTTAGCTATGGTGTTTGCC
TTTGAAAAATTCCGACCATATTTGGTTGGATCCAAAGTCACAGTGTTCACAGATCATGCAGCAATAAGGTGTGTTTCAGGTGATGAAGCAAAGGAAATCCTGGAG
CAATGTCATTCTTCGTCGTATGGAGGTCATTTTAGCGGTCAGAGGACAGCTATGAGGATTTTACATTGTGGATTCTTTTGGCCTACCTTATTCAAGGATGCCCAT
TGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAACTTGGGGCCTAGAGATGAAATGCCTCTTACGAATATTTTGAAATTGAATTATTCGATGTATGGG
TGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTAGCAAGGTTTCTTCAATTGCACATCTTTGTGCGGTTTGGGACACCTAGGGCTCTAGGGATT
AAGCATAGGATAGCTACCCGTTATCACCCACAAGTAAATGGTCAAGCTGAGATTAGTGATAGGGAAATAAAATTGATTTTAGAGAAAATAGTCCATCCATCTAGA
AAGGATTGGTCTTTTAGGTTGGATGAGGCTCTTTGGGCCTATAGGACAACCTATAAGACTCCTCTAGATACGGACGAAGTTCAAAATCCTGAAGTAGAACCGATA
GTCACAGATACGGTTCAAGAGGAAAATGCTGAGGAGAATCAAGAACAGAAGGTTGACATGGTGAGAGACGAGCAGACAGACGTGGTGCCTGAAAGAGGGAACGAG
CAGGAGCAAGAAGCTCGTGTTGAGGTAATCATGCCCGAACCACCAAAATGTCGCCGCATTAAGCGAAAGACCGGCCGCGTTCAGGTGATTCGGACTGATACCCCA
TCACCACCATCATCAGATTCTGAGAGAGAGAAGACAGAACGAGAGGAAAGAGAGAAAAAAGAAGCTGAGGACAAATTGCGGGAAAAAGCAAAGAAGAAGATTGAG
GAAGAGCGGTTGCTCAAGCGCAGGGCGGAAAAGGGCAAAAATGTTGCTGAAGCATCAGAGGATCACGATGAAATAGAAGAAAACAACCATGGTTGGGAGTTGTTT
TGTGCGAAGCCTGAGTCTGTAAACGCGCAGGTGGTACATGAATTTTATGCTAACATTGACAAAGAAGATGGTTTCCAGGTATTGAAGGAGCACAGTGGCAGTTGT
CCAAGACTCAAAGAGGACATTCCAGGCGGCTTATTTGAAAAGGGAAGTAAACATGTGGATGGGATTTATAAGACAGAGGATGCTTCCAACAACACACGACTCGAC
AATCTCCAAGGAACGAGTTCTTCTAGCTTTTGCAATCTTGCGGTCACTCAACATTGA
Protein sequenceShow/hide protein sequence
MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMI
NQFNIAEANGGVVCERSQASSSKQIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGHWKRNCPRYLAEKRRGKEGKFDLLVLETCLVEHDEFAWILDSG
ATNHVCSSFQGNDFQQLADGEMTLKVGTGDVVSARAVGAAKLFFRNKPTDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDRIDRLVKNGLLTG
IEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARCGYEYFISFIDDYSRYGYLYLMGHKSEALEKFQEFKAEVENLLGKTIKTLRSDRGG
EYLDQRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVATAVHILNNVPSKSVSETPFELWKGRQPSLRYFRIWGCPAHVLV
TNPKKLEPRTRICQFVVYPKETRGGLSAMKRCMLAIFSAMIESTVERGEFNYWAIVECPDCRKAFETLKVALISAPILCAPNWNLPFEVMCDASDAVVGAMLGQK
QGKFIHPIYYASKVLNEAQVNYTTTEKELLAMVFAFEKFRPYLVGSKVTVFTDHAAIRCVSGDEAKEILEQCHSSSYGGHFSGQRTAMRILHCGFFWPTLFKDAH
WFYKQCDACQRRGNLGPRDEMPLTNILKLNYSMYGWVEAIACHQSDAKTVARFLQLHIFVRFGTPRALGIKHRIATRYHPQVNGQAEISDREIKLILEKIVHPSR
KDWSFRLDEALWAYRTTYKTPLDTDEVQNPEVEPIVTDTVQEENAEENQEQKVDMVRDEQTDVVPERGNEQEQEARVEVIMPEPPKCRRIKRKTGRVQVIRTDTP
SPPSSDSEREKTEREEREKKEAEDKLREKAKKKIEEERLLKRRAEKGKNVAEASEDHDEIEENNHGWELFCAKPESVNAQVVHEFYANIDKEDGFQVLKEHSGSC
PRLKEDIPGGLFEKGSKHVDGIYKTEDASNNTRLDNLQGTSSSSFCNLAVTQH