; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018420 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018420
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr5:26462065..26463914
RNA-Seq ExpressionLag0018420
SyntenyLag0018420
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-20866.03Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        MSSSII+LLK +QLTGEN+  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K +++IL S+S++L+K++E + TAR+IM+SL+E
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGD
        MFG PS Q                                                    +K EANVAHSK++F          VP  S S++IQKRK  
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGD

Query:  KGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLLVLETCLVEHDEFSWILDSGATNHVCSSFQ-GNNFQQLAEGEMT
        KGK P  AV+ KGKAK VA K +CFHCN   D HWK NCP+YL  K++EKEGK+DLLVLETCLVE+D+ +WILDSGATNHVCSS Q  ++F+QL + EMT
Subjt:  KGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLLVLETCLVEHDEFSWILDSGATNHVCSSFQ-GNNFQQLAEGEMT

Query:  LKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAE
        LKVGTGDV+SARAVG AKLFF N+F+ LENLY+VP+IKRNL+SVS L+E  YS++F +NEA I KNGV+ICSAK ENNL+VLRPN+AKA+L+HEMF+TA 
Subjt:  LKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAE

Query:  TQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFI
        TQNKRQ++SP +NNTYLWHLRLGHIN+D I RLVKNGLL+ ++D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGG+EYFISFI
Subjt:  TQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFI

Query:  DDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQ
        DDYSRYGYLYLM HKSEALE+FKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQ
Subjt:  DDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQ

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-20560.45Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G   A     K   K  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DLL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL

Query:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL
        VLETCLVE+D+ +WI+DSGATNHVCSSFQG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F 
Subjt:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL

Query:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK
        +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEGK
Subjt:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK

Query:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM
        MTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y+
Subjt:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM

Query:  IEHGIQSQLSAPGTPQQ
        +E GI SQLSAPGTPQQ
Subjt:  IEHGIQSQLSAPGTPQQ

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-20560.84Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G KA   A +   KAK  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL

Query:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF
        LVLETCLVE+D+ +WI+DSGATNHVCSSFQG +++QQL  GEMT++VGTG VVSA AVG  +L+ +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F
Subjt:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF

Query:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG
         +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEG
Subjt:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG

Query:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY
        KMTKRPFTGKG+RAKEPLEL+HS+LCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y
Subjt:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY

Query:  MIEHGIQSQLSAPGTPQQ
        ++E GI SQLSAPGTPQQ
Subjt:  MIEHGIQSQLSAPGTPQQ

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-20560.84Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G KA   A +   KAK  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL

Query:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF
        LVLETCLVE+D+ +WI+DSGATNHVCSSFQG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F
Subjt:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF

Query:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG
         +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEG
Subjt:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG

Query:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY
        KMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y
Subjt:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY

Query:  MIEHGIQSQLSAPGTPQQ
        ++E GI SQLSAPGTPQQ
Subjt:  MIEHGIQSQLSAPGTPQQ

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-20560.45Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G   A     K   K  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DLL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL

Query:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL
        VLETCLVE+D+ +WI+DSGATNHVCSSFQG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F 
Subjt:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL

Query:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK
        +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEGK
Subjt:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK

Query:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM
        MTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y+
Subjt:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM

Query:  IEHGIQSQLSAPGTPQQ
        +E GI SQLSAPGTPQQ
Subjt:  IEHGIQSQLSAPGTPQQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.5e-20560.84Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G KA   A +   KAK  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL

Query:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF
        LVLETCLVE+D+ +WI+DSGATNHVCSSFQG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F
Subjt:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF

Query:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG
         +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEG
Subjt:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG

Query:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY
        KMTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y
Subjt:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY

Query:  MIEHGIQSQLSAPGTPQQ
        ++E GI SQLSAPGTPQQ
Subjt:  MIEHGIQSQLSAPGTPQQ

A0A5A7TU93 Gag/pol protein5.2e-20660.84Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G KA   A +   KAK  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKG-KAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDL

Query:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF
        LVLETCLVE+D+ +WI+DSGATNHVCSSFQG +++QQL  GEMT++VGTG VVSA AVG  +L+ +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F
Subjt:  LVLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSF

Query:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG
         +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEG
Subjt:  LLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEG

Query:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY
        KMTKRPFTGKG+RAKEPLEL+HS+LCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y
Subjt:  KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDY

Query:  MIEHGIQSQLSAPGTPQQ
        ++E GI SQLSAPGTPQQ
Subjt:  MIEHGIQSQLSAPGTPQQ

A0A5A7TZD0 Gag/pol protein1.9e-20866.03Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        MSSSII+LLK +QLTGEN+  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K +++IL S+S++L+K++E + TAR+IM+SL+E
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGD
        MFG PS Q                                                    +K EANVAHSK++F          VP  S S++IQKRK  
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGD

Query:  KGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLLVLETCLVEHDEFSWILDSGATNHVCSSFQ-GNNFQQLAEGEMT
        KGK P  AV+ KGKAK VA K +CFHCN   D HWK NCP+YL  K++EKEGK+DLLVLETCLVE+D+ +WILDSGATNHVCSS Q  ++F+QL + EMT
Subjt:  KGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLLVLETCLVEHDEFSWILDSGATNHVCSSFQ-GNNFQQLAEGEMT

Query:  LKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAE
        LKVGTGDV+SARAVG AKLFF N+F+ LENLY+VP+IKRNL+SVS L+E  YS++F +NEA I KNGV+ICSAK ENNL+VLRPN+AKA+L+HEMF+TA 
Subjt:  LKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAE

Query:  TQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFI
        TQNKRQ++SP +NNTYLWHLRLGHIN+D I RLVKNGLL+ ++D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGG+EYFISFI
Subjt:  TQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFI

Query:  DDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQ
        DDYSRYGYLYLM HKSEALE+FKE+K EVENLL K IK LRSDRGGEY+D RFQDYMIEHGIQSQLSAPGTPQQ
Subjt:  DDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAPGTPQQ

A0A5D3CPJ6 Gag/pol protein1.2e-20560.45Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G   A     K   K  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DLL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL

Query:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL
        VLETCLVE+D+ +WI+DSGATNHVCSSFQG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F 
Subjt:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL

Query:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK
        +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEGK
Subjt:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK

Query:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM
        MTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y+
Subjt:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM

Query:  IEHGIQSQLSAPGTPQQ
        +E GI SQLSAPGTPQQ
Subjt:  IEHGIQSQLSAPGTPQQ

A0A5D3CSZ6 Gag/pol protein1.2e-20560.45Show/hide
Query:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++L G N+  WK+ +NT+L+++DLRFVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG
        MFG  SYQ+ HDALK ++NA+M EG SVREHVL+M+  FN+ E NG V+ E SQ                                            KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQ-------------------------------------------SKG

Query:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL
        +   KGEANVA S +KF +GS+SGTKS+P +S +K+ +K+KG +G   A     K   K  A KG CFHCN   +GHWK NCP+YLAEK++ K+GK+DLL
Subjt:  KRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLL

Query:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL
        VLETCLVE+D+ +WI+DSGATNHVCSSFQG ++++QL  GEMT++VGTG VVSA AVG  +L  +  FL+LEN+Y+VP +KRNLISV  LLEQ YS++F 
Subjt:  VLETCLVEHDEFSWILDSGATNHVCSSFQG-NNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFL

Query:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK
        +N+  I KNGV ICSAK ENNL+VLR   +KA+L+ EMFKTA TQNKR K+SP   N +LWHLRLGHIN++ I+RLVKNGLLS++E+ SLP CESCLEGK
Subjt:  LNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGK

Query:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM
        MTKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGG+EYFI+F DDYSRYGY+YLM HKSEALE+FKE+KAEVEN L K IKT RSDRGGEY+D +FQ+Y+
Subjt:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYM

Query:  IEHGIQSQLSAPGTPQQ
        +E GI SQLSAPGTPQQ
Subjt:  IEHGIQSQLSAPGTPQQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-2723.89Show/hide
Query:  GENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALK
        GE +  WK  +  +L  +D+  V+    P            +  + W KA    K  I+  LS+       +  TAR+I+ +L  ++   S        K
Subjt:  GENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHHDALK

Query:  NVFNAKMLEGQSVREHV-------------------LDMINQFNIV--EANGGVV--CERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSK
         + + K+    S+  H                    +D I+   I       G++   E    +   +   +  +   + K     +  +K V  A    
Subjt:  NVFNAKMLEGQSVREHV-------------------LDMINQFNIV--EANGGVV--CERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSK

Query:  RIQKRKGDKGK----APAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNC---PRYLAEKRREKEGKFD--------LLVLETCLVE-HDEFSWILDSG
             K +  K     P +  +G  K KV     +C HC    +GH K +C    R L  K +E E +           +V E       D   ++LDSG
Subjt:  RIQKRKGDKGK----APAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNC---PRYLAEKRREKEGKFD--------LLVLETCLVE-HDEFSWILDSG

Query:  ATNHVCSSFQGNNFQQLAEGEMTLKVGT---GDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYIC-SA
        A++H+ +    + +    E    LK+     G+ + A   G  +L   +  + LE++        NL+SV  L E   S+ F  +   ISKNG+ +  ++
Subjt:  ATNHVCSSFQGNNFQQLAEGEMTLKVGT---GDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYIC-SA

Query:  KRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIE-----DTSLPPCESCLEGKMTKRPFTGKGY
           NN+ V+                A + N + K     NN  LWH R GHI+   +  + +  + SD       + S   CE CL GK  + PF     
Subjt:  KRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIE-----DTSLPPCESCLEGKMTKRPFTGKGY

Query:  RA--KEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQL
        +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+ +KS+    F++F A+ E      +  L  D G EYL    + + ++ GI   L
Subjt:  RA--KEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQL

Query:  SAPGTPQ
        + P TPQ
Subjt:  SAPGTPQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-4124.96Show/hide
Query:  QLTGEN-FPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHH
        +  G+N F  W+  +  +L+ + L  VL  +        A        E W   +E+    I + LS+ +     + +TAR I   L+ ++   +     
Subjt:  QLTGEN-FPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLHH

Query:  DALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQS--------------------KGKRVV--KGEANVAHSKKKFLKGSSSGTKSVPQAS
           K ++   M EG +   H L++ N      AN GV  E                         GK  +  K   +     +K  K   +  +++    
Subjt:  DALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQS--------------------KGKRVV--KGEANVAHSKKKFLKGSSSGTKSVPQAS

Query:  SSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFD-------------LLVL---ETCL-VEHDEFSWI
          +  Q+   + G++ A   +GK K +  +    C++CN    GH+K +CP     K      K D             +L +   E C+ +   E  W+
Subjt:  SSKRIQKRKGDKGKAPAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFD-------------LLVL---ETCL-VEHDEFSWI

Query:  LDSGATNH------VCSSFQGNNFQQLAEGEMTLK--VGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISK
        +D+ A++H      +   +   +F  +  G  +     G GD+     VG          L+L+++  VP ++ NLIS  AL    Y   F   +  ++K
Subjt:  LDSGATNH------VCSSFQGNNFQQLAEGEMTLK--VGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISK

Query:  NGVYICSAKRENNLFVLRPNDAKAILSHEMFKT-AETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFT
          + I                AK +    +++T AE        +    +  LWH R+GH++   +  L K  L+S  + T++ PC+ CL GK  +  F 
Subjt:  NGVYICSAKRENNLFVLRPNDAKAILSHEMFKT-AETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFT

Query:  GKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQS
            R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y++  K +  + F++F A VE   G+ +K LRSD GGEY  + F++Y   HGI+ 
Subjt:  GKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQS

Query:  QLSAPGTPQ
        + + PGTPQ
Subjt:  QLSAPGTPQ

Q12491 Transposon Ty2-B Gag-Pol polyprotein8.1e-1526.38Show/hide
Query:  ILDSGATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICS
        ++DSGA+  +  S    +       E+ +       +   A+G     F+N           P I  +L+S+S L  Q  +  F  N  L   +G  +  
Subjt:  ILDSGATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICS

Query:  AKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTY---LWHLRLGHINIDWIDRLVKNGLL-----SDIE--DTSLPPCESCLEGKMTKRP
          +  + + L  +    I SH    T    NK + V     N Y   L H  LGH N   I + +K   +     SDIE  + S   C  CL GK TK  
Subjt:  AKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTY---LWHLRLGHINIDWIDRLVKNGLL-----SDIE--DTSLPPCESCLEGKMTKRP

Query:  FTGKGYRAK-----EPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSE--ALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQD
           KG R K     EP + +H+D+ GP++   +    YFISF D+ +R+ ++Y +  + E   L  F    A ++N     +  ++ DRG EY ++    
Subjt:  FTGKGYRAK-----EPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSE--ALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQD

Query:  YMIEHGI
        +    GI
Subjt:  YMIEHGI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.0e-3030.23Show/hide
Query:  SWILDSGATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLE------QCYSVSFLLNEALIS
        +W+LDSGAT+H+ S F   +  Q   G   + V  G  +     G+  L  ++R L L N+  VP I +NLISV  L        + +  SF + +    
Subjt:  SWILDSGATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLE------QCYSVSFLLNEALIS

Query:  KNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTY-LWHLRLGHINIDWIDRLVKNGLLSDIEDT-SLPPCESCLEGKMTKRP
          GV +   K ++ L+               +  A +Q      SP S  T+  WH RLGH     ++ ++ N  LS +  +     C  CL  K  K P
Subjt:  KNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTY-LWHLRLGHINIDWIDRLVKNGLLSDIEDT-SLPPCESCLEGKMTKRP

Query:  FTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGI
        F+     +  PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  E F  FK  +EN     I T  SD GGE++     +Y  +HGI
Subjt:  FTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGI

Query:  QSQLSAPGTPQ
            S P TP+
Subjt:  QSQLSAPGTPQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-3224.34Show/hide
Query:  QLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRT----AAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQ
        +LT  N+  W   ++ +    +L   L +   P+PP T    A   V   Y RW + ++ +   IL ++S  +        TA +I  +L++++  PSY 
Subjt:  QLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRT----AAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQ

Query:  LHHDALKNVFN-------AKMLEGQSVREHVLDMINQFNIVEANGGVVCE-RSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVP------QASSSKRI
         H   L+ +          K ++     E VL+     N+ +    V+ +  ++     + +    + + + K L  +S+    +       + +++ R 
Subjt:  LHHDALKNVFN-------AKMLEGQSVREHVLDMINQFNIVEANGGVVCE-RSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVP------QASSSKRI

Query:  QKRKGDKGK-----------APAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPR----YLAEKRREKEGKFDLLVLETCLVEHDEF---SWILDSG
        Q  +GD               P+ +       +     GRC  C+    GH    CP+         +++    F        L  +  +   +W+LDSG
Subjt:  QKRKGDKGK-----------APAQAVQGKGKAKVVADKGRCFHCNADGDGHWKHNCPR----YLAEKRREKEGKFDLLVLETCLVEHDEF---SWILDSG

Query:  ATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLE------QCYSVSFLLNEALISKNGVYIC
        AT+H+ S F   +F Q   G   + +  G  +     G+A L   +R L L  +  VP I +NLISV  L        + +  SF + +      GV + 
Subjt:  ATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENLYLVPRIKRNLISVSALLE------QCYSVSFLLNEALISKNGVYIC

Query:  SAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTY-LWHLRLGHINIDWIDRLVKNGLLSDIEDT-SLPPCESCLEGKMTKRPFTGKGYR
          K ++ L+               +  A +Q      SP S  T+  WH RLGH ++  ++ ++ N  L  +  +  L  C  C   K  K PF+     
Subjt:  SAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTY-LWHLRLGHINIDWIDRLVKNGLLSDIEDT-SLPPCESCLEGKMTKRPFTGKGYR

Query:  AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAP
        + +PLE I+SD+     + +   Y Y++ F+D ++RY +LY +  KS+  + F  FK+ VEN     I TL SD GGE++  R  DY+ +HGI    S P
Subjt:  AKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQRFQDYMIEHGIQSQLSAP

Query:  GTPQ
         TP+
Subjt:  GTPQ

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.3e-0938.67Show/hide
Query:  NNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        + T LWH RL H++   ++ LVK G L   + +SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINIDWIDRLVKNGLLSDIEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCTATAATTTCCCTGCTTAAAAATGAACAACTTACCGGCGAGAATTTTCCACAATGGAAATCTAACCTCAATACAATACTCGTGGTTGAGGACTTAAGATT
CGTCTTAACGGAGGAATGTCCTCCCGTTCCCCCTCGCACTGCCGCTCAGGCAGTTAAGGACGCCTACGAACGCTGGACCAAGGCCAATGAAAAGGTCAAAGTCTATATTC
TGGTCAGCTTGTCTGAAGTATTGGCCAAGCGTTACGAGAACGTGGAAACTGCCAGGGAGATTATGAATTCCCTGCAGGAGATGTTTGGACTTCCGTCCTACCAGCTCCAC
CACGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGGTCAATCTGTTCGGGAACATGTCCTGGACATGATTAACCAATTCAACATAGTTGAGGCAAATGGCGG
GGTTGTCTGCGAGCGCAGTCAGAGCAAGGGAAAAAGGGTGGTTAAAGGAGAGGCCAACGTGGCCCATTCCAAGAAGAAGTTCCTGAAGGGTTCATCCTCAGGGACTAAAT
CTGTACCCCAGGCTTCTTCATCGAAGCGGATTCAAAAGAGGAAGGGAGACAAGGGGAAGGCTCCTGCACAGGCTGTGCAAGGGAAGGGGAAGGCCAAGGTCGTGGCCGAC
AAAGGCAGATGCTTCCACTGTAATGCAGATGGAGATGGTCACTGGAAGCACAACTGTCCCCGCTACCTTGCTGAGAAGAGAAGAGAAAAAGAAGGTAAATTCGATTTACT
TGTGTTAGAGACTTGTCTCGTTGAACATGATGAGTTTTCCTGGATACTGGATTCGGGAGCCACTAATCATGTTTGTTCTTCTTTTCAGGGAAATAATTTCCAGCAGCTGG
CAGAGGGTGAAATGACGCTCAAGGTCGGAACGGGAGACGTCGTTTCAGCTCGTGCAGTGGGAGCTGCAAAATTATTCTTTAGAAATAGGTTTTTAATTTTAGAGAACTTG
TACTTGGTTCCTAGAATTAAAAGGAACCTTATTTCTGTTTCAGCTTTGTTAGAACAATGTTATTCTGTTTCCTTCTTGCTTAATGAAGCTTTAATTTCAAAGAATGGAGT
TTACATTTGTTCAGCTAAGCGTGAGAATAATTTGTTTGTGTTAAGACCTAATGACGCTAAGGCTATTTTAAGTCATGAAATGTTTAAAACGGCTGAAACGCAAAACAAAA
GGCAAAAAGTTTCTCCTTTAAGTAACAACACGTATCTTTGGCATCTTCGTCTTGGTCATATTAACATCGATTGGATCGATCGTTTGGTTAAAAATGGACTTCTAAGTGAC
ATAGAAGATACATCTTTACCACCCTGTGAATCGTGTCTCGAGGGTAAAATGACCAAGAGGCCTTTTACTGGAAAAGGTTATAGGGCCAAAGAACCACTTGAACTAATACA
TTCGGATCTTTGTGGTCCAATGAATGTAAAGGCTCGAGGTGGTTATGAATATTTCATCTCTTTCATAGATGATTATTCTCGATACGGTTACTTGTACCTGATGGGCCATA
AGTCTGAAGCTCTTGAAAGGTTCAAAGAGTTTAAGGCTGAGGTAGAAAACCTGTTAGGTAAAATGATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATTTAGATCAG
AGATTCCAGGACTATATGATAGAACATGGAATCCAATCACAACTCTCAGCACCTGGTACACCTCAGCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCTATAATTTCCCTGCTTAAAAATGAACAACTTACCGGCGAGAATTTTCCACAATGGAAATCTAACCTCAATACAATACTCGTGGTTGAGGACTTAAGATT
CGTCTTAACGGAGGAATGTCCTCCCGTTCCCCCTCGCACTGCCGCTCAGGCAGTTAAGGACGCCTACGAACGCTGGACCAAGGCCAATGAAAAGGTCAAAGTCTATATTC
TGGTCAGCTTGTCTGAAGTATTGGCCAAGCGTTACGAGAACGTGGAAACTGCCAGGGAGATTATGAATTCCCTGCAGGAGATGTTTGGACTTCCGTCCTACCAGCTCCAC
CACGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGGTCAATCTGTTCGGGAACATGTCCTGGACATGATTAACCAATTCAACATAGTTGAGGCAAATGGCGG
GGTTGTCTGCGAGCGCAGTCAGAGCAAGGGAAAAAGGGTGGTTAAAGGAGAGGCCAACGTGGCCCATTCCAAGAAGAAGTTCCTGAAGGGTTCATCCTCAGGGACTAAAT
CTGTACCCCAGGCTTCTTCATCGAAGCGGATTCAAAAGAGGAAGGGAGACAAGGGGAAGGCTCCTGCACAGGCTGTGCAAGGGAAGGGGAAGGCCAAGGTCGTGGCCGAC
AAAGGCAGATGCTTCCACTGTAATGCAGATGGAGATGGTCACTGGAAGCACAACTGTCCCCGCTACCTTGCTGAGAAGAGAAGAGAAAAAGAAGGTAAATTCGATTTACT
TGTGTTAGAGACTTGTCTCGTTGAACATGATGAGTTTTCCTGGATACTGGATTCGGGAGCCACTAATCATGTTTGTTCTTCTTTTCAGGGAAATAATTTCCAGCAGCTGG
CAGAGGGTGAAATGACGCTCAAGGTCGGAACGGGAGACGTCGTTTCAGCTCGTGCAGTGGGAGCTGCAAAATTATTCTTTAGAAATAGGTTTTTAATTTTAGAGAACTTG
TACTTGGTTCCTAGAATTAAAAGGAACCTTATTTCTGTTTCAGCTTTGTTAGAACAATGTTATTCTGTTTCCTTCTTGCTTAATGAAGCTTTAATTTCAAAGAATGGAGT
TTACATTTGTTCAGCTAAGCGTGAGAATAATTTGTTTGTGTTAAGACCTAATGACGCTAAGGCTATTTTAAGTCATGAAATGTTTAAAACGGCTGAAACGCAAAACAAAA
GGCAAAAAGTTTCTCCTTTAAGTAACAACACGTATCTTTGGCATCTTCGTCTTGGTCATATTAACATCGATTGGATCGATCGTTTGGTTAAAAATGGACTTCTAAGTGAC
ATAGAAGATACATCTTTACCACCCTGTGAATCGTGTCTCGAGGGTAAAATGACCAAGAGGCCTTTTACTGGAAAAGGTTATAGGGCCAAAGAACCACTTGAACTAATACA
TTCGGATCTTTGTGGTCCAATGAATGTAAAGGCTCGAGGTGGTTATGAATATTTCATCTCTTTCATAGATGATTATTCTCGATACGGTTACTTGTACCTGATGGGCCATA
AGTCTGAAGCTCTTGAAAGGTTCAAAGAGTTTAAGGCTGAGGTAGAAAACCTGTTAGGTAAAATGATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATTTAGATCAG
AGATTCCAGGACTATATGATAGAACATGGAATCCAATCACAACTCTCAGCACCTGGTACACCTCAGCAGTGA
Protein sequenceShow/hide protein sequence
MSSSIISLLKNEQLTGENFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLH
HDALKNVFNAKMLEGQSVREHVLDMINQFNIVEANGGVVCERSQSKGKRVVKGEANVAHSKKKFLKGSSSGTKSVPQASSSKRIQKRKGDKGKAPAQAVQGKGKAKVVAD
KGRCFHCNADGDGHWKHNCPRYLAEKRREKEGKFDLLVLETCLVEHDEFSWILDSGATNHVCSSFQGNNFQQLAEGEMTLKVGTGDVVSARAVGAAKLFFRNRFLILENL
YLVPRIKRNLISVSALLEQCYSVSFLLNEALISKNGVYICSAKRENNLFVLRPNDAKAILSHEMFKTAETQNKRQKVSPLSNNTYLWHLRLGHINIDWIDRLVKNGLLSD
IEDTSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGYLYLMGHKSEALERFKEFKAEVENLLGKMIKTLRSDRGGEYLDQ
RFQDYMIEHGIQSQLSAPGTPQQ