; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001201 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001201
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag protease polyprotein
Genome locationchr06:29665796..29667547
RNA-Seq ExpressionPay0001201
SyntenyPay0001201
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR001969 - Aspartic peptidase, active site
IPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038231.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0098.79Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFV+GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP+RNFRPGGEFRSFQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVV+GTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTW ILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEH+EHLRMVLQTLRDNKLY+KFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

KAA0043391.1 pol protein [Cucumis melo var. makuwa]0.0e+0098.62Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQPVPVPQRNFR GGEFR FQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
         HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

KAA0053290.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0098.27Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGR STSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFE GEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAG PHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        +HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLA NHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGS+RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDL SGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR+NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

KAA0054678.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0098.96Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVI VTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTF+PPSMASFKFKGGGSKSLP+V
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKK+GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+ KTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0099.65Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

TrEMBL top hitse value%identityAlignment
A0A5A7T9B7 Gag protease polyprotein0.0e+0098.79Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFV+GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP+RNFRPGGEFRSFQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVV+GTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTW ILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEH+EHLRMVLQTLRDNKLY+KFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

A0A5A7TP96 Reverse transcriptase0.0e+0098.62Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQPVPVPQRNFR GGEFR FQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
         HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

A0A5A7UDK9 Gag protease polyprotein0.0e+0098.27Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGR STSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFE GEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAG PHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        +HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLA NHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGS+RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDL SGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR+NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

A0A5A7UI54 Gag protease polyprotein0.0e+0098.96Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVI VTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTF+PPSMASFKFKGGGSKSLP+V
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKK+GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+ KTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

A0A5D3BPI1 Reverse transcriptase0.0e+0099.65Show/hide
Query:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
        MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG
Subjt:  MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARG

Query:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
        KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV
Subjt:  KPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV

Query:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
        SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV
Subjt:  SHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
        VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV
Subjt:  VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.7e-4438.14Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        L++  L    +KCEF   QV F+G+ +S+ G +  Q
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

P0CT35 Transposon Tf2-2 polyprotein1.7e-4438.14Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        L++  L    +KCEF   QV F+G+ +S+ G +  Q
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

P0CT41 Transposon Tf2-12 polyprotein1.7e-4438.14Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        L++  L    +KCEF   QV F+G+ +S+ G +  Q
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.7e-4740.82Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++ +Q
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.7e-4740.82Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++ +Q
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCGACCGAGGCGGCCAGAGCTGACAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCG
CCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGC
CACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGGAAGCCGTTGTGTACCACTTGTGGGAAGCAC
CATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGG
AGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAG
TTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCC
GGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAAT
CCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGT
CAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCC
CTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCAC
GGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCAC
CTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGATGGGTCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAAAGAACAGATATCCCTTGCCC
AGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGAC
AGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCC
TAGATACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCTAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAG
TTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGATCCAGCTAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGCGACCGAGGCGGCCAGAGCTGACAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCG
CCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGC
CACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGGAAGCCGTTGTGTACCACTTGTGGGAAGCAC
CATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGG
AGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAG
TTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCC
GGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAAT
CCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGT
CAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCC
CTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCAC
GGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCAC
CTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGATGGGTCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAAAGAACAGATATCCCTTGCCC
AGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGAC
AGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCC
TAGATACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCTAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAG
TTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGATCCAGCTAAGATAG
Protein sequenceShow/hide protein sequence
MIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKH
HLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPS
GECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVS
LSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP
RIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
LYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQLR