; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0011311 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0011311
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag protease polyprotein
Genome locationchr02:4119025..4121025
RNA-Seq ExpressionPay0011311
SyntenyPay0011311
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032231.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0096.85Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPD AAPVTHADLAAMEQRFRDLIMQMREQQKPAS      PAPAPAPAPAPAPAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQ+WLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQGDM VEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQ LVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFRS GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKE+VKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEEL GLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPA   ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

KAA0043391.1 pol protein [Cucumis melo var. makuwa]0.0e+0098.05Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPT    PA APAPAPAPAPAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLN+EQG+MTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

KAA0049635.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0096.4Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPP+RGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTH DLAAMEQRFRDLIMQMREQQK ASPT    PAPAPAPAPAPAPAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTT DGSLEDPTRAQ+WLSSLETIFRYMKCPEDQKVQCAVFMLTD+ TAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQ DMTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFRSGGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGA APHQGRVFATNKTEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSH FISSAFV HARLEVEPLHHVLSVSTPS ECMLSKE+VKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEF+IELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

KAA0054678.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0096.55Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMREQQKPASPT    PAPAPAPAPAP PAPAPAPVPVAPQF+P
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTA WETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVI VTLIVLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTF+PPSMASFKFKGGGSKSLP+VISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK+GSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0097.6Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMREQQKPASPT    PAPAPAPAPAP PAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTG AQNQGAGAPHQGRVFATN+TEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

TrEMBL top hitse value%identityAlignment
A0A5A7SRS1 Gag protease polyprotein0.0e+0096.85Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPD AAPVTHADLAAMEQRFRDLIMQMREQQKPAS      PAPAPAPAPAPAPAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQ+WLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQGDM VEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQ LVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFRS GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKE+VKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEEL GLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPA   ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

A0A5A7TP96 Reverse transcriptase0.0e+0098.05Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPT    PA APAPAPAPAPAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLN+EQG+MTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

A0A5A7U7Q0 Gag protease polyprotein0.0e+0096.4Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPP+RGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTH DLAAMEQRFRDLIMQMREQQK ASPT    PAPAPAPAPAPAPAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTT DGSLEDPTRAQ+WLSSLETIFRYMKCPEDQKVQCAVFMLTD+ TAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQ DMTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFRSGGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGA APHQGRVFATNKTEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSH FISSAFV HARLEVEPLHHVLSVSTPS ECMLSKE+VKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEF+IELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

A0A5A7UI54 Gag protease polyprotein0.0e+0096.55Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMREQQKPASPT    PAPAPAPAPAP PAPAPAPVPVAPQF+P
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTA WETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVI VTLIVLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTF+PPSMASFKFKGGGSKSLP+VISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK+GSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

A0A5D3BPI1 Reverse transcriptase0.0e+0097.6Show/hide
Query:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP
        MPPRRGARRGGRGGRGRGAG VQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMREQQKPASPT    PAPAPAPAPAP PAPAPAPVPVAPQFVP
Subjt:  MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVP

Query:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
        DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD+GTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL
Subjt:  DQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASL

Query:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
        RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Subjt:  RDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA

Query:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK
        EQQPVP PQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTG AQNQGAGAPHQGRVFATN+TEAEK
Subjt:  EQQPVPAPQRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEK

Query:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
        AGTVV GTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN
Subjt:  AGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN

Query:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
        HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG
Subjt:  HASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPG

Query:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
Subjt:  TVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

SwissProt top hitse value%identityAlignment
Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.7e-1140.62Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.7e-1140.62Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCAAGGAGAGGTGCACGTAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACACGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGC
GCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATTTGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAG
CGCCAGCTCCAGCGCCAGCTCCAGCACCAGCTCCTGCTCCAGCTCCGGCTCCAGTACCAGTTGCGCCCCAGTTTGTGCCGGATCAGTTGTCAGCAGAGGCTAAGCACCTG
AGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGACCCCACCAGGGCTCAGATGTGGTTATCGTCCTTGGAGACCATATTCCGTTACATGAAATG
CCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTTATGTTGACTGACAAAGGTACTGCATGGTGGGAGACTACAGAGAGAATGCTAGGTGGTGATGTGAGTCAGATCACGT
GGCAGCAGTTTAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCTAGTTTGAGAGACGCCAAGCGGCAGGAGTTCCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAG
TATGATGCGGAGTTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCCGACTGGACATTCAGGGTTT
GGTCCGAGCTTTCCGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCTAAGACCGCTGGTAGAGGTTCGACAT
CGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGCGCCACAGCGGAATTTCAGATCAGGTGGTGAGTTTCGCCGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAG
GCTGCCAGAGGGAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACTTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGA
TAGATGCCCGTTGAGACTCACGGGGAACGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAAAACTGAGGCTGAGAAGGCAGGCACGG
TAGTGATAGGTACGCTCCCAGTGTTGGGGCATTATGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTTGCATGCCCGCTTAGAG
GTAGAGCCCTTACACCATGTTCTATCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGA
AGTAACGCTGATAGTCCTGGATATGCTCGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTA
ACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGT
ATCTTAGCGAGCGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCGTCAGAACCGGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCC
GCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTAC
AGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGG
GAGTTGAACAAGGTAACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCAAGGAGAGGTGCACGTAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACACGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGC
GCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATTTGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAG
CGCCAGCTCCAGCGCCAGCTCCAGCACCAGCTCCTGCTCCAGCTCCGGCTCCAGTACCAGTTGCGCCCCAGTTTGTGCCGGATCAGTTGTCAGCAGAGGCTAAGCACCTG
AGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGACCCCACCAGGGCTCAGATGTGGTTATCGTCCTTGGAGACCATATTCCGTTACATGAAATG
CCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTTATGTTGACTGACAAAGGTACTGCATGGTGGGAGACTACAGAGAGAATGCTAGGTGGTGATGTGAGTCAGATCACGT
GGCAGCAGTTTAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCTAGTTTGAGAGACGCCAAGCGGCAGGAGTTCCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAG
TATGATGCGGAGTTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCCGACTGGACATTCAGGGTTT
GGTCCGAGCTTTCCGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCTAAGACCGCTGGTAGAGGTTCGACAT
CGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGCGCCACAGCGGAATTTCAGATCAGGTGGTGAGTTTCGCCGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAG
GCTGCCAGAGGGAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACTTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGA
TAGATGCCCGTTGAGACTCACGGGGAACGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAAAACTGAGGCTGAGAAGGCAGGCACGG
TAGTGATAGGTACGCTCCCAGTGTTGGGGCATTATGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTTGCATGCCCGCTTAGAG
GTAGAGCCCTTACACCATGTTCTATCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGA
AGTAACGCTGATAGTCCTGGATATGCTCGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTA
ACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGT
ATCTTAGCGAGCGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCGTCAGAACCGGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCC
GCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTAC
AGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGG
GAGTTGAACAAGGTAACGTAA
Protein sequenceShow/hide protein sequence
MPPRRGARRGGRGGRGRGAGHVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMREQQKPASPTPAPAPAPAPAPAPAPAPAPAPAPVPVAPQFVPDQLSAEAKHL
RDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDKGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQ
YDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPAPQRNFRSGGEFRRFQQKPFEAGE
AARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEKAGTVVIGTLPVLGHYALVLFDSGSSHSFISSAFVLHARLE
VEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWG
ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR
ELNKVT