; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020916 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020916
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr06:21176230..21178670
RNA-Seq ExpressionPay0020916
SyntenyPay0020916
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR032567 - LDOC1-related
IPR021109 - Aspartic peptidase domain superfamily
IPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043391.1 pol protein [Cucumis melo var. makuwa]0.0e+0092.71Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMREQQKPASPTPA APAPAPAP PAPAPAPVPVAPQFVPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              +EQG+MTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

KAA0046014.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0092.46Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKP SPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFD SLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCA+FMLT+RGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGD+TVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRP THADALRL VDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHL RCLFGTRTCFKCRQEGHTAD+CPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEI+GHVIEVTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGSMRLCID RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKT FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSK GVSV
Subjt:  VVSKAGVSV

KAA0053290.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0092.71Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEV PVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPA VPAPAPAPVPVAPQ VPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGR STSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFE GEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAG PHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFV+HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLA NHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGS+RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDL SGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR+NKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

KAA0054678.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0093.33Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQF+PDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTA WETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVI VTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTF+PPSMASFKFKGGGSKSLP+VISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKK+GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+ KTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0094.07Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

TrEMBL top hitse value%identityAlignment
A0A5A7TP96 Reverse transcriptase0.0e+0092.71Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMREQQKPASPTPA APAPAPAP PAPAPAPVPVAPQFVPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              +EQG+MTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

A0A5A7TSI1 Gag protease polyprotein0.0e+0092.46Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKP SPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFD SLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCA+FMLT+RGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGD+TVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRP THADALRL VDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHL RCLFGTRTCFKCRQEGHTAD+CPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEI+GHVIEVTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGSMRLCID RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKT FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSK GVSV
Subjt:  VVSKAGVSV

A0A5A7UDK9 Gag protease polyprotein0.0e+0092.71Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEV PVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPA VPAPAPAPVPVAPQ VPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGR STSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFE GEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAG PHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFV+HARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLA NHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGS+RLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDL SGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR+NKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

A0A5A7UI54 Gag protease polyprotein0.0e+0093.33Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQF+PDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTA WETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVI VTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTF+PPSMASFKFKGGGSKSLP+VISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKK+GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+ KTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

A0A5D3BPI1 Reverse transcriptase0.0e+0094.07Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
        MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLS

Query:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------
        AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE               
Subjt:  AEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKE---------------

Query:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
              LEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Subjt:  --RVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP

Query:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
        VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTV
Subjt:  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTV

Query:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
        VT           VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI
Subjt:  VT-----------VLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI

Query:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
        DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI
Subjt:  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI

Query:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKV              VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVH-------------VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKAGVSV
        VVSKAGVSV
Subjt:  VVSKAGVSV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein9.8e-4036.02Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPW-------------GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +   ++                 PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPW-------------GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        L++  L    +KCEF   QV F+G+ +S+ G +  Q
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

P0CT35 Transposon Tf2-2 polyprotein9.8e-4036.02Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPW-------------GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +   ++                 PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPW-------------GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        L++  L    +KCEF   QV F+G+ +S+ G +  Q
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

P0CT41 Transposon Tf2-12 polyprotein9.8e-4036.02Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPW-------------GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +   ++                 PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPW-------------GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        L++  L    +KCEF   QV F+G+ +S+ G +  Q
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.0e-4037.96Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVHV-------------SPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   V             SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVHV-------------SPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++ +Q
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.0e-4037.96Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVHV-------------SPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   V             SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVHV-------------SPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++ +Q
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ

Arabidopsis top hitse value%identityAlignment
AT2G15180.1 Zinc knuckle (CCHC-type) family protein8.9e-0433.73Show/hide
Query:  GRCLFGTR----TCFKCRQEGHTADRCPLRVT----GIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTVLFDSGSSHSFISSA
        G+C F T+    TC++C+QEGH A  CP R T    G+ Q +   A  +  + ++ + + E+A T  T L +S +  + +SSA
Subjt:  GRCLFGTR----TCFKCRQEGHTADRCPLRVT----GIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTVLFDSGSSHSFISSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAAGGAGAGGTGCACGAAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGC
GCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATATGATTATGCAAATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAG
CGCCAGCTCCAGCACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCACCTGAGGGATTTCAGG
AAGTATAATCCCACGACGTTTGATGGGTCTTTGGAGGACCCCACCAGAGCTCAGATGTGGCTATCGTCCTTGGAGACCATATTCCGTTACATGAAATGCCCTGAGGATCA
GAAGGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCA
AGGAGAGAGTTTCTGAACTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAGTTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCC
AGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTT
ACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAG
GTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGGAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTC
GGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGG
TAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGC
ATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCA
GGCCATGTGATTGAGGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAA
GGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTC
AGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTT
CCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACT
GAAGGTACACGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTAAAGA
ACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGAT
GAGGATATACCGAAGACAGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAG
AGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAA
CACTTCGGGATAATAAGTTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGATCCAG
CTAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCACCAAGGAGAGGTGCACGAAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGC
GCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATATGATTATGCAAATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAG
CGCCAGCTCCAGCACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCACCTGAGGGATTTCAGG
AAGTATAATCCCACGACGTTTGATGGGTCTTTGGAGGACCCCACCAGAGCTCAGATGTGGCTATCGTCCTTGGAGACCATATTCCGTTACATGAAATGCCCTGAGGATCA
GAAGGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCA
AGGAGAGAGTTTCTGAACTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAGTTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCC
AGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTT
ACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAG
GTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGGAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTC
GGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGG
TAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGC
ATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCA
GGCCATGTGATTGAGGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAA
GGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTC
AGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTT
CCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACT
GAAGGTACACGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTAAAGA
ACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGAT
GAGGATATACCGAAGACAGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAG
AGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAA
CACTTCGGGATAATAAGTTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGATCCAG
CTAAGATAG
Protein sequenceShow/hide protein sequence
MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFR
KYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKERVSELEQGDMTVEQYDAEFDMLSRFAPEMIATEAA
RADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLF
GTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIA
GHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEEL
PGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVHVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD
EDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVIQ
LR