; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003294 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003294
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr11:19660962..19663442
RNA-Seq ExpressionPay0003294
SyntenyPay0003294
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR005162 - Retrotransposon gag domain
IPR001969 - Aspartic peptidase, active site
IPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036671.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0092.24Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        +IMQMREQQKPASPTPAPAP PAPA VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRA+MWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTE MLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VR FRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFR GGEFRSFQQKPFEAGEAARGK LCTTCGKHHLGRCL GTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TRE +VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKG PFVWSKACEDSFQ L +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

KAA0038231.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0092.98Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        MIMQMREQQKPASPTPAPAPAPAPA +PAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFV+GLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP+RNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVV+GTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEH+EHLRMVLQTLRDNKLY+KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNRS
        TQLTRKGAPFVWSKACEDSFQTL ++
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNRS

KAA0043391.1 pol protein [Cucumis melo var. makuwa]0.0e+0092.48Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        +IMQMREQQKPASPTPA APAPAPA  PAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLN+EQG+MTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQPVPVPQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY RFVENFS IATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKGAPFVWSKACEDSFQ L +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

KAA0046014.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0092.36Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        MIMQMREQQKP SPTPAPAPAPAPA VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFD SLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCA+
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLT+RGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGD+TVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRP THADALRL VDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHL RCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTAD+CPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEI+GHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKT FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKGAPFVWSKACEDSFQTL +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0093.94Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        MIMQMREQQKPASPTPAPAPAPAPA VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKGAPFVWSKACEDSFQTL +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

TrEMBL top hitse value%identityAlignment
A0A5A7T538 Reverse transcriptase0.0e+0092.24Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        +IMQMREQQKPASPTPAPAP PAPA VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRA+MWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTE MLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VR FRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFR GGEFRSFQQKPFEAGEAARGK LCTTCGKHHLGRCL GTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TRE +VSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKG PFVWSKACEDSFQ L +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

A0A5A7T9B7 Gag protease polyprotein0.0e+0092.98Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        MIMQMREQQKPASPTPAPAPAPAPA +PAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFV+GLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP+RNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVV+GTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEH+EHLRMVLQTLRDNKLY+KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNRS
        TQLTRKGAPFVWSKACEDSFQTL ++
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNRS

A0A5A7TP96 Reverse transcriptase0.0e+0092.48Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        +IMQMREQQKPASPTPA APAPAPA  PAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLN+EQG+MTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQPVPVPQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFV HARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY RFVENFS IATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKGAPFVWSKACEDSFQ L +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

A0A5A7TSI1 Gag protease polyprotein0.0e+0092.36Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        MIMQMREQQKP SPTPAPAPAPAPA VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFD SLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCA+
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLT+RGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGD+TVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRP THADALRL VDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHL RCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTAD+CPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEI+GHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKT FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKGAPFVWSKACEDSFQTL +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

A0A5D3BPI1 Reverse transcriptase0.0e+0093.94Show/hide
Query:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
        MIMQMREQQKPASPTPAPAPAPAPA VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV
Subjt:  MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAV

Query:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
        FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL
Subjt:  FMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGL

Query:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
        VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF
Subjt:  VRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCF

Query:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
        KCRQEGHTADRCPLR                         AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC
Subjt:  KCRQEGHTADRCPLR-------------------------AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGEC

Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP               
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------

Query:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
               VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  -------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVWSKACEDSFQTLNR
        TQLTRKGAPFVWSKACEDSFQTL +
Subjt:  TQLTRKGAPFVWSKACEDSFQTLNR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.9e-4836.15Show/hide
Query:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------------------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ +P                           +TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------------------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  KDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG
          E V KTAF +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLG
Subjt:  KDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG

Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK
        HV++  G+  +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK

P20825 Retrovirus-related Pol polyprotein from transposon 2976.2e-5036.09Show/hide
Query:  PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------------------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
        PI    Y +A     E++ Q+QE+L++G IR S SP+ +P                           +T+ +RYP+P +D++  +L     F+ IDL  G
Subjt:  PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP---------------------------VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLK
        +HQ+ + +E + KTAF ++ GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L D  L  +  KCEF  K
Subjt:  YHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLK

Query:  QVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK
        + +FLGH+V+  G+  +P K++A+  +  P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K
Subjt:  QVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.6e-4435.89Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPV----------------------TVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV                      T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPV----------------------TVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.9e-4831.76Show/hide
Query:  GHVIEVTLIVLDML-DFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLS----QGTWGILASVVDTREADVSLSS
        G+  ++T  VL  L  FD I+G D L    A +D     +   P           G K      ++I  + LL+     GT  IL S++           
Subjt:  GHVIEVTLIVLDML-DFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLS----QGTWGILASVVDTREADVSLSS

Query:  EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGT---VPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP----------------------
             ++P +F   L G+     VE A++ E  T    PI    Y        E++ Q+ ELL  G IRPS SP+ +P                      
Subjt:  EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGT---VPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP----------------------

Query:  -----VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDI
             VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +K+ D+PKTAF +  G YEF+ + FGL NAPA+F  +++ + RE +     V+IDDI
Subjt:  -----VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDI

Query:  LIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQ
        +++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  G+  DP K+ A++    P++V E++ FLG+  YYR+F+++++++A PLT 
Subjt:  LIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQ

Query:  LTR
        LTR
Subjt:  LTR

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.6e-4435.89Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPV----------------------TVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV                      T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPV----------------------TVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.9e-2345.95Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQTL
        ++    +F+ L
Subjt:  SKACEDSFQTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGCGCCAGCTCCAGCACGAGTTCCTGCTCCAGCTCCGGCTCCAGTA
CCAGTTGCGCCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCATCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGAC
CCCACCAGGGCTCAGATGTGGTTATCGTCCTTGGAAACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAGGTTCAGTGTGCTGTTTTTATGTTGACTGAC
AGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCT
GCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAGTTTGACATGTTATCCCGCTTCGCT
CCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCC
GATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAG
CAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTG
TGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGA
GCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCA
TTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTATCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAAGAAAAGGTGAAGGCATGC
CAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCC
AGTATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCC
ATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGG
GACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGGGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTCCCTATATCCAGAGCC
CCTTACAGAATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCA
GTAACCGTAAAGAACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCAT
CAGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCA
GTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACAC
GAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCAC
GTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTA
GCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAG
GACAGTTTCCAGACCTTAAACAGAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGCGCCAGCTCCAGCACGAGTTCCTGCTCCAGCTCCGGCTCCAGTA
CCAGTTGCGCCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCATCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGAC
CCCACCAGGGCTCAGATGTGGTTATCGTCCTTGGAAACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAGGTTCAGTGTGCTGTTTTTATGTTGACTGAC
AGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCT
GCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAGTTTGACATGTTATCCCGCTTCGCT
CCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCC
GATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAG
CAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTG
TGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGA
GCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCA
TTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTATCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAAGAAAAGGTGAAGGCATGC
CAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCC
AGTATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCC
ATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGG
GACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGGGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTCCCTATATCCAGAGCC
CCTTACAGAATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCA
GTAACCGTAAAGAACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCAT
CAGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCA
GTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACAC
GAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCAC
GTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTA
GCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAG
GACAGTTTCCAGACCTTAAACAGAAGCTAG
Protein sequenceShow/hide protein sequence
MIMQMREQQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTD
RGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHA
DALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLR
AEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHA
SIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRA
PYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPA
VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGL
AGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLNRS