; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0132011 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0132011
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr05:11885734..11887818
RNA-Seq ExpressionCmc05g0132011
SyntenyCmc05g0132011
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0097.84Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLS EPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA F KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAE GQ  
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

KAA0043391.1 pol protein [Cucumis melo var. makuwa]0.0e+0097.41Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY RFVENFS IATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSH+QNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCE LYHPGKANVVADALSRKVSHSAALITRQA LHRDLERA+IAVSVGAVTMQLAQL VQPTLRQRI+DAQSNDPYLVEKRGLAEAGQ  
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSAVK ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+C VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

KAA0046094.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0097.84Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGIL SVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAF SRYGHYEFIVMSFGLTNAPAVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQT 
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STK+YQDLKRVYWWRNMKREVAEFVSKC VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0097.84Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAA+HASIDCSRKEVTFNPPS ASFKFKGGGS+SLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+C VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0099.14Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase0.0e+0097.84Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLS EPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA F KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAE GQ  
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

A0A5A7TP96 Reverse transcriptase0.0e+0097.41Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY RFVENFS IATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSH+QNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCE LYHPGKANVVADALSRKVSHSAALITRQA LHRDLERA+IAVSVGAVTMQLAQL VQPTLRQRI+DAQSNDPYLVEKRGLAEAGQ  
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSAVK ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+C VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

A0A5A7TSQ8 Reverse transcriptase0.0e+0097.84Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGIL SVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK TVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAF SRYGHYEFIVMSFGLTNAPAVFM+LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQT 
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP STK+YQDLKRVYWWRNMKREVAEFVSKC VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

A0A5A7U330 Reverse transcriptase0.0e+0097.84Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAA+HASIDCSRKEVTFNPPS ASFKFKGGGS+SLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+C VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

A0A5D3BPI1 Reverse transcriptase0.0e+0099.14Show/hide
Query:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
        MLDFDVILGMDWLAANHASIDCSRKEVTFNPPS+ASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG
Subjt:  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPG

Query:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
        LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ
Subjt:  LPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Subjt:  GATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK

Query:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYA FSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
Subjt:  LYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
        RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQL VQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA
Subjt:  RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTA

Query:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE
        EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC VCQQVKAPRQKPAGLLQPLSIPE
Subjt:  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.8e-9432.51Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT
        L++  L  N +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + 
Subjt:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT

Query:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY

Query:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA
        L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP 
Subjt:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA

Query:  GLLQPL
        G LQP+
Subjt:  GLLQPL

P0CT35 Transposon Tf2-2 polyprotein2.8e-9432.51Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT
        L++  L  N +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + 
Subjt:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT

Query:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY

Query:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA
        L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP 
Subjt:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA

Query:  GLLQPL
        G LQP+
Subjt:  GLLQPL

P0CT36 Transposon Tf2-3 polyprotein2.8e-9432.51Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT
        L++  L  N +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + 
Subjt:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT

Query:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY

Query:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA
        L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP 
Subjt:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA

Query:  GLLQPL
        G LQP+
Subjt:  GLLQPL

P0CT37 Transposon Tf2-4 polyprotein2.8e-9432.51Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT
        L++  L  N +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + 
Subjt:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT

Query:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY

Query:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA
        L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP 
Subjt:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA

Query:  GLLQPL
        G LQP+
Subjt:  GLLQPL

P0CT41 Transposon Tf2-12 polyprotein2.8e-9432.51Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT
        L++  L  N +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + 
Subjt:  LRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQT

Query:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPY

Query:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA
        L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP 
Subjt:  LVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCFVCQQVKAPRQKPA

Query:  GLLQPL
        G LQP+
Subjt:  GLLQPL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-2847.2Show/hide
Query:  HLRMVLQTLRDNKLYANFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQ    ++ YAN  KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLRDNKLYANFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQTLKQKLVTAPVLTVPD
        ++    +F+ LK  + T PVL +PD
Subjt:  SKACEDSFQTLKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGCTGGCCGCTAACCACGCTAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGTTGGCCAGTTT
TAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATA
CTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCC
ATAGAGTTAGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGGATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGACAAGGGATT
CATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGATGGGTCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAA
AGAACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAG
GATGAGGATATACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAA
CAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGC
AAACACTTCGGGATAATAAGTTGTATGCAAACTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCATTTCTGGGCCACGTGGTTTCCAAGGCGGGAGTCTCTGTGGAT
CCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTC
TCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTTCAGACCCTTAAACAGAAGTTAGTTACCGCAC
CGGTTCTTACGGTACCTGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCG
TCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGAT
ACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTAAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATAC
TGTATCATCCAGGCAAGGCAAATGTGGTAGCCGATGCTCTTAGTAGGAAAGTGTCACATTCAGCAGCACTCATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGG
GCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGGCGGTACAGCCGACTTTGAGGCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCT
GGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGG
TTAAGACTGAATTATTATCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGCACGAAGATGTATCAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAG
AGGGAAGTAGCAGAATTTGTTAGTAAATGCTTTGTGTGCCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGCTGGCCGCTAACCACGCTAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGTTGGCCAGTTT
TAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATA
CTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCC
ATAGAGTTAGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGGATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGACAAGGGATT
CATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGATGGGTCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAA
AGAACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAG
GATGAGGATATACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAA
CAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGC
AAACACTTCGGGATAATAAGTTGTATGCAAACTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCATTTCTGGGCCACGTGGTTTCCAAGGCGGGAGTCTCTGTGGAT
CCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTC
TCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTTCAGACCCTTAAACAGAAGTTAGTTACCGCAC
CGGTTCTTACGGTACCTGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCG
TCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGAT
ACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTAAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATAC
TGTATCATCCAGGCAAGGCAAATGTGGTAGCCGATGCTCTTAGTAGGAAAGTGTCACATTCAGCAGCACTCATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGG
GCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGGCGGTACAGCCGACTTTGAGGCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCT
GGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGG
TTAAGACTGAATTATTATCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGCACGAAGATGTATCAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAG
AGGGAAGTAGCAGAATTTGTTAGTAAATGCTTTGTGTGCCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGA
Protein sequenceShow/hide protein sequence
MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSLASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFA
IELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
DEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYANFSKCEFWLKQVSFLGHVVSKAGVSVD
PAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYA
SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
AEIAVSVGAVTMQLAQLAVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK
REVAEFVSKCFVCQQVKAPRQKPAGLLQPLSIPE