; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0175181 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0175181
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:32067339..32069407
RNA-Seq ExpressionCmc06g0175181
SyntenyCmc06g0175181
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0094.19Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TRE DVSLS EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTE EHEEHLRMVLQTLRDNKLYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAE GQ  EFSLSSDGGLLFERRLCVPSD A+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

KAA0037291.1 pol protein [Cucumis melo var. makuwa]0.0e+0094.05Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKG PFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKS KYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK+SHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPS  AVKTELLSEAHSSPFSMHPG+TKMYQDLKRVYWWR+MKRK
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

KAA0046094.1 gag protease polyprotein [Cucumis melo var. makuwa]0.0e+0094.19Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGIL SVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNK TVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAF SRYGHYEFIVMSFGLTNAPAVFM+LMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQT EFSLSSDGGLLFERRLCVPSD AVKTELLSEAHSSPFSMHP STK+YQDLKRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0093.9Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVK CQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAA+ ASIDCSRKEVTFNPPS ASFKFKGGGS+SLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSD  VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0095.36Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSD AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase0.0e+0094.19Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TRE DVSLS EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTE EHEEHLRMVLQTLRDNKLYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAE GQ  EFSLSSDGGLLFERRLCVPSD A+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

A0A5A7T6R9 Reverse transcriptase0.0e+0094.05Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKG PFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKS KYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRK+SHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPS  AVKTELLSEAHSSPFSMHPG+TKMYQDLKRVYWWR+MKRK
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

A0A5A7TSQ8 Reverse transcriptase0.0e+0094.19Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGIL SVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNK TVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAF SRYGHYEFIVMSFGLTNAPAVFM+LMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQT EFSLSSDGGLLFERRLCVPSD AVKTELLSEAHSSPFSMHP STK+YQDLKRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

A0A5A7U330 Reverse transcriptase0.0e+0093.9Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVK CQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAA+ ASIDCSRKEVTFNPPS ASFKFKGGGS+SLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSD  VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

A0A5D3BPI1 Reverse transcriptase0.0e+0095.36Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
        MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAAN ASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVD

Query:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
        TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI
Subjt:  TREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
        DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Subjt:  DYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID

Query:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
        DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL
Subjt:  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL

Query:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFV ++                  ++ P     FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFVGARHVR--------------TVSRPLNRSYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
        GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Subjt:  GEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL

Query:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK
        RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSD AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKR+
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.1e-8238.69Show/hide
Query:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  KDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG
          E V KTAF +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLG
Subjt:  KDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG

Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG-----------APFVGARHVRT----VSRPLNRSYFVIY
        HV++  G+  +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K            + F   +++ +    +  P     F + 
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG-----------APFVGARHVRT----VSRPLNRSYFVIY

Query:  SDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPG
        +DAS   LG VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  G
Subjt:  SDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPG

Query:  KANVVADALSR
        K N VADALSR
Subjt:  KANVVADALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.5e-8138.13Show/hide
Query:  PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
        PI    Y +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G
Subjt:  PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLK
        +HQ+ + +E + KTAF ++ GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L D  L  +  KCEF  K
Subjt:  YHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLK

Query:  QVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPF-------------VGARHVR--TVSRPLNR
        + +FLGH+V+  G+  +P K++A+  +  P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K                 + A  +R   +  P   
Subjt:  QVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPF-------------VGARHVR--TVSRPLNR

Query:  SYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE
          FV+ +DAS   LG VL Q G  +++ SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y  +
Subjt:  SYFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE

Query:  ILYHPGKANVVADALSR
        I Y  GK N VADALSR
Subjt:  ILYHPGKANVVADALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-7833.33Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL-------TQLTRK
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+       +Q T K
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL-------TQLTRK

Query:  GAPFVGARHVRTVSRPL-----NRSYFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTD
            +        + P+     N++ + + +DASK G+G VL +         VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+   + TD
Subjt:  GAPFVGARHVRTVSRPL-----NRSYFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTD

Query:  HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS
        H SL     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR      A+ T      R ++         +  +  A L     L Q  +  + 
Subjt:  HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLS-SDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK
           +   ++ L E  +T   + S  D  + ++ RL VP         L   H+  F  H G T     +  +Y+W  ++
Subjt:  NDPYLVEKRGLAEAGQTAEFSLS-SDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.6e-7832.63Show/hide
Query:  GHVIEVTLIVLDML-DFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLS----QGTWGILASVVDTREADVSLSS
        G+  ++T  VL  L  FD I+G D L    A +D     +   P           G K      ++I  + LL+     GT  IL S++           
Subjt:  GHVIEVTLIVLDML-DFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLS----QGTWGILASVVDTREADVSLSS

Query:  EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGT---VPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDY
             ++P +F   L G+     VE A++ E  T    PI    Y        E++ Q+ ELL  G IRPS SP+ +P+  V KK     +   R+ +D+
Subjt:  EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGT---VPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDY

Query:  RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDI
        + LN VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +K+ D+PKTAF +  G YEF+ + FGL NAPA+F  +++ + RE +     V+IDDI
Subjt:  RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDI

Query:  LIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQ
        +++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  G+  DP K+ A++    P++V E++ FLG+  YYR+F+++++++A PLT 
Subjt:  LIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQ

Query:  LTRKGAPFVGARHVRTVSRPLNRSY-------------------------FVIYSDASKKGLGCVLMQ----QGKVVAYASRQLKSHEQNYPTHDLELAA
        LTR     + +     V   L+ +                          F + +DAS   +G VL Q    + + +AY SR L   E+NY T + E+ A
Subjt:  LTRKGAPFVGARHVRTVSRPLNRSY-------------------------FVIYSDASKKGLGCVLMQ----QGKVVAYASRQLKSHEQNYPTHDLELAA

Query:  VVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR
        ++++L   R YLYG   I+++TDH+ L +    +  N + +RW   +++Y+CE++Y PGK+NVVADALSR
Subjt:  VVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-7833.33Show/hide
Query:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  V+  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFPEELPGLPP---HREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLPRID+L  ++  A +F+ +DL SGYHQ+ ++ +D  KTAF +  G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL-------TQLTRK
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+       +Q T K
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPL-------TQLTRK

Query:  GAPFVGARHVRTVSRPL-----NRSYFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTD
            +        + P+     N++ + + +DASK G+G VL +         VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+   + TD
Subjt:  GAPFVGARHVRTVSRPL-----NRSYFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTD

Query:  HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS
        H SL     + E   R +RWL+ +  YD  + Y  G  NVVADA+SR V       +R           +      AV + + +LT      Q  +  + 
Subjt:  HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLS-SDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK
           +   ++ L E  +T   + S  D  + ++ RL VP         L   H+  F  H G T     +  +Y+W  ++
Subjt:  NDPYLVEKRGLAEAGQTAEFSLS-SDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.2e-2250Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGA
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTCGAAGGAAAAGGTGAAAGCTTGCCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAATCCTGGGTAT
GGATTGGTTGGCCGCTAACGACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGT
TGCCCCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCA
GAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTAT
ATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTGCAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTG
CGCCAGTTTTATTTGTTAAGAAGAAGGATGGGTCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATTGAC
GATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGGTACCATCAGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCG
TTCTAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTATTTATGGACTTGATGAACAGAGTGTTTAGGGAATTTCTAGACACTT
TTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTATATGCA
AAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACTGGTTG
GACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGA
CCAGAAAGGGGGCTCCTTTTGTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGT
TGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGC
TTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAA
GGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAAGTATCACATTCAGCAGCACTC
ATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGAGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAG
GCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGT
TGTTTGAGAGACGCCTCTGTGTTCCGTCAGATTGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTAT
CAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAGAGGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGTCGAAGGAAAAGGTGAAAGCTTGCCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTTCTGGATATGCTGGACTTTGATGTAATCCTGGGTAT
GGATTGGTTGGCCGCTAACGACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGT
TGCCCCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCA
GAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTAT
ATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAACTGAAAGAACTGAAGGTACAGTTGCAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTG
CGCCAGTTTTATTTGTTAAGAAGAAGGATGGGTCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATTGAC
GATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGGTACCATCAGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCG
TTCTAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTATTTATGGACTTGATGAACAGAGTGTTTAGGGAATTTCTAGACACTT
TTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTATATGCA
AAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACTGGTTG
GACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGA
CCAGAAAGGGGGCTCCTTTTGTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGT
TGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGC
TTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAA
GGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAAGTATCACATTCAGCAGCACTC
ATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGAGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAG
GCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGT
TGTTTGAGAGACGCCTCTGTGTTCCGTCAGATTGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTAT
CAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAGAGGAAGTAG
Protein sequenceShow/hide protein sequence
MLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANDASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLSS
EPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRID
DLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYA
KFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVGARHVRTVSRPLNRSYFVIYSDASKKGLG
CVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL
ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDCAVKTELLSEAHSSPFSMHPGSTKMY
QDLKRVYWWRNMKRK