; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222311 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222311
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:10872023..10873984
RNA-Seq ExpressionCmc08g0222311
SyntenyCmc08g0222311
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026020.1 pol protein [Cucumis melo var. makuwa]0.0e+00100Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

KAA0031931.1 pol protein [Cucumis melo var. makuwa]0.0e+0092.5Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEV FNPPSMASFKFK  GSRSLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+EVDV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPYRMA AELKELKVQLQELLDKGFIRPS+SPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDL+SGYHQLRIKDGDV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLR+VLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA PL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQNLKQKLVTALVLT+PDGSGSF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKELNMRQR WLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVT QLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+ ND YLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

KAA0040871.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.0e+0091.88Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEV FNP SMASFKFK EGSRSLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+EVDV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPY+MA AELKELKVQLQELLDKGFIRPSVSPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKN+YPL +IDDLFDQLQGATVFSKIDL+SGYHQLRIKDGDV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLR+VLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQNLKQKLVTALVLT+PDGSGSF+IY DAS KGLGCVLMQQGKVVA+ASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKE NMRQR WLELVKDYDCEILYHP KANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVTMQLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+ ND YLVEKRGLAEAGQAVEFS SSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0092.04Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVK CQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA+HASIDCSRKEV FNPPS ASFKFK  GSRSLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+E DV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPYRMA AELKELKVQLQELLDKGFIRPSVSPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDL+SGYHQLRIKD DV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLRMVLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQNLKQKLVTA VLT+PDGSGSF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKELNMRQR WLELVKDYDCEILYHP KANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVTMQLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+SND YLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0091.88Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAANHASIDCSRKEV FNPPSMASFKFK  GS+SLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+E DV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPYRMA AELKELKVQLQELLDKGFIRPSVSPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDL+SGYHQLRIKD DV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLRMVLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQ LKQKLVTA VLT+PDGSGSF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKELNMRQR WLELVKDYDCEILYHP KANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVTMQLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+SND YLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

TrEMBL top hitse value%identityAlignment
A0A5A7SIL8 Reverse transcriptase0.0e+00100Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

A0A5A7SQU8 Reverse transcriptase0.0e+0092.5Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEV FNPPSMASFKFK  GSRSLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+EVDV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPYRMA AELKELKVQLQELLDKGFIRPS+SPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDL+SGYHQLRIKDGDV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLR+VLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIA PL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQNLKQKLVTALVLT+PDGSGSF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKELNMRQR WLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVT QLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+ ND YLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

A0A5A7TGS7 Reverse transcriptase0.0e+0091.88Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEV FNP SMASFKFK EGSRSLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+EVDV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPY+MA AELKELKVQLQELLDKGFIRPSVSPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKN+YPL +IDDLFDQLQGATVFSKIDL+SGYHQLRIKDGDV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLR+VLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQNLKQKLVTALVLT+PDGSGSF+IY DAS KGLGCVLMQQGKVVA+ASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKE NMRQR WLELVKDYDCEILYHP KANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVTMQLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+ ND YLVEKRGLAEAGQAVEFS SSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

A0A5A7U330 Reverse transcriptase0.0e+0092.04Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVK CQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA+HASIDCSRKEV FNPPS ASFKFK  GSRSLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+E DV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPYRMA AELKELKVQLQELLDKGFIRPSVSPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDL+SGYHQLRIKD DV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLRMVLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQNLKQKLVTA VLT+PDGSGSF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKELNMRQR WLELVKDYDCEILYHP KANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVTMQLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+SND YLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

A0A5D3BPI1 Reverse transcriptase0.0e+0091.88Show/hide
Query:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD
        MLSKEKVKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAANHASIDCSRKEV FNPPSMASFKFK  GS+SLPQVISA+RASKLLSQGTW ILASVVD
Subjt:  MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVD

Query:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI
        T+E DV LSSEPVVRDYPDVFPEELPGLPP RE+EFAIELEP  VPISRAPYRMA AELKELKVQLQELLDKGFIRPSVSPWGAP+LFVKKKDGSMRLCI
Subjt:  TKEVDVCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCI

Query:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID
        DYRELNKVTVKNRYPL RIDDLFDQLQGATVFSKIDL+SGYHQLRIKD DV KT FRSRY HYEFIVMSFGLTNAPAVFMDLMN+VFREFLDTF+I+FID
Subjt:  DYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFID

Query:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL
        DI IYSKTEAEH+EHLRMVLQTL+DNKLYAKFSKCEFWLKQV FLGHVVSKAGVS+DPAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENF RIATPL
Subjt:  DIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPL

Query:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY
        TQLTRKGAPF WSK C+DSFQ LKQKLVTA VLT+PDGSGSF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLY
Subjt:  TQLTRKGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLY

Query:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL
        GEKIQIFTDHKSLK+FFTQKELNMRQR WLELVKDYDCEILYHP KANVVADALSRKVSHSAALITRQAPLHRDLERAEI VSVGAVTMQLAQLTVQ TL
Subjt:  GEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTL

Query:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
        RQ+IIDA+SND YLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSAVKT
Subjt:  RQKIIDAESNDSYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-8136.89Show/hide
Query:  VVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFV-KKKDGS----MRLCIDYRELNK
        +++ Y D+   E   L    + +  I  + ++   S+  Y  A  +  E++ Q+Q++L++G IR S SP+ +PI  V KK+D S     R+ IDYR+LN+
Subjt:  VVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFV-KKKDGS----MRLCIDYRELNK

Query:  VTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSK
        +TV +R+P+  +D++  +L     F+ IDL  G+HQ+ +    V KT F +++ HYE++ M FGL NAPA F   MN + R  L+   ++++DDI ++S 
Subjt:  VTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSK

Query:  TEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKG
        +  EH + L +V + L    L  +  KCEF  ++  FLGHV++  G+  +P KIEA+  +P P+   E+++FLGL GYYR+F+ NF  IA P+T+  +K 
Subjt:  TEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKG

Query:  APFFWSKTCKDS-FQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYGEKIQI
             +    DS F+ LK  +    +L +PD +  F +  DAS   LG VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I
Subjt:  APFFWSKTCKDS-FQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYGEKIQI

Query:  FTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSR
         +DH+ L   +  K+ N +   W   + ++D +I Y   K N VADALSR
Subjt:  FTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSR

P0CT34 Transposon Tf2-1 polyprotein3.2e-7631.97Show/hide
Query:  EELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLSRIDDL
        E+LP   P++ +EF +EL      +    Y +   +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPL  I+ L
Subjt:  EELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLSRIDDL

Query:  FDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR     +E++VM +G++ APA F   +N +  E  ++ ++ ++DDI I+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSKTCKDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+    ++  PL  L +K   + W+ T   + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSKTCKDSFQN

Query:  LKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGK-----VVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKH
        +KQ LV+  VL   D S   ++  DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGK-----VVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKH

Query:  FFTQKE--LNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTLRQKIIDAESNDSY
          T +    N R   W   ++D++ EI Y P  AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ 
Subjt:  FFTQKE--LNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTLRQKIIDAESNDSY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDS
        L+    L    + VE +I    GLL   +  + +P+D+
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDS

P0CT41 Transposon Tf2-12 polyprotein3.2e-7631.97Show/hide
Query:  EELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLSRIDDL
        E+LP   P++ +EF +EL      +    Y +   +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPL  I+ L
Subjt:  EELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLSRIDDL

Query:  FDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR     +E++VM +G++ APA F   +N +  E  ++ ++ ++DDI I+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSKTCKDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+    ++  PL  L +K   + W+ T   + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSKTCKDSFQN

Query:  LKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGK-----VVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKH
        +KQ LV+  VL   D S   ++  DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGK-----VVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKH

Query:  FFTQKE--LNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTLRQKIIDAESNDSY
          T +    N R   W   ++D++ EI Y P  AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND+ 
Subjt:  FFTQKE--LNMRQRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTLRQKIIDAESNDSY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDS
        L+    L    + VE +I    GLL   +  + +P+D+
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDS

P20825 Retrovirus-related Pol polyprotein from transposon 2972.0e-8137.65Show/hide
Query:  PISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSG
        PI    Y +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+  +D++  +L     F+ IDL  G
Subjt:  PISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSG

Query:  YHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLK
        +HQ+ + +  + KT F ++  HYE++ M FGL NAPA F   MN + R  L+   ++++DDI I+S +  EH   +++V   L D  L  +  KCEF  K
Subjt:  YHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLK

Query:  QVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSK-TCKDSFQNLKQKLVTALVLTIPDGS
        +  FLGH+V+  G+  +P K++A+ S+P P+   E+R+FLGL GYYR+F+ N+  IA P+T   +K       K    ++F+ LK  ++   +L +PD  
Subjt:  QVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSK-TCKDSFQNLKQKLVTALVLTIPDGS

Query:  GSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCE
          F++  DAS   LG VL Q G  +++ SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L+     KE   +   W   + +Y  +
Subjt:  GSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCE

Query:  ILYHPSKANVVADALSR
        I Y   K N VADALSR
Subjt:  ILYHPSKANVVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.1e-7932.51Show/hide
Query:  GHVIEVTLLVLDML-DFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVDTKEVDVCLSSEPVV
        G+  ++T  VL  L  FD I+G D L    A +D     +   P            S ++  +++A         GT  IL S++               
Subjt:  GHVIEVTLLVLDML-DFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVDTKEVDVCLSSEPVV

Query:  RDYPDVFPEELPGLPPLREIEFAIELE---PSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKK-----DGSMRLCIDYRELN
         ++P +F   L G+     +E A++ E    +  PI    Y        E++ Q+ ELL  G IRPS SP+ +PI  V KK     +   R+ +D++ LN
Subjt:  RDYPDVFPEELPGLPPLREIEFAIELE---PSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKK-----DGSMRLCIDYRELN

Query:  KVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYS
         VT+ + YP+  I+     L  A  F+ +DL SG+HQ+ +K+ D+ KT F +    YEF+ + FGL NAPA+F  +++ + RE +     ++IDDI ++S
Subjt:  KVTVKNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYS

Query:  KTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTR-
        +    H ++LR+VL +L    L     K  F   QV FLG++V+  G+  DP K+ A++  P P++V E++ FLG+  YYR+F++++ ++A PLT LTR 
Subjt:  KTEAEHDEHLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTR-

Query:  ----------KGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQ----QGKVVAYASRQLKSHEENYPTHDLELAAVVFA
                     P    +T   SF +LK  L ++ +L  P  +  F +  DAS   +G VL Q    + + +AY SR L   EENY T + E+ A++++
Subjt:  ----------KGAPFFWSKTCKDSFQNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQ----QGKVVAYASRQLKSHEENYPTHDLELAAVVFA

Query:  LKIWRHYLYGE-KIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSR
        L   R YLYG   I+++TDH+ L      +  N + + W   +++Y+CE++Y P K+NVVADALSR
Subjt:  LKIWRHYLYGE-KIQIFTDHKSLKHFFTQKELNMRQRGWLELVKDYDCEILYHPSKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.4e-2645.6Show/hide
Query:  HLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLG--HVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFW
        HL MVLQ  + ++ YA   KC F   Q+ +LG  H++S  GVS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLQDNKLYAKFSKCEFWLKQVFFLG--HVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFW

Query:  SKTCKDSFQNLKQKLVTALVLTIPD
        ++    +F+ LK  + T  VL +PD
Subjt:  SKTCKDSFQNLKQKLVTALVLTIPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTCGAAAGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGTTGTTAGTCCTGGACATGCTCGACTTTGATGTAATTCTG
GGTATGGATTGGTTGGCCGCTAACCATGCCAGCATAGATTGTTCCCGTAAGGAGGTAGCGTTTAACCCTCCCTCGATGGCTAGTTTTAAATTTAAGGTAGAAGGG
TCAAGGTCGTTACCTCAGGTAATCTCAGCCATGAGGGCAAGCAAACTGCTCAGTCAGGGTACTTGGAGTATCTTAGCGAGCGTGGTGGATACTAAAGAGGTTGAT
GTATGCCTGTCATCAGAACCAGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCTCAGAGAGATTGAGTTTGCCATAGAGTTG
GAACCGAGCATAGTTCCTATATCCAGAGCCCCGTACAGAATGGCCACAGCAGAGTTGAAAGAACTGAAAGTGCAGTTACAGGAATTGCTTGATAAGGGCTTCATT
CGACCGAGTGTGTCACCTTGGGGTGCACCAATTTTATTTGTTAAGAAGAAAGATGGATCAATGCGCCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTT
AAGAACAGATATCCCTTGTCCAGGATCGATGACCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTTCAGTCGGGATATCATCAGCTGAGG
ATTAAGGACGGTGATGTACTGAAGACGACCTTTCGTTCCAGATACGAACATTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATG
GACTTGATGAACAAAGTGTTTAGGGAGTTTCTAGACACTTTTATGATCATGTTTATTGATGACATCTTTATATATTCCAAGACGGAGGCCGAGCATGACGAACAT
TTACGTATGGTTCTGCAAACCCTTCAGGATAATAAATTATATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAAGTGTTCTTTCTAGGCCATGTGGTTTCT
AAGGCTGGAGTTTCTTTGGATCCAGCTAAGATAGAGGCAGTTACCAGCTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTAT
TATCGACGGTTTGTGGAGAACTTTTTCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTTTTTGGAGCAAGACATGTAAGGACAGTTTC
CAGAATCTTAAACAGAAGCTAGTTACTGCACTGGTTCTTACTATACCTGATGGTTCGGGCAGTTTTATGATTTATAGGGATGCTTCTAAGAAGGGTTTGGGTTGT
GTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGGAGAATTACCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTT
GCATTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAGATCTTCACGGATCATAAAAGCTTGAAACATTTCTTTACTCAGAAGGAATTGAATATGAGG
CAGCGAGGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCATCCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACAT
TCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGAGGGCTGAGATTGTAGTGTCAGTAGGGGCAGTCACTATGCAATTAGCCCAGTTGACG
GTACAGCTGACTTTGAGGCAAAAGATCATTGATGCTGAGAGTAACGATTCTTATTTGGTTGAGAAGCGTGGCTTAGCAGAGGCAGGGCAAGCTGTTGAGTTCTCC
ATATCCTCTGATGGTGGACTTTTGTTTGAGAGGCGTCTCTGTGTGCCATCAGATAGTGCAGTTAAAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGTCGAAAGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGTTGTTAGTCCTGGACATGCTCGACTTTGATGTAATTCTG
GGTATGGATTGGTTGGCCGCTAACCATGCCAGCATAGATTGTTCCCGTAAGGAGGTAGCGTTTAACCCTCCCTCGATGGCTAGTTTTAAATTTAAGGTAGAAGGG
TCAAGGTCGTTACCTCAGGTAATCTCAGCCATGAGGGCAAGCAAACTGCTCAGTCAGGGTACTTGGAGTATCTTAGCGAGCGTGGTGGATACTAAAGAGGTTGAT
GTATGCCTGTCATCAGAACCAGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCTCAGAGAGATTGAGTTTGCCATAGAGTTG
GAACCGAGCATAGTTCCTATATCCAGAGCCCCGTACAGAATGGCCACAGCAGAGTTGAAAGAACTGAAAGTGCAGTTACAGGAATTGCTTGATAAGGGCTTCATT
CGACCGAGTGTGTCACCTTGGGGTGCACCAATTTTATTTGTTAAGAAGAAAGATGGATCAATGCGCCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTT
AAGAACAGATATCCCTTGTCCAGGATCGATGACCTGTTTGACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATCTTCAGTCGGGATATCATCAGCTGAGG
ATTAAGGACGGTGATGTACTGAAGACGACCTTTCGTTCCAGATACGAACATTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATG
GACTTGATGAACAAAGTGTTTAGGGAGTTTCTAGACACTTTTATGATCATGTTTATTGATGACATCTTTATATATTCCAAGACGGAGGCCGAGCATGACGAACAT
TTACGTATGGTTCTGCAAACCCTTCAGGATAATAAATTATATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAAGTGTTCTTTCTAGGCCATGTGGTTTCT
AAGGCTGGAGTTTCTTTGGATCCAGCTAAGATAGAGGCAGTTACCAGCTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTAT
TATCGACGGTTTGTGGAGAACTTTTTCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTTTTTGGAGCAAGACATGTAAGGACAGTTTC
CAGAATCTTAAACAGAAGCTAGTTACTGCACTGGTTCTTACTATACCTGATGGTTCGGGCAGTTTTATGATTTATAGGGATGCTTCTAAGAAGGGTTTGGGTTGT
GTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGGAGAATTACCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTT
GCATTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAGATCTTCACGGATCATAAAAGCTTGAAACATTTCTTTACTCAGAAGGAATTGAATATGAGG
CAGCGAGGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCATCCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACAT
TCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGAGGGCTGAGATTGTAGTGTCAGTAGGGGCAGTCACTATGCAATTAGCCCAGTTGACG
GTACAGCTGACTTTGAGGCAAAAGATCATTGATGCTGAGAGTAACGATTCTTATTTGGTTGAGAAGCGTGGCTTAGCAGAGGCAGGGCAAGCTGTTGAGTTCTCC
ATATCCTCTGATGGTGGACTTTTGTTTGAGAGGCGTCTCTGTGTGCCATCAGATAGTGCAGTTAAAACATAA
Protein sequenceShow/hide protein sequence
MLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASFKFKVEGSRSLPQVISAMRASKLLSQGTWSILASVVDTKEVD
VCLSSEPVVRDYPDVFPEELPGLPPLREIEFAIELEPSIVPISRAPYRMATAELKELKVQLQELLDKGFIRPSVSPWGAPILFVKKKDGSMRLCIDYRELNKVTV
KNRYPLSRIDDLFDQLQGATVFSKIDLQSGYHQLRIKDGDVLKTTFRSRYEHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFMIMFIDDIFIYSKTEAEHDEH
LRMVLQTLQDNKLYAKFSKCEFWLKQVFFLGHVVSKAGVSLDPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFFRIATPLTQLTRKGAPFFWSKTCKDSF
QNLKQKLVTALVLTIPDGSGSFMIYRDASKKGLGCVLMQQGKVVAYASRQLKSHEENYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKHFFTQKELNMR
QRGWLELVKDYDCEILYHPSKANVVADALSRKVSHSAALITRQAPLHRDLERAEIVVSVGAVTMQLAQLTVQLTLRQKIIDAESNDSYLVEKRGLAEAGQAVEFS
ISSDGGLLFERRLCVPSDSAVKT