; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224321 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224321
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:14274189..14276240
RNA-Seq ExpressionCmc08g0224321
SyntenyCmc08g0224321
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037901.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.18Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        +SFGLTNAPAVFM+LM+RVFREFL TFVI+FIDDILIYSKTEAEH+EHLRMVLQTLRDN+LYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAK EAVT W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
         RP+TVSEVRSFLGLAGYYRR VE FSRIATPLTQLTRKGA F+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFA KIW HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KANVVADALSRK +HSA LITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR

Query:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH
        QAPLHRDLER+EIAVSVGAVTMQLAQLT                       GLAEAGQAVEFS+SSDGGLLFERRLCVLSD AVKTELLSEAHSSPF MH
Subjt:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PGSTKMYQDLK VYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL + EWKW+NVSMDFITGLPRTLRGFTVIWVVVDRLTKS HFVPGKSTYTA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
        +KWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFW GLQTAMGTRLDFS AFHPQTDGQT  LNQVLEDML+ACALEFPGSWDSHLHLMEFA NNS+QATI
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL
        GMAPFEALYGKCCRS+VC  EVGEQRLMGPELVQSTNEAIQKIRSRM T Q+RQKSYADVRRKDLEF+VGDKVFLKVA MRGVL
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL

KAA0040380.1 pol protein [Cucumis melo var. makuwa]0.0e+00100Show/hide
Query:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
        MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
Subjt:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF

Query:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
        LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
Subjt:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP

Query:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI
        THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI
Subjt:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI

Query:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
        AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
Subjt:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ

Query:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
        VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
Subjt:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN

Query:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
        GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
Subjt:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS

Query:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
        TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
Subjt:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.62Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        MSFGLTNAPAVFM+LM+RVFREFL TFVI+FIDDILIYSKTEAEH+EHLRMVLQTLRDNKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAK EAVT W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
         RPSTVSEVRSFLGLAGYYRRFVE FSRIATPLTQLTRKGA F+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KANVVADALSRK +HSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR

Query:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH
        QAPLHRDLER+EIAVSVGAVTMQLAQLT                       GLAEAGQAVEFS+SSDGGLLFERRLCV SD  VKTELLSEAHSSPF MH
Subjt:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PGSTKMY+D+K VYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKW+NVSMDFITGLPRTLRGFTVIWVVVDRLTKS HFVPGKSTYTA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
        +KWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFW  LQTAMGTRLDFS AFHPQTDGQTE LNQVLE ML+ACALEFPGSWDSHLHLMEF YNNS+QATI
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL
        GMAPFEALYGKCCRS VC  EVGEQRLMGPELVQSTNEAIQKIRSRM T Q+RQKSYADVRRKDLEF+VGDKVFLKVA MRGVL
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL

KAA0057672.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.33Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        MSFGLTNAPAVFM+LM+RVFREFL TFVI+FIDDILIYSKTEAEH+EHLRMVLQTLRDNKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAK EAVT W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
         RPSTVSEVRSFLGLAGYYRRFVE FSR ATPLTQLTRKGA F+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KANVVADALSRK +HSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR

Query:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH
        QAPLHRDLER+EIAVSVGAVTMQLAQLT                       GLAEAGQAVEFS+SSDGGLLFERRLCV SD AVKTELL+EAHSSPF MH
Subjt:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PGSTKMYQDLK +YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLS+ EWKW+NVSMDFI GLPRTLRGFTVIWVVVDRLTKS HFVPGKSTYT 
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
        +KWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFW GLQTAMGTRLDFS AFHPQTDGQTE LN+VLEDML+ACALEFPGSWDSHLHLMEFAYNNS+QATI
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL
        GMAPFEALY KCCRS +C  EVGEQRLMGPELVQSTNEAIQKIRSRM T Q+RQKSYADVRRKDLEF+VGDKVFLKVA MRGV+
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL

TYK04048.1 pol protein [Cucumis melo var. makuwa]0.0e+0099.85Show/hide
Query:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
        MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
Subjt:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF

Query:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
        LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
Subjt:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP

Query:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI
        THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK HSAALITRQAPLHRDLERSEI
Subjt:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI

Query:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
        AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
Subjt:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ

Query:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
        VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
Subjt:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN

Query:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
        GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
Subjt:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS

Query:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
        TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
Subjt:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR

TrEMBL top hitse value%identityAlignment
A0A5A7T8G8 Reverse transcriptase0.0e+0089.18Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        +SFGLTNAPAVFM+LM+RVFREFL TFVI+FIDDILIYSKTEAEH+EHLRMVLQTLRDN+LYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAK EAVT W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
         RP+TVSEVRSFLGLAGYYRR VE FSRIATPLTQLTRKGA F+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFA KIW HYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KANVVADALSRK +HSA LITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR

Query:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH
        QAPLHRDLER+EIAVSVGAVTMQLAQLT                       GLAEAGQAVEFS+SSDGGLLFERRLCVLSD AVKTELLSEAHSSPF MH
Subjt:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PGSTKMYQDLK VYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL + EWKW+NVSMDFITGLPRTLRGFTVIWVVVDRLTKS HFVPGKSTYTA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
        +KWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFW GLQTAMGTRLDFS AFHPQTDGQT  LNQVLEDML+ACALEFPGSWDSHLHLMEFA NNS+QATI
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL
        GMAPFEALYGKCCRS+VC  EVGEQRLMGPELVQSTNEAIQKIRSRM T Q+RQKSYADVRRKDLEF+VGDKVFLKVA MRGVL
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL

A0A5A7TAN6 Pol protein0.0e+00100Show/hide
Query:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
        MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
Subjt:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF

Query:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
        LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
Subjt:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP

Query:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI
        THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI
Subjt:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI

Query:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
        AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
Subjt:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ

Query:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
        VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
Subjt:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN

Query:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
        GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
Subjt:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS

Query:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
        TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
Subjt:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR

A0A5A7U330 Reverse transcriptase0.0e+0089.62Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        MSFGLTNAPAVFM+LM+RVFREFL TFVI+FIDDILIYSKTEAEH+EHLRMVLQTLRDNKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAK EAVT W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
         RPSTVSEVRSFLGLAGYYRRFVE FSRIATPLTQLTRKGA F+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KANVVADALSRK +HSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR

Query:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH
        QAPLHRDLER+EIAVSVGAVTMQLAQLT                       GLAEAGQAVEFS+SSDGGLLFERRLCV SD  VKTELLSEAHSSPF MH
Subjt:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PGSTKMY+D+K VYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKW+NVSMDFITGLPRTLRGFTVIWVVVDRLTKS HFVPGKSTYTA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
        +KWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFW  LQTAMGTRLDFS AFHPQTDGQTE LNQVLE ML+ACALEFPGSWDSHLHLMEF YNNS+QATI
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL
        GMAPFEALYGKCCRS VC  EVGEQRLMGPELVQSTNEAIQKIRSRM T Q+RQKSYADVRRKDLEF+VGDKVFLKVA MRGVL
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL

A0A5A7UP94 Pol protein0.0e+0089.33Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        MSFGLTNAPAVFM+LM+RVFREFL TFVI+FIDDILIYSKTEAEH+EHLRMVLQTLRDNKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAK EAVT W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
         RPSTVSEVRSFLGLAGYYRRFVE FSR ATPLTQLTRKGA F+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KANVVADALSRK +HSAALITR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK-AHSAALITR

Query:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH
        QAPLHRDLER+EIAVSVGAVTMQLAQLT                       GLAEAGQAVEFS+SSDGGLLFERRLCV SD AVKTELL+EAHSSPF MH
Subjt:  QAPLHRDLERSEIAVSVGAVTMQLAQLTC----------------------GLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PGSTKMYQDLK +YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLS+ EWKW+NVSMDFI GLPRTLRGFTVIWVVVDRLTKS HFVPGKSTYT 
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
        +KWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFW GLQTAMGTRLDFS AFHPQTDGQTE LN+VLEDML+ACALEFPGSWDSHLHLMEFAYNNS+QATI
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL
        GMAPFEALY KCCRS +C  EVGEQRLMGPELVQSTNEAIQKIRSRM T Q+RQKSYADVRRKDLEF+VGDKVFLKVA MRGV+
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMRGVL

A0A5D3BYE8 Pol protein0.0e+0099.85Show/hide
Query:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
        MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF
Subjt:  MNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSF

Query:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
        LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP
Subjt:  LGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYP

Query:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI
        THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRK HSAALITRQAPLHRDLERSEI
Subjt:  THDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEI

Query:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
        AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ
Subjt:  AVSVGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQ

Query:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
        VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN
Subjt:  VKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWN

Query:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
        GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS
Subjt:  GLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQS

Query:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
        TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR
Subjt:  TNEAIQKIRSRMQTVQNRQKSYADVRRKDLEFDVGDKVFLKVASMR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.2e-9231.17Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        M +G++ APA F   ++ +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   Q+ F+G+ +S+ G +      + V  W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y R+F+ K S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P  AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA

Query:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH
            ++    P+ +D E + I  V+  ++T               +L   L    + VE +I    GLL   +  +L  +D  +   ++ + H     +H
Subjt:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PG   +   +   + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  W+++SMDFIT LP +  G+  ++VVVDR +K    VP   + TA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
         + A+++   ++   G P  I++D D  FTS+ W          + FS+ + PQTDGQTE  NQ +E +L+      P +W  H+ L++ +YNN+  +  
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK
         M PFE ++       +   E+        E  Q T +  Q ++  + T   + K Y D++ +++ EF  GD V +K
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK

P0CT35 Transposon Tf2-2 polyprotein4.2e-9231.17Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        M +G++ APA F   ++ +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   Q+ F+G+ +S+ G +      + V  W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y R+F+ K S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P  AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA

Query:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH
            ++    P+ +D E + I  V+  ++T               +L   L    + VE +I    GLL   +  +L  +D  +   ++ + H     +H
Subjt:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PG   +   +   + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  W+++SMDFIT LP +  G+  ++VVVDR +K    VP   + TA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
         + A+++   ++   G P  I++D D  FTS+ W          + FS+ + PQTDGQTE  NQ +E +L+      P +W  H+ L++ +YNN+  +  
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK
         M PFE ++       +   E+        E  Q T +  Q ++  + T   + K Y D++ +++ EF  GD V +K
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK

P0CT36 Transposon Tf2-3 polyprotein4.2e-9231.17Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        M +G++ APA F   ++ +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   Q+ F+G+ +S+ G +      + V  W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y R+F+ K S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P  AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA

Query:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH
            ++    P+ +D E + I  V+  ++T               +L   L    + VE +I    GLL   +  +L  +D  +   ++ + H     +H
Subjt:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PG   +   +   + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  W+++SMDFIT LP +  G+  ++VVVDR +K    VP   + TA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
         + A+++   ++   G P  I++D D  FTS+ W          + FS+ + PQTDGQTE  NQ +E +L+      P +W  H+ L++ +YNN+  +  
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK
         M PFE ++       +   E+        E  Q T +  Q ++  + T   + K Y D++ +++ EF  GD V +K
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK

P0CT37 Transposon Tf2-4 polyprotein4.2e-9231.17Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        M +G++ APA F   ++ +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   Q+ F+G+ +S+ G +      + V  W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y R+F+ K S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P  AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA

Query:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH
            ++    P+ +D E + I  V+  ++T               +L   L    + VE +I    GLL   +  +L  +D  +   ++ + H     +H
Subjt:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PG   +   +   + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  W+++SMDFIT LP +  G+  ++VVVDR +K    VP   + TA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
         + A+++   ++   G P  I++D D  FTS+ W          + FS+ + PQTDGQTE  NQ +E +L+      P +W  H+ L++ +YNN+  +  
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK
         M PFE ++       +   E+        E  Q T +  Q ++  + T   + K Y D++ +++ EF  GD V +K
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK

P0CT41 Transposon Tf2-12 polyprotein4.2e-9231.17Show/hide
Query:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW
        M +G++ APA F   ++ +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   Q+ F+G+ +S+ G +      + V  W
Subjt:  MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSW

Query:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----
         +P    E+R FLG   Y R+F+ K S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  PRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P  AN +ADALSR  
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKA

Query:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH
            ++    P+ +D E + I  V+  ++T               +L   L    + VE +I    GLL   +  +L  +D  +   ++ + H     +H
Subjt:  HSAALITRQAPLHRDLERSEI-AVSVGAVTMQL-----------AQLTCGLAEAGQAVEFSISSDGGLLFERRLCVL--SDGAVKTELLSEAHSSPFFMH

Query:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA
        PG   +   +   + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  W+++SMDFIT LP +  G+  ++VVVDR +K    VP   + TA
Subjt:  PGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTA

Query:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI
         + A+++   ++   G P  I++D D  FTS+ W          + FS+ + PQTDGQTE  NQ +E +L+      P +W  H+ L++ +YNN+  +  
Subjt:  NKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFSMAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATI

Query:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK
         M PFE ++       +   E+        E  Q T +  Q ++  + T   + K Y D++ +++ EF  GD V +K
Subjt:  GMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNRQKSYADVRRKDL-EFDVGDKVFLK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.9e-2645.8Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLG--HVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK EA+  WP P   +E+R FLGL GYYRRFV+ + +I  PLT+L +K  S  W
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLG--HVVSKAGVSLDPAKKEAVTSWPRPSTVSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIW

Query:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        ++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGGTTGACGAATGCTCCGGCAGTGTTTATGAACTTGATGAGCAGAGTGTTTAGGGAGTTTCTATACACTTTCGTGATCCTGTTTATTGATGACATC
TTGATATATTCCAAGACGGAGGCCGAGCATGATGAACATTTACGTATGGTTCTACAAACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTT
TGGCTGAAGCAGATGTCCTTTCTAGGACATGTGGTTTCTAAGGCTGGAGTTTCTTTGGATCCAGCTAAGAAAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACA
GTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGGCGGTTTGTGGAGAAATTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGA
GCTTCTTTTATTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACTGCACCGGTTCTTACTGTACCTGATGGTTCAGGCAGTTTT
GTGATTTACAGTGATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAAT
TACCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAAATCTTCACAGATCATAAGAGC
TTGAAATATTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATATTGTATCATCCGGTCAAGGCA
AATGTGGTAGCTGATGCTCTTAGTAGGAAGGCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGAGGTCTGAGATTGCAGTGTCA
GTGGGGGCAGTCACTATGCAGTTAGCGCAGTTGACGTGTGGCCTAGCAGAGGCAGGGCAAGCTGTTGAGTTCTCCATATCCTCTGATGGTGGACTTTTGTTTGAG
AGGCGTCTCTGTGTGCTATCAGATGGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTCTTCATGCACCCGGGTAGTACGAAGATGTATCAG
GACCTGAAATGGGTTTATTGGTGGCGTAATATGAAGAGGGAGGTGGCAGAATTTGTTAGTAAGTGCTTGGTGTGTCAGCAGGTTAAGGCTCCAAGGCAGAAACCA
GCGGGTTTATTACAACCCTTGAGCGTACTGGAATGGAAGTGGAAAAACGTGTCGATGGATTTCATTACAGGACTGCCAAGAACTTTGAGGGGTTTTACAGTGATT
TGGGTTGTAGTTGACAGGCTTACCAAATCAACGCACTTCGTTCCGGGTAAATCCACCTATACCGCTAATAAGTGGGCACAGTTGTACATGTCTGAGATAGTAAGA
TTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCATTTCACTTCCAAGTTTTGGAATGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGT
ATGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCATCTGAACCAAGTTTTAGAAGATATGTTGCAAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCC
CACTTACATTTGATGGAATTTGCTTATAATAACAGTTTTCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGCTGTAGATCCGTCGTTTGT
AGGAGTGAGGTGGGTGAGCAGAGATTGATGGGACCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAAATTAGATCTCGTATGCAGACCGTACAGAATAGG
CAGAAGAGCTATGCTGATGTGAGACGGAAGGATCTTGAGTTTGATGTGGGGGACAAGGTGTTCTTGAAGGTAGCATCTATGAGAGGTGTCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGGTTGACGAATGCTCCGGCAGTGTTTATGAACTTGATGAGCAGAGTGTTTAGGGAGTTTCTATACACTTTCGTGATCCTGTTTATTGATGACATC
TTGATATATTCCAAGACGGAGGCCGAGCATGATGAACATTTACGTATGGTTCTACAAACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGTTT
TGGCTGAAGCAGATGTCCTTTCTAGGACATGTGGTTTCTAAGGCTGGAGTTTCTTTGGATCCAGCTAAGAAAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACA
GTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGGCGGTTTGTGGAGAAATTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGA
GCTTCTTTTATTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACTGCACCGGTTCTTACTGTACCTGATGGTTCAGGCAGTTTT
GTGATTTACAGTGATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAAT
TACCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTTGCATTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAAATCTTCACAGATCATAAGAGC
TTGAAATATTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATATTGTATCATCCGGTCAAGGCA
AATGTGGTAGCTGATGCTCTTAGTAGGAAGGCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGAGGTCTGAGATTGCAGTGTCA
GTGGGGGCAGTCACTATGCAGTTAGCGCAGTTGACGTGTGGCCTAGCAGAGGCAGGGCAAGCTGTTGAGTTCTCCATATCCTCTGATGGTGGACTTTTGTTTGAG
AGGCGTCTCTGTGTGCTATCAGATGGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTCTTCATGCACCCGGGTAGTACGAAGATGTATCAG
GACCTGAAATGGGTTTATTGGTGGCGTAATATGAAGAGGGAGGTGGCAGAATTTGTTAGTAAGTGCTTGGTGTGTCAGCAGGTTAAGGCTCCAAGGCAGAAACCA
GCGGGTTTATTACAACCCTTGAGCGTACTGGAATGGAAGTGGAAAAACGTGTCGATGGATTTCATTACAGGACTGCCAAGAACTTTGAGGGGTTTTACAGTGATT
TGGGTTGTAGTTGACAGGCTTACCAAATCAACGCACTTCGTTCCGGGTAAATCCACCTATACCGCTAATAAGTGGGCACAGTTGTACATGTCTGAGATAGTAAGA
TTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCATTTCACTTCCAAGTTTTGGAATGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGT
ATGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCATCTGAACCAAGTTTTAGAAGATATGTTGCAAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCC
CACTTACATTTGATGGAATTTGCTTATAATAACAGTTTTCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGCTGTAGATCCGTCGTTTGT
AGGAGTGAGGTGGGTGAGCAGAGATTGATGGGACCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAAATTAGATCTCGTATGCAGACCGTACAGAATAGG
CAGAAGAGCTATGCTGATGTGAGACGGAAGGATCTTGAGTTTGATGTGGGGGACAAGGTGTTCTTGAAGGTAGCATCTATGAGAGGTGTCTTATGA
Protein sequenceShow/hide protein sequence
MSFGLTNAPAVFMNLMSRVFREFLYTFVILFIDDILIYSKTEAEHDEHLRMVLQTLRDNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSLDPAKKEAVTSWPRPST
VSEVRSFLGLAGYYRRFVEKFSRIATPLTQLTRKGASFIWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQN
YPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPVKANVVADALSRKAHSAALITRQAPLHRDLERSEIAVS
VGAVTMQLAQLTCGLAEAGQAVEFSISSDGGLLFERRLCVLSDGAVKTELLSEAHSSPFFMHPGSTKMYQDLKWVYWWRNMKREVAEFVSKCLVCQQVKAPRQKP
AGLLQPLSVLEWKWKNVSMDFITGLPRTLRGFTVIWVVVDRLTKSTHFVPGKSTYTANKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSKFWNGLQTAMGTRLDFS
MAFHPQTDGQTEHLNQVLEDMLQACALEFPGSWDSHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSVVCRSEVGEQRLMGPELVQSTNEAIQKIRSRMQTVQNR
QKSYADVRRKDLEFDVGDKVFLKVASMRGVL