; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0018853 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0018853
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPol protein
Genome locationchr12:8632748..8637207
RNA-Seq ExpressionIVF0018853
SyntenyIVF0018853
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040677.1 pol protein [Cucumis melo var. makuwa]0.082.03Show/hide
Query:  MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRK---------------------EEQSIFALKEDVNQLWHKRV
        MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRK                     EEQSIFALKEDVNQLWHKRV
Subjt:  MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRK---------------------EEQSIFALKEDVNQLWHKRV

Query:  GHYHHQGLLQLTELTLDFPSLMKKSQAAKRVILESKIGSHFPSHLGEPLKNCS-SFTQMLQF----KSEVAHVFWKFKARVENESGCKIQIVRYDNGKEY
        GHYHHQGLLQLTELTLDF  L ++  + K          HF     +     S   TQ LQ      +EVAHVFWKFKARVENESGCKIQIVRYDNGKEY
Subjt:  GHYHHQGLLQLTELTLDFPSLMKKSQAAKRVILESKIGSHFPSHLGEPLKNCS-SFTQMLQF----KSEVAHVFWKFKARVENESGCKIQIVRYDNGKEY

Query:  SSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLHENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEAC-----------------
        SSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLHENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEA                  
Subjt:  SSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLHENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEAC-----------------

Query:  -----------------------IEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKV
                               IEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKV
Subjt:  -----------------------IEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKV

Query:  IGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPW-----------------------------
        IGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPW                             
Subjt:  IGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPW-----------------------------

Query:  ----------------------------------------------------------------------------------GYYRWFVENFSHIVTPLT
                                                                                          GYYRWFVENFSHIVTPLT
Subjt:  ----------------------------------------------------------------------------------GYYRWFVENFSHIVTPLT

Query:  QLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYG
        QLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYG
Subjt:  QLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYG

Query:  EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR
        EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR
Subjt:  EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR

Query:  QRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ
        QRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ
Subjt:  QRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ

Query:  QVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFW
        QVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFW
Subjt:  QVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFW

Query:  KEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRF
        KEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRF
Subjt:  KEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRF

Query:  VGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLARVEEATWEIEDNMRARYPELFEE
        VGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLARVEEATWEIEDNMRARYPELFEE
Subjt:  VGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLARVEEATWEIEDNMRARYPELFEE

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.080.14Show/hide
Query:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE
        +R  +   GYYR FVENFS I TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS A KKGLGCVLMQQGKVVAYAS QLKSHE
Subjt:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE

Query:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL
        QNYPTHDLELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDL
Subjt:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL

Query:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
        ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPY VEK GLAEAGQA+EFS+SSDGG  FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQ
Subjt:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ

Query:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM
        DLKRVYWWRNMKREVAEFVSKCLVCQQVK PRQKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIWV VD+LTKSAHFVPGKS YTAS WAQLYM
Subjt:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM

Query:  SEIVRLHGVPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEAL
        SEIVRLHGVPV IVSDRDA FTSKFWK                                                     E+ ++ +          EAL
Subjt:  SEIVRLHGVPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEAL

Query:  YGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGS
        YGKCCRSPVCWGEVGEQRLMGPELVQS+NEAIQKIRSRM TAQSRQKSYADVRRKDLEF++ DKVFLKVAPMKGVLRFERRGKLS RFVG FEILERIG 
Subjt:  YGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGS

Query:  VAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE
        VAYRLALPPSLS VHDVFHVSMLRKYV DPSHVVDYEPLEIDENLSY EQ VEVLAR                     VEEATWE ED+MR+RYPELFEE
Subjt:  VAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE

KAA0051368.1 pol protein [Cucumis melo var. makuwa]0.080.57Show/hide
Query:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE
        +R  +   GYYR FVENFS I TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS A KKGLGCVLMQQGKVVAYAS QLKSHE
Subjt:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE

Query:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL
        QNYPTHDLELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDL
Subjt:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL

Query:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
        ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPY VEK GLAEAGQ  EFS+SSDGG LFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
Subjt:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ

Query:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM
        DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIWV VD+LTKSAHFVPGKS YTAS WAQLYM
Subjt:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM

Query:  SEIVRLHGVPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEAL
        SEIVRLHGVPV IVSDRDA FTSKFWK                                                     E+ ++ +          EAL
Subjt:  SEIVRLHGVPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEAL

Query:  YGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGS
        YG+CCRSPVCWGEVGEQRLMGPELVQS+NEAIQKIRSRM TAQSRQKSYADVRRKDLEF++GDKVFLKVAPMKGVLRFERRGKLS RFVG FEILERIG 
Subjt:  YGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGS

Query:  VAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE
        VAYRLALPPSLS VHDVFHVSMLRKYV DPSHVVDYEPLEIDENLSY EQ VEVLAR                     VEEATWE ED+MR+RYPELFEE
Subjt:  VAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE

KAA0056702.1 pol protein [Cucumis melo var. makuwa]0.084.01Show/hide
Query:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE
        +R  +   GYYR FVENFS I TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPV+TVPDGSGSFVIYS A KKGLGCVLMQQGKVVAYAS QLKSHE
Subjt:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE

Query:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL
        QNYPTHDLELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDL
Subjt:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL

Query:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
        ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPY VEK GLAEAGQA+EFS+SSDGG LFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
Subjt:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ

Query:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM
        DLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIW  VD+LTKSAHFVPGKS YTAS WAQLYM
Subjt:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM

Query:  SEIVRLHGVPVLIVSDRDAYFTSKFWK----------------------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEA
        SEIVRLHGVPV IVSDRDA FTSKFWK                      E+ ++ +          EALYGKCCRSPVCWGEVGEQRLMGPELVQS+NEA
Subjt:  SEIVRLHGVPVLIVSDRDAYFTSKFWK----------------------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEA

Query:  IQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPS
        IQKIRSRM TAQSRQKSYADVRRKDLEF+VGDKVFLKVAPM+GVLRFERRGKLS RFVG FEILERIG VAYRLALPPSLS VHDVFHVSMLRKYV DPS
Subjt:  IQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPS

Query:  HVVDYEPLEIDENLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE
        HVVDYEPLEIDENLSYTEQ VEVLAR                     VEEATWE ED+MR+RYP+LFEE
Subjt:  HVVDYEPLEIDENLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE

KAA0063718.1 pol protein [Cucumis melo var. makuwa]0.084.93Show/hide
Query:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE
        +R  +   GYYR FVENFS I TPLTQLTRK APFVWSK CEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS A KKGL CVLMQQGKVVAYAS QLKSHE
Subjt:  IRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHE

Query:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL
        QNYPTHDLELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDL
Subjt:  QNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDL

Query:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
        ERAEIAVSVGAVT+QLAQLTVQ TLRQRIIDAQSNDPY  EK GLAEAGQA EFSISSDGG LFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
Subjt:  ERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ

Query:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM
        DLKRVYWWRNMKREVAEFVS+CLVCQQVKAP+QKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIWV VD+LTKSAHFVPGKS YTAS WAQLYM
Subjt:  DLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYM

Query:  SEIVRLHGVPVLIVSDRDAYFTSKFWK----------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQ
        SEIVRLHGVPVLIVSDRDA FTSKFWK          E+ ++ +          EALYGKCCRSPVCWGEVGEQRLMGPELVQS+NEAIQKIRSRM TAQ
Subjt:  SEIVRLHGVPVLIVSDRDAYFTSKFWK----------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQ

Query:  SRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDE
        SRQKSYADVRRKDLEF+VGDKVFLKVAPM+GVLRFER+GKLS RFVG FEILERIG VAYRLALPPSLS VHDVFHVSMLRKYV DPSHVVDY+PLEIDE
Subjt:  SRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDE

Query:  NLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE
        NLSYTEQ  EVLAR                     VEEATWE ED+MR+RYPELFEE
Subjt:  NLSYTEQLVEVLAR---------------------VEEATWEIEDNMRARYPELFEE

TrEMBL top hitse value%identityAlignment
A0A5A7U7V9 Reverse transcriptase0.0e+0081.36Show/hide
Query:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL
        GYYR FVENFS I TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS A KKGLGCVLMQQGKVVAYAS QLKSHEQNYPTHDL
Subjt:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL

Query:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS
        ELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDLERAEIAVS
Subjt:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS

Query:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
        VGAVTMQLAQLTVQPTLRQRIIDAQSNDPY VEK GLAEAGQ  EFS+SSDGG LFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
Subjt:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW

Query:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG
        RNMKREVAEFVSKCLVCQQVKAPRQKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIWV VD+LTKSAHFVPGKS YTAS WAQLYMSEIVRLHG
Subjt:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG

Query:  VPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEALYGKCCRSP
        VPV IVSDRDA FTSKFWK                                                     E+ ++ +          EALYG+CCRSP
Subjt:  VPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEALYGKCCRSP

Query:  VCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALP
        VCWGEVGEQRLMGPELVQS+NEAIQKIRSRM TAQSRQKSYADVRRKDLEF++GDKVFLKVAPMKGVLRFERRGKLS RFVG FEILERIG VAYRLALP
Subjt:  VCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALP

Query:  PSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLA---------------------RVEEATWEIEDNMRARYPELFEE
        PSLS VHDVFHVSMLRKYV DPSHVVDYEPLEIDENLSY EQ VEVLA                     RVEEATWE ED+MR+RYPELFEE
Subjt:  PSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLA---------------------RVEEATWEIEDNMRARYPELFEE

A0A5A7UAA8 Reverse transcriptase3.8e-31080.92Show/hide
Query:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL
        GYYR FVENFS I TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS A KKGLGCVLMQQGKVVAYAS QLKSHEQNYPTHDL
Subjt:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL

Query:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS
        ELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDLERAEIAVS
Subjt:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS

Query:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
        VGAVTMQLAQLTVQPTLRQRIIDAQSNDPY VEK GLAEAGQA+EFS+SSDGG  FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDLKRVYWW
Subjt:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW

Query:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG
        RNMKREVAEFVSKCLVCQQVK PRQKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIWV VD+LTKSAHFVPGKS YTAS WAQLYMSEIVRLHG
Subjt:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG

Query:  VPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEALYGKCCRSP
        VPV IVSDRDA FTSKFWK                                                     E+ ++ +          EALYGKCCRSP
Subjt:  VPVLIVSDRDAYFTSKFWK-----------------------------------------------------EYYWHGT---------IEALYGKCCRSP

Query:  VCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALP
        VCWGEVGEQRLMGPELVQS+NEAIQKIRSRM TAQSRQKSYADVRRKDLEF++ DKVFLKVAPMKGVLRFERRGKLS RFVG FEILERIG VAYRLALP
Subjt:  VCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALP

Query:  PSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLA---------------------RVEEATWEIEDNMRARYPELFEE
        PSLS VHDVFHVSMLRKYV DPSHVVDYEPLEIDENLSY EQ VEVLA                     RVEEATWE ED+MR+RYPELFEE
Subjt:  PSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLA---------------------RVEEATWEIEDNMRARYPELFEE

A0A5A7ULI8 Pol protein0.0e+0084.87Show/hide
Query:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL
        GYYR FVENFS I TPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPV+TVPDGSGSFVIYS A KKGLGCVLMQQGKVVAYAS QLKSHEQNYPTHDL
Subjt:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL

Query:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS
        ELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDLERAEIAVS
Subjt:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS

Query:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
        VGAVTMQLAQLTVQPTLRQRIIDAQSNDPY VEK GLAEAGQA+EFS+SSDGG LFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
Subjt:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW

Query:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG
        RNMKREVAEFVS+CLVCQQVKAPRQKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIW  VD+LTKSAHFVPGKS YTAS WAQLYMSEIVRLHG
Subjt:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG

Query:  VPVLIVSDRDAYFTSKFWK----------------------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRM
        VPV IVSDRDA FTSKFWK                      E+ ++ +          EALYGKCCRSPVCWGEVGEQRLMGPELVQS+NEAIQKIRSRM
Subjt:  VPVLIVSDRDAYFTSKFWK----------------------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRM

Query:  QTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPL
         TAQSRQKSYADVRRKDLEF+VGDKVFLKVAPM+GVLRFERRGKLS RFVG FEILERIG VAYRLALPPSLS VHDVFHVSMLRKYV DPSHVVDYEPL
Subjt:  QTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPL

Query:  EIDENLSYTEQLVEVLA---------------------RVEEATWEIEDNMRARYPELFEE
        EIDENLSYTEQ VEVLA                     RVEEATWE ED+MR+RYP+LFEE
Subjt:  EIDENLSYTEQLVEVLA---------------------RVEEATWEIEDNMRARYPELFEE

A0A5A7V810 Reverse transcriptase0.0e+0085.82Show/hide
Query:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL
        GYYR FVENFS I TPLTQLTRK APFVWSK CEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS A KKGL CVLMQQGKVVAYAS QLKSHEQNYPTHDL
Subjt:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDL

Query:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS
        ELAAVVFALKI RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT+QA LHRDLERAEIAVS
Subjt:  ELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVS

Query:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
        VGAVT+QLAQLTVQ TLRQRIIDAQSNDPY  EK GLAEAGQA EFSISSDGG LFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW
Subjt:  VGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW

Query:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG
        RNMKREVAEFVS+CLVCQQVKAP+QKPA LLQPLS+PEWKWENVSMDFITGLPRTLRG TVIWV VD+LTKSAHFVPGKS YTAS WAQLYMSEIVRLHG
Subjt:  RNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHG

Query:  VPVLIVSDRDAYFTSKFWK----------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYAD
        VPVLIVSDRDA FTSKFWK          E+ ++ +          EALYGKCCRSPVCWGEVGEQRLMGPELVQS+NEAIQKIRSRM TAQSRQKSYAD
Subjt:  VPVLIVSDRDAYFTSKFWK----------EYYWHGT---------IEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYAD

Query:  VRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQL
        VRRKDLEF+VGDKVFLKVAPM+GVLRFER+GKLS RFVG FEILERIG VAYRLALPPSLS VHDVFHVSMLRKYV DPSHVVDY+PLEIDENLSYTEQ 
Subjt:  VRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQL

Query:  VEVLA---------------------RVEEATWEIEDNMRARYPELFEE
         EVLA                     RVEEATWE ED+MR+RYPELFEE
Subjt:  VEVLA---------------------RVEEATWEIEDNMRARYPELFEE

A0A5D3CBT3 Pol protein0.0e+0082.03Show/hide
Query:  MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRK---------------------EEQSIFALKEDVNQLWHKRV
        MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRK                     EEQSIFALKEDVNQLWHKRV
Subjt:  MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRK---------------------EEQSIFALKEDVNQLWHKRV

Query:  GHYHHQGLLQLTELTLDFPSLMKKSQAAKRVILESKIGSHFPSHLGEPLKNCS-SFTQMLQF----KSEVAHVFWKFKARVENESGCKIQIVRYDNGKEY
        GHYHHQGLLQLTELTLDF  L ++  + K          HF     +     S   TQ LQ      +EVAHVFWKFKARVENESGCKIQIVRYDNGKEY
Subjt:  GHYHHQGLLQLTELTLDFPSLMKKSQAAKRVILESKIGSHFPSHLGEPLKNCS-SFTQMLQF----KSEVAHVFWKFKARVENESGCKIQIVRYDNGKEY

Query:  SSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLHENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEAC-----------------
        SSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLHENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEA                  
Subjt:  SSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLHENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEAC-----------------

Query:  -----------------------IEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKV
                               IEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKV
Subjt:  -----------------------IEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKV

Query:  IGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPW-----------------------------
        IGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPW                             
Subjt:  IGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPW-----------------------------

Query:  ----------------------------------------------------------------------------------GYYRWFVENFSHIVTPLT
                                                                                          GYYRWFVENFSHIVTPLT
Subjt:  ----------------------------------------------------------------------------------GYYRWFVENFSHIVTPLT

Query:  QLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYG
        QLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYG
Subjt:  QLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYG

Query:  EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR
        EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR
Subjt:  EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR

Query:  QRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ
        QRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ
Subjt:  QRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ

Query:  QVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFW
        QVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFW
Subjt:  QVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFW

Query:  KEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRF
        KEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRF
Subjt:  KEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRF

Query:  VGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLARVEEATWEIEDNMRARYPELFEE
        VGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLARVEEATWEIEDNMRARYPELFEE
Subjt:  VGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVLARVEEATWEIEDNMRARYPELFEE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.9e-5525.56Show/hide
Query:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL
        ++  + +   V  VK V +   NA+  IN+ K +   +   +  G   S+        ++ D+V Q ++  ++  +R  +    Y R F+   S +  PL
Subjt:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL

Query:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC
          L +K   + W+     + +N+KQ LV+ PVL   D S   ++ + A    +G VL Q+        V Y S ++   + NY   D E+ A++ +LK  
Subjt:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC

Query:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL
        RHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +   + +D E   I          +
Subjt:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL

Query:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE
         Q+++    + +++   +ND   +    L    + +E +I    G L   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ ++++
Subjt:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE

Query:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV
        + E+V  C  CQ  K+   KP   LQP+   E  WE++SMDFIT LP +  G   ++V VD+ +K A  VP     TA   A+++   ++   G P  I+
Subjt:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV

Query:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------
        +D D  FTS+ WK++                         + T+E L    C + P  W              +     M P                  
Subjt:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------

Query:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-
              E  Q + +  Q ++  + T   + K Y D++ +++ EF  GD V +K     G L   +  KL+  F G F +L++ G   Y L LP S+  + 
Subjt:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-

Query:  HDVFHVSMLRKY
           FHVS L KY
Subjt:  HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein4.9e-5525.56Show/hide
Query:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL
        ++  + +   V  VK V +   NA+  IN+ K +   +   +  G   S+        ++ D+V Q ++  ++  +R  +    Y R F+   S +  PL
Subjt:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL

Query:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC
          L +K   + W+     + +N+KQ LV+ PVL   D S   ++ + A    +G VL Q+        V Y S ++   + NY   D E+ A++ +LK  
Subjt:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC

Query:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL
        RHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +   + +D E   I          +
Subjt:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL

Query:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE
         Q+++    + +++   +ND   +    L    + +E +I    G L   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ ++++
Subjt:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE

Query:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV
        + E+V  C  CQ  K+   KP   LQP+   E  WE++SMDFIT LP +  G   ++V VD+ +K A  VP     TA   A+++   ++   G P  I+
Subjt:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV

Query:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------
        +D D  FTS+ WK++                         + T+E L    C + P  W              +     M P                  
Subjt:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------

Query:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-
              E  Q + +  Q ++  + T   + K Y D++ +++ EF  GD V +K     G L   +  KL+  F G F +L++ G   Y L LP S+  + 
Subjt:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-

Query:  HDVFHVSMLRKY
           FHVS L KY
Subjt:  HDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein4.9e-5525.56Show/hide
Query:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL
        ++  + +   V  VK V +   NA+  IN+ K +   +   +  G   S+        ++ D+V Q ++  ++  +R  +    Y R F+   S +  PL
Subjt:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL

Query:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC
          L +K   + W+     + +N+KQ LV+ PVL   D S   ++ + A    +G VL Q+        V Y S ++   + NY   D E+ A++ +LK  
Subjt:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC

Query:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL
        RHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +   + +D E   I          +
Subjt:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL

Query:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE
         Q+++    + +++   +ND   +    L    + +E +I    G L   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ ++++
Subjt:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE

Query:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV
        + E+V  C  CQ  K+   KP   LQP+   E  WE++SMDFIT LP +  G   ++V VD+ +K A  VP     TA   A+++   ++   G P  I+
Subjt:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV

Query:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------
        +D D  FTS+ WK++                         + T+E L    C + P  W              +     M P                  
Subjt:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------

Query:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-
              E  Q + +  Q ++  + T   + K Y D++ +++ EF  GD V +K     G L   +  KL+  F G F +L++ G   Y L LP S+  + 
Subjt:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-

Query:  HDVFHVSMLRKY
           FHVS L KY
Subjt:  HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein4.9e-5525.56Show/hide
Query:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL
        ++  + +   V  VK V +   NA+  IN+ K +   +   +  G   S+        ++ D+V Q ++  ++  +R  +    Y R F+   S +  PL
Subjt:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL

Query:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC
          L +K   + W+     + +N+KQ LV+ PVL   D S   ++ + A    +G VL Q+        V Y S ++   + NY   D E+ A++ +LK  
Subjt:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC

Query:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL
        RHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +   + +D E   I          +
Subjt:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL

Query:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE
         Q+++    + +++   +ND   +    L    + +E +I    G L   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ ++++
Subjt:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE

Query:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV
        + E+V  C  CQ  K+   KP   LQP+   E  WE++SMDFIT LP +  G   ++V VD+ +K A  VP     TA   A+++   ++   G P  I+
Subjt:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV

Query:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------
        +D D  FTS+ WK++                         + T+E L    C + P  W              +     M P                  
Subjt:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------

Query:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-
              E  Q + +  Q ++  + T   + K Y D++ +++ EF  GD V +K     G L   +  KL+  F G F +L++ G   Y L LP S+  + 
Subjt:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-

Query:  HDVFHVSMLRKY
           FHVS L KY
Subjt:  HDVFHVSMLRKY

Q9UR07 Transposon Tf2-11 polyprotein4.9e-5525.56Show/hide
Query:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL
        ++  + +   V  VK V +   NA+  IN+ K +   +   +  G   S+        ++ D+V Q ++  ++  +R  +    Y R F+   S +  PL
Subjt:  ILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRV-QLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPL

Query:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC
          L +K   + W+     + +N+KQ LV+ PVL   D S   ++ + A    +G VL Q+        V Y S ++   + NY   D E+ A++ +LK  
Subjt:  TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGK-----VVAYASCQLKSHEQNYPTHDLELAAVVFALKIC

Query:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL
        RHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++ +   + +D E   I          +
Subjt:  RHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQL

Query:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE
         Q+++    + +++   +ND   +    L    + +E +I    G L   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ ++++
Subjt:  AQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKRE

Query:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV
        + E+V  C  CQ  K+   KP   LQP+   E  WE++SMDFIT LP +  G   ++V VD+ +K A  VP     TA   A+++   ++   G P  I+
Subjt:  VAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVFVDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIV

Query:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------
        +D D  FTS+ WK++                         + T+E L    C + P  W              +     M P                  
Subjt:  SDRDAYFTSKFWKEY-----------------------YWHGTIEALYGKCCRS-PVCW------------GEVGEQRLMGP------------------

Query:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-
              E  Q + +  Q ++  + T   + K Y D++ +++ EF  GD V +K     G L   +  KL+  F G F +L++ G   Y L LP S+  + 
Subjt:  ------ELVQSSNEAIQKIRSRMQTAQSRQKSYADVRRKDL-EFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAV-

Query:  HDVFHVSMLRKY
           FHVS L KY
Subjt:  HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-1740.21Show/hide
Query:  CNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDS
        C     EP+ Y E K+   W  AM++E+  +E   TW +   P ++K IG KWV++ K N+DG+I ++KA+LV KGY Q  G+D+ +TF+ V ++ S
Subjt:  CNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.4e-1744.32Show/hide
Query:  EPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVAR
        EP +     KD  W  AM+EEL  + +NKTWILV  P ++ ++G KWVF+TKL++DG++++ KA+LV KG+ Q  G+ + +T++ V R
Subjt:  EPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQDRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVAR

ATMG00860.1 DNA/RNA polymerases superfamily protein1.9e-0643.33Show/hide
Query:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        GYYR FV+N+  IV PLT+L +K +   W++    +F+ LK  + T PVL +PD    FV
Subjt:  GYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTGAGTCAAATGAAAGCTGGCTGATTGACAGTGGATGTACCAACCATATGACTCATGACAAAGAGTTGTTTAAAGATCTGAAGCCAACAAATATCACCAAAGT
CAGAATTGGCAATGGAGACTACATCTCAGTAAAAGGTAAGGGGACTATTGCGATTGCAAGTCGTAAAGAGGAGCAATCTATTTTTGCTCTCAAAGAAGATGTAAACCAAC
TTTGGCACAAAAGAGTCGGTCACTATCATCATCAAGGGCTGCTGCAACTAACGGAGTTGACACTTGATTTTCCAAGCTTAATGAAGAAATCTCAAGCTGCAAAGCGTGTC
ATTTTGGAAAGCAAAATAGGAAGTCATTTTCCAAGTCATCTTGGAGAGCCACTCAAAAATTGCAGCTCATTCACACAGATGTTACAGTTTAAATCAGAGGTTGCTCATGT
TTTTTGGAAGTTCAAGGCAAGAGTTGAAAATGAAAGTGGTTGTAAAATTCAAATTGTAAGGTATGACAATGGGAAGGAGTATTCTTCGGCAGAATTTGACAAGTTTTGTG
AAGATTTAGGCATAAAACATCAACTTACAGCTCCTTACACTCCTCAACAAAATGGAGTTAGTGAGAGGAGAAATAGATACATCATGGAGATGACAATATGCATGCTGCAT
GAAAATAGTTTGCCAAAGAAGTTTTGGGCAGAGGCAGCAAATACAGTTGTATTTCTTCAAAATAGGCTTCCCACAATAGCAGTGAAGGAAAAGACACCATTTGAGGCATG
CATTGAAGAGGAAGATGACAGGCATAATGAAATAGTAGATGATGCTTCCGTGAGAGGAACAAGGTTACTTTCTGATATTTATGAAAGGTGCAATGTTTCTGTGTGTGAAC
CTGCAAACTATGCAGAAGTAAAGAAAGATCAAAGGTGGGTAGCTGCAATGGAGGAGGAGTTGTCAATGGTAGAGAAGAACAAAACTTGGATCCTTGTTGACAGACCTCAA
GATAGGAAGGTAATTGGAGTTAAATGGGTGTTTAGAACCAAGCTTAATGCTGATGGCTCGATTAACAAGCACAAAGCCAAGCTTGTGGTTAAAGGGTATGCTCAAATCTT
TGGTGTTGATTACTCTGATACTTTTGCTCATGTTGCTAGAATGGACTCATTCGACCGAGTGCAGTTACAGGAATTACTTGATAAGGGCTTCATTCGACCGAGTGTGTCAC
CTTGGGGTTATTATCGATGGTTTGTGGAGAACTTTTCCCATATAGTTACTCCTCTTACTCAGTTGACTAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGAC
AGTTTCCAAAACCTTAAACAGAAGCTAGTTACTGCACCGGTTCTTACTGTACCTGATGGTTCAGGCAGTTTTGTGATTTACAGTTATGCTTTTAAGAAGGGTTTAGGTTG
TGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTTGTCAGTTGAAGAGTCATGAGCAGAATTACCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTTGCAT
TGAAGATATGTAGGCATTACTTGTATGGTGAAAAGATACAGATCTTCACGGATCATAAAAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGA
TGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTAT
TACCCAACAGGCCCTATTACATCGAGATCTTGAGAGGGCTGAGATTGCAGTGTCAGTAGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGC
AAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTTGTTGAGAAGCATGGCCTAGCAGAGGCAGGGCAAGCTATTGAGTTCTCCATATCCTCTGATGGTGGACATTTG
TTTGAGAGGCGTCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACAAAGATGTATCA
GGACCTGAAGCGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGG
ATTTATTACAACCCTTGAGCGTACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCAAGAACTCTGAGGGGTTTGACAGTGATTTGGGTTTTC
GTGGACAAGCTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCATCTATACCGCTAGTAACTGGGCACAACTATACATGTCTGAGATAGTGAGACTACATGGAGTGCC
AGTGTTGATTGTTTCTGATAGAGATGCCTATTTCACTTCCAAATTCTGGAAGGAATACTATTGGCATGGCACCATTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCTG
TGTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTTCTAACGAAGCGATACAGAAAATTAGGTCACGTATGCAGACCGCACAGAGTAGG
CAGAAGAGTTATGCAGATGTGAGACGGAAGGATCTTGAATTTGATGTGGGGGACAAGGTGTTCTTGAAGGTAGCACCTATGAAAGGTGTCTTACGATTTGAAAGGAGAGG
AAAGCTAAGTCTCCGTTTTGTTGGGACGTTTGAGATTCTGGAGCGAATTGGCTCTGTAGCGTATCGCTTGGCGTTGCCACCATCACTCTCGGCAGTTCATGATGTGTTCC
ATGTTTCTATGTTGAGGAAGTACGTGTCAGATCCATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATACAGAACAACTCGTTGAGGTGTTG
GCTAGGGTTGAAGAAGCTACATGGGAGATAGAAGATAACATGAGAGCTCGTTACCCAGAATTGTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTGAGTCAAATGAAAGCTGGCTGATTGACAGTGGATGTACCAACCATATGACTCATGACAAAGAGTTGTTTAAAGATCTGAAGCCAACAAATATCACCAAAGT
CAGAATTGGCAATGGAGACTACATCTCAGTAAAAGGTAAGGGGACTATTGCGATTGCAAGTCGTAAAGAGGAGCAATCTATTTTTGCTCTCAAAGAAGATGTAAACCAAC
TTTGGCACAAAAGAGTCGGTCACTATCATCATCAAGGGCTGCTGCAACTAACGGAGTTGACACTTGATTTTCCAAGCTTAATGAAGAAATCTCAAGCTGCAAAGCGTGTC
ATTTTGGAAAGCAAAATAGGAAGTCATTTTCCAAGTCATCTTGGAGAGCCACTCAAAAATTGCAGCTCATTCACACAGATGTTACAGTTTAAATCAGAGGTTGCTCATGT
TTTTTGGAAGTTCAAGGCAAGAGTTGAAAATGAAAGTGGTTGTAAAATTCAAATTGTAAGGTATGACAATGGGAAGGAGTATTCTTCGGCAGAATTTGACAAGTTTTGTG
AAGATTTAGGCATAAAACATCAACTTACAGCTCCTTACACTCCTCAACAAAATGGAGTTAGTGAGAGGAGAAATAGATACATCATGGAGATGACAATATGCATGCTGCAT
GAAAATAGTTTGCCAAAGAAGTTTTGGGCAGAGGCAGCAAATACAGTTGTATTTCTTCAAAATAGGCTTCCCACAATAGCAGTGAAGGAAAAGACACCATTTGAGGCATG
CATTGAAGAGGAAGATGACAGGCATAATGAAATAGTAGATGATGCTTCCGTGAGAGGAACAAGGTTACTTTCTGATATTTATGAAAGGTGCAATGTTTCTGTGTGTGAAC
CTGCAAACTATGCAGAAGTAAAGAAAGATCAAAGGTGGGTAGCTGCAATGGAGGAGGAGTTGTCAATGGTAGAGAAGAACAAAACTTGGATCCTTGTTGACAGACCTCAA
GATAGGAAGGTAATTGGAGTTAAATGGGTGTTTAGAACCAAGCTTAATGCTGATGGCTCGATTAACAAGCACAAAGCCAAGCTTGTGGTTAAAGGGTATGCTCAAATCTT
TGGTGTTGATTACTCTGATACTTTTGCTCATGTTGCTAGAATGGACTCATTCGACCGAGTGCAGTTACAGGAATTACTTGATAAGGGCTTCATTCGACCGAGTGTGTCAC
CTTGGGGTTATTATCGATGGTTTGTGGAGAACTTTTCCCATATAGTTACTCCTCTTACTCAGTTGACTAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGAC
AGTTTCCAAAACCTTAAACAGAAGCTAGTTACTGCACCGGTTCTTACTGTACCTGATGGTTCAGGCAGTTTTGTGATTTACAGTTATGCTTTTAAGAAGGGTTTAGGTTG
TGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTTGTCAGTTGAAGAGTCATGAGCAGAATTACCCTACACACGATTTAGAGTTGGCAGCAGTGGTTTTTGCAT
TGAAGATATGTAGGCATTACTTGTATGGTGAAAAGATACAGATCTTCACGGATCATAAAAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGA
TGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTAT
TACCCAACAGGCCCTATTACATCGAGATCTTGAGAGGGCTGAGATTGCAGTGTCAGTAGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGC
AAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTTGTTGAGAAGCATGGCCTAGCAGAGGCAGGGCAAGCTATTGAGTTCTCCATATCCTCTGATGGTGGACATTTG
TTTGAGAGGCGTCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACAAAGATGTATCA
GGACCTGAAGCGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGG
ATTTATTACAACCCTTGAGCGTACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCAAGAACTCTGAGGGGTTTGACAGTGATTTGGGTTTTC
GTGGACAAGCTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCATCTATACCGCTAGTAACTGGGCACAACTATACATGTCTGAGATAGTGAGACTACATGGAGTGCC
AGTGTTGATTGTTTCTGATAGAGATGCCTATTTCACTTCCAAATTCTGGAAGGAATACTATTGGCATGGCACCATTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCTG
TGTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTTCTAACGAAGCGATACAGAAAATTAGGTCACGTATGCAGACCGCACAGAGTAGG
CAGAAGAGTTATGCAGATGTGAGACGGAAGGATCTTGAATTTGATGTGGGGGACAAGGTGTTCTTGAAGGTAGCACCTATGAAAGGTGTCTTACGATTTGAAAGGAGAGG
AAAGCTAAGTCTCCGTTTTGTTGGGACGTTTGAGATTCTGGAGCGAATTGGCTCTGTAGCGTATCGCTTGGCGTTGCCACCATCACTCTCGGCAGTTCATGATGTGTTCC
ATGTTTCTATGTTGAGGAAGTACGTGTCAGATCCATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATACAGAACAACTCGTTGAGGTGTTG
GCTAGGGTTGAAGAAGCTACATGGGAGATAGAAGATAACATGAGAGCTCGTTACCCAGAATTGTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASRKEEQSIFALKEDVNQLWHKRVGHYHHQGLLQLTELTLDFPSLMKKSQAAKRV
ILESKIGSHFPSHLGEPLKNCSSFTQMLQFKSEVAHVFWKFKARVENESGCKIQIVRYDNGKEYSSAEFDKFCEDLGIKHQLTAPYTPQQNGVSERRNRYIMEMTICMLH
ENSLPKKFWAEAANTVVFLQNRLPTIAVKEKTPFEACIEEEDDRHNEIVDDASVRGTRLLSDIYERCNVSVCEPANYAEVKKDQRWVAAMEEELSMVEKNKTWILVDRPQ
DRKVIGVKWVFRTKLNADGSINKHKAKLVVKGYAQIFGVDYSDTFAHVARMDSFDRVQLQELLDKGFIRPSVSPWGYYRWFVENFSHIVTPLTQLTRKGAPFVWSKACED
SFQNLKQKLVTAPVLTVPDGSGSFVIYSYAFKKGLGCVLMQQGKVVAYASCQLKSHEQNYPTHDLELAAVVFALKICRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITQQALLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYFVEKHGLAEAGQAIEFSISSDGGHL
FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPADLLQPLSVPEWKWENVSMDFITGLPRTLRGLTVIWVF
VDKLTKSAHFVPGKSIYTASNWAQLYMSEIVRLHGVPVLIVSDRDAYFTSKFWKEYYWHGTIEALYGKCCRSPVCWGEVGEQRLMGPELVQSSNEAIQKIRSRMQTAQSR
QKSYADVRRKDLEFDVGDKVFLKVAPMKGVLRFERRGKLSLRFVGTFEILERIGSVAYRLALPPSLSAVHDVFHVSMLRKYVSDPSHVVDYEPLEIDENLSYTEQLVEVL
ARVEEATWEIEDNMRARYPELFEE