; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G013900 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G013900
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr17:10753315..10756482
RNA-Seq ExpressionCmoCh17G013900
SyntenyCmoCh17G013900
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78022.1 hypothetical protein VITISV_015518 [Vitis vinifera]0.0e+0066.63Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV
        MT DPS LDQSKNY  KDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVV HLTKNLLSI                          R VATGKRDGGLYV
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV

Query:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------
        LERGNSAFIS L+NKSLRASYDLWHARLGH                                                                      
Subjt:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------VDAF
                                                                                                        VDAF
Subjt:  ------------------------------------------------------------------------------------------------VDAF

Query:  STAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPS
        STA YIINRLPTPLLGGKS FELLYG +PHY+NFHPFGCRVYP LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFDETHFP +PS
Subjt:  STAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPS

Query:  SQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIF
        SQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS+LPP  S+  SIE   D  SSLG+HPMITRAKAGIF
Subjt:  SQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIF

Query:  KTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPG
        KTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPG
Subjt:  KTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPG

Query:  LDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRA
        LDYTDTFSPVVKATTVRVVLS+AVTNKWPL QLDVKNAFLNGTL E V+ME PPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRA
Subjt:  LDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRA

Query:  DTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHL
        DTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHL
Subjt:  DTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHL

Query:  TADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYS
        T  GSPFS+ TLYRSLV ALQYL I RPDIA+AVNSVSQFLHA T +HFLAVK                                               
Subjt:  TADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYS

Query:  IYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQY
               +SWSAKKQPT SRS+CESEYRALA TAAELLW+THLLHDLKVPI +QPLLLCDNKSAIF SSNPVSHK+AKHVELDYHFLRELV+AGKLRTQY
Subjt:  IYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQY

Query:  VPSHLQVADIFTKSIS
        VPSHLQVADIFTKS S
Subjt:  VPSHLQVADIFTKSIS

RVW41798.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0081.77Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH
        MT DPSILDQSKNY  KDSVIVGNGASLPITHTGTLSPVPNIHLLD               RVVATGKRDGGLYVLER NSAFI  L+NKSLRASYDLWH
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH

Query:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY
        ARL H                                               VDAFSTA YIIN LPTPLLGGKS FELLY Y+PHY+NFHPFGCRVYP 
Subjt:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY

Query:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC
        LRDYM NKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC
Subjt:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC

Query:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE
        +ICSDLVDESVQVDTSLAG +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDE
Subjt:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE

Query:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL
        E++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVVKATTVRVVLS+AVTNKWPL QLDVKNAFLNGTL
Subjt:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL

Query:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF
         E V+ME PPGYID RFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEF
Subjt:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF

Query:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL
        ATKDLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+ TLYRSLV ALQYL ITRPDIA+AVNSVSQFLHA 
Subjt:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL

Query:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL
        T +HFLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS+CESEYRALA TAAELLW+THLL
Subjt:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL

Query:  HDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS
        HDLKVPI +QPLLLCDNKSAIFFSSNPVSHK+AKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS S
Subjt:  HDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS

RVW43615.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0081.38Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH
        MT DPS LDQSKNY  KDSVIVGNGASLPITHTGTLSPVPNIHLLD               R+VATGKRDGGLYVLERGNSAFIS L+NKSLRASYDLWH
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH

Query:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY
        ARL H                                               VDAFSTA YIINRLPTPLLGGKS FELLYG++PHY+NFHPFGCRVYP 
Subjt:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY

Query:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC
        LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC
Subjt:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC

Query:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE
        +ICSDLVDESV+VDTSLAGS+LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDE
Subjt:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE

Query:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL
        E++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+A+TNKWPL QLDVKNAFLNGTL
Subjt:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL

Query:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF
         E V+ME PPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEF
Subjt:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF

Query:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL
        ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+ TLYRSLV ALQYL ITRPDIA+AVNSVSQFLHA 
Subjt:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL

Query:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL
        T +HFLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTV RS+CESEYRALA TAAELLW+THLL
Subjt:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL

Query:  HDLKVPISKQP
        HDLKVPI +QP
Subjt:  HDLKVPISKQP

RVW45095.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0071.23Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV
        MT DPSILDQSKNY  KD VIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSI                          RVVATGKRDGGLYV
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV

Query:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------
        LERGNSAFIS L+NKSLRASYDLWHARLGH                                                                      
Subjt:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITC
                  VDAFST  YIINRLPTPLLGGKS FELLYGY+PHY+NFHPFGC VYP LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT 
Subjt:  ----------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITC

Query:  HAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSL
        HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS+LPP  S+  SIE   D  SSL
Subjt:  HAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSL

Query:  GTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERF
        G+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LV RP NTNIVGSKWVFR KY PDGSVER 
Subjt:  GTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERF

Query:  KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRF
        KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPL QLDV NAFLNGTL E V+ME PPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRF
Subjt:  KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRF

Query:  SSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLD
        SSF LTLGFSCSRADTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHS+FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRA LLD
Subjt:  SSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLD

Query:  SKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADW
        SKPVHTPMVVSQHLT  GSPFS+ TLYRSLV ALQYL ITRPDIA+AVNSVSQFLHALT +HFLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADW
Subjt:  SKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADW

Query:  AGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHF
        AGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS+CESEYRALA T AELLW+THLLHDLKVPI +QPLLLCDNKSAIF SSNPVSHK+AKHVELDYHF
Subjt:  AGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHF

Query:  LRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG
        LRELV+AGKLRTQYVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLSLRG
Subjt:  LRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG

RVW96109.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0076.23Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH
        MT DPSILDQSKNY  KDSVIVGNGASLPITHT  L      H L     P          R V     + GL +L      F S L       S   W 
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH

Query:  ARLGHVDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFD
             VDAFSTA YIINRLPTPLLGGK+SFELLYGY+PHY+NFHPFGCRVYP LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFD
Subjt:  ARLGHVDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFD

Query:  ETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSP--PTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPM
        ETHFP IPSSQ QPLSS+ ISNFLEP  HHID SP  PT+ S   PRS+SSPC+ICSDLVDESVQVDTSLAGS+ PP  S+  SIE   D  SSLG+HPM
Subjt:  ETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSP--PTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPM

Query:  ITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLV
        ITRAKAGIFKTRHPANLG+LGS GLLS LL STEPKGFKSAAKNP W+A MDEE++ALQQN                                       
Subjt:  ITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLV

Query:  AKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLL
          GYTQVPGLDYTDTFS VVKATTVRVVLS+AVTNKWPL Q DVKNAFLNGTL E V+ME P GYID RFPTHVCLLKKALYGLKQAPRAWFQRFSSFLL
Subjt:  AKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLL

Query:  TLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
        TLGFS SRA  SLFVFHQQ ++IYLLLYV DIIVTGNN SL+D+FTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
Subjt:  TLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH

Query:  TPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPD
        TPMVVSQHLT   SPFS+ T YRSLV ALQYL ITRPDIA+AVNSVSQFLHA T ++FLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADWAGCPD
Subjt:  TPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPD

Query:  TRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELV
        TRRSTS YSIYLGNNLVSWSAKKQPTVSRS+CESEYRALA TAAELLW+THLLHDLKVPI +Q LLLCDNKSAIF SSNPVSHK+AKHVELDYHFLRELV
Subjt:  TRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELV

Query:  IAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLS-LRGGVKDS
        +AGKL TQYVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLS L GGVKD+
Subjt:  IAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLS-LRGGVKDS

TrEMBL top hitse value%identityAlignment
A0A438E275 Retrovirus-related Pol polyprotein from transposon RE10.0e+0081.77Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH
        MT DPSILDQSKNY  KDSVIVGNGASLPITHTGTLSPVPNIHLLD               RVVATGKRDGGLYVLER NSAFI  L+NKSLRASYDLWH
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH

Query:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY
        ARL H                                               VDAFSTA YIIN LPTPLLGGKS FELLY Y+PHY+NFHPFGCRVYP 
Subjt:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY

Query:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC
        LRDYM NKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC
Subjt:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC

Query:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE
        +ICSDLVDESVQVDTSLAG +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDE
Subjt:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE

Query:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL
        E++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVVKATTVRVVLS+AVTNKWPL QLDVKNAFLNGTL
Subjt:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL

Query:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF
         E V+ME PPGYID RFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEF
Subjt:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF

Query:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL
        ATKDLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+ TLYRSLV ALQYL ITRPDIA+AVNSVSQFLHA 
Subjt:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL

Query:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL
        T +HFLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS+CESEYRALA TAAELLW+THLL
Subjt:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL

Query:  HDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS
        HDLKVPI +QPLLLCDNKSAIFFSSNPVSHK+AKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS S
Subjt:  HDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS

A0A438E763 Retrovirus-related Pol polyprotein from transposon RE10.0e+0081.38Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH
        MT DPS LDQSKNY  KDSVIVGNGASLPITHTGTLSPVPNIHLLD               R+VATGKRDGGLYVLERGNSAFIS L+NKSLRASYDLWH
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH

Query:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY
        ARL H                                               VDAFSTA YIINRLPTPLLGGKS FELLYG++PHY+NFHPFGCRVYP 
Subjt:  ARLGH-----------------------------------------------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPY

Query:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC
        LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC
Subjt:  LRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPC

Query:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE
        +ICSDLVDESV+VDTSLAGS+LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDE
Subjt:  DICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDE

Query:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL
        E++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+A+TNKWPL QLDVKNAFLNGTL
Subjt:  EIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTL

Query:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF
         E V+ME PPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEF
Subjt:  IERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF

Query:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL
        ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+ TLYRSLV ALQYL ITRPDIA+AVNSVSQFLHA 
Subjt:  ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHAL

Query:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL
        T +HFLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTV RS+CESEYRALA TAAELLW+THLL
Subjt:  TANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLL

Query:  HDLKVPISKQP
        HDLKVPI +QP
Subjt:  HDLKVPISKQP

A0A438EBA0 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0071.23Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV
        MT DPSILDQSKNY  KD VIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSI                          RVVATGKRDGGLYV
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV

Query:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------
        LERGNSAFIS L+NKSLRASYDLWHARLGH                                                                      
Subjt:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITC
                  VDAFST  YIINRLPTPLLGGKS FELLYGY+PHY+NFHPFGC VYP LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT 
Subjt:  ----------VDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITC

Query:  HAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSL
        HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS+LPP  S+  SIE   D  SSL
Subjt:  HAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSL

Query:  GTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERF
        G+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LV RP NTNIVGSKWVFR KY PDGSVER 
Subjt:  GTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERF

Query:  KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRF
        KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPL QLDV NAFLNGTL E V+ME PPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRF
Subjt:  KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRF

Query:  SSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLD
        SSF LTLGFSCSRADTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHS+FATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRA LLD
Subjt:  SSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLD

Query:  SKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADW
        SKPVHTPMVVSQHLT  GSPFS+ TLYRSLV ALQYL ITRPDIA+AVNSVSQFLHALT +HFLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADW
Subjt:  SKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADW

Query:  AGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHF
        AGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRS+CESEYRALA T AELLW+THLLHDLKVPI +QPLLLCDNKSAIF SSNPVSHK+AKHVELDYHF
Subjt:  AGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHF

Query:  LRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG
        LRELV+AGKLRTQYVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLSLRG
Subjt:  LRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG

A0A438IHQ8 Retrovirus-related Pol polyprotein from transposon RE10.0e+0076.23Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH
        MT DPSILDQSKNY  KDSVIVGNGASLPITHT  L      H L     P          R V     + GL +L      F S L       S   W 
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWH

Query:  ARLGHVDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFD
             VDAFSTA YIINRLPTPLLGGK+SFELLYGY+PHY+NFHPFGCRVYP LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFD
Subjt:  ARLGHVDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFD

Query:  ETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSP--PTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPM
        ETHFP IPSSQ QPLSS+ ISNFLEP  HHID SP  PT+ S   PRS+SSPC+ICSDLVDESVQVDTSLAGS+ PP  S+  SIE   D  SSLG+HPM
Subjt:  ETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSP--PTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPM

Query:  ITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLV
        ITRAKAGIFKTRHPANLG+LGS GLLS LL STEPKGFKSAAKNP W+A MDEE++ALQQN                                       
Subjt:  ITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLV

Query:  AKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLL
          GYTQVPGLDYTDTFS VVKATTVRVVLS+AVTNKWPL Q DVKNAFLNGTL E V+ME P GYID RFPTHVCLLKKALYGLKQAPRAWFQRFSSFLL
Subjt:  AKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLL

Query:  TLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
        TLGFS SRA  SLFVFHQQ ++IYLLLYV DIIVTGNN SL+D+FTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH
Subjt:  TLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVH

Query:  TPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPD
        TPMVVSQHLT   SPFS+ T YRSLV ALQYL ITRPDIA+AVNSVSQFLHA T ++FLAVKRILRYVKGTLHFGLTFRPST+PS LVAYSDADWAGCPD
Subjt:  TPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPD

Query:  TRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELV
        TRRSTS YSIYLGNNLVSWSAKKQPTVSRS+CESEYRALA TAAELLW+THLLHDLKVPI +Q LLLCDNKSAIF SSNPVSHK+AKHVELDYHFLRELV
Subjt:  TRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELV

Query:  IAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLS-LRGGVKDS
        +AGKL TQYVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLS L GGVKD+
Subjt:  IAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLS-LRGGVKDS

A5AKX4 Integrase catalytic domain-containing protein0.0e+0066.63Show/hide
Query:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV
        MT DPS LDQSKNY  KDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVV HLTKNLLSI                          R VATGKRDGGLYV
Subjt:  MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIR-------------------------RVVATGKRDGGLYV

Query:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------
        LERGNSAFIS L+NKSLRASYDLWHARLGH                                                                      
Subjt:  LERGNSAFISALRNKSLRASYDLWHARLGH----------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------VDAF
                                                                                                        VDAF
Subjt:  ------------------------------------------------------------------------------------------------VDAF

Query:  STAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPS
        STA YIINRLPTPLLGGKS FELLYG +PHY+NFHPFGCRVYP LRDYMPNKL PRSIPCIFLGYSP+HKGFRCLDP T++LYIT HAQFDETHFP +PS
Subjt:  STAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPS

Query:  SQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIF
        SQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS+LPP  S+  SIE   D  SSLG+HPMITRAKAGIF
Subjt:  SQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIF

Query:  KTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPG
        KTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPG
Subjt:  KTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPG

Query:  LDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRA
        LDYTDTFSPVVKATTVRVVLS+AVTNKWPL QLDVKNAFLNGTL E V+ME PPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRA
Subjt:  LDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRA

Query:  DTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHL
        DTSLFVFHQQ ++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHL
Subjt:  DTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHL

Query:  TADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYS
        T  GSPFS+ TLYRSLV ALQYL I RPDIA+AVNSVSQFLHA T +HFLAVK                                               
Subjt:  TADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYS

Query:  IYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQY
               +SWSAKKQPT SRS+CESEYRALA TAAELLW+THLLHDLKVPI +QPLLLCDNKSAIF SSNPVSHK+AKHVELDYHFLRELV+AGKLRTQY
Subjt:  IYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQY

Query:  VPSHLQVADIFTKSIS
        VPSHLQVADIFTKS S
Subjt:  VPSHLQVADIFTKSIS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-9037.02Show/hide
Query:  AWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVK
        +W  A++ E+ A + N+TWT+  RP N NIV S+WVF +KY   G+  R+KARLVA+G+TQ   +DY +TF+PV + ++ R +LS+ +     + Q+DVK
Subjt:  AWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVK

Query:  NAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNI---IYLLLYVDDIIVTGNNSSLI
         AFLNGTL E ++M LP G        +VC L KA+YGLKQA R WF+ F   L    F  S  D  +++   + NI   IY+LLYVDD+++   + + +
Subjt:  NAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNI---IYLLLYVDDIIVTGNNSSLI

Query:  DSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLII-TRPDIAY
        ++F R L  +F   DL  + +F+G+      D +++SQ  Y + IL++  + +   V TP+    +     S     T  RSL+  L Y+++ TRPD+  
Subjt:  DSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLII-TRPDIAY

Query:  AVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPS-TVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGN-NLVSWSAKKQPTVSRSNCESEYRAL
        AVN +S++     +  +  +KR+LRY+KGT+   L F+ +    + ++ Y D+DWAG    R+ST+GY   + + NL+ W+ K+Q +V+ S+ E+EY AL
Subjt:  AVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPS-TVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGN-NLVSWSAKKQPTVSRSNCESEYRAL

Query:  ATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKL
             E LW+  LL  + + +     +  DN+  I  ++NP  HK+AKH+++ YHF RE V    +  +Y+P+  Q+ADIFTK +    F   R KL
Subjt:  ATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-9932.39Show/hide
Query:  VLERGNSAFISALRN--KSLRASYDLWHARLGHVDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFL
        V ER N   +  +R+  +  +     W       +A  TA Y+INR P+  L  +    +       Y +   FGCR + ++      KL  +SIPCIF+
Subjt:  VLERGNSAFISALRN--KSLRASYDLWHARLGHVDAFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFL

Query:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDS-SPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGS
        GY     G+R  DP   K+  +    F E+        +T    S  + N + P+F  I S S   TS+  T    S   +   +++++  Q+D  +   
Subjt:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDS-SPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNP---AWVAAMDEEIRALQQNDTWTLVPRPT
                   +E P         P+    +  +   R+P+   +L S           EP+  K    +P     + AM EE+ +LQ+N T+ LV  P 
Subjt:  TLPPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNP---AWVAAMDEEIRALQQNDTWTLVPRPT

Query:  NTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFP
            +  KWVF++K   D  + R+KARLV KG+ Q  G+D+ + FSPVVK T++R +LS+A +    + QLDVK AFL+G L E ++ME P G+      
Subjt:  NTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFP

Query:  THVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSL-FVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLE--
          VC L K+LYGLKQAPR W+ +F SF+ +  +  + +D  + F    + N I LLLYVDD+++ G +  LI      L   F  KDLG     LG++  
Subjt:  THVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSL-FVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLE--

Query:  ASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFS-------DLTLYRSLVSALQY-LIITRPDIAYAVNSVSQFLHALTANHFLA
           T   L++SQ KY   +L R  + ++KPV TP+     L+    P +           Y S V +L Y ++ TRPDIA+AV  VS+FL      H+ A
Subjt:  ASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFS-------DLTLYRSLVSALQY-LIITRPDIAYAVNSVSQFLHALTANHFLA

Query:  VKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPI
        VK ILRY++GT    L F  S     L  Y+DAD AG  D R+S++GY        +SW +K Q  V+ S  E+EY A   T  E++W+   L +L +  
Subjt:  VKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPI

Query:  SKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSN
         K+ ++ CD++SAI  S N + H + KH+++ YH++RE+V    L+   + ++   AD+ TK + R  FE  +  + + SN
Subjt:  SKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSN

P92519 Uncharacterized mitochondrial protein AtMg008104.7e-5951.33Show/hide
Query:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLY
        +YLLLYVDDI++TG++++L++    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + D + +
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLY

Query:  RSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK
        RS+V ALQYL +TRPDI+YAVN V Q +H  T   F  +KR+LRYVKGT+  GL    ++    + A+ D+DWAGC  TRRST+G+  +LG N++SWSAK
Subjt:  RSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK

Query:  KQPTVSRSNCESEYRALATTAAELLW
        +QPTVSRS+ E+EYRALA TAAEL W
Subjt:  KQPTVSRSNCESEYRALATTAAELLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-17041.9Show/hide
Query:  AFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFP--
        AF+ A Y+INRLPTPLL  +S F+ L+G +P+YD    FGC  YP+LR Y  +KL  +S  C+FLGYS     + CL   T++LYI+ H +FDE  FP  
Subjt:  AFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFP--

Query:  ---------------------------------------------AIPSSQTQPL--SSIPISNFLEPHFHHIDSSP-----------PTTSSPQTP---
                                                       PSS + P   S +  SN          SSP           PTT   QT    
Subjt:  ---------------------------------------------AIPSSQTQPL--SSIPISNFLEPHFHHIDSSP-----------PTTSSPQTP---

Query:  -----RSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTS----------IEPPVDFSS---------LGTHPMITRAKAGIFKTRHPANLGILGSS
              S ++P +     + +S+      + S+  P+TS S+S          I PP   +          L TH M TRAKAGI K     +L +    
Subjt:  -----RSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTS----------IEPPVDFSS---------LGTHPMITRAKAGIFKTRHPANLGILGSS

Query:  GLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKA
            +L A +EP+    A K+  W  AM  EI A   N TW LV P P++  IVG +W+F  KY  DGS+ R+KARLVAKGY Q PGLDY +TFSPV+K+
Subjt:  GLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKA

Query:  TTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNI
        T++R+VL +AV   WP+ QLDV NAFL GTL + V+M  PPG+ID   P +VC L+KALYGLKQAPRAW+    ++LLT+GF  S +DTSLFV  +  +I
Subjt:  TTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNI

Query:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDLTL
        +Y+L+YVDDI++TGN+ +L+ +    L   F+ KD   L YFLG+EA   P GL +SQ +Y  D+L R  ++ +KPV TPM  S  L+   G+  +D T 
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDLTL

Query:  YRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSA
        YR +V +LQYL  TRPDI+YAVN +SQF+H  T  H  A+KRILRY+ GT + G+  +     S L AYSDADWAG  D   ST+GY +YLG++ +SWS+
Subjt:  YRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSA

Query:  KKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFT
        KKQ  V RS+ E+EYR++A T++E+ W+  LL +L + +++ P++ CDN  A +  +NPV H + KH+ +DYHF+R  V +G LR  +V +H Q+AD  T
Subjt:  KKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFT

Query:  KSISRPLFEFFRSKLYVRSNP
        K +SR  F+ F SK+ V   P
Subjt:  KSISRPLFEFFRSKLYVRSNP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-16440.97Show/hide
Query:  AFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFP--
        AFS A Y+INRLPTPLL  +S F+ L+G  P+Y+    FGC  YP+LR Y  +KL  +S  C F+GYS     + CL   T +LY + H QFDE  FP  
Subjt:  AFSTAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFP--

Query:  ----AIPSSQTQPLSSIP--ISNFLEPHFHHIDSSPPTTSS--PQTPRSSSSPCDICSDLVDES------------------------------------
             + +SQ Q   S P   S+   P    +  +PP        +PR  SSP  +C+  V  S                                    
Subjt:  ----AIPSSQTQPLSSIP--ISNFLEPHFHHIDSSPPTTSS--PQTPRSSSSPCDICSDLVDES------------------------------------

Query:  -----------------------------------VQVDTSLAGSTLPPSTSNSTSIEPPV----------DFSSLGTHPMITRAKAGIFKTRHPANLGI
                                               TS++    P S+S ST   PPV            + + TH M TRAK GI K     +   
Subjt:  -----------------------------------VQVDTSLAGSTLPPSTSNSTSIEPPV----------DFSSLGTHPMITRAKAGIFKTRHPANLGI

Query:  LGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSP
               ++L A++EP+    A K+  W  AM  EI A   N TW LV P P +  IVG +W+F  K+  DGS+ R+KARLVAKGY Q PGLDY +TFSP
Subjt:  LGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSP

Query:  VVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQ
        V+K+T++R+VL +AV   WP+ QLDV NAFL GTL + V+M  PPG++D   P +VC L+KA+YGLKQAPRAW+    ++LLT+GF  S +DTSLFV  +
Subjt:  VVKATTVRVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQ

Query:  QFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFS
          +IIY+L+YVDDI++TGN++ L+      L   F+ K+   L YFLG+EA   P GL +SQ +Y  D+L R  +L +KPV TPM  S  LT   G+   
Subjt:  QFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFS

Query:  DLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLV
        D T YR +V +LQYL  TRPD++YAVN +SQ++H  T +H+ A+KR+LRY+ GT   G+  +     S L AYSDADWAG  D   ST+GY +YLG++ +
Subjt:  DLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLV

Query:  SWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVA
        SWS+KKQ  V RS+ E+EYR++A T++EL W+  LL +L + +S  P++ CDN  A +  +NPV H + KH+ LDYHF+R  V +G LR  +V +H Q+A
Subjt:  SWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVA

Query:  DIFTKSISRPLFEFFRSKLYVRSNP
        D  TK +SR  F+ F  K+ V   P
Subjt:  DIFTKSISRPLFEFFRSKLYVRSNP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.0e-11545.4Show/hide
Query:  LSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTV
        L  +  + EP  +  A +   W  AMD+EI A++   TW +   P N   +G KWV++IKY  DG++ER+KARLVAKGYTQ  G+D+ +TFSPV K T+V
Subjt:  LSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTV

Query:  RVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYI----DPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFN
        +++L+I+    + L QLD+ NAFLNG L E ++M+LPPGY     D   P  VC LKK++YGLKQA R WF +FS  L+  GF  S +D + F+      
Subjt:  RVVLSIAVTNKWPLGQLDVKNAFLNGTLIERVHMELPPGYI----DPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFN

Query:  IIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDLT
         + +L+YVDDII+  NN + +D    +L S F  +DLG L YFLGLE + +  G+ I Q KYA D+L    LL  KP   PM  S   +A  G  F D  
Subjt:  IIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDLT

Query:  LYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWS
         YR L+  L YL ITR DI++AVN +SQF  A    H  AV +IL Y+KGT+  GL F  S     L  +SDA +  C DTRRST+GY ++LG +L+SW 
Subjt:  LYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWS

Query:  AKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRE
        +KKQ  VS+S+ E+EYRAL+    E++W+     +L++P+SK  LL CDN +AI  ++N V H++ KH+E D H +RE
Subjt:  AKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFFSSNPVSHKQAKHVELDYHFLRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.6e-1451.28Show/hide
Query:  YLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGY
        YL ITRPD+ +AVN +SQF  A       AV ++L YVKGT+  GL F  +T    L A++D+DWA CPDTRRS +G+
Subjt:  YLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein3.3e-6051.33Show/hide
Query:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLY
        +YLLLYVDDI++TG++++L++    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + D + +
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLY

Query:  RSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK
        RS+V ALQYL +TRPDI+YAVN V Q +H  T   F  +KR+LRYVKGT+  GL    ++    + A+ D+DWAGC  TRRST+G+  +LG N++SWSAK
Subjt:  RSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKRILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK

Query:  KQPTVSRSNCESEYRALATTAAELLW
        +QPTVSRS+ E+EYRALA TAAEL W
Subjt:  KQPTVSRSNCESEYRALATTAAELLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-2645.86Show/hide
Query:  MITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARL
        M+TR+KAGI K     +L I              EPK    A K+P W  AM EE+ AL +N TW LVP P N NI+G KWVF+ K   DG+++R KARL
Subjt:  MITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARL

Query:  VAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA
        VAKG+ Q  G+ + +T+SPVV+  T+R +L++A
Subjt:  VAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCTGACCCATCTATTTTGGATCAGTCTAAAAATTACACGAGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACACCGGTACTCTTTC
TCCTGTTCCAAATATTCACTTGTTAGATGTCTTGGTTGTCCCTCACCTCACTAAAAATCTTCTTTCCATAAGAAGGGTGGTGGCAACCGGTAAAAGAGATGGAGGGCTAT
ATGTGCTGGAGCGCGGCAACTCTGCTTTTATTTCAGCCCTTAGAAACAAATCTTTACGTGCTTCATATGATTTATGGCATGCTCGTCTAGGTCATGTTGACGCCTTCAGC
ACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCATCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGG
TTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTGTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGCTCATAAAGGGTTTCGCTGTC
TTGATCCGGCCACCACTAAGCTATATATCACCTGTCATGCTCAATTTGATGAAACCCATTTTCCTGCTATCCCTAGCTCCCAGACCCAACCTCTTTCCTCTATTCCTATT
TCAAATTTCTTAGAACCACATTTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCAAACTCCTCGATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCT
TGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGCCACCCTCGACTTCTAATTCGACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCA
CTCATCCTATGATCACACGAGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATTTTGGGCTCATCTGGACTTCTTTCTGCTCTTCTTGCATCCACT
GAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCATTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCC
TACCAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCTGATGGATCTGTCGAGCGTTTCAAGGCTCGTCTTGTTGCCAAAGGTTATACTCAGG
TTCCTGGTCTTGACTACACTGACACTTTCAGTCCGGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTGGACAACTTGAT
GTCAAGAATGCTTTTCTCAATGGAACTCTTATTGAACGTGTTCATATGGAACTACCTCCTGGGTATATTGATCCTCGATTTCCCACTCATGTTTGTCTATTAAAGAAAGC
CCTCTATGGTTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGCAGTCGCGCTGACACGTCCCTTTTTGTCTTTC
ATCAGCAATTTAACATTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTGACAGCTTTACTCGCAAGCTTCATTCTGAGTTT
GCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATTAGTCAGTTAAAATATGCTCGAGATATTCTTACTCG
TGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCTTACTCTCTACAGATCTCTTGTTA
GCGCCCTTCAGTACTTGATTATTACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCTTACTGCAAATCACTTTCTTGCTGTCAAACGT
ATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGA
TACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTGGTTTCTTGGAGTGCCAAAAAGCAACCTACTGTCTCACGCTCCAACTGTGAATCTGAGTATC
GTGCTCTTGCCACAACCGCTGCTGAACTTCTTTGGGTTACGCATCTTTTGCATGACCTCAAGGTCCCTATTTCAAAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCT
ATTTTTTTTAGCTCTAATCCCGTTTCTCACAAGCAGGCCAAACATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGCACACAATATGT
ACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGAGTATTTCTCGACCTCTCTTTGAATTTTTCAGATCCAAGCTTTACGTTCGTTCAAATCCGACGCTCAGCTTGC
GGGGGGGTGTTAAGGATAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCTGACCCATCTATTTTGGATCAGTCTAAAAATTACACGAGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACACCGGTACTCTTTC
TCCTGTTCCAAATATTCACTTGTTAGATGTCTTGGTTGTCCCTCACCTCACTAAAAATCTTCTTTCCATAAGAAGGGTGGTGGCAACCGGTAAAAGAGATGGAGGGCTAT
ATGTGCTGGAGCGCGGCAACTCTGCTTTTATTTCAGCCCTTAGAAACAAATCTTTACGTGCTTCATATGATTTATGGCATGCTCGTCTAGGTCATGTTGACGCCTTCAGC
ACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCATCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGG
TTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTGTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGCTCATAAAGGGTTTCGCTGTC
TTGATCCGGCCACCACTAAGCTATATATCACCTGTCATGCTCAATTTGATGAAACCCATTTTCCTGCTATCCCTAGCTCCCAGACCCAACCTCTTTCCTCTATTCCTATT
TCAAATTTCTTAGAACCACATTTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCAAACTCCTCGATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCT
TGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGCCACCCTCGACTTCTAATTCGACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCA
CTCATCCTATGATCACACGAGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATTTTGGGCTCATCTGGACTTCTTTCTGCTCTTCTTGCATCCACT
GAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCATTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCC
TACCAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCTGATGGATCTGTCGAGCGTTTCAAGGCTCGTCTTGTTGCCAAAGGTTATACTCAGG
TTCCTGGTCTTGACTACACTGACACTTTCAGTCCGGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTGGACAACTTGAT
GTCAAGAATGCTTTTCTCAATGGAACTCTTATTGAACGTGTTCATATGGAACTACCTCCTGGGTATATTGATCCTCGATTTCCCACTCATGTTTGTCTATTAAAGAAAGC
CCTCTATGGTTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGCAGTCGCGCTGACACGTCCCTTTTTGTCTTTC
ATCAGCAATTTAACATTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTGACAGCTTTACTCGCAAGCTTCATTCTGAGTTT
GCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATTAGTCAGTTAAAATATGCTCGAGATATTCTTACTCG
TGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCTTACTCTCTACAGATCTCTTGTTA
GCGCCCTTCAGTACTTGATTATTACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCTTACTGCAAATCACTTTCTTGCTGTCAAACGT
ATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGA
TACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTGGTTTCTTGGAGTGCCAAAAAGCAACCTACTGTCTCACGCTCCAACTGTGAATCTGAGTATC
GTGCTCTTGCCACAACCGCTGCTGAACTTCTTTGGGTTACGCATCTTTTGCATGACCTCAAGGTCCCTATTTCAAAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCT
ATTTTTTTTAGCTCTAATCCCGTTTCTCACAAGCAGGCCAAACATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGCACACAATATGT
ACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGAGTATTTCTCGACCTCTCTTTGAATTTTTCAGATCCAAGCTTTACGTTCGTTCAAATCCGACGCTCAGCTTGC
GGGGGGGTGTTAAGGATAGTTGA
Protein sequenceShow/hide protein sequence
MTADPSILDQSKNYTSKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSIRRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVDAFS
TAAYIINRLPTPLLGGKSSFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLCPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPI
SNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLAST
EPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPTNTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLGQLD
VKNAFLNGTLIERVHMELPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQFNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEF
ATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDLTLYRSLVSALQYLIITRPDIAYAVNSVSQFLHALTANHFLAVKR
ILRYVKGTLHFGLTFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSNCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
IFFSSNPVSHKQAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRGGVKDS