; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh03G007770 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh03G007770
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCma_Chr03:6040188..6048925
RNA-Seq ExpressionCmaCh03G007770
SyntenyCmaCh03G007770
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW33283.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0070.39Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT+VPPPRFEPETS+T + KYLAW+AADQRLLCLLLS LTEEA+AVVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T++H SKARELRLKDDLQLMKRGTKPVAEYAR FK +C+QLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LTP+P FADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF ATN  RT  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------
        NTSCS++GP+AADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH                                             
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------

Query:  -----------IESSNRKGG------GN------------------------------------------------------------------------
                   + +  R GG      GN                                                                        
Subjt:  -----------IESSNRKGG------GN------------------------------------------------------------------------

Query:  ----RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFP
             TA YIINRLPTPLLGGKSPFELLYGY+PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETHFP
Subjt:  ----RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFP

Query:  AIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAK
         +PSSQAQPLS++ ISNFLEP LHHID SPP+++SP  HIP+S+SSPC+ICSDLVDESVQVDTSLAGS+L P  S+  SIE   D  SSLG+H MITRAK
Subjt:  AIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAK

Query:  AGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYT
        AGIFKTRHPANLG+LGSSGLLS+LLASTEPKGFKSAAKNPAW+ AMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KY PDGSVER KARLVAKGYT
Subjt:  AGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYT

Query:  QVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFS
        QVPGLDYTDTFSPVVKATTVRVVLS+ VTNKWPLRQLDVKNAFLNGTL E V+MEQPPGY+DPRFP H                                
Subjt:  QVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFS

Query:  CSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVV
                     QS+LIYLLLYVDDIIVTGNN SL++SFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVV
Subjt:  CSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVV

Query:  SQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRST
        SQHLT  GSPFS+PTLY+SLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFGL FRPST+                      
Subjt:  SQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRST

Query:  SGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKL
                  ++SWSAKKQPTVSRSSCESEYRALA TAAELLW+ H+LHDLKVPI QQPLLLCDNKSAIF SSNPVSHKRAKHVELDYHFLRELV+AGKL
Subjt:  SGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKL

Query:  RTQYVPSHL
        RTQYVPSHL
Subjt:  RTQYVPSHL

RVW41798.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0076.46Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT+VPPPRFEPETS+T + KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL                         LVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF ATNR RT  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG---------------
        NTSCS++GP+AADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH                     + +  R GG               
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG---------------

Query:  -------------------------------------------------------------------GNRTAAYIINRLPTPLLGGKSPFELLYGYTPHY
                                                                              TA YIIN LPTPLLGGKSPFELLY Y+PHY
Subjt:  -------------------------------------------------------------------GNRTAAYIINRLPTPLLGGKSPFELLYGYTPHY

Query:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS
        +NFHPFGCRVYP LRDYM NKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Subjt:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS

Query:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS
        P  HIP+S+SSPC+ICSDLVDESVQVDTSLAG +L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKS
Subjt:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS

Query:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR
        AAKNPAW+AAMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVVKATTVRVVLS+AVTNKWPLR
Subjt:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR

Query:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS
        QLDVKNAFLNGTL E V+MEQPPGY+D RFP HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Subjt:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS

Query:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA
        L++SFTRKLHSEFATKDLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA
Subjt:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA

Query:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA
        +AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWSAKKQPTVSRSSCESEYRALA
Subjt:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA

Query:  TTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTK--TLFSEYFELASYAL
         TAAELLW+TH+LHDLKVPI QQPLLLCDNKSAIF SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTK  +LF    E+   AL
Subjt:  TTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTK--TLFSEYFELASYAL

RVW43615.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0078.29Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMI IKLSSSNYLLWKSQLLPLLESQD+L YVDGT+VPPPRFEPETS+T + KYLAW+AADQRLLCLLLSSLTEEA+AVVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL GLG +FS+FST QM+LTP+P FADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF  TNR RT  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG------GN-------
        NTSCS++GP+AADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITH                     + +  R GG      GN       
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG------GN-------

Query:  ---------------------------------------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHY
                                                                              TA YIINRLPTPLLGGKSPFELLYG++PHY
Subjt:  ---------------------------------------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHY

Query:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS
        +NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Subjt:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS

Query:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS
        P  HIP+S+SSPC+ICSDLVDESV+VDTSLAGS+L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKS
Subjt:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS

Query:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR
        AAKNPAW+AAMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+A+TNKWPLR
Subjt:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR

Query:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS
        QLDVKNAFLNGTL E V+MEQPPGY+DPRFP HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Subjt:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS

Query:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA
        L++SFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA
Subjt:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA

Query:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA
        +AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWSAKKQPTV RSSCESEYRALA
Subjt:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA

Query:  TTAAELLWVTHILHDLKVPISQQP
         TAAELLW+TH+LHDLKVPI QQP
Subjt:  TTAAELLWVTHILHDLKVPISQQP

RVW45095.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0069Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT+VPPPRFEPETS+T + KYLAW+AADQRLLCLLLSSLTEEA+ VVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LTP+P FADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF ATNR  T  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------
        NTSCS++GP+AADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITH                                             
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------

Query:  -----------IESSNRKGG------GN------------------------------------------------------------------------
                   + +  R GG      GN                                                                        
Subjt:  -----------IESSNRKGG------GN------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL
                                              T  YIINRLPTPLLGGKSPFELLYGY+PHY+NFHPFGC VYP LRDYMPNKLSPRSIPCIFL
Subjt:  -------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL

Query:  GYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQVDTSLAG
        GYSP HKGFRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  SP  HIP+S+SSPC+ICSDLVDESVQVDTSLAG
Subjt:  GYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQVDTSLAG

Query:  STLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPAN
        S+L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LV RP N
Subjt:  STLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPAN

Query:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPK
        TNIVGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDV NAFLNGTL E V+MEQPPGY+DPRFP 
Subjt:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPK

Query:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPT
        HVCLLKKALYGLKQAPRAWFQRFSSF LTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN SL++SFTRKLHS+FATKDLGSLSYFLGLEASPT
Subjt:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPT

Query:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL
        PDGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHA T DHFLAVKRILRYVKGTL
Subjt:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL

Query:  HFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKS
        HFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWSAKKQPTVSRSSCESEYRALA T AELLW+TH+LHDLKVPI QQPLLLCDNKS
Subjt:  HFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKS

Query:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE
        AIFLSSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTK++    FE
Subjt:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE

RVW96109.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0073.91Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES +HLLPFNTLIHMITIKLSSSNYLLWKSQLL LLESQD+LGYVDGT+VPPPRFEPETS+T + KYLAW+A DQRLLCLLLSSLTEEA+AVVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH  KARELRLKDDLQLMKR TKPVAEYAR FK +CDQLHAIGRPVEDIDKVH                                  
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTS
                  +S  TP AF                +NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EG+YA+RCNQRY R DSS AHLA+A NTS
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTS

Query:  CSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIESSNRKGGGNR------------------------------------
        CS++G +AADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH       G  ++                                    
Subjt:  CSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIESSNRKGGGNR------------------------------------

Query:  -------TAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETH
               TA YIINRLPTPLLGGK+ FELLYGY+PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETH
Subjt:  -------TAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETH

Query:  FPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITR
        FP IPSSQAQPLS++ ISNFLEP LHHID SP  PT+ S HIP+S+SSPC+ICSDLVDESVQVDTSLAGS+  P  S+  SIE   D  SSLG+HPMITR
Subjt:  FPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITR

Query:  AKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKG
        AKAGIFKTRHPANLG+LGS GLLS LL STEPKGFKSAAKNP W+A MDEE++ALQQN                                         G
Subjt:  AKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKG

Query:  YTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLG
        YTQVPGLDYTDTFS VVKATTVRVVLS+AVTNKWPLRQ DVKNAFLNGTL E V+MEQP GY+D RFP HVCLLKKALYGLKQAPRAWFQRFSSFLLTLG
Subjt:  YTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLG

Query:  FSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPM
        FS SRA  SLFVFHQQS+LIYLLLYV DIIVTGNN SL+++FTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPM
Subjt:  FSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPM

Query:  VVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRR
        VVSQHLT   SPFS+PT YRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT D+FLAVKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRR
Subjt:  VVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRR

Query:  STSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAG
        STS YSIYLGNNL+SWSAKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVPI QQ LLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELV+AG
Subjt:  STSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAG

Query:  KLRTQYVPSHLQVADIFTKTLFSEYFE
        KL TQYVPSHLQVADIFTK++    FE
Subjt:  KLRTQYVPSHLQVADIFTKTLFSEYFE

TrEMBL top hitse value%identityAlignment
A0A2N9EEM3 Integrase catalytic domain-containing protein0.0e+0073.99Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MAS+S   LLPFNT+IHM+TIKLSSSNYLLWKSQLLPLLESQ++LG+VDGT+VPPP F+P TS T +PK+LAW+A DQRLL LLLSSLTEEAMA  VGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        T+R+VW ALE T+SH+SKARE+RLKDDLQLMKRGT+PV  YARAFK +CDQLHAIGRPV+D DK HWFLRGLG++FS+FSTAQ+ALTP+PCFADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGR--THESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQRSLE S +T  AF AT+RGR   H    ++ +NQ+GRS    N+SSNRGR++S QGRRPP CQICR EGHYADRC+QRY R DSS AHLAEAFN
Subjt:  SFELFQRSLESSDSTPTAFIATNRGR--THESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASL-------PITHIESS------NRKGGGNR---------------------
         SCS++  + +DW+LDTGASAHMT   + LDQS  YT    ++V N  ++       P+  I +S       + G   R                     
Subjt:  TSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASL-------PITHIESS------NRKGGGNR---------------------

Query:  -------TAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETH
               TAAYIINRLPT LLGGKSPFELLYG +P+Y+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T+++YIT HAQFDETH
Subjt:  -------TAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETH

Query:  FPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIE--------PPVDFSSLG
        FP + +SQAQP+S++  SNFLEP L   D  P  P   SPHIPQS S+PCDIC+D VDES+QV+ SL G +L PS  +  S+E         PV  + + 
Subjt:  FPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIE--------PPVDFSSLG

Query:  THPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFK
        +HPM+TRAKAGIFKTRHPANL +LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEEI+ALQ N TW LVPRPANTNIVGSKWVFR KYLPDGS+ER K
Subjt:  THPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFK

Query:  ARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFS
        ARLVAKGYTQVPGLDYTDTFSPV+KATTVRVVLS+AVTNKWPLRQLDVKNAFLNG+L E V+MEQPPGY+DPRFP HVC LKKALYGLKQAPRAWFQRFS
Subjt:  ARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFS

Query:  SFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDS
        SFLLTLGFSCSRADTSLFVFHQQS +IYLLLYVDDII+TGNNSSL++SFT KLHSEFATKDLGSLSYFLGLEA PTPDGLF+SQLKYARDILTRAQLLDS
Subjt:  SFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDS

Query:  KPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWA
        KPVHTPM                           YLTITRPDIA+AVNSVSQF+HAPTADHFLAVKRILRYVKGTLHFGL FRPS  P TLVAYSDADWA
Subjt:  KPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWA

Query:  GCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFL
        GCPDTRRSTSGYSIYLG+NL+SWSAKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVP+ QQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFL
Subjt:  GCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFL

Query:  RELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE
        RELV+AGKLRTQYVPSHLQVADIFTK++    FE
Subjt:  RELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE

A0A2N9I601 Uncharacterized protein0.0e+0076.15Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MAS+S   LLPFNT+IHM+TIKLSSSNYLLWKSQLLPLLESQ++LG+VDGT+VPPP F+P TS T +PK+LAW+A DQRLL LLLSSLTEEAMA  VGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        T+R+VW ALE T+SH+SKARE+RLKDDLQLMKRGT+PV  YARAFK +CDQLHAIGRPV+D DK HWFLRGLG++FS+FSTAQ+ALTP+PCFADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGR--THESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQRSLE S +T  AF AT+RGR   H    ++ +NQ+GRS    N+SSNRGR++S QGRRPP CQICR EGHYADRC+QRY R DSS AHLAEAFN
Subjt:  SFELFQRSLESSDSTPTAFIATNRGR--THESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIE-SSNR----------------------KGGGNR-----------
         SCS++  + +DW+LDTGASAHMT   + LDQS  YTGKD VIVGNGASLPITH E + NR                       G   R           
Subjt:  TSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHIE-SSNR----------------------KGGGNR-----------

Query:  -----------------TAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYI
                         TAAYIINRLPT LLGGKSPFELLYG +P+Y+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T+++YI
Subjt:  -----------------TAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYI

Query:  TCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIE-------
        T HAQFDETHFP + +SQAQP+S++  SNFLEP L   D  P  P   SPHIPQS S+PCDIC+D VDES+QV+ SL G +L PS  +  S+E       
Subjt:  TCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSP--PTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIE-------

Query:  -PPVDFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKY
          PV  + + +HPM+TRAKAGIFKTRHPANL +LG SGLLSALLASTEPKGFKSAAKNPAW+AAMDEEI+ALQ N TW LVPRPANTNIVGSKWVFR KY
Subjt:  -PPVDFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKY

Query:  LPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQ
        LPDGS+ER KARLVAKGYTQVPGLDYTDTFSPV+KATTVRVVLS+AVTNKWPLRQLDVKNAFLNG+L E V+MEQPPGY+DPRFP HVC LKKALYGLKQ
Subjt:  LPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQ

Query:  APRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARD
        APRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS +IYLLLYVDDII+TGNNSSL++SFT KLHSEFATKDLGSLSYFLGLEA PTPDGLF+SQLKYARD
Subjt:  APRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARD

Query:  ILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPST
        ILTRAQLLDSKPVHTPMVVSQHL+ADG  F DPTLYRSLVGALQYLTITRPDIA+AVNSVSQF+HAPTADHFLAVKRILRYVKGTLHFGL FRPS  P T
Subjt:  ILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPST

Query:  LVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRA
        LVAYSDADWAGCPDTRRSTSGYSIYLG+NL+SWSAKKQPTVSRSSCESEYRALA TAAELLW+TH+LHDLKVP+ QQPLLLCDNKSAIFLSSNPVSHKRA
Subjt:  LVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRA

Query:  KHVELDYHFLRELVIAGKLRTQYVPSHLQVAD
        KHVELDYHFLRELV+AGKLRTQY   HL + +
Subjt:  KHVELDYHFLRELVIAGKLRTQYVPSHLQVAD

A0A438E275 Retrovirus-related Pol polyprotein from transposon RE10.0e+0076.46Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT+VPPPRFEPETS+T + KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL                         LVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF ATNR RT  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG---------------
        NTSCS++GP+AADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH                     + +  R GG               
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG---------------

Query:  -------------------------------------------------------------------GNRTAAYIINRLPTPLLGGKSPFELLYGYTPHY
                                                                              TA YIIN LPTPLLGGKSPFELLY Y+PHY
Subjt:  -------------------------------------------------------------------GNRTAAYIINRLPTPLLGGKSPFELLYGYTPHY

Query:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS
        +NFHPFGCRVYP LRDYM NKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Subjt:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS

Query:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS
        P  HIP+S+SSPC+ICSDLVDESVQVDTSLAG +L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKS
Subjt:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS

Query:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR
        AAKNPAW+AAMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVVKATTVRVVLS+AVTNKWPLR
Subjt:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR

Query:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS
        QLDVKNAFLNGTL E V+MEQPPGY+D RFP HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Subjt:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS

Query:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA
        L++SFTRKLHSEFATKDLGSL+YFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA
Subjt:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA

Query:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA
        +AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWSAKKQPTVSRSSCESEYRALA
Subjt:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA

Query:  TTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTK--TLFSEYFELASYAL
         TAAELLW+TH+LHDLKVPI QQPLLLCDNKSAIF SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTK  +LF    E+   AL
Subjt:  TTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTK--TLFSEYFELASYAL

A0A438E763 Retrovirus-related Pol polyprotein from transposon RE10.0e+0078.29Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMI IKLSSSNYLLWKSQLLPLLESQD+L YVDGT+VPPPRFEPETS+T + KYLAW+AADQRLLCLLLSSLTEEA+AVVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL GLG +FS+FST QM+LTP+P FADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF  TNR RT  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG------GN-------
        NTSCS++GP+AADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITH                     + +  R GG      GN       
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------IESSNRKGG------GN-------

Query:  ---------------------------------------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHY
                                                                              TA YIINRLPTPLLGGKSPFELLYG++PHY
Subjt:  ---------------------------------------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHY

Query:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS
        +NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLGYSP HKGFRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  S
Subjt:  DNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSS

Query:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS
        P  HIP+S+SSPC+ICSDLVDESV+VDTSLAGS+L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKS
Subjt:  P--HIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKS

Query:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR
        AAKNPAW+AAMDEE++ALQQN TW LVPRP NTNIVGSKWVFR KYLPDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+A+TNKWPLR
Subjt:  AAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLR

Query:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS
        QLDVKNAFLNGTL E V+MEQPPGY+DPRFP HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN S
Subjt:  QLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSS

Query:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA
        L++SFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA
Subjt:  LINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIA

Query:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA
        +AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWSAKKQPTV RSSCESEYRALA
Subjt:  YAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALA

Query:  TTAAELLWVTHILHDLKVPISQQP
         TAAELLW+TH+LHDLKVPI QQP
Subjt:  TTAAELLWVTHILHDLKVPISQQP

A0A438EBA0 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0069Show/hide
Query:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS
        MASES  HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT+VPPPRFEPETS+T + KYLAW+AADQRLLCLLLSSLTEEA+ VVVGLS
Subjt:  MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLS

Query:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE
        TAR+VWLALE T+SH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LTP+P FADLVSK E
Subjt:  TARDVWLALETTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTE

Query:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF
        SFELFQRSLESS+ T  AF ATNR  T  SH   F    NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAF
Subjt:  SFELFQRSLESSDSTPTAFIATNRGRTHESHPASF---TNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAF

Query:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------
        NTSCS++GP+AADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITH                                             
Subjt:  NTSCSIAGPDAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------

Query:  -----------IESSNRKGG------GN------------------------------------------------------------------------
                   + +  R GG      GN                                                                        
Subjt:  -----------IESSNRKGG------GN------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL
                                              T  YIINRLPTPLLGGKSPFELLYGY+PHY+NFHPFGC VYP LRDYMPNKLSPRSIPCIFL
Subjt:  -------------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL

Query:  GYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQVDTSLAG
        GYSP HKGFRCLDP T++LYIT HAQFDETHFP +PSSQAQPLS++ ISNFLEP LHHID SPP+  SP  HIP+S+SSPC+ICSDLVDESVQVDTSLAG
Subjt:  GYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSP--HIPQSSSSPCDICSDLVDESVQVDTSLAG

Query:  STLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPAN
        S+L P  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE++ALQQN TW LV RP N
Subjt:  STLSPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPAN

Query:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPK
        TNIVGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDV NAFLNGTL E V+MEQPPGY+DPRFP 
Subjt:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPK

Query:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPT
        HVCLLKKALYGLKQAPRAWFQRFSSF LTLGFSCSRADTSLFVFHQQS+LIYLLLYVDDIIVTGNN SL++SFTRKLHS+FATKDLGSLSYFLGLEASPT
Subjt:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPT

Query:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL
        PDGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHA T DHFLAVKRILRYVKGTL
Subjt:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL

Query:  HFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKS
        HFGL FRPST+PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNL+SWSAKKQPTVSRSSCESEYRALA T AELLW+TH+LHDLKVPI QQPLLLCDNKS
Subjt:  HFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKS

Query:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE
        AIFLSSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTK++    FE
Subjt:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-9630.6Show/hide
Query:  TAAYIINRLPTPLL--GGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIP
        TA Y+INR+P+  L    K+P+E+ +   P+  +   FG  VY ++++    K   +S   IF+GY P   GF+  D    K  +      DET+   + 
Subjt:  TAAYIINRLPTPLL--GGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFPAIP

Query:  SSQAQPLSTIPI--------SNFLEPHLHHIDSSPPTTS----------------SPHIPQSS-----------SSPCDICSDLVD------------ES
        +S+A    T+ +         NF       I +  P  S                + + P  S           S  CD    L D            + 
Subjt:  SSQAQPLSTIPI--------SNFLEPHLHHIDSSPPTTS----------------SPHIPQSS-----------SSPCDICSDLVD------------ES

Query:  VQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLG----------------THPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWV
         + D  L  S  S + + S   E       +G                +  + T+ +    +  +  N  +L +  + + +  S +   ++      +W 
Subjt:  VQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLG----------------THPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWV

Query:  AAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAF
         A++ E+ A + N+TWT+  RP N NIV S+WVF +KY   G+  R+KARLVA+G+TQ   +DY +TF+PV + ++ R +LS+ +     + Q+DVK AF
Subjt:  AAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAF

Query:  LNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ--SNLIYLLLYVDDIIVTGNNSSLINSFT
        LNGTL E ++M  P G        +VC L KA+YGLKQA R WF+ F   L    F  S  D  +++  +   +  IY+LLYVDD+++   + + +N+F 
Subjt:  LNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ--SNLIYLLLYVDDIIVTGNNSSLINSFT

Query:  RKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNS
        R L  +F   DL  + +F+G+      D +++SQ  Y + IL++  + +   V TP+    +     S     T  RSL+G L Y+ + TRPD+  AVN 
Subjt:  RKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNS

Query:  VSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPS-TVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGN-NLISWSAKKQPTVSRSSCESEYRALATTA
        +S++     ++ +  +KR+LRY+KGT+   LIF+ +    + ++ Y D+DWAG    R+ST+GY   + + NLI W+ K+Q +V+ SS E+EY AL    
Subjt:  VSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPS-TVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGN-NLISWSAKKQPTVSRSSCESEYRALATTA

Query:  AELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTL
         E LW+  +L  + + +     +  DN+  I +++NP  HKRAKH+++ YHF RE V    +  +Y+P+  Q+ADIFTK L
Subjt:  AELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-10033.33Show/hide
Query:  GGGNRTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFP
        G   +TA Y+INR P+  L  + P  +       Y +   FGCR + ++      KL  +SIPCIF+GY     G+R  DP   K+  +    F E+   
Subjt:  GGGNRTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDPATTKLYITCHAQFDETHFP

Query:  AIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSPHIPQSS----SSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLGTHPMITRA
           +  ++ +    I NF+        + P T+++P   +S+    S   +   +++++  Q+D  +              +E P         P+    
Subjt:  AIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSPHIPQSS----SSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLGTHPMITRA

Query:  KAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNP---AWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVA
        +  +   R+P+   +L S           EP+  K    +P     + AM EE+ +LQ+N T+ LV  P     +  KWVF++K   D  + R+KARLV 
Subjt:  KAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNP---AWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVA

Query:  KGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLT
        KG+ Q  G+D+ + FSPVVK T++R +LS+A +    + QLDVK AFL+G L E ++MEQP G+        VC L K+LYGLKQAPR W+ +F SF+ +
Subjt:  KGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLT

Query:  LGFSCSRADTSL-FVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLE--ASPTPDGLFISQLKYARDILTRAQLLDSKP
          +  + +D  + F    ++N I LLLYVDD+++ G +  LI      L   F  KDLG     LG++     T   L++SQ KY   +L R  + ++KP
Subjt:  LGFSCSRADTSL-FVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLE--ASPTPDGLFISQLKYARDILTRAQLLDSKP

Query:  VHTPMVVSQHLTADGSPFS-------DPTLYRSLVGALQY-LTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAY
        V TP+     L+    P +           Y S VG+L Y +  TRPDIA+AV  VS+FL  P  +H+ AVK ILRY++GT    L F  S     L  Y
Subjt:  VHTPMVVSQHLTADGSPFS-------DPTLYRSLVGALQY-LTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAY

Query:  SDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVE
        +DAD AG  D R+S++GY        ISW +K Q  V+ S+ E+EY A   T  E++W+   L +L +   ++ ++ CD++SAI LS N + H R KH++
Subjt:  SDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVE

Query:  LDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFEL
        + YH++RE+V    L+   + ++   AD+ TK +    FEL
Subjt:  LDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFEL

P92519 Uncharacterized mitochondrial protein AtMg008102.4e-6454.42Show/hide
Query:  IYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY
        +YLLLYVDDI++TG++++L+N    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + DP+ +
Subjt:  IYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY

Query:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAK
        RS+VGALQYLT+TRPDI+YAVN V Q +H PT   F  +KR+LRYVKGT+  GL    ++    + A+ D+DWAGC  TRRST+G+  +LG N+ISWSAK
Subjt:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAK

Query:  KQPTVSRSSCESEYRALATTAAELLW
        +QPTVSRSS E+EYRALA TAAEL W
Subjt:  KQPTVSRSSCESEYRALATTAAELLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-17730.46Show/hide
Query:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR
        KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     + +   NP Y  W+  D+ +   +L +++      V   +TA  +W  L   Y++ S   
Subjt:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR

Query:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFI
          +L+  L+   +GTK + +Y +      DQL  +G+P++  ++V   L  L  E+        A    P   ++  +  + E    ++ S+   P    
Subjt:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFI

Query:  ATNRGRTHESHPASFTNQRGRSYSHKNNSSN-------RGRTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSIA
        A +   T  ++  +  N+  R Y ++NN++N           H +  +  P+   CQIC  +GH A RC+Q      S ++    +  T      + ++ 
Subjt:  ATNRGRTHESHPASFTNQRGRSYSHKNNSSN-------RGRTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSIA

Query:  GP-DAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------------
         P  + +W LD+GA+ H+T+D + L   + YTG D V+V +G+++PI+H                                                   
Subjt:  GP-DAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH---------------------------------------------------

Query:  ---------------------------------------------------------------------------------IESSNR-------------
                                                                                         I  SN+             
Subjt:  ---------------------------------------------------------------------------------IESSNR-------------

Query:  --------------------------------------------------------------------KGGGN---------------------------
                                                                              GG                            
Subjt:  --------------------------------------------------------------------KGGGN---------------------------

Query:  -------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVH
                                         A Y+INRLPTPLL  +SPF+ L+G +P+YD    FGC  YP+LR Y  +KL  +S  C+FLGYS   
Subjt:  -------------------------------RTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVH

Query:  KGFRCLDPATTKLYITCHAQFDETHFP-----------------------------------------------AIPSSQAQPL--STIPISNFLEPHLH
          + CL   T++LYI+ H +FDE  FP                                                 PSS + P   S +  SN       
Subjt:  KGFRCLDPATTKLYITCHAQFDETHFP-----------------------------------------------AIPSSQAQPL--STIPISNFLEPHLH

Query:  HIDSSP-PTTSSPHIPQ------------------SSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTS----------IEPPVDFSS---------
           SSP PT    + PQ                  S ++P +     + +S+      + S+ SP+TS S+S          I PP   +          
Subjt:  HIDSSP-PTTSSPHIPQ------------------SSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTS----------IEPPVDFSS---------

Query:  LGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPANTNIVGSKWVFRIKYLPDGSVE
        L TH M TRAKAGI K     +L +        +L A +EP+    A K+  W  AM  EI A   N TW LV P P++  IVG +W+F  KY  DGS+ 
Subjt:  LGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPANTNIVGSKWVFRIKYLPDGSVE

Query:  RFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQ
        R+KARLVAKGY Q PGLDY +TFSPV+K+T++R+VL +AV   WP+RQLDV NAFL GTL + V+M QPPG++D   P +VC L+KALYGLKQAPRAW+ 
Subjt:  RFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQ

Query:  RFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQL
           ++LLT+GF  S +DTSLFV  +  +++Y+L+YVDDI++TGN+ +L+++    L   F+ KD   L YFLG+EA   P GL +SQ +Y  D+L R  +
Subjt:  RFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQL

Query:  LDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSD
        + +KPV TPM  S  L+   G+  +DPT YR +VG+LQYL  TRPDI+YAVN +SQF+H PT +H  A+KRILRY+ GT + G+  +     S L AYSD
Subjt:  LDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSD

Query:  ADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELD
        ADWAG  D   ST+GY +YLG++ ISWS+KKQ  V RSS E+EYR++A T++E+ W+  +L +L + +++ P++ CDN  A +L +NPV H R KH+ +D
Subjt:  ADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELD

Query:  YHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE
        YHF+R  V +G LR  +V +H Q+AD  TK L    F+
Subjt:  YHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.5e-17030.7Show/hide
Query:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR
        KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     +     NP Y  WR  D+ +   +L +++      V   +TA  +W  L   Y++ S   
Subjt:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALETTYSHQSKAR

Query:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTESFELFQRSLESSDSTP-TAF
          +L+                   F    DQL  +G+P++  ++V   L  L  ++        A    P   ++  +  + E    +L S++  P TA 
Subjt:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTESFELFQRSLESSDSTP-TAF

Query:  IATNRGRTHESHPASFTNQRG--RSYSHKNNSSNRGRTHSSQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI
        + T+R      +     N RG  R+Y++ NN SN  +  SS  R   R P      CQIC  +GH A RC Q +    +++   + +  T      + ++
Subjt:  IATNRGRTHESHPASFTNQRG--RSYSHKNNSSNRGRTHSSQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI

Query:  AGP-DAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH--------------------------------------------------
          P +A +W LD+GA+ H+T+D + L   + YTG D V++ +G+++PITH                                                  
Subjt:  AGP-DAADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITH--------------------------------------------------

Query:  ----------------------------------------------------------------------------------IESSNRKGGGNRT-----
                                                                                          I  S++    N T     
Subjt:  ----------------------------------------------------------------------------------IESSNRKGGGNRT-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------AAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPV
                                          A Y+INRLPTPLL  +SPF+ L+G  P+Y+    FGC  YP+LR Y  +KL  +S  C F+GYS  
Subjt:  ----------------------------------AAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPV

Query:  HKGFRCLDPATTKLYITCHAQFDETHFP------AIPSSQAQ------------PLSTIPISNFLEPHL-HHIDSSP-----------------------
           + CL   T +LY + H QFDE  FP       + +SQ Q             L T P+     P L  H+D+SP                       
Subjt:  HKGFRCLDPATTKLYITCHAQFDETHFP------AIPSSQAQ------------PLSTIPISNFLEPHL-HHIDSSP-----------------------

Query:  --------PT---------TSSPHIPQSSSSPCDICSDLVDESVQVD----------------------TSLAGSTLSPSTSNSTSIEPPV---------
                PT         T+ PH  Q+S+S   I ++    S   +                      TS++      S+S ST   PPV         
Subjt:  --------PT---------TSSPHIPQSSSSPCDICSDLVDESVQVD----------------------TSLAGSTLSPSTSNSTSIEPPV---------

Query:  -DFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPANTNIVGSKWVFRIKYLP
           + + TH M TRAK GI K     +          ++L A++EP+    A K+  W  AM  EI A   N TW LV P P +  IVG +W+F  K+  
Subjt:  -DFSSLGTHPMITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLV-PRPANTNIVGSKWVFRIKYLP

Query:  DGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAP
        DGS+ R+KARLVAKGY Q PGLDY +TFSPV+K+T++R+VL +AV   WP+RQLDV NAFL GTL + V+M QPPG+VD   P +VC L+KA+YGLKQAP
Subjt:  DGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAP

Query:  RAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDIL
        RAW+    ++LLT+GF  S +DTSLFV  +  ++IY+L+YVDDI++TGN++ L+      L   F+ K+   L YFLG+EA   P GL +SQ +Y  D+L
Subjt:  RAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDIL

Query:  TRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTL
         R  +L +KPV TPM  S  LT   G+   DPT YR +VG+LQYL  TRPD++YAVN +SQ++H PT DH+ A+KR+LRY+ GT   G+  +     S L
Subjt:  TRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTL

Query:  VAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAK
         AYSDADWAG  D   ST+GY +YLG++ ISWS+KKQ  V RSS E+EYR++A T++EL W+  +L +L + +S  P++ CDN  A +L +NPV H R K
Subjt:  VAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAK

Query:  HVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE
        H+ LDYHF+R  V +G LR  +V +H Q+AD  TK L    F+
Subjt:  HVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFE

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.9e-1426.34Show/hide
Query:  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLT-EEAMAVVVGLSTARDVWLALETTYSHQS
        + + +  SNY  W+   L    S D++G++DGT++P            N   + W+  D  +   L  +LT ++     V  ST+RD+WL ++  + +  
Subjt:  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLT-EEAMAVVVGLSTARDVWLALETTYSHQS

Query:  KARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSK-TESFELFQRSLESS----
         AR LRL  +L+    G   VA+Y R  KK+ D L  +  PV D + V + L GL  +F           P P F D  +   E  +  +R+++ +    
Subjt:  KARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSK-TESFELFQRSLESS----

Query:  -DSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSNRGR
          S+ +  +A +      +   S  NQ G     + N+  RGR
Subjt:  -DSTPTAFIATNRGRTHESHPASFTNQRGRSYSHKNNSSNRGR

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-11745.82Show/hide
Query:  LSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTV
        L  +  + EP  +  A +   W  AMD+EI A++   TW +   P N   +G KWV++IKY  DG++ER+KARLVAKGYTQ  G+D+ +TFSPV K T+V
Subjt:  LSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTV

Query:  RVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYV----DPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSN
        +++L+I+    + L QLD+ NAFLNG L E ++M+ PPGY     D   P  VC LKK++YGLKQA R WF +FS  L+  GF  S +D + F+    + 
Subjt:  RVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYV----DPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSN

Query:  LIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPT
         + +L+YVDDII+  NN + ++    +L S F  +DLG L YFLGLE + +  G+ I Q KYA D+L    LL  KP   PM  S   +A  G  F D  
Subjt:  LIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPT

Query:  LYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWS
         YR L+G L YL ITR DI++AVN +SQF  AP   H  AV +IL Y+KGT+  GL F  S     L  +SDA +  C DTRRST+GY ++LG +LISW 
Subjt:  LYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWS

Query:  AKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRE
        +KKQ  VS+SS E+EYRAL+    E++W+     +L++P+S+  LL CDN +AI +++N V H+R KH+E D H +RE
Subjt:  AKKQPTVSRSSCESEYRALATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.6e-1552.56Show/hide
Query:  YLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGY
        YLTITRPD+ +AVN +SQF  A       AV ++L YVKGT+  GL F  +T    L A++D+DWA CPDTRRS +G+
Subjt:  YLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-6554.42Show/hide
Query:  IYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY
        +YLLLYVDDI++TG++++L+N    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + DP+ +
Subjt:  IYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY

Query:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAK
        RS+VGALQYLT+TRPDI+YAVN V Q +H PT   F  +KR+LRYVKGT+  GL    ++    + A+ D+DWAGC  TRRST+G+  +LG N+ISWSAK
Subjt:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAK

Query:  KQPTVSRSSCESEYRALATTAAELLW
        +QPTVSRSS E+EYRALA TAAEL W
Subjt:  KQPTVSRSSCESEYRALATTAAELLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.5e-2645.11Show/hide
Query:  MITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARL
        M+TR+KAGI K     +L +              EPK    A K+P W  AM EE+ AL +N TW LVP P N NI+G KWVF+ K   DG+++R KARL
Subjt:  MITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARL

Query:  VAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA
        VAKG+ Q  G+ + +T+SPVV+  T+R +L++A
Subjt:  VAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGAATCTTGTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCC
TCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACTATGGTTCCACCACCTCGCTTTGAACCAGAAACCTCCTCAACATTCAACCCCAAATATTTGGCATGGA
GAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCTCTACTGCACGTGATGTTTGGCTTGCGTTGGAA
ACTACGTACAGCCATCAGTCAAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACAAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAA
AATTTGTGACCAACTTCATGCCATTGGCAGACCCGTCGAGGACATTGATAAAGTGCATTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGA
TGGCTCTCACCCCTATCCCCTGTTTTGCAGATCTAGTCTCTAAAACTGAAAGTTTTGAGTTGTTCCAGCGCTCCCTTGAGTCCTCTGACTCCACTCCTACAGCATTCATA
GCCACTAATCGTGGCCGCACCCATGAAAGTCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAACAACTCTTCTAATCGAGGACGAACCCACTC
AAGTCAGGGTCGTCGACCACCTCATTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTG
CTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATGCTGCTGATTGGTTTTTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCAATTCTGGATCAG
TCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACATCGAATCATCAAACAGGAAGGGTGGTGGCAACCGCACTGCAGC
TTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTG
TTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGTTCATAAAGGGTTCCGCTGTCTTGATCCC
GCCACCACTAAGCTATATATCACCTGCCATGCTCAATTTGATGAAACTCACTTTCCTGCTATCCCTAGCTCCCAAGCCCAACCTCTTTCCACTATTCCTATTTCAAATTT
CTTGGAACCACATCTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCACATTCCTCAATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATG
AGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGTCACCCTCGACTTCTAATTCAACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCT
ATGATCACACGCGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATGTTGGGCTCATCTGGACTTCTCTCTGCTCTTCTTGCATCCACTGAGCCAAA
AGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACA
CCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCCGATGGATCCGTCGAGCGTTTCAAGGCTCGTCTCGTTGCCAAAGGTTATACTCAGGTTCCTGGT
CTTGACTACACTGACACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAA
TGCTTTTCTCAATGGAACGCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATGTTGATCCTCGATTTCCAAAGCATGTTTGTCTATTAAAGAAAGCTCTCTATG
GCTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGTAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAA
TCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTAACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAA
AGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATCAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGT
TGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTT
CAGTACTTGACTATCACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCG
CTATGTCAAAGGAACACTCCACTTTGGTCTTATCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTC
GTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCATTTCTTGGAGTGCCAAAAAGCAACCTACTGTATCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTT
GCCACAACTGCTGCTGAACTTCTTTGGGTTACGCATATTTTGCATGACCTCAAGGTCCCTATTTCACAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTT
GAGCTCTAATCCTGTTTCTCACAAGCGGGCCAAGCATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGTACACAATATGTACCCTCTC
ATCTCCAAGTTGCTGACATCTTCACAAAGACTCTGTTTTCTGAGTATTTTGAGCTCGCCTCGTATGCACTCCCCAGATCATGGGATTCAGAACTAAACAGGCACCGACAA
GACAAGATCTTGAAAGCTTCTTCAGACAAAAATATTATACTGCGATGCTCGACCCTCAAGGGCATGGGTTGGCTTTCTTTGGTGCCTTTGTCGCATGCATCCATAAACAA
AATGTGGGGAGCTGATATGCGTGGAAAGTTGTCATTAATTTGGTTTTATCCTCAAGATTCTCCAAGCTGTCACTTCTCTCTTGGTCTGGTTTCTGATCCACGCTCACGTT
GCTACCCACTCGCTCACCCCCCACCGCCAGCCTACAGTCACACAGCAGCTCAACGCCAGCTGATTCCTCCTCCTTCTGCAGCCGTCATCTTCTCCATCCATCAAGAGGAA
GAAGAAGAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAGAAGCCGTCTTCGAGTTTCACTGTACTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGAATCTTGTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCC
TCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACTATGGTTCCACCACCTCGCTTTGAACCAGAAACCTCCTCAACATTCAACCCCAAATATTTGGCATGGA
GAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCTCTACTGCACGTGATGTTTGGCTTGCGTTGGAA
ACTACGTACAGCCATCAGTCAAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACAAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAA
AATTTGTGACCAACTTCATGCCATTGGCAGACCCGTCGAGGACATTGATAAAGTGCATTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGA
TGGCTCTCACCCCTATCCCCTGTTTTGCAGATCTAGTCTCTAAAACTGAAAGTTTTGAGTTGTTCCAGCGCTCCCTTGAGTCCTCTGACTCCACTCCTACAGCATTCATA
GCCACTAATCGTGGCCGCACCCATGAAAGTCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAACAACTCTTCTAATCGAGGACGAACCCACTC
AAGTCAGGGTCGTCGACCACCTCATTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTG
CTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATGCTGCTGATTGGTTTTTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCAATTCTGGATCAG
TCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCATCCCTACCCATTACCCACATCGAATCATCAAACAGGAAGGGTGGTGGCAACCGCACTGCAGC
TTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGCTACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTG
TTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTTGGGTTATAGTCCTGTTCATAAAGGGTTCCGCTGTCTTGATCCC
GCCACCACTAAGCTATATATCACCTGCCATGCTCAATTTGATGAAACTCACTTTCCTGCTATCCCTAGCTCCCAAGCCCAACCTCTTTCCACTATTCCTATTTCAAATTT
CTTGGAACCACATCTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCACATTCCTCAATCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATG
AGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGTCACCCTCGACTTCTAATTCAACCTCTATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCT
ATGATCACACGCGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATGTTGGGCTCATCTGGACTTCTCTCTGCTCTTCTTGCATCCACTGAGCCAAA
AGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCGAGCGTTACAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCCAACA
CCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCCGATGGATCCGTCGAGCGTTTCAAGGCTCGTCTCGTTGCCAAAGGTTATACTCAGGTTCCTGGT
CTTGACTACACTGACACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTGCAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAA
TGCTTTTCTCAATGGAACGCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATGTTGATCCTCGATTTCCAAAGCATGTTTGTCTATTAAAGAAAGCTCTCTATG
GCTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTCTTGTAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAA
TCTAACCTTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTATTAACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAA
AGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATCAGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGT
TGCTCGATAGCAAACCAGTTCACACTCCCATGGTTGTTTCTCAACACCTGACTGCTGATGGTTCTCCTTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTT
CAGTACTTGACTATCACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATGCCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCG
CTATGTCAAAGGAACACTCCACTTTGGTCTTATCTTTCGTCCATCCACTGTTCCTAGTACGCTAGTCGCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTC
GTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTCATTTCTTGGAGTGCCAAAAAGCAACCTACTGTATCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTT
GCCACAACTGCTGCTGAACTTCTTTGGGTTACGCATATTTTGCATGACCTCAAGGTCCCTATTTCACAGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTT
GAGCTCTAATCCTGTTTCTCACAAGCGGGCCAAGCATGTTGAACTAGATTATCATTTCCTTCGAGAACTTGTTATCGCTGGCAAACTTCGTACACAATATGTACCCTCTC
ATCTCCAAGTTGCTGACATCTTCACAAAGACTCTGTTTTCTGAGTATTTTGAGCTCGCCTCGTATGCACTCCCCAGATCATGGGATTCAGAACTAAACAGGCACCGACAA
GACAAGATCTTGAAAGCTTCTTCAGACAAAAATATTATACTGCGATGCTCGACCCTCAAGGGCATGGGTTGGCTTTCTTTGGTGCCTTTGTCGCATGCATCCATAAACAA
AATGTGGGGAGCTGATATGCGTGGAAAGTTGTCATTAATTTGGTTTTATCCTCAAGATTCTCCAAGCTGTCACTTCTCTCTTGGTCTGGTTTCTGATCCACGCTCACGTT
GCTACCCACTCGCTCACCCCCCACCGCCAGCCTACAGTCACACAGCAGCTCAACGCCAGCTGATTCCTCCTCCTTCTGCAGCCGTCATCTTCTCCATCCATCAAGAGGAA
GAAGAAGAAGAGGAAGAAGAAGAAGAGGAAGAAGAAGAAGAAGAAGCCGTCTTCGAGTTTCACTGTACTTTTTGA
Protein sequenceShow/hide protein sequence
MASESCYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTMVPPPRFEPETSSTFNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLSTARDVWLALE
TTYSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTPIPCFADLVSKTESFELFQRSLESSDSTPTAFI
ATNRGRTHESHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTSCSIAGPDAADWFLDTGASAHMTADPSILDQ
SKNYTGKDSVIVGNGASLPITHIESSNRKGGGNRTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPVHKGFRCLDP
ATTKLYITCHAQFDETHFPAIPSSQAQPLSTIPISNFLEPHLHHIDSSPPTTSSPHIPQSSSSPCDICSDLVDESVQVDTSLAGSTLSPSTSNSTSIEPPVDFSSLGTHP
MITRAKAGIFKTRHPANLGMLGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIRALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPG
LDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYVDPRFPKHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQ
SNLIYLLLYVDDIIVTGNNSSLINSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGAL
QYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLIFRPSTVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLISWSAKKQPTVSRSSCESEYRAL
ATTAAELLWVTHILHDLKVPISQQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKTLFSEYFELASYALPRSWDSELNRHRQ
DKILKASSDKNIILRCSTLKGMGWLSLVPLSHASINKMWGADMRGKLSLIWFYPQDSPSCHFSLGLVSDPRSRCYPLAHPPPPAYSHTAAQRQLIPPPSAAVIFSIHQEE
EEEEEEEEEEEEEEEAVFEFHCTF