; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G004340 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G004340
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr20:2033038..2037165
RNA-Seq ExpressionCmoCh20G004340
SyntenyCmoCh20G004340
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005840 - ribosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
GO:0030247 - polysaccharide binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN69607.1 hypothetical protein VITISV_009561 [Vitis vinifera]0.0e+0076.9Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITI+LSSSNYLLWKSQLLPLLESQD+LGYVDGT VP PRFEP TS+ L+ KYLAW+AADQRLLCLLLSSLTEEA+A VVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKR TKPVAE+AR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQMALT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T TAFT TNR RT  HG+  A  +NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQ+Y R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH                                              
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
              TGRVVATGKRDGGLYVLER NSAFIS L+NKSLRASYDLWHARLGHVN+SVISFLN+K  LSLTSLLPSPSLC+TCQ AK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNS FLYYVIFIDDYSRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDG  EFTNTCFK HL  SGIHHQLS PYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPR-SIPCIFL
        PAQNGRAERKHRHVTETGLALLFHSH+S RFW DA                                       C +Y     + P   S    IPCIFL
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPR-SIPCIFL

Query:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPP--TTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAG
        GYSP+HKGFRCL+P T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP  T  SP  PRS+SSPC                   
Subjt:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPP--TTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAG

Query:  STLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPAN
        S+LPP  S+  SIE  VD  SSLG+HPMITRAKAGIFKTRHP NLG+LG SGLLSA LASTEPKGFKSAAKNP W+AAMDEE+QALQQN TW LV RP N
Subjt:  STLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPAN

Query:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPT
        TNI                  R KARLVAKGYTQV GLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDVKN FLNGTL E V+MEQPPGYID RFPT
Subjt:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPT

Query:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPT
        HVCLLKKALYGLKQAPRAWFQRFSSF LTLGFS SR DTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFT KLHSEFATKDLGSLSYFLGLEASPT
Subjt:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPT

Query:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL
        PDGLFI QLKYARDILT  QLLDSKPVHTPMVVSQHLT  G PFS+PTLYRSLVG LQYL ITR DIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTL
Subjt:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL

Query:  HFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKS
        HFGLTF PS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA TAAELLW+THLLHDL VPI +QPLLLCDNK 
Subjt:  HFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKS

Query:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHL
        AIFLSSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHL
Subjt:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHL

CAN73071.1 hypothetical protein VITISV_032383 [Vitis vinifera]0.0e+0078.89Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWF RGLG +FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S              EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNG SLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNL 
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
        T+QNRQTGRVVATGKRDGGLYVLE GNSAFIS L+NKSLRASYDLWHARLGHVN+SVISFLN+KGHLSLTSLLPSPSLC+TCQLAK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNSGFLYYVIFIDDYSRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDGG EFTNTCFK HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
         AQNGRAERKHRHVTETGLALLFH HLS RFWV+ F                                      C +Y                      
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTL
                                     H P   S+                                                  S  +   L G +L
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTL

Query:  PPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNI
        PP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALL+STEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LVPRP NTNI
Subjt:  PPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNI

Query:  VGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVC
        VGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGYIDPRFPTHVC
Subjt:  VGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVC

Query:  LLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDG
        LLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSLSYFLGLEASPTPDG
Subjt:  LLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDG

Query:  LFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFG
        LFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFG
Subjt:  LFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFG

Query:  LTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIF
        LTFRPS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA TAAELLW+THLLHDLKVPI  QPLLLCDNKSAIF
Subjt:  LTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIF

Query:  LSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRGG
        LSSNPVSHKRAKHVELDYHFLRELV+AGKLRT+YVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLSLRGG
Subjt:  LSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRGG

CAN78022.1 hypothetical protein VITISV_015518 [Vitis vinifera]0.0e+0082.32Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMK GTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRG    F  F                     
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
         F     SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVV HLTKNLLSISKLTSDFPLSVTFTNNL 
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
        T+QNRQTGR VATGKRDGGLYVLERGNSAFIS L+NKSLRASYDLWHARL              GHLSLTSLLPSPSLC+TCQLAK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNSGFLYYVIFIDDYSRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDGG EFTNTCFK HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
        PAQNGRAERKHRHVTETGLALLFHSHLS RFWVDAFSTA YIINRLPTPLLGGKSPFELLYG +PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLG
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS
        YSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT
        +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LVPRP NT
Subjt:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT

Query:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH
        NIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGYIDPRFPTH
Subjt:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH

Query:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP
        VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSLSYFLGLEASPTP
Subjt:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP

Query:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH
        DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTI RPDIA+AVNSVSQFLHAPT DHFLAVK           
Subjt:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH

Query:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
                                                   +SWSAKKQPT SRSSCESEYRALA TAAELLW+THLLHDLKVPI +QPLLLCDNKSA
Subjt:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA

Query:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS
        IF SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS S
Subjt:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS

RVW41798.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]0.0e+0075.59Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL                         LVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITHTGTLSPVPNIHLLD                                
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
           NRQTGRVVATGKRDGGLYVLER NSAFI  L+NKSLRASYDLWHARL                                                  
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
                                                                                            HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
        PAQNGRAERKHRHVTETGLALLFHSHLS RFWVDAFSTA YIIN LPTPLLGGKSPFELLY Y+PHY+NFHPFGCRVYP LRDYM NKLSPRSIPCIFLG
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS
        YSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAG 
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT
        +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LVPRP NT
Subjt:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT

Query:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH
        NIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVVKATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGYID RFPTH
Subjt:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH

Query:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP
        VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSL+YFLGLEASPTP
Subjt:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP

Query:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH
        DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTLH
Subjt:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH

Query:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
        FGLTFRPS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA TAAELLW+THLLHDLKVPI +QPLLLCDNKSA
Subjt:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA

Query:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS
        IF SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS S
Subjt:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS

RVW45095.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0088.73Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+ VVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR  T  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNL 
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
        T+QNRQTGRVVATGKRDGGLYVLERGNSAFIS L+NKSLRASYDLWHARLGHVN+SVISF+N+KGHLSLTSLLPSPSLC+TCQLAK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNSGFLYYVIFIDD+SRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDGG EFTNTCFK HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
        PAQNGRAERKHRHVTETGLALLFHSHLS RFWVDAFST  YIINRLPTPLLGGKSPFELLYGY+PHY+NFHPFGC VYP LRDYMPNKLSPRSIPCIFLG
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS
        YSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT
        +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LV RP NT
Subjt:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT

Query:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH
        NIVGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDV NAFLNGTL E V+MEQPPGYIDPRFPTH
Subjt:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH

Query:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP
        VCLLKKALYGLKQAPRAWFQRFSSF LTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHS+FATKDLGSLSYFLGLEASPTP
Subjt:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP

Query:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH
        DGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHA T DHFLAVKRILRYVKGTLH
Subjt:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH

Query:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
        FGLTFRPS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHDLKVPI +QPLLLCDNKSA
Subjt:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA

Query:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG
        IFLSSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLSLRG
Subjt:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG

TrEMBL top hitse value%identityAlignment
A0A438E275 Retrovirus-related Pol polyprotein from transposon RE10.0e+0075.59Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLP LESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AA+QRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFL                         LVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQIC  EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDT ASAHMT DPSILDQSKNY GKDSVIVGNGASLPITHTGTLSPVPNIHLLD                                
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
           NRQTGRVVATGKRDGGLYVLER NSAFI  L+NKSLRASYDLWHARL                                                  
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
                                                                                            HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
        PAQNGRAERKHRHVTETGLALLFHSHLS RFWVDAFSTA YIIN LPTPLLGGKSPFELLY Y+PHY+NFHPFGCRVYP LRDYM NKLSPRSIPCIFLG
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS
        YSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAG 
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT
        +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LVPRP NT
Subjt:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT

Query:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH
        NIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTD FSPVVKATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGYID RFPTH
Subjt:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH

Query:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP
        VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSL+YFLGLEASPTP
Subjt:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP

Query:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH
        DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTLH
Subjt:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH

Query:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
        FGLTFRPS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA TAAELLW+THLLHDLKVPI +QPLLLCDNKSA
Subjt:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA

Query:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS
        IF SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS S
Subjt:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS

A0A438EBA0 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0088.73Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+LGYVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+ VVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR  T  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKD VIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNL 
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
        T+QNRQTGRVVATGKRDGGLYVLERGNSAFIS L+NKSLRASYDLWHARLGHVN+SVISF+N+KGHLSLTSLLPSPSLC+TCQLAK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNSGFLYYVIFIDD+SRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDGG EFTNTCFK HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
        PAQNGRAERKHRHVTETGLALLFHSHLS RFWVDAFST  YIINRLPTPLLGGKSPFELLYGY+PHY+NFHPFGC VYP LRDYMPNKLSPRSIPCIFLG
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS
        YSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT
        +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALLASTEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LV RP NT
Subjt:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT

Query:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH
        NIVGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDV NAFLNGTL E V+MEQPPGYIDPRFPTH
Subjt:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH

Query:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP
        VCLLKKALYGLKQAPRAWFQRFSSF LTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHS+FATKDLGSLSYFLGLEASPTP
Subjt:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP

Query:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH
        DGLFISQLKYARDILTRA LLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHA T DHFLAVKRILRYVKGTLH
Subjt:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH

Query:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
        FGLTFRPS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA T AELLW+THLLHDLKVPI +QPLLLCDNKSA
Subjt:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA

Query:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG
        IFLSSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLSLRG
Subjt:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRG

A5AKX4 Integrase catalytic domain-containing protein0.0e+0082.32Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMK GTKPVAEYAR FK +CDQLHAIGRPVED DKVHWFLRG    F  F                     
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
         F     SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPS LDQSKNY GKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVV HLTKNLLSISKLTSDFPLSVTFTNNL 
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
        T+QNRQTGR VATGKRDGGLYVLERGNSAFIS L+NKSLRASYDLWHARL              GHLSLTSLLPSPSLC+TCQLAK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNSGFLYYVIFIDDYSRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDGG EFTNTCFK HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
        PAQNGRAERKHRHVTETGLALLFHSHLS RFWVDAFSTA YIINRLPTPLLGGKSPFELLYG +PHY+NFHPFGCRVYP LRDYMPNKLSPRSIPCIFLG
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS
        YSP+HKGFRCLDP T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP+  SP +  PRS+SSPC+ICSDLVDESVQVDTSLAGS
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQT--PRSSSSPCDICSDLVDESVQVDTSLAGS

Query:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT
        +LPP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLL ALLASTEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LVPRP NT
Subjt:  TLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANT

Query:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH
        NIVGSKWVFR KYLPDGSVER KARLVAKGYT VPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGYIDPRFPTH
Subjt:  NIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTH

Query:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP
        VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSLSYFLGLEASPTP
Subjt:  VCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTP

Query:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH
        DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTI RPDIA+AVNSVSQFLHAPT DHFLAVK           
Subjt:  DGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLH

Query:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA
                                                   +SWSAKKQPT SRSSCESEYRALA TAAELLW+THLLHDLKVPI +QPLLLCDNKSA
Subjt:  FGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSA

Query:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS
        IF SSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHLQVADIFTKS S
Subjt:  IFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSIS

A5B2A6 Integrase catalytic domain-containing protein0.0e+0076.9Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITI+LSSSNYLLWKSQLLPLLESQD+LGYVDGT VP PRFEP TS+ L+ KYLAW+AADQRLLCLLLSSLTEEA+A VVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKR TKPVAE+AR FK +CDQLHAIGRPVED DKVHWFLRGLGT+FS+FSTAQMALT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T TAFT TNR RT  HG+  A  +NQRGRS+SH NNSSNRGRT+S  GRRPP CQICR EGHYADRCNQ+Y R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNGASLPITH                                              
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
              TGRVVATGKRDGGLYVLER NSAFIS L+NKSLRASYDLWHARLGHVN+SVISFLN+K  LSLTSLLPSPSLC+TCQ AK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNS FLYYVIFIDDYSRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDG  EFTNTCFK HL  SGIHHQLS PYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPR-SIPCIFL
        PAQNGRAERKHRHVTETGLALLFHSH+S RFW DA                                       C +Y     + P   S    IPCIFL
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPR-SIPCIFL

Query:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPP--TTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAG
        GYSP+HKGFRCL+P T++LYIT HAQFDETHFP +PSSQ QPLSS+ ISNFLEP  HHID SPP  T  SP  PRS+SSPC                   
Subjt:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPP--TTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAG

Query:  STLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPAN
        S+LPP  S+  SIE  VD  SSLG+HPMITRAKAGIFKTRHP NLG+LG SGLLSA LASTEPKGFKSAAKNP W+AAMDEE+QALQQN TW LV RP N
Subjt:  STLPPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPAN

Query:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPT
        TNI                  R KARLVAKGYTQV GLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDVKN FLNGTL E V+MEQPPGYID RFPT
Subjt:  TNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPT

Query:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPT
        HVCLLKKALYGLKQAPRAWFQRFSSF LTLGFS SR DTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFT KLHSEFATKDLGSLSYFLGLEASPT
Subjt:  HVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPT

Query:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL
        PDGLFI QLKYARDILT  QLLDSKPVHTPMVVSQHLT  G PFS+PTLYRSLVG LQYL ITR DIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTL
Subjt:  PDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTL

Query:  HFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKS
        HFGLTF PS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA TAAELLW+THLLHDL VPI +QPLLLCDNK 
Subjt:  HFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKS

Query:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHL
        AIFLSSNPVSHKRAKHVELDYHFLRELV+AGKLRTQYVPSHL
Subjt:  AIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHL

A5C5R8 Integrase catalytic domain-containing protein0.0e+0078.89Show/hide
Query:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP
        MASESS HLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQD+L YVDGT VPPPRFEPETS+TL+ KYLAW+AADQRLLCLLLSSLTEEA+AVVVGL 
Subjt:  MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLP

Query:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE
        TAR+VWLALE TFSH SKARELRLKDDLQLMKRGTKPVAEYAR FK +CDQLHAIGRPVED DKVHWF RGLG +FS+FSTAQM+LT +P FADLVSKAE
Subjt:  TARDVWLALETTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAE

Query:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN
        SFELFQ SLESS  T  AFT TNR RT  HG+  A   NQRGRS+SH NNSSNRGRT+S              EGHYADRCNQRY R DSS AHLAEAFN
Subjt:  SFELFQLSLESSNSTPTAFTVTNRGRT--HGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFN

Query:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI
        TSCS++GP+ ADWFLDTGASAHMT DPSILDQSKNY GKDSVIVGNG SLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNL 
Subjt:  TSCSIAGPDTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLI

Query:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS
        T+QNRQTGRVVATGKRDGGLYVLE GNSAFIS L+NKSLRASYDLWHARLGHVN+SVISFLN+KGHLSLTSLLPSPSLC+TCQLAK+HRLPYSRNE RSS
Subjt:  TIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSS

Query:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT
        HVLDLIHCDLWGPSP+KSNSGFLYYVIFIDDYSRFTW YPLKFKSDFFDIFLQFQKFVENQ+S+RIKVFQSDGG EFTNTCFK HLR SGIHHQLSCPYT
Subjt:  HVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYT

Query:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG
         AQNGRAERKHRHVTETGLALLFH HLS RFWV+ F                                      C +Y                      
Subjt:  PAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLG

Query:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTL
                                     H P   S+                                                  S  +   L G +L
Subjt:  YSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCDICSDLVDESVQVDTSLAGSTL

Query:  PPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNI
        PP  S+  SIE   D  SSLG+HPMITRAKAGIFKTRHPANLG+LGSSGLLSALL+STEPKGFKSAAKNPAW+AAMDEE+QALQQN TW LVPRP NTNI
Subjt:  PPSTSNSTSIEPPVD-FSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNI

Query:  VGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVC
        VGSKWVFR KY PDGSVER KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLS+AVTNKWPLRQLDVKNAFLNGTL E V+MEQPPGYIDPRFPTHVC
Subjt:  VGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVC

Query:  LLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDG
        LLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQS++IYLLLYVDDIIVTGNN SL+DSFTRKLHSEFATKDLGSLSYFLGLEASPTPDG
Subjt:  LLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDG

Query:  LFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFG
        LFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLT  GSPFS+PTLYRSLVGALQYLTITRPDIA+AVNSVSQFLHAPT DHFLAVKRILRYVKGTLHFG
Subjt:  LFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFG

Query:  LTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIF
        LTFRPS +PS LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALA TAAELLW+THLLHDLKVPI  QPLLLCDNKSAIF
Subjt:  LTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIF

Query:  LSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRGG
        LSSNPVSHKRAKHVELDYHFLRELV+AGKLRT+YVPSHLQVADIFTKS+SRPLFEFFRSKL++RSNPTLSLRGG
Subjt:  LSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRGG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-13527.11Show/hide
Query:  YLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYL--AWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKARELRLK
        Y +WK ++  LL  QD+L  VDG               L P  +  +W+ A++     ++  L++  +       TAR +   L+  +  +S A +L L+
Subjt:  YLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYL--AWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKARELRLK

Query:  DDLQLMKRGTK-PVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTPTAFTVTNR
          L  +K  ++  +  +   F ++  +L A G  +E++DK+   L  L + +    TA   L+      +L        L    ++  N           
Subjt:  DDLQLMKRGTK-PVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTPTAFTVTNR

Query:  GRTHGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRC-------------NQRYVRPDSSHA---HLAEAFNTSCSIAGPDT
           H +      N   ++   KN  +   +      +    C  C +EGH    C             N++ V+  +SH     + E  NTS      D 
Subjt:  GRTHGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRC-------------NQRYVRPDSSHA---HLAEAFNTSCSIAGPDT

Query:  ADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVG-NGASLPITHTG--TLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLITIQNRQT
          + LD+GAS H+  D S+   S        + V   G  +  T  G   L     I L DVL       NL+S+ +L  +  +S+ F  + +TI   + 
Subjt:  ADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVG-NGASLPITHTG--TLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLITIQNRQT

Query:  GRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLP----SPSLCNTCQLAKSHRLPYS--RNERRSSH
        G +V   K  G L  +   N  F +   N   + ++ LWH R GH++   +  + RK   S  SLL     S  +C  C   K  RLP+   +++     
Subjt:  GRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLP----SPSLCNTCQLAKSHRLPYS--RNERRSSH

Query:  VLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYTP
         L ++H D+ GP    +     Y+VIF+D ++ +   Y +K+KSD F +F  F    E  ++ ++     D G E+ +   +      GI + L+ P+TP
Subjt:  VLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYTP

Query:  AQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLL--GGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL
          NG +ER  R +TE    ++  + L   FW +A  TA Y+INR+P+  L    K+P+E+ +   P+  +   FG  VY ++++    K   +S   IF+
Subjt:  AQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLL--GGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL

Query:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCD--------------------
        GY P   GF+  D    K  +      DET+   + +S+     ++ + +  E    +  +        + P + S  CD                    
Subjt:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPRSSSSPCD--------------------

Query:  ---ICSDLVDESVQVD----------------------------TSLAGSTLPPSTSNSTSIE-------------PPVDFSSLGTHPMITRAKAGIFKT
           I ++  +ES + D                                GS  P  +  S + E               ++  +  +  + T+ +    + 
Subjt:  ---ICSDLVDESVQVD----------------------------TSLAGSTLPPSTSNSTSIE-------------PPVDFSSLGTHPMITRAKAGIFKT

Query:  RHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLD
         +  N  +L +  + + +  S +   ++      +W  A++ E+ A + N+TWT+  RP N NIV S+WVF +KY   G+  R+KARLVA+G+TQ   +D
Subjt:  RHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLD

Query:  YTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADT
        Y +TF+PV + ++ R +LS+ +     + Q+DVK AFLNGTL E ++M  P G        +VC L KA+YGLKQA R WF+ F   L    F  S  D 
Subjt:  YTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADT

Query:  SLFVFHQQSNI---IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQH
         +++   + NI   IY+LLYVDD+++   + + +++F R L  +F   DL  + +F+G+      D +++SQ  Y + IL++  + +   V TP+    +
Subjt:  SLFVFHQQSNI---IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQH

Query:  LTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPS-NVPSTLVAYSDADWAGCPDTRRSTS
             S     T  RSL+G L Y+ + TRPD+  AVN +S++     ++ +  +KR+LRY+KGT+   L F+ +    + ++ Y D+DWAG    R+ST+
Subjt:  LTADGSPFSDPTLYRSLVGALQYLTI-TRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPS-NVPSTLVAYSDADWAGCPDTRRSTS

Query:  GYSIYLGN-NLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKL
        GY   + + NL+ W+ K+Q +V+ SS E+EY AL     E LW+  LL  + + +     +  DN+  I +++NP  HKRAKH+++ YHF RE V    +
Subjt:  GYSIYLGN-NLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKL

Query:  RTQYVPSHLQVADIFTKSISRPLFEFFRSKL
          +Y+P+  Q+ADIFTK +    F   R KL
Subjt:  RTQYVPSHLQVADIFTKSISRPLFEFFRSKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-15530.67Show/hide
Query:  SNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKARELRLK
        + +  W+ ++  LL  Q +   +D     P   + E           W   D+R    +   L+++ +  ++   TAR +W  LE+ +  ++   +L LK
Subjt:  SNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKARELRLK

Query:  DDLQL--MKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTPTAFTVTN
          L    M  GT     +   F  +  QL  +G  +E+ DK    L  L + +   +T  +   +     D+ S         L  E     P      N
Subjt:  DDLQL--MKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTPTAFTVTN

Query:  RGRTHGSHPASFTNQRGRSYSHKNN----SSNRGRTHSSQGRRPPHCQICRKEGHYADRC-NQRYVRPDSSHAHLAEAFNTSCSI---------------
        +G+      A  T  RGRSY   +N    S  RG++ +    R  +C  C + GH+   C N R  + ++S     +  NT+  +               
Subjt:  RGRTHGSHPASFTNQRGRSYSHKNN----SSNRGRTHSSQGRRPPHCQICRKEGHYADRC-NQRYVRPDSSHAHLAEAFNTSCSI---------------

Query:  ----AGPDTADWFLDTGASAHMTADPSILDQSKNYTGKD--SVIVGNGASLPITHTGTLSPVPNIH----LLDVLVVPHLTKNLLSISKLTSDFPLSVTF
            +GP+ ++W +DT AS H T    + D    Y   D  +V +GN +   I   G +    N+     L DV  VP L  NL+S   L  D   S  F
Subjt:  ----AGPDTADWFLDTGASAHMTADPSILDQSKNYTGKD--SVIVGNGASLPITHTGTLSPVPNIH----LLDVLVVPHLTKNLLSISKLTSDFPLSVTF

Query:  TNNLITIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSL-RASYDLWHARLGHVNHSVISFLNRKGHLSL---TSLLPSPSLCNTCQLAKSHRLP
         N    +   +   V+A G   G LY   R N+       N +    S DLWH R+GH++   +  L +K  +S    T++ P    C+ C   K HR+ 
Subjt:  TNNLITIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSL-RASYDLWHARLGHVNHSVISFLNRKGHLSL---TSLLPSPSLCNTCQLAKSHRLP

Query:  YSRNERRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGI
        +  +  R  ++LDL++ D+ GP  ++S  G  Y+V FIDD SR  W Y LK K   F +F +F   VE +   ++K  +SD G E+T+  F+ +  + GI
Subjt:  YSRNERRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGI

Query:  HHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSP
         H+ + P TP  NG AER +R + E   ++L  + L   FW +A  TA Y+INR P+  L  + P  +       Y +   FGCR + ++      KL  
Subjt:  HHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSP

Query:  RSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDS-SPPTTSSPQTPRSSSSPCDICSDLVDESVQ
        +SIPCIF+GY     G+R  DP   K+  +    F E+        +T    S  + N + P+F  I S S   TS+  T    S   +   +++++  Q
Subjt:  RSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDS-SPPTTSSPQTPRSSSSPCDICSDLVDESVQ

Query:  VDTSLAGSTLPPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNP---AWVAAMDEEIQALQQNDT
        +D  +              +E P         P+    +  +   R+P+   +L S           EP+  K    +P     + AM EE+++LQ+N T
Subjt:  VDTSLAGSTLPPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNP---AWVAAMDEEIQALQQNDT

Query:  WTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPP
        + LV  P     +  KWVF++K   D  + R+KARLV KG+ Q  G+D+ + FSPVVK T++R +LS+A +    + QLDVK AFL+G L E ++MEQP 
Subjt:  WTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPP

Query:  GYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSL-FVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLS
        G+        VC L K+LYGLKQAPR W+ +F SF+ +  +  + +D  + F    ++N I LLLYVDD+++ G +  LI      L   F  KDLG   
Subjt:  GYIDPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSL-FVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLS

Query:  YFLGLE--ASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFS-------DPTLYRSLVGALQY-LTITRPDIAYAVNSVSQFLHA
          LG++     T   L++SQ KY   +L R  + ++KPV TP+     L+    P +           Y S VG+L Y +  TRPDIA+AV  VS+FL  
Subjt:  YFLGLE--ASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFS-------DPTLYRSLVGALQY-LTITRPDIAYAVNSVSQFLHA

Query:  PTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHL
        P  +H+ AVK ILRY++GT    L F  S+    L  Y+DAD AG  D R+S++GY        +SW +K Q  V+ S+ E+EY A   T  E++W+   
Subjt:  PTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHL

Query:  LHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSN
        L +L +   K+ ++ CD++SAI LS N + H R KH+++ YH++RE+V    L+   + ++   AD+ TK + R  FE  +  + + SN
Subjt:  LHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSN

P92519 Uncharacterized mitochondrial protein AtMg008107.8e-6453.98Show/hide
Query:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY
        +YLLLYVDDI++TG++++L++    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + DP+ +
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY

Query:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK
        RS+VGALQYLT+TRPDI+YAVN V Q +H PT   F  +KR+LRYVKGT+  GL +   N    + A+ D+DWAGC  TRRST+G+  +LG N++SWSAK
Subjt:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK

Query:  KQPTVSRSSCESEYRALATTAAELLW
        +QPTVSRSS E+EYRALA TAAEL W
Subjt:  KQPTVSRSSCESEYRALATTAAELLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-26737.64Show/hide
Query:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR
        KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     + +  +NP Y  W+  D+ +   +L +++      V    TA  +W  L   +++ S   
Subjt:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR

Query:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF
          +L+  L+   +GTK + +Y +      DQL  +G+P++  ++V   L  L  E+        A  + P   ++  +  + E   L++ S+   P TA 
Subjt:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF

Query:  TVTNRGRTHGSHPASFTNQRGRSYSHKNNSSN-------RGRTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI
         V++R  T  ++  +    R   Y ++NN++N           H +  +  P+   CQIC  +GH A RC+Q      S ++    +  T      + ++
Subjt:  TVTNRGRTHGSHPASFTNQRGRSYSHKNNSSN-------RGRTHSSQGRRPPH---CQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI

Query:  AGP-DTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPN---IHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLIT
          P  + +W LD+GA+ H+T+D + L   + YTG D V+V +G+++PI+HTG+ S       ++L ++L VP++ KNL+S+ +L +   +SV F      
Subjt:  AGP-DTADWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPN---IHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLIT

Query:  IQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSL--CNTCQLAKSHRLPYSRNERRS
        +++  TG  +  GK    LY     +S  +S   + S +A++  WHARLGH   S+++ +    + SL+ L PS     C+ C + KS+++P+S++   S
Subjt:  IQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSL--CNTCQLAKSHRLPYSRNERRS

Query:  SHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPY
        +  L+ I+ D+W  SP+ S+  + YYVIF+D ++R+TW YPLK KS   + F+ F+  +EN++ +RI  F SD G EF       +    GI H  S P+
Subjt:  SHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPY

Query:  TPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL
        TP  NG +ERKHRH+ ETGL LL H+ +   +W  AF+ A Y+INRLPTPLL  +SPF+ L+G +P+YD    FGC  YP+LR Y  +KL  +S  C+FL
Subjt:  TPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL

Query:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFP-----------------------------------------------AIPSSQTQPL--SSIPISNF
        GYS     + CL   T++LYI+ H +FDE  FP                                                 PSS + P   S +  SN 
Subjt:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFP-----------------------------------------------AIPSSQTQPL--SSIPISNF

Query:  LEPHFHHIDSSP-----------PTTSSPQTP--------RSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTS----------IEPPVDFSS---
                 SSP           PTT   QT          S ++P +     + +S+      + S+  P+TS S+S          I PP   +    
Subjt:  LEPHFHHIDSSP-----------PTTSSPQTP--------RSSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTS----------IEPPVDFSS---

Query:  ------LGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLV-PRPANTNIVGSKWVFRIKYL
              L TH M TRAKAGI K     +L +        +L A +EP+    A K+  W  AM  EI A   N TW LV P P++  IVG +W+F  KY 
Subjt:  ------LGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLV-PRPANTNIVGSKWVFRIKYL

Query:  PDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVCLLKKALYGLKQA
         DGS+ R+KARLVAKGY Q PGLDY +TFSPV+K+T++R+VL +AV   WP+RQLDV NAFL GTL + V+M QPPG+ID   P +VC L+KALYGLKQA
Subjt:  PDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVCLLKKALYGLKQA

Query:  PRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDI
        PRAW+    ++LLT+GF  S +DTSLFV  +  +I+Y+L+YVDDI++TGN+ +L+ +    L   F+ KD   L YFLG+EA   P GL +SQ +Y  D+
Subjt:  PRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDI

Query:  LTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPST
        L R  ++ +KPV TPM  S  L+   G+  +DPT YR +VG+LQYL  TRPDI+YAVN +SQF+H PT +H  A+KRILRY+ GT + G+  +  N  S 
Subjt:  LTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPST

Query:  LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRA
        L AYSDADWAG  D   ST+GY +YLG++ +SWS+KKQ  V RSS E+EYR++A T++E+ W+  LL +L + +++ P++ CDN  A +L +NPV H R 
Subjt:  LVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRA

Query:  KHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNP
        KH+ +DYHF+R  V +G LR  +V +H Q+AD  TK +SR  F+ F SK+ V   P
Subjt:  KHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.9e-25837.67Show/hide
Query:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR
        KL+S+NYL+W  Q+  L +  ++ G++DG T +PP     +    +NP Y  WR  D+ +   +L +++      V    TA  +W  L   +++ S   
Subjt:  KLSSSNYLLWKSQLLPLLESQDMLGYVDG-TRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALETTFSHQSKAR

Query:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF
          +L+                   F    DQL  +G+P++  ++V   L  L  ++        A  + P   ++  +  + E   L+L S+   P TA 
Subjt:  ELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTP-TAF

Query:  TVTNRGRTHGSHPASFTNQRG--RSYSHKNNSSNRGRTHSSQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI
         VT+R      +     N RG  R+Y++ NN SN  +  SS  R   R P      CQIC  +GH A RC Q +    +++   + +  T      + ++
Subjt:  TVTNRGRTHGSHPASFTNQRG--RSYSHKNNSSNRGRTHSSQGR---RPP-----HCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNT------SCSI

Query:  AGPDTA-DWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLD---VLVVPHLTKNLLSISKLTSDFPLSVTFTNNLIT
          P  A +W LD+GA+ H+T+D + L   + YTG D V++ +G+++PITHTG+ S   +   LD   VL VP++ KNL+S+ +L +   +SV F      
Subjt:  AGPDTA-DWFLDTGASAHMTADPSILDQSKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLD---VLVVPHLTKNLLSISKLTSDFPLSVTFTNNLIT

Query:  IQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSL--CNTCQLAKSHRLPYSRNERRS
        +++  TG  +  GK    LY     +S  +S   +   +A++  WH+RLGH + ++++ +    + SL  L PS  L  C+ C + KSH++P+S +   S
Subjt:  IQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRASYDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSL--CNTCQLAKSHRLPYSRNERRS

Query:  SHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPY
        S  L+ I+ D+W  SP+ S   + YYVIF+D ++R+TW YPLK KS   D F+ F+  VEN++ +RI    SD G EF     + +L   GI H  S P+
Subjt:  SHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFLQFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPY

Query:  TPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL
        TP  NG +ERKHRH+ E GL LL H+ +   +W  AFS A Y+INRLPTPLL  +SPF+ L+G  P+Y+    FGC  YP+LR Y  +KL  +S  C F+
Subjt:  TPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYGYTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFL

Query:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFP------AIPSSQTQPLSSIP--ISNFLEPHFHHIDSSPPTTSS--PQTPRSSSSPCDICSDLVDES-
        GYS     + CL   T +LY + H QFDE  FP       + +SQ Q   S P   S+   P    +  +PP        +PR  SSP  +C+  V  S 
Subjt:  GYSPAHKGFRCLDPATTKLYITCHAQFDETHFP------AIPSSQTQPLSSIP--ISNFLEPHFHHIDSSPPTTSS--PQTPRSSSSPCDICSDLVDES-

Query:  ----------------------------------------------------------------------VQVDTSLAGSTLPPSTSNSTSIEPPV----
                                                                                  TS++    P S+S ST   PPV    
Subjt:  ----------------------------------------------------------------------VQVDTSLAGSTLPPSTSNSTSIEPPV----

Query:  ------DFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLV-PRPANTNIVGSKWVFR
                + + TH M TRAK GI K     +          ++L A++EP+    A K+  W  AM  EI A   N TW LV P P +  IVG +W+F 
Subjt:  ------DFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLV-PRPANTNIVGSKWVFR

Query:  IKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVCLLKKALYG
         K+  DGS+ R+KARLVAKGY Q PGLDY +TFSPV+K+T++R+VL +AV   WP+RQLDV NAFL GTL + V+M QPPG++D   P +VC L+KA+YG
Subjt:  IKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDPRFPTHVCLLKKALYG

Query:  LKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKY
        LKQAPRAW+    ++LLT+GF  S +DTSLFV  +  +IIY+L+YVDDI++TGN++ L+      L   F+ K+   L YFLG+EA   P GL +SQ +Y
Subjt:  LKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKY

Query:  ARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSN
          D+L R  +L +KPV TPM  S  LT   G+   DPT YR +VG+LQYL  TRPD++YAVN +SQ++H PT DH+ A+KR+LRY+ GT   G+  +  N
Subjt:  ARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSN

Query:  VPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVS
          S L AYSDADWAG  D   ST+GY +YLG++ +SWS+KKQ  V RSS E+EYR++A T++EL W+  LL +L + +S  P++ CDN  A +L +NPV 
Subjt:  VPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVS

Query:  HKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNP
        H R KH+ LDYHF+R  V +G LR  +V +H Q+AD  TK +SR  F+ F  K+ V   P
Subjt:  HKRAKHVELDYHFLRELVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNP

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.1e-1227.6Show/hide
Query:  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLT-EEAMAVVVGLPTARDVWLALETTFSHQS
        + + +  SNY  W+   L    S D++G++DGT +P            N   + W+  D  +   L  +LT ++     V   T+RD+WL ++  F +  
Subjt:  ITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLT-EEAMAVVVGLPTARDVWLALETTFSHQS

Query:  KARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSK-AESFELFQLSLE------
         AR LRL  +L+    G   VA+Y R  KK+ D L  +  PV D + V + L GL  +F             P F D  +   E  +  + +++      
Subjt:  KARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSK-AESFELFQLSLE------

Query:  --SSNSTPTAFT----VTNRGRTHGSHPASFTNQRGRSYSHKNNSSNRGR
          SS+ST  A +    VTN  R+ G       NQ G     + N+  RGR
Subjt:  --SSNSTPTAFT----VTNRGRTHGSHPASFTNQRGRSYSHKNNSSNRGR

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-11846.03Show/hide
Query:  LSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTV
        L  +  + EP  +  A +   W  AMD+EI A++   TW +   P N   +G KWV++IKY  DG++ER+KARLVAKGYTQ  G+D+ +TFSPV K T+V
Subjt:  LSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTV

Query:  RVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYI----DPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSN
        +++L+I+    + L QLD+ NAFLNG L E ++M+ PPGY     D   P  VC LKK++YGLKQA R WF +FS  L+  GF  S +D + F+    + 
Subjt:  RVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYI----DPRFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSN

Query:  IIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPT
         + +L+YVDDII+  NN + +D    +L S F  +DLG L YFLGLE + +  G+ I Q KYA D+L    LL  KP   PM  S   +A  G  F D  
Subjt:  IIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTA-DGSPFSDPT

Query:  LYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWS
         YR L+G L YL ITR DI++AVN +SQF  AP   H  AV +IL Y+KGT+  GL F  S     L  +SDA +  C DTRRST+GY ++LG +L+SW 
Subjt:  LYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWS

Query:  AKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRE
        +KKQ  VS+SS E+EYRAL+    E++W+     +L++P+SK  LL CDN +AI +++N V H+R KH+E D H +RE
Subjt:  AKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.2e-1451.28Show/hide
Query:  YLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGY
        YLTITRPD+ +AVN +SQF  A       AV ++L YVKGT+  GL F  +     L A++D+DWA CPDTRRS +G+
Subjt:  YLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein5.5e-6553.98Show/hide
Query:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY
        +YLLLYVDDI++TG++++L++    +L S F+ KDLG + YFLG++    P GLF+SQ KYA  IL  A +LD KP+ TP+ +  + +   + + DP+ +
Subjt:  IYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFISQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLY

Query:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK
        RS+VGALQYLT+TRPDI+YAVN V Q +H PT   F  +KR+LRYVKGT+  GL +   N    + A+ D+DWAGC  TRRST+G+  +LG N++SWSAK
Subjt:  RSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLVAYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAK

Query:  KQPTVSRSSCESEYRALATTAAELLW
        +QPTVSRSS E+EYRALA TAAEL W
Subjt:  KQPTVSRSSCESEYRALATTAAELLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.0e-2645.86Show/hide
Query:  MITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARL
        M+TR+KAGI K     +L I              EPK    A K+P W  AM EE+ AL +N TW LVP P N NI+G KWVF+ K   DG+++R KARL
Subjt:  MITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQALQQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARL

Query:  VAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA
        VAKG+ Q  G+ + +T+SPVV+  T+R +L++A
Subjt:  VAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGAATCTTCTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCC
TCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACCAGGGTTCCACCACCTCGCTTTGAACCAGAAACCTCTTCAACACTCAACCCCAAATATTTGGCATGGA
GAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCCCTACTGCACGTGATGTTTGGCTTGCGTTGGAA
ACTACGTTCAGCCATCAGTCGAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACCAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAA
AATTTGTGACCAACTTCATGCCATTGGCAGACCGGTCGAGGACATTGATAAAGTGCACTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGA
TGGCTCTTACCTCTATCCCCTGTTTTGCAGATTTAGTCTCTAAAGCTGAAAGTTTTGAGTTGTTTCAGCTCTCCCTTGAGTCCTCTAACTCCACTCCTACAGCATTCACA
GTCACTAATCGTGGTCGCACCCATGGAAGCCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAATAACTCTTCTAATCGAGGACGAACCCACTC
AAGTCAAGGTCGTCGACCACCTCACTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTG
CTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATACTGCTGATTGGTTTCTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCTATTTTGGATCAG
TCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCTTCCCTACCCATTACCCACACCGGTACTCTTTCACCTGTTCCAAATATTCACTTGTTAGATGT
CTTGGTTGTCCCTCACCTCACTAAAAATCTTCTTTCCATAAGTAAATTAACGTCTGATTTTCCTCTCTCCGTTACATTTACTAATAATCTTATTACTATCCAGAATCGTC
AAACAGGAAGGGTGGTGGCAACCGGTAAAAGAGATGGAGGGCTATATGTGCTGGAGCGCGGCAACTCTGCTTTTATTTCAGCCCTTAGAAACAAATCTTTACGTGCTTCA
TATGATTTATGGCATGCTCGTCTAGGTCATGTGAATCATTCTGTTATTTCTTTTTTAAATAGAAAAGGTCATCTTTCTCTTACGTCTTTATTGCCTTCTCCATCATTATG
TAATACCTGTCAGCTTGCAAAAAGTCATCGATTGCCTTATTCCCGCAATGAACGTAGGTCGTCTCATGTGTTAGATCTTATTCATTGTGATCTTTGGGGTCCTTCTCCCG
TCAAATCAAATTCGGGTTTCCTTTATTATGTTATTTTTATTGATGATTATTCTCGATTCACTTGGTTTTACCCTTTAAAATTTAAATCTGATTTTTTTGATATTTTTCTT
CAATTTCAAAAATTTGTGGAAAATCAATATTCTTCTCGTATCAAGGTATTTCAAAGTGATGGTGGTACCGAATTTACTAATACTTGTTTCAAAACTCATTTACGTAATTC
TGGCATCCACCATCAACTCTCTTGTCCATATACACCTGCTCAAAATGGTCGTGCTGAGAGAAAACATCGTCATGTGACTGAGACTGGCTTGGCCCTTCTCTTTCACTCTC
ATCTTTCTTCTCGTTTTTGGGTTGACGCCTTCAGCACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGC
TACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTT
GGGTTATAGTCCTGCTCATAAAGGGTTTCGCTGTCTTGATCCGGCCACCACTAAGCTATATATCACCTGTCATGCTCAATTTGATGAAACCCACTTTCCTGCTATCCCTA
GCTCCCAGACCCAACCTCTTTCCTCTATTCCTATTTCAAATTTCTTAGAACCACATTTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCAAACTCCTCGA
TCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGCCACCTTCGACTTCTAATTCGACCTC
TATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCTATGATCACACGAGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATTTTGGGCT
CATCTGGACTTCTTTCTGCTCTTCTTGCATCCACTGAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCAAGCATTA
CAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCTAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCTGATGGATCTGTCGAGCGTTT
CAAGGCTCGTCTTGTTGCCAAAGGTTATACTCAGGTTCCTGGTCTTGACTACACTGACACTTTCAGTCCGGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTG
CAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAATGCTTTTCTCAATGGAACTCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATATTGATCCT
CGATTTCCCACTCATGTTTGTCTATTAAAGAAAGCCCTCTATGGTTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTC
TTGCAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAATCTAACATTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTA
TTGACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATT
AGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCTATGGTTGTTTCTCAACACTTGACTGCTGATGGTTCTCC
TTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATTACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATG
CCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCGTCCATCCAATGTTCCTAGTACGCTAGTC
GCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTGGTTTCTTGGAGTGCCAAAAAGCAACC
TACTGTCTCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTTGCCACAACCGCTGCTGAACTTCTTTGGGTTACGCATCTTTTGCATGACCTCAAGGTCCCTATTTCAA
AGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTTGAGCTCTAATCCCGTTTCTCACAAGCGGGCCAAACATGTTGAACTAGATTATCATTTCCTTCGAGAA
CTTGTTATCGCTGGCAAACTTCGCACACAATATGTACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGAGTATTTCTCGACCTCTCTTTGAATTTTTCAGATCCAA
GCTTTACGTTCGTTCAAATCCGACGCTCAGCTTGCGGGGGGGTGTTAAGGATAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGAATCTTCTTATCATCTTCTTCCTTTCAATACTCTAATCCATATGATCACCATCAAACTTTCTTCCTCCAATTATCTTCTTTGGAAAAGCCAACTTCTCCC
TCTTCTTGAGAGTCAAGACATGCTGGGCTATGTCGATGGAACCAGGGTTCCACCACCTCGCTTTGAACCAGAAACCTCTTCAACACTCAACCCCAAATATTTGGCATGGA
GAGCAGCCGATCAACGACTTCTCTGTCTCCTGCTCTCCTCTCTCACTGAGGAAGCCATGGCTGTTGTCGTTGGTCTCCCTACTGCACGTGATGTTTGGCTTGCGTTGGAA
ACTACGTTCAGCCATCAGTCGAAAGCTCGTGAACTGAGACTCAAGGATGACTTGCAGTTGATGAAACGTGGCACCAAACCTGTTGCTGAGTATGCCCGTGCCTTCAAAAA
AATTTGTGACCAACTTCATGCCATTGGCAGACCGGTCGAGGACATTGATAAAGTGCACTGGTTCCTTCGTGGACTCGGCACCGAATTTTCAGCTTTTTCTACTGCTCAGA
TGGCTCTTACCTCTATCCCCTGTTTTGCAGATTTAGTCTCTAAAGCTGAAAGTTTTGAGTTGTTTCAGCTCTCCCTTGAGTCCTCTAACTCCACTCCTACAGCATTCACA
GTCACTAATCGTGGTCGCACCCATGGAAGCCACCCTGCTTCCTTTACCAACCAGCGAGGTCGTTCTTATTCTCACAAAAATAACTCTTCTAATCGAGGACGAACCCACTC
AAGTCAAGGTCGTCGACCACCTCACTGCCAAATATGCCGCAAAGAGGGCCATTATGCTGACCGCTGCAACCAACGGTATGTTCGACCTGATTCTTCTCATGCTCACCTTG
CTGAAGCCTTTAACACGTCATGTTCTATTGCTGGACCCGATACTGCTGATTGGTTTCTGGACACTGGAGCTTCGGCCCATATGACTGCCGACCCATCTATTTTGGATCAG
TCTAAAAATTACACGGGTAAGGACTCTGTGATTGTAGGAAACGGTGCTTCCCTACCCATTACCCACACCGGTACTCTTTCACCTGTTCCAAATATTCACTTGTTAGATGT
CTTGGTTGTCCCTCACCTCACTAAAAATCTTCTTTCCATAAGTAAATTAACGTCTGATTTTCCTCTCTCCGTTACATTTACTAATAATCTTATTACTATCCAGAATCGTC
AAACAGGAAGGGTGGTGGCAACCGGTAAAAGAGATGGAGGGCTATATGTGCTGGAGCGCGGCAACTCTGCTTTTATTTCAGCCCTTAGAAACAAATCTTTACGTGCTTCA
TATGATTTATGGCATGCTCGTCTAGGTCATGTGAATCATTCTGTTATTTCTTTTTTAAATAGAAAAGGTCATCTTTCTCTTACGTCTTTATTGCCTTCTCCATCATTATG
TAATACCTGTCAGCTTGCAAAAAGTCATCGATTGCCTTATTCCCGCAATGAACGTAGGTCGTCTCATGTGTTAGATCTTATTCATTGTGATCTTTGGGGTCCTTCTCCCG
TCAAATCAAATTCGGGTTTCCTTTATTATGTTATTTTTATTGATGATTATTCTCGATTCACTTGGTTTTACCCTTTAAAATTTAAATCTGATTTTTTTGATATTTTTCTT
CAATTTCAAAAATTTGTGGAAAATCAATATTCTTCTCGTATCAAGGTATTTCAAAGTGATGGTGGTACCGAATTTACTAATACTTGTTTCAAAACTCATTTACGTAATTC
TGGCATCCACCATCAACTCTCTTGTCCATATACACCTGCTCAAAATGGTCGTGCTGAGAGAAAACATCGTCATGTGACTGAGACTGGCTTGGCCCTTCTCTTTCACTCTC
ATCTTTCTTCTCGTTTTTGGGTTGACGCCTTCAGCACTGCAGCTTATATTATCAACCGGTTGCCTACTCCACTTCTTGGAGGTAAGTCACCCTTTGAACTCCTTTATGGC
TACACTCCACATTATGACAATTTTCATCCCTTTGGTTGTCGTGTTTATCCTTATTTGCGTGATTATATGCCTAACAAGCTTTCTCCCCGCAGCATTCCTTGTATTTTTTT
GGGTTATAGTCCTGCTCATAAAGGGTTTCGCTGTCTTGATCCGGCCACCACTAAGCTATATATCACCTGTCATGCTCAATTTGATGAAACCCACTTTCCTGCTATCCCTA
GCTCCCAGACCCAACCTCTTTCCTCTATTCCTATTTCAAATTTCTTAGAACCACATTTTCATCATATTGATTCATCCCCCCCTACCACTTCATCACCGCAAACTCCTCGA
TCCAGTTCATCCCCGTGTGATATTTGTTCTGACCTTGTAGATGAGTCTGTGCAGGTTGATACTTCTCTTGCAGGTTCCACTTTGCCACCTTCGACTTCTAATTCGACCTC
TATTGAACCTCCTGTTGATTTCTCTTCTTTGGGCACTCATCCTATGATCACACGAGCCAAAGCTGGTATATTCAAGACTCGTCATCCAGCAAATCTTGGTATTTTGGGCT
CATCTGGACTTCTTTCTGCTCTTCTTGCATCCACTGAGCCAAAAGGATTCAAATCTGCGGCTAAGAATCCTGCTTGGGTGGCTGCCATGGATGAAGAAATTCAAGCATTA
CAACAAAATGATACTTGGACTTTGGTTCCTCGCCCTGCTAACACCAACATCGTGGGCTCTAAATGGGTGTTTCGTATTAAATATTTGCCTGATGGATCTGTCGAGCGTTT
CAAGGCTCGTCTTGTTGCCAAAGGTTATACTCAGGTTCCTGGTCTTGACTACACTGACACTTTCAGTCCGGTTGTCAAAGCTACCACTGTCCGTGTTGTGCTTTCTATTG
CAGTCACAAATAAATGGCCTCTTCGACAACTTGATGTCAAGAATGCTTTTCTCAATGGAACTCTTATTGAACGTGTTCATATGGAACAACCTCCTGGGTATATTGATCCT
CGATTTCCCACTCATGTTTGTCTATTAAAGAAAGCCCTCTATGGTTTAAAGCAAGCTCCTCGTGCTTGGTTTCAGCGTTTTAGCTCATTTCTTCTCACACTTGGGTTTTC
TTGCAGTCGCGCTGACACGTCCCTTTTTGTCTTTCATCAGCAATCTAACATTATCTATTTGCTTCTTTATGTTGATGACATTATTGTTACCGGCAACAACTCATCTCTTA
TTGACAGCTTTACTCGCAAGCTTCATTCTGAGTTTGCTACCAAAGATTTGGGTTCTCTCAGTTACTTTCTTGGTCTTGAAGCTTCACCCACTCCTGATGGTCTCTTTATT
AGTCAGTTAAAATATGCTCGAGATATTCTTACTCGTGCTCAGTTGCTCGATAGCAAACCAGTTCACACTCCTATGGTTGTTTCTCAACACTTGACTGCTGATGGTTCTCC
TTTCTCTGATCCTACTCTCTACAGATCTCTTGTTGGCGCCCTTCAGTACTTGACTATTACGCGTCCAGATATTGCCTATGCTGTCAATTCTGTCAGTCAATTCTTGCATG
CCCCTACTGCAGATCACTTTCTTGCTGTCAAACGTATTCTTCGCTATGTCAAAGGAACACTCCACTTTGGTCTTACCTTTCGTCCATCCAATGTTCCTAGTACGCTAGTC
GCTTATTCGGATGCTGACTGGGCTGGTTGTCCCGATACTCGTCGTTCTACATCCGGCTATTCTATTTATCTTGGTAACAATCTGGTTTCTTGGAGTGCCAAAAAGCAACC
TACTGTCTCACGCTCCAGCTGTGAATCTGAGTATCGTGCTCTTGCCACAACCGCTGCTGAACTTCTTTGGGTTACGCATCTTTTGCATGACCTCAAGGTCCCTATTTCAA
AGCAGCCCTTACTCTTATGTGACAACAAAAGTGCTATTTTTTTGAGCTCTAATCCCGTTTCTCACAAGCGGGCCAAACATGTTGAACTAGATTATCATTTCCTTCGAGAA
CTTGTTATCGCTGGCAAACTTCGCACACAATATGTACCCTCTCATCTCCAAGTTGCTGACATCTTCACAAAGAGTATTTCTCGACCTCTCTTTGAATTTTTCAGATCCAA
GCTTTACGTTCGTTCAAATCCGACGCTCAGCTTGCGGGGGGGTGTTAAGGATAGTTGA
Protein sequenceShow/hide protein sequence
MASESSYHLLPFNTLIHMITIKLSSSNYLLWKSQLLPLLESQDMLGYVDGTRVPPPRFEPETSSTLNPKYLAWRAADQRLLCLLLSSLTEEAMAVVVGLPTARDVWLALE
TTFSHQSKARELRLKDDLQLMKRGTKPVAEYARAFKKICDQLHAIGRPVEDIDKVHWFLRGLGTEFSAFSTAQMALTSIPCFADLVSKAESFELFQLSLESSNSTPTAFT
VTNRGRTHGSHPASFTNQRGRSYSHKNNSSNRGRTHSSQGRRPPHCQICRKEGHYADRCNQRYVRPDSSHAHLAEAFNTSCSIAGPDTADWFLDTGASAHMTADPSILDQ
SKNYTGKDSVIVGNGASLPITHTGTLSPVPNIHLLDVLVVPHLTKNLLSISKLTSDFPLSVTFTNNLITIQNRQTGRVVATGKRDGGLYVLERGNSAFISALRNKSLRAS
YDLWHARLGHVNHSVISFLNRKGHLSLTSLLPSPSLCNTCQLAKSHRLPYSRNERRSSHVLDLIHCDLWGPSPVKSNSGFLYYVIFIDDYSRFTWFYPLKFKSDFFDIFL
QFQKFVENQYSSRIKVFQSDGGTEFTNTCFKTHLRNSGIHHQLSCPYTPAQNGRAERKHRHVTETGLALLFHSHLSSRFWVDAFSTAAYIINRLPTPLLGGKSPFELLYG
YTPHYDNFHPFGCRVYPYLRDYMPNKLSPRSIPCIFLGYSPAHKGFRCLDPATTKLYITCHAQFDETHFPAIPSSQTQPLSSIPISNFLEPHFHHIDSSPPTTSSPQTPR
SSSSPCDICSDLVDESVQVDTSLAGSTLPPSTSNSTSIEPPVDFSSLGTHPMITRAKAGIFKTRHPANLGILGSSGLLSALLASTEPKGFKSAAKNPAWVAAMDEEIQAL
QQNDTWTLVPRPANTNIVGSKWVFRIKYLPDGSVERFKARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSIAVTNKWPLRQLDVKNAFLNGTLIERVHMEQPPGYIDP
RFPTHVCLLKKALYGLKQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFHQQSNIIYLLLYVDDIIVTGNNSSLIDSFTRKLHSEFATKDLGSLSYFLGLEASPTPDGLFI
SQLKYARDILTRAQLLDSKPVHTPMVVSQHLTADGSPFSDPTLYRSLVGALQYLTITRPDIAYAVNSVSQFLHAPTADHFLAVKRILRYVKGTLHFGLTFRPSNVPSTLV
AYSDADWAGCPDTRRSTSGYSIYLGNNLVSWSAKKQPTVSRSSCESEYRALATTAAELLWVTHLLHDLKVPISKQPLLLCDNKSAIFLSSNPVSHKRAKHVELDYHFLRE
LVIAGKLRTQYVPSHLQVADIFTKSISRPLFEFFRSKLYVRSNPTLSLRGGVKDS