; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G03320 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G03320
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:2648795..2652175
RNA-Seq ExpressionCSPI07G03320
SyntenyCSPI07G03320
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037745.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0082.35Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFE+ SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIH
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV
        FT VLRILRY+KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGV
Subjt:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV

Query:  PQQGPTLLHCDNRSGI
        PQQGPTLLHCDNRS I
Subjt:  PQQGPTLLHCDNRSGI

KAA0041601.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0082.44Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIH
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV
        FT VLRILRY+KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGV
Subjt:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV

Query:  PQQGPTLLHCDNRSGI
        PQQGPTLLHCDNRS I
Subjt:  PQQGPTLLHCDNRSGI

TYK04714.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0082.44Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIH
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV
        FT VLRILRY+KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGV
Subjt:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV

Query:  PQQGPTLLHCDNRSGI
        PQQGPTLLHCDNRS I
Subjt:  PQQGPTLLHCDNRSGI

TYK19656.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0082.14Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+I +       +  
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLR-------------------ILRYLKGTLGHGLHKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI
         ++VL                       YL  +L     KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGVPQQGPTLLHCDNRS I
Subjt:  FTVVLR-------------------ILRYLKGTLGHGLHKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI

TYK21532.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0081.99Show/hide
Query:  MGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPTRPPR
        MGL PEYESVRAALLHR+PLPSL AAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GHILDNCP +PPR
Subjt:  MGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPTRPPR

Query:  PPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNIS
        P   ST+ K FTK  +SS      D+S+    QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTS+ SLM+T SPTKSLPPIYAADGNCMNI+
Subjt:  PPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNIS

Query:  HTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASS
        HTGTI+TPS++LPHTYCVPNLTFNLVSVGQLCDL L VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTYQWHLRLGHAS 
Subjt:  HTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASS

Query:  EKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFAN
        EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS+S S CDKPFDL+HSDIWGPAP TTVHGYRYYVLFIDD+SRFTWIYFLKHRSELS TYIEFAN
Subjt:  EKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFAN

Query:  MIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSP
        MIRTQFS PIK LRTDN  EYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKH HILDSVRALLLSASCPEKFWGEAALTSVYTINRLPS VLQN SP
Subjt:  MIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSP

Query:  FEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFTNTSVDL
        FEKLYG  P+YS LKVFG ACFVLLHPHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFT+TS+DL
Subjt:  FEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFTNTSVDL

Query:  FPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYV
        FPL E T D ELAQS+P +A  D  S+SD  P+  P  P RRSTRVR PP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQALEKTHTWDYV
Subjt:  FPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYV

Query:  DLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSP
        DLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQE+GIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEEVYMKPP GTS 
Subjt:  DLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSP

Query:  PPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV
        PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND QAISDLQ YLGQHFEMKDLGSLNYFLGLEV
Subjt:  PPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV

Query:  SHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYL
        S RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIHFT VLRILRY+
Subjt:  SHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYL

Query:  KGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCD
        KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGVPQQGPTLLHCD
Subjt:  KGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCD

Query:  NRSGI
        NRS I
Subjt:  NRSGI

TrEMBL top hitse value%identityAlignment
A0A5A7T8F2 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0082.35Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFE+ SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIH
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV
        FT VLRILRY+KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGV
Subjt:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV

Query:  PQQGPTLLHCDNRSGI
        PQQGPTLLHCDNRS I
Subjt:  PQQGPTLLHCDNRSGI

A0A5A7VIT8 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0082.44Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIH
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV
        FT VLRILRY+KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGV
Subjt:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV

Query:  PQQGPTLLHCDNRSGI
        PQQGPTLLHCDNRS I
Subjt:  PQQGPTLLHCDNRSGI

A0A5D3C0D7 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0082.44Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIH
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV
        FT VLRILRY+KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGV
Subjt:  FTVVLRILRYLKGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGV

Query:  PQQGPTLLHCDNRSGI
        PQQGPTLLHCDNRS I
Subjt:  PQQGPTLLHCDNRSGI

A0A5D3D7V8 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0082.14Show/hide
Query:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH
        SKDHLRLIKVLMGLRPEYESVRAALLHR+PLPSLDAAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GH
Subjt:  SKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGH

Query:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI
        ILDNCP +PPRP   ST+ K FTK  +SS      D S++   QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTSD SLM+T SPTKSLPPI
Subjt:  ILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI

Query:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY
        YAADGNCMNI+H GTI+TPS++LPHTYCVPNLTFNLVSVGQLCDLG  VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTY
Subjt:  YAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY

Query:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS
        QWHLRLGHAS EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS S S CDKPFDL+HSDIWGPAP +TVHGYRYYVLFIDD+SRFTWIYFLKHRS
Subjt:  QWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRS

Query:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
        ELSRTYIEFANMIRTQFS PIK LRTDN LEYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR
Subjt:  ELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINR

Query:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP
        LPSSVLQN SPFE+LYG  P+YS LKVFG ACFVLL PHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFH SFSSP
Subjt:  LPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSP

Query:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ
        QSFFT+TS+DLFPLSE T   ELAQS+P +A  D  S+SD  P+ PP  P RRSTRVREPP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQ
Subjt:  QSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQ

Query:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE
        ALEK HTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEE
Subjt:  ALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEE

Query:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL
        VYMKPP GTS PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND  AISDLQ YLGQHFEMKDL
Subjt:  VYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDL

Query:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH
        GSLNYFLGLEVS RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+I +       +  
Subjt:  GSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIH

Query:  FTVVLR-------------------ILRYLKGTLGHGLHKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI
         ++VL                       YL  +L     KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGVPQQGPTLLHCDNRS I
Subjt:  FTVVLR-------------------ILRYLKGTLGHGLHKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI

A0A5D3DD70 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0081.99Show/hide
Query:  MGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPTRPPR
        MGL PEYESVRAALLHR+PLPSL AAIQEILFEE+RLGIN +K SD VLASTY+P   ++ FCKNCKL+GHKF NCPKIECRYCHK GHILDNCP +PPR
Subjt:  MGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPTRPPR

Query:  PPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNIS
        P   ST+ K FTK  +SS      D+S+    QISDLQSLLNQLISS SSALAVS GNRWLLDS CCNHMTS+ SLM+T SPTKSLPPIYAADGNCMNI+
Subjt:  PPGTSTKEKIFTKHGSSSVVAATSDDSSL--IQISDLQSLLNQLISS-SSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNIS

Query:  HTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASS
        HTGTI+TPS++LPHTYCVPNLTFNLVSVGQLCDL L VSFS NGCQVQDPQTGQTIGTGRKVGRLFEL SL+V SP SISA VTDSDTYQWHLRLGHAS 
Subjt:  HTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASS

Query:  EKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFAN
        EKLRHLIS+NNL ++TKFVPFNCLNCKLAKQPALSFS+S S CDKPFDL+HSDIWGPAP TTVHGYRYYVLFIDD+SRFTWIYFLKHRSELS TYIEFAN
Subjt:  EKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFAN

Query:  MIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSP
        MIRTQFS PIK LRTDN  EYKDS LLSFLSQQGT+VQRSCPH SQQNGRAERKH HILDSVRALLLSASCPEKFWGEAALTSVYTINRLPS VLQN SP
Subjt:  MIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSP

Query:  FEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFTNTSVDL
        FEKLYG  P+YS LKVFG ACFVLLHPHEH KLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFT+TS+DL
Subjt:  FEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFTNTSVDL

Query:  FPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYV
        FPL E T D ELAQS+P +A  D  S+SD  P+  P  P RRSTRVR PP HL DYHCFSTIVSL+EPTSYQEAST+P+W+KAM++ELQALEKTHTWDYV
Subjt:  FPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYV

Query:  DLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSP
        DLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQE+GIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKN FLNG LSEEVYMKPP GTS 
Subjt:  DLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSP

Query:  PPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV
        PP+KVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSS HD ALFTR T  GIVLLLLYVDDMIITGND QAISDLQ YLGQHFEMKDLGSLNYFLGLEV
Subjt:  PPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV

Query:  SHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYL
        S RSDGYLLSQAKYASDL+ARS IT+S T+STPLDP+VHLT +DG+PL++ SLYRQLVGSL+YLTVTRPDIAY V+IVSQFM APRTIHFT VLRILRY+
Subjt:  SHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYL

Query:  KGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCD
        KGTLGHGL                                            KK+SV+SRSSTESEYRALADATAEL+WLRWLLADMGVPQQGPTLLHCD
Subjt:  KGTLGHGLH-------------------------------------------KKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCD

Query:  NRSGI
        NRS I
Subjt:  NRSGI

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.3e-10126.35Show/hide
Query:  STYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALA
        +TY  N   N   K  K+   K ++  K++C +C + GHI  +C     R      KE          V  ATS   +             ++   +  +
Subjt:  STYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHKHGHILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALA

Query:  VSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI---YAADGNCMNISHTGTIDTPSVH---LPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQV
        V     ++LDS   +H+ +D SL + S   + +PP+    A  G  +  +  G +   + H   L           NL+SV +L + G+++ F  +G  +
Subjt:  VSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPI---YAADGNCMNISHTGTIDTPSVH---LPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQV

Query:  QDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQ-WHLRLGHASSEKL-----RHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQ--S
                  +G        L ++ V +  + S +    + ++ WH R GH S  KL     +++ S  +L N  +     C  C   KQ  L F Q   
Subjt:  QDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQ-WHLRLGHASSEKL-----RHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQ--S

Query:  ISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQR
         ++  +P  +VHSD+ GP    T+    Y+V+F+D ++ +   Y +K++S++   + +F       F+  +  L  DN  EY  + +  F  ++G     
Subjt:  ISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQR

Query:  SCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTS--PFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRA
        + PH  Q NG +ER  R I +  R ++  A   + FWGEA LT+ Y INR+PS  L ++S  P+E  +   P    L+VFG+  +V +  ++  K + ++
Subjt:  SCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTS--PFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRA

Query:  RLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMF-SRLSSFHTSF-----SSPQSFFTNTSVDL----FPLSEPTLDT-ELAQSSPATANLDPPS
            F+GY  E  GF+ WD ++ +  ++R V   E  M  SR   F T F      S    F N S  +    FP      D  +  + S  + N + P+
Subjt:  RLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMF-SRLSSFHTSF-----SSPQSFFTNTSVDL----FPLSEPTLDT-ELAQSSPATANLDPPS

Query:  VS-----------------------------------------------------DDVPESPPATPL------------------RRSTRVREPPP----
         S                                                     ++  ES  A  L                  RRS R++  P     
Subjt:  VS-----------------------------------------------------DDVPESPPATPL------------------RRSTRVREPPP----

Query:  ---------HLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYG
                  L  +  F+ + +  +   Y++  ++  WE+A++ EL A +  +TW     P  K  +  +W++ +K +  G   RYKARLVA+G++Q+Y 
Subjt:  ---------HLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYG

Query:  IDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDN
        IDYEETFAPVAR++S R +L++       + QMDVK  FLNG L EE+YM+ PQG S   + VC L +A+YGLKQA R WF  F   + +  F +SS D 
Subjt:  IDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDN

Query:  ALF--TRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHL
         ++   +   +  + +LLYVDD++I   D   +++ ++YL + F M DL  + +F+G+ +  + D   LSQ+ Y   ++++  + N    STPL   ++ 
Subjt:  ALF--TRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHL

Query:  TSFDGIPLDDASLYRQLVGSLLYLTV-TRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGL-------------------------------
           +    D  +  R L+G L+Y+ + TRPD+   V I+S++     +  +  + R+LRYLKGT+   L                               
Subjt:  TSFDGIPLDDASLYRQLVGSLLYLTV-TRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGL-------------------------------

Query:  ---------------HKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI
                        K+++ ++ SSTE+EY AL +A  E +WL++LL  + +  + P  ++ DN+  I
Subjt:  ---------------HKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-11430.7Show/hide
Query:  RLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEK-RLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKI-ECRYCHKHGHILD
        + I +L  L   Y+++   +LH      L      +L  EK R    +  Q+ +      +  R +N + ++      K  +  ++  C  C++ GH   
Subjt:  RLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEK-RLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKI-ECRYCHKHGHILD

Query:  NCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALAVSS-GNRWLLDSACCNHMTSDVSLM---------STSSPTKS
        +CP  P +  G ++ +K              +DD++   + +  +++  +      + +S   + W++D+A  +H T    L          +      S
Subjt:  NCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALAVSS-GNRWLLDSACCNHMTSDVSLM---------STSSPTKS

Query:  LPPIYAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTD
           I      C+  +   T+    V       VP+L  NL+S   L   G    F+    ++   +    I  G   G L+  T+  +     ++A+  +
Subjt:  LPPIYAADGNCMNISHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTD

Query:  SDTYQWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFL
             WH R+GH S + L+ L   + ++         C  C   KQ  +SF  S        DLV+SD+ GP  I ++ G +Y+V FIDD SR  W+Y L
Subjt:  SDTYQWHLRLGHASSEKLRHLISVNNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFL

Query:  KHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVY
        K + ++ + + +F  ++  +    +K LR+DN  EY       + S  G   +++ P   Q NG AER +R I++ VR++L  A  P+ FWGEA  T+ Y
Subjt:  KHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVY

Query:  TINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLS-SFHT
         INR PS  L    P          YS LKVFG   F  +   +  KL+ ++  C F+GYG E  G+R WDP+  ++  SR V F E  + +    S   
Subjt:  TINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLS-SFHT

Query:  SFSSPQSFFTNTSVDLFPLSEPTLDTELAQ--SSPATANLDPPSVSDDVPESPPAT-------PLRRSTRVREPPPHLTDYHCFSTIVSLV----EPTSY
              +F T  S    P S  +   E+++    P         + + V E    T       PLRRS R     P +      ST   L+    EP S 
Subjt:  SFSSPQSFFTNTSVDLFPLSEPTLDTELAQ--SSPATANLDPPSVSDDVPESPPAT-------PLRRSTRVREPPPHLTDYHCFSTIVSLV----EPTSY

Query:  QEASTNPVWE---KAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAK
        +E  ++P      KAM EE+++L+K  T+  V+LP GKRP+ CKW++K+K   D  + RYKARLV KG+ Q+ GID++E F+PV +MTS+R++L++AA+ 
Subjt:  QEASTNPVWE---KAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAK

Query:  QWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSPPPNK--VCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNAL-FTRQTTHGIVLLLLYVDDMII
           + Q+DVK  FL+G+L EE+YM+ P+G      K  VC L ++LYGLKQAPR W+  F S +    +  +  D  + F R + +  ++LLLYVDDM+I
Subjt:  QWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSPPPNK--VCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNAL-FTRQTTHGIVLLLLYVDDMII

Query:  TGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV--SHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHL------TSFDGIPLDDASLYRQL
         G D+  I+ L+  L + F+MKDLG     LG+++     S    LSQ KY   ++ R  + N+   STPL  H+ L      T+ +         Y   
Subjt:  TGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEV--SHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHL------TSFDGIPLDDASLYRQL

Query:  VGSLLYLTV-TRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLG
        VGSL+Y  V TRPDIA+ V +VS+F+  P   H+  V  ILRYL+GT G
Subjt:  VGSLLYLTV-TRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLG

P25600 Putative transposon Ty5-1 protein YCL074W2.9e-2935.19Show/hide
Query:  MDVKNVFLNGNLSEEVYMKPPQG--TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQA
        MDV   FLN  + E +Y+K P G      P+ V  L   +YGLKQAP  W    ++T+ ++GF     ++ L+ R T+ G + + +YVDD+++     + 
Subjt:  MDVKNVFLNGNLSEEVYMKPPQG--TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQA

Query:  ISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYL-LSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVT-RPD
           ++Q L + + MKDLG ++ FLGL +   S+G + LS   Y +   + SEI     + TPL     L       L D + Y+ +VG LL+   T RPD
Subjt:  ISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYL-LSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVT-RPD

Query:  IAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGT
        I+Y V ++S+F+  PR IH     R+LRYL  T
Subjt:  IAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGT

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.2e-16535.76Show/hide
Query:  SGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNISHTG--TIDTPS--VHLPHTYCVPNLTFNLVSVGQLCDL-GLNVSFSPNGCQVQDP
        S N WLLDS   +H+TSD + +S   P      +  ADG+ + ISHTG  ++ T S  ++L +   VPN+  NL+SV +LC+  G++V F P   QV+D 
Subjt:  SGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNISHTG--TIDTPS--VHLPHTYCVPNLTFNLVSVGQLCDL-GLNVSFSPNGCQVQDP

Query:  QTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY-QWHLRLGHASSEKLRHLISVNNLTNLTKFVPF-NCLNCKLAKQPALSFSQSISNCDKPFD
         TG  +  G+    L+E   +  S P S+ AS +   T+  WH RLGH +   L  +IS  +L+ L     F +C +C + K   + FSQS  N  +P +
Subjt:  QTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTY-QWHLRLGHASSEKLRHLISVNNLTNLTKFVPF-NCLNCKLAKQPALSFSQSISNCDKPFD

Query:  LVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQN
         ++SD+W  +PI +   YRYYV+F+D ++R+TW+Y LK +S++  T+I F N++  +F + I    +DN  E+    L  + SQ G     S PH  + N
Subjt:  LVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVLEYKDSILLSFLSQQGTIVQRSCPHISQQN

Query:  GRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTE
        G +ERKHRHI+++   LL  AS P+ +W  A   +VY INRLP+ +LQ  SPF+KL+G SP+Y KL+VFG AC+  L P+  +KL+ ++R C FLGY   
Subjt:  GRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHEHNKLEPRARLCCFLGYGTE

Query:  HKGFRCWDPLSNRLRISRHVTFWEHTM------------------FSRLSSFHTSF----------------------SSPQSFFTNTSV----------
           + C    ++RL ISRHV F E+                     S + S HT+                       SSP + F N+ V          
Subjt:  HKGFRCWDPLSNRLRISRHVTFWEHTM------------------FSRLSSFHTSF----------------------SSPQSFFTNTSV----------

Query:  DLFPLS-EPTL----------------------------------DTELAQ--SSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPP----------
          FP S EPT                                    ++LAQ  S+PA ++   PS +     S   +P   S  +  PPP          
Subjt:  DLFPLS-EPTL----------------------------------DTELAQ--SSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPP----------

Query:  -----HLTDYHC----------FSTIVSLV---EPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPI-GCKWIYKIKTHSDGTIERYKARL
             H                +S  VSL    EP +  +A  +  W  AM  E+ A    HTWD V  PP    I GC+WI+  K +SDG++ RYKARL
Subjt:  -----HLTDYHC----------FSTIVSLV---EPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPI-GCKWIYKIKTHSDGTIERYKARL

Query:  VAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQG--TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTI
        VAKGY+Q  G+DY ETF+PV + TS+R +L VA  + WP+ Q+DV N FL G L+++VYM  P G      PN VC LR+ALYGLKQAPRAW+    + +
Subjt:  VAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQG--TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTI

Query:  TQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTS
          +GF +S  D +LF  Q    IV +L+YVDD++ITGND   + +    L Q F +KD   L+YFLG+E      G  LSQ +Y  DL+AR+ +  +   
Subjt:  TQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTS

Query:  STPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGL----------------------
        +TP+ P   L+ + G  L D + YR +VGSL YL  TRPDI+Y V  +SQFM  P   H   + RILRYL GT  HG+                      
Subjt:  STPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGL----------------------

Query:  ---------------------HKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDN
                              KK+  + RSSTE+EYR++A+ ++E+ W+  LL ++G+    P +++CDN
Subjt:  ---------------------HKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.6e-15933.36Show/hide
Query:  QISDLQSLLNQLISSS--------SALAVSS---GNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDTP----SVHLPHTYCV
        Q+   QS  NQ  S+S        + LAV+S    N WLLDS   +H+TSD + +S   P      +  ADG+ + I+HTG+   P    S+ L     V
Subjt:  QISDLQSLLNQLISSS--------SALAVSS---GNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNISHTGTIDTP----SVHLPHTYCV

Query:  PNLTFNLVSVGQLCDLG-LNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASSEKLRHLISVNNLTNLT-
        PN+  NL+SV +LC+   ++V F P   QV+D  TG  +  G+    L+E       + S  ++  + +    WH RLGH S   L  +IS ++L  L  
Subjt:  PNLTFNLVSVGQLCDLG-LNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASSEKLRHLISVNNLTNLT-

Query:  KFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTD
             +C +C + K   + FS S     KP + ++SD+W  +PI ++  YRYYV+F+D ++R+TW+Y LK +S++  T+I F +++  +F + I  L +D
Subjt:  KFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTD

Query:  NVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKV
        N  E+   +L  +LSQ G     S PH  + NG +ERKHRHI++    LL  AS P+ +W  A   +VY INRLP+ +LQ  SPF+KL+G  P+Y KLKV
Subjt:  NVLEYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKV

Query:  FGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQ------SFFTNTSVDLFPL-------
        FG AC+  L P+  +KLE +++ C F+GY      + C    + RL  SRHV F E       ++F  S S  Q      ++ ++T++   PL       
Subjt:  FGSACFVLLHPHEHNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQ------SFFTNTSVDLFPL-------

Query:  --------------------------------------SEPTLDTE---------------------LAQSSPATANLDPPSVSDDVPESP---------
                                              SEPT  +                      L   +P + + + P+ +  +P+SP         
Subjt:  --------------------------------------SEPTLDTE---------------------LAQSSPATANLDPPSVSDDVPESP---------

Query:  ------PATPLRRSTRVREPPPHL-------------------------------TDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWD
              P +P   ST     PP L                                 Y   +++ +  EP +  +A  +  W +AM  E+ A    HTWD
Subjt:  ------PATPLRRSTRVREPPPHL-------------------------------TDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWD

Query:  YVDLPPGKRPI-GCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQG
         V  PP    I GC+WI+  K +SDG++ RYKARLVAKGY+Q  G+DY ETF+PV + TS+R +L VA  + WP+ Q+DV N FL G L++EVYM  P G
Subjt:  YVDLPPGKRPI-GCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQG

Query:  --TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYF
              P+ VC LR+A+YGLKQAPRAW+    + +  +GF +S  D +LF  Q    I+ +L+YVDD++ITGND   +      L Q F +K+   L+YF
Subjt:  --TSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYF

Query:  LGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLR
        LG+E      G  LSQ +Y  DL+AR+ +  +   +TP+     LT   G  L D + YR +VGSL YL  TRPD++Y V  +SQ+M  P   H+  + R
Subjt:  LGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLR

Query:  ILRYLKGTLGHGL-------------------------------------------HKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPT
        +LRYL GT  HG+                                            KK+  + RSSTE+EYR++A+ ++EL W+  LL ++G+    P 
Subjt:  ILRYLKGTLGHGL-------------------------------------------HKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPT

Query:  LLHCDN
        +++CDN
Subjt:  LLHCDN

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-10540.57Show/hide
Query:  DTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFST---------------------------IVSLVEPTSYQEASTNPVWE
        D + + SS +   +   ++ +DVPE    T  RR+ +    P +L DY+C S                            I    EP++Y EA    VW 
Subjt:  DTELAQSSPATANLDPPSVSDDVPESPPATPLRRSTRVREPPPHLTDYHCFST---------------------------IVSLVEPTSYQEASTNPVWE

Query:  KAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVF
         AMD+E+ A+E THTW+   LPP K+PIGCKW+YKIK +SDGTIERYKARLVAKGY+Q+ GID+ ETF+PV ++TSV+ +LA++A   + L Q+D+ N F
Subjt:  KAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVAAAKQWPLLQMDVKNVF

Query:  LNGNLSEEVYMKPP------QGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISD
        LNG+L EE+YMK P      QG S PPN VC L++++YGLKQA R WF  FS T+   GF  S  D+  F + T    + +L+YVDD+II  N+  A+ +
Subjt:  LNGNLSEEVYMKPP------QGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLYVDDMIITGNDQQAISD

Query:  LQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVV
        L+  L   F+++DLG L YFLGLE++  + G  + Q KYA DL+  + +     SS P+DP V  ++  G    DA  YR+L+G L+YL +TR DI++ V
Subjt:  LQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRPDIAYVV

Query:  YIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGL-------------------------------------------HKKRSVISRSSTESEYRALADATA
          +SQF  APR  H   V++IL Y+KGT+G GL                                            KK+ V+S+SS E+EYRAL+ AT 
Subjt:  YIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGL-------------------------------------------HKKRSVISRSSTESEYRALADATA

Query:  ELIWLRWLLADMGVPQQGPTLLHCDNRSGI
        E++WL     ++ +P   PTLL CDN + I
Subjt:  ELIWLRWLLADMGVPQQGPTLLHCDNRSGI

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.3e-0537.68Show/hide
Query:  LYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGLHKKRSVISRSSTESEYRALADA
        +YLT+TRPD+ + V  +SQF  A RT     V ++L Y+KGT+G GL         ++++ + +A AD+
Subjt:  LYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGLHKKRSVISRSSTESEYRALADA

ATMG00810.1 DNA/RNA polymerases superfamily protein6.6e-2936.61Show/hide
Query:  LLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYR
        LLLYVDD+++TG+    ++ L   L   F MKDLG ++YFLG+++     G  LSQ KYA  ++  + + +    STPL   ++ +S       D S +R
Subjt:  LLLYVDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYR

Query:  QLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGLH-------------------------------------------KKR
         +VG+L YLT+TRPDI+Y V IV Q M  P    F ++ R+LRY+KGT+ HGL+                                           K++
Subjt:  QLVGSLLYLTVTRPDIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGLH-------------------------------------------KKR

Query:  SVISRSSTESEYRALADATAELIW
          +SRSSTE+EYRALA   AEL W
Subjt:  SVISRSSTESEYRALADATAELIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.6e-2250.51Show/hide
Query:  EPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVA
        EP S   A  +P W +AM EEL AL +  TW  V  P  +  +GCKW++K K HSDGT++R KARLVAKG+ QE GI + ET++PV R  ++R++L VA
Subjt:  EPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETFAPVARMTSVRSLLAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTTTTTGTCTACACGCTTCAAATCTATTGGTTTAGCAAAGATCATCTTCGCCTTATTAAAGTCCTTATGGGATTACGTCCAGAATATGAATCTGTTAGAGCTGC
TTTACTACACCGGAATCCCTTACCCTCATTAGATGCAGCTATTCAAGAAATTCTGTTTGAAGAAAAGCGTCTTGGCATCAACTCTACTAAACAATCTGATGTTGTCCTTG
CTAGCACATACACTCCCAACAGAGTCGCAAATATGTTTTGTAAGAATTGTAAGCTCTCTGGTCACAAATTTAGTAACTGTCCTAAAATAGAGTGCAGGTACTGCCATAAA
CATGGCCACATTCTGGATAACTGCCCTACCAGACCACCCCGACCTCCTGGCACTTCCACAAAAGAGAAAATTTTTACCAAACATGGTTCCTCATCTGTTGTTGCTGCGAC
CTCGGATGATTCATCCCTCATTCAGATAAGTGATCTTCAGAGCTTATTGAATCAACTAATTTCATCATCCTCCGCTCTGGCTGTTTCATCAGGTAATCGATGGCTTCTTG
ATTCTGCCTGTTGTAATCATATGACCTCTGACGTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTATGCTGCTGATGGTAATTGTATGAACATC
TCTCATACTGGTACCATTGATACTCCCAGTGTACATCTTCCCCATACTTACTGTGTTCCTAACCTGACCTTTAATCTAGTGTCTGTTGGTCAATTATGTGATCTTGGCTT
AAATGTTTCATTTTCTCCCAATGGTTGTCAGGTTCAGGATCCGCAGACGGGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTGTTTGAGCTCACATCACTTCGGG
TTTCATCTCCTTCTTCCATCTCTGCTTCGGTCACTGATTCTGACACATATCAGTGGCATCTTCGTCTTGGTCATGCTTCCTCTGAAAAACTTCGTCATTTAATTTCTGTT
AACAATTTGACTAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACCTGCCTTATCTTTTTCTCAATCCATCTCTAATTGTGATAAACC
TTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCGATTTACATGGATTT
ACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCACTGATAATGTTTTG
GAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATATCTCTCAACAAAATGGACGTGCTGAGCGCAAACATCG
TCACATTCTTGACTCAGTACGTGCCCTCCTTCTTTCTGCCTCTTGTCCAGAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACAATCAATCGTCTCCCTTCCT
CTGTCCTTCAAAATACCTCTCCGTTCGAAAAACTATATGGTATTTCTCCCGACTATTCTAAACTCAAAGTTTTTGGTAGTGCCTGCTTCGTTCTGTTACATCCTCATGAA
CACAATAAACTTGAACCACGTGCCCGTCTCTGTTGTTTCCTTGGCTATGGCACCGAACACAAAGGATTTCGTTGTTGGGACCCTCTTTCCAACCGACTCCGGATATCTCG
GCATGTCACTTTTTGGGAACACACTATGTTCTCTCGTTTGTCCTCCTTCCACACCTCTTTCTCTAGTCCTCAATCTTTCTTTACAAATACATCTGTTGACCTTTTTCCTC
TCTCTGAACCCACCTTGGATACTGAGCTTGCACAATCTTCACCTGCTACTGCAAATCTGGATCCACCGTCTGTCTCCGATGATGTTCCTGAATCGCCACCTGCTACTCCT
CTTCGTCGCTCTACCCGGGTAAGAGAACCTCCCCCTCATCTCACTGATTACCATTGTTTTTCTACCATTGTTTCCCTTGTTGAACCCACCTCTTATCAAGAGGCCAGTAC
TAACCCAGTATGGGAGAAAGCAATGGATGAAGAATTACAGGCTCTTGAAAAGACGCACACTTGGGACTATGTTGATTTACCTCCCGGTAAAAGACCCATTGGTTGCAAAT
GGATTTACAAAATCAAAACTCACTCTGATGGAACTATTGAACGTTATAAAGCTCGGCTTGTTGCAAAAGGATACTCACAAGAATATGGGATTGACTATGAAGAAACATTT
GCCCCTGTTGCCCGGATGACATCTGTTCGCAGCTTGTTAGCTGTTGCTGCTGCCAAACAGTGGCCTCTTCTTCAGATGGATGTCAAAAATGTCTTTCTTAACGGAAACCT
CTCTGAAGAAGTGTATATGAAGCCACCTCAGGGCACTTCTCCTCCTCCCAACAAGGTGTGTCTCCTTCGTCGCGCTCTATACGGTCTAAAACAGGCTCCACGAGCTTGGT
TTGCCACGTTTAGCTCCACCATTACTCAACTTGGATTTACCTCCAGCTCTCACGACAATGCCCTTTTTACACGACAGACAACTCATGGTATTGTTCTTCTCCTTCTTTAT
GTTGATGATATGATTATTACTGGTAATGATCAACAGGCCATATCCGACCTACAACAATATCTTGGTCAACATTTTGAGATGAAAGACCTTGGATCTCTCAATTACTTTCT
CGGTCTTGAAGTCTCTCACCGTTCAGATGGTTATCTATTATCTCAAGCGAAATATGCATCTGATCTGATAGCGCGCTCAGAAATTACAAACTCCACCACATCTTCAACAC
CGTTAGATCCTCATGTCCATCTAACTTCGTTTGATGGTATTCCTCTTGACGATGCAAGCTTGTATCGACAACTTGTTGGCAGTCTTCTATACCTAACAGTGACTCGCCCA
GATATTGCATATGTTGTTTATATTGTCAGTCAATTTATGGTTGCTCCTCGAACAATTCATTTCACTGTTGTTCTACGCATACTTCGCTATCTCAAAGGCACCTTGGGACA
TGGTCTCCATAAGAAACGAAGTGTTATATCTCGTTCAAGTACGGAATCTGAATATCGTGCTCTGGCTGATGCTACAGCTGAACTTATATGGCTTCGGTGGCTCCTTGCTG
ATATGGGTGTCCCTCAACAGGGTCCTACCCTCCTCCATTGTGACAATCGTAGTGGCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATTTTTTGTCTACACGCTTCAAATCTATTGGTTTAGCAAAGATCATCTTCGCCTTATTAAAGTCCTTATGGGATTACGTCCAGAATATGAATCTGTTAGAGCTGC
TTTACTACACCGGAATCCCTTACCCTCATTAGATGCAGCTATTCAAGAAATTCTGTTTGAAGAAAAGCGTCTTGGCATCAACTCTACTAAACAATCTGATGTTGTCCTTG
CTAGCACATACACTCCCAACAGAGTCGCAAATATGTTTTGTAAGAATTGTAAGCTCTCTGGTCACAAATTTAGTAACTGTCCTAAAATAGAGTGCAGGTACTGCCATAAA
CATGGCCACATTCTGGATAACTGCCCTACCAGACCACCCCGACCTCCTGGCACTTCCACAAAAGAGAAAATTTTTACCAAACATGGTTCCTCATCTGTTGTTGCTGCGAC
CTCGGATGATTCATCCCTCATTCAGATAAGTGATCTTCAGAGCTTATTGAATCAACTAATTTCATCATCCTCCGCTCTGGCTGTTTCATCAGGTAATCGATGGCTTCTTG
ATTCTGCCTGTTGTAATCATATGACCTCTGACGTTTCTCTTATGTCTACTTCTAGCCCTACAAAATCTTTACCTCCTATTTATGCTGCTGATGGTAATTGTATGAACATC
TCTCATACTGGTACCATTGATACTCCCAGTGTACATCTTCCCCATACTTACTGTGTTCCTAACCTGACCTTTAATCTAGTGTCTGTTGGTCAATTATGTGATCTTGGCTT
AAATGTTTCATTTTCTCCCAATGGTTGTCAGGTTCAGGATCCGCAGACGGGACAGACGATTGGAACGGGTCGCAAAGTGGGAAGATTGTTTGAGCTCACATCACTTCGGG
TTTCATCTCCTTCTTCCATCTCTGCTTCGGTCACTGATTCTGACACATATCAGTGGCATCTTCGTCTTGGTCATGCTTCCTCTGAAAAACTTCGTCATTTAATTTCTGTT
AACAATTTGACTAATCTTACTAAGTTTGTTCCTTTTAATTGTTTGAATTGCAAACTTGCTAAACAACCTGCCTTATCTTTTTCTCAATCCATCTCTAATTGTGATAAACC
TTTTGATTTAGTGCATTCTGATATTTGGGGTCCTGCCCCAATTACTACTGTTCATGGTTATCGCTACTATGTTTTATTCATTGATGACTACTCTCGATTTACATGGATTT
ACTTTCTAAAACATCGTTCTGAATTATCTCGCACATATATTGAGTTTGCTAACATGATTCGCACTCAATTTTCCTCTCCCATCAAAATTCTTCGCACTGATAATGTTTTG
GAATATAAAGATTCCATCCTTCTTTCTTTTCTTTCCCAACAGGGCACTATTGTTCAGCGCTCTTGCCCTCATATCTCTCAACAAAATGGACGTGCTGAGCGCAAACATCG
TCACATTCTTGACTCAGTACGTGCCCTCCTTCTTTCTGCCTCTTGTCCAGAAAAATTCTGGGGTGAAGCTGCCCTTACATCAGTATATACAATCAATCGTCTCCCTTCCT
CTGTCCTTCAAAATACCTCTCCGTTCGAAAAACTATATGGTATTTCTCCCGACTATTCTAAACTCAAAGTTTTTGGTAGTGCCTGCTTCGTTCTGTTACATCCTCATGAA
CACAATAAACTTGAACCACGTGCCCGTCTCTGTTGTTTCCTTGGCTATGGCACCGAACACAAAGGATTTCGTTGTTGGGACCCTCTTTCCAACCGACTCCGGATATCTCG
GCATGTCACTTTTTGGGAACACACTATGTTCTCTCGTTTGTCCTCCTTCCACACCTCTTTCTCTAGTCCTCAATCTTTCTTTACAAATACATCTGTTGACCTTTTTCCTC
TCTCTGAACCCACCTTGGATACTGAGCTTGCACAATCTTCACCTGCTACTGCAAATCTGGATCCACCGTCTGTCTCCGATGATGTTCCTGAATCGCCACCTGCTACTCCT
CTTCGTCGCTCTACCCGGGTAAGAGAACCTCCCCCTCATCTCACTGATTACCATTGTTTTTCTACCATTGTTTCCCTTGTTGAACCCACCTCTTATCAAGAGGCCAGTAC
TAACCCAGTATGGGAGAAAGCAATGGATGAAGAATTACAGGCTCTTGAAAAGACGCACACTTGGGACTATGTTGATTTACCTCCCGGTAAAAGACCCATTGGTTGCAAAT
GGATTTACAAAATCAAAACTCACTCTGATGGAACTATTGAACGTTATAAAGCTCGGCTTGTTGCAAAAGGATACTCACAAGAATATGGGATTGACTATGAAGAAACATTT
GCCCCTGTTGCCCGGATGACATCTGTTCGCAGCTTGTTAGCTGTTGCTGCTGCCAAACAGTGGCCTCTTCTTCAGATGGATGTCAAAAATGTCTTTCTTAACGGAAACCT
CTCTGAAGAAGTGTATATGAAGCCACCTCAGGGCACTTCTCCTCCTCCCAACAAGGTGTGTCTCCTTCGTCGCGCTCTATACGGTCTAAAACAGGCTCCACGAGCTTGGT
TTGCCACGTTTAGCTCCACCATTACTCAACTTGGATTTACCTCCAGCTCTCACGACAATGCCCTTTTTACACGACAGACAACTCATGGTATTGTTCTTCTCCTTCTTTAT
GTTGATGATATGATTATTACTGGTAATGATCAACAGGCCATATCCGACCTACAACAATATCTTGGTCAACATTTTGAGATGAAAGACCTTGGATCTCTCAATTACTTTCT
CGGTCTTGAAGTCTCTCACCGTTCAGATGGTTATCTATTATCTCAAGCGAAATATGCATCTGATCTGATAGCGCGCTCAGAAATTACAAACTCCACCACATCTTCAACAC
CGTTAGATCCTCATGTCCATCTAACTTCGTTTGATGGTATTCCTCTTGACGATGCAAGCTTGTATCGACAACTTGTTGGCAGTCTTCTATACCTAACAGTGACTCGCCCA
GATATTGCATATGTTGTTTATATTGTCAGTCAATTTATGGTTGCTCCTCGAACAATTCATTTCACTGTTGTTCTACGCATACTTCGCTATCTCAAAGGCACCTTGGGACA
TGGTCTCCATAAGAAACGAAGTGTTATATCTCGTTCAAGTACGGAATCTGAATATCGTGCTCTGGCTGATGCTACAGCTGAACTTATATGGCTTCGGTGGCTCCTTGCTG
ATATGGGTGTCCCTCAACAGGGTCCTACCCTCCTCCATTGTGACAATCGTAGTGGCATTTAG
Protein sequenceShow/hide protein sequence
MGFFVYTLQIYWFSKDHLRLIKVLMGLRPEYESVRAALLHRNPLPSLDAAIQEILFEEKRLGINSTKQSDVVLASTYTPNRVANMFCKNCKLSGHKFSNCPKIECRYCHK
HGHILDNCPTRPPRPPGTSTKEKIFTKHGSSSVVAATSDDSSLIQISDLQSLLNQLISSSSALAVSSGNRWLLDSACCNHMTSDVSLMSTSSPTKSLPPIYAADGNCMNI
SHTGTIDTPSVHLPHTYCVPNLTFNLVSVGQLCDLGLNVSFSPNGCQVQDPQTGQTIGTGRKVGRLFELTSLRVSSPSSISASVTDSDTYQWHLRLGHASSEKLRHLISV
NNLTNLTKFVPFNCLNCKLAKQPALSFSQSISNCDKPFDLVHSDIWGPAPITTVHGYRYYVLFIDDYSRFTWIYFLKHRSELSRTYIEFANMIRTQFSSPIKILRTDNVL
EYKDSILLSFLSQQGTIVQRSCPHISQQNGRAERKHRHILDSVRALLLSASCPEKFWGEAALTSVYTINRLPSSVLQNTSPFEKLYGISPDYSKLKVFGSACFVLLHPHE
HNKLEPRARLCCFLGYGTEHKGFRCWDPLSNRLRISRHVTFWEHTMFSRLSSFHTSFSSPQSFFTNTSVDLFPLSEPTLDTELAQSSPATANLDPPSVSDDVPESPPATP
LRRSTRVREPPPHLTDYHCFSTIVSLVEPTSYQEASTNPVWEKAMDEELQALEKTHTWDYVDLPPGKRPIGCKWIYKIKTHSDGTIERYKARLVAKGYSQEYGIDYEETF
APVARMTSVRSLLAVAAAKQWPLLQMDVKNVFLNGNLSEEVYMKPPQGTSPPPNKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSSHDNALFTRQTTHGIVLLLLY
VDDMIITGNDQQAISDLQQYLGQHFEMKDLGSLNYFLGLEVSHRSDGYLLSQAKYASDLIARSEITNSTTSSTPLDPHVHLTSFDGIPLDDASLYRQLVGSLLYLTVTRP
DIAYVVYIVSQFMVAPRTIHFTVVLRILRYLKGTLGHGLHKKRSVISRSSTESEYRALADATAELIWLRWLLADMGVPQQGPTLLHCDNRSGI