; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G02540 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G02540
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr08:5005616..5008000
RNA-Seq ExpressionClc08G02540
SyntenyClc08G02540
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]1.4e-22560Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGE  +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYAATA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.3e-22459.85Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGE  +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYAATA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SET     SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+SI GK PWILDSGA DHLTG+S++F+SY  C  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

KAA0061447.1 Beta-galactosidase [Cucumis melo var. makuwa]1.4e-22560Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGE  +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYAATA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-22560Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGEI +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYA TA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-22560Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGEI +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYA TA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase6.6e-22660Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGE  +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYAATA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-22459.85Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGE  +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYAATA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SET     SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+SI GK PWILDSGA DHLTG+S++F+SY  C  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

A0A5A7V3J5 Beta-galactosidase6.6e-22660Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGE  +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYAATA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

A0A5D3E603 Beta-galactosidase5.0e-22660Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGEI +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYA TA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

A0A5D3E6F8 Beta-galactosidase5.0e-22660Show/hide
Query:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK
        GSL  ++TG FSG   L   N  S     Q I+M LEGR++FG+LTGEI +P PGD  ERLWKGED+L+RS+LI+SMEPQIGKPLLYA TA+++WD  Q 
Subjt:  GSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQK

Query:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS
        LYSKRQNASR YTLRKQ+H CKQG++DVT+YFNKLSL+WQE+DLCRE +WD P    QY K+EE DRVYDFLA  N KFD V  RILGQRP+P+LMEVC 
Subjt:  LYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCS

Query:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP
        EVRLEE+R++AM +  T  IDSA FSA+SS    DK NGK   VCEHCKK WH KDQ WKLHGRPP G++R  N K N  +A +SETT    SQ  +   
Subjt:  EVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPN--QALVSETTSGFQSQHREICP

Query:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-
        + T   +LG IAQSG+ Q L L+S+ GK PWILDSGA DHLTG+S++F+SY PC  NEKI I DG+L P+ GKG I PFDG  LQNVLHVPK+S NLLS 
Subjt:  ADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLS-

Query:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL
                                                                                            VSFPSQPYKP++ F L
Subjt:  ------------------------------------------------------------------------------------VSFPSQPYKPSKSFTL

Query:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG
        IHSDVWGPS++TTS  K WFVTFIDDHTRLTWV+L++DKSEV  IFQ FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNG
Subjt:  IHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN
        VAERKNRHL+EVARSLMLSTSLPSYL GDAILTAAHLINRMPSR+L+ QTPLDCLK SYP+T L+ +VPLRVFG T +VH+F PN
Subjt:  VAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-2936.08Show/hide
Query:  DNEKIW------IVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLSVSFPSQPYKPSKSFT-------LIHSDVWGPSRITTSLCKCWFVTFIDDH
        +N ++W      I DG L+ +  K   S  D  +L N+    +I    L+      P+K  K  T       ++HSDV GP    T   K +FV F+D  
Subjt:  DNEKIW------IVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLLSVSFPSQPYKPSKSFT-------LIHSDVWGPSRITTSLCKCWFVTFIDDH

Query:  TRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLL
        T     +L+  KS+V  +FQ F A  E  FN K+  L  DNGRE+L+N +R+F   KGI +  +  +TPQ NGV+ER  R + E AR+++    L     
Subjt:  TRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLL

Query:  GDAILTAAHLINRMPSRVL--NFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVH
        G+A+LTA +LINR+PSR L  + +TP +      P         LRVFG T +VH
Subjt:  GDAILTAAHLINRMPSRVL--NFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVH

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-2939.29Show/hide
Query:  VSFPSQPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIK
        VSF +   +      L++SDV GP  I +     +FVTFIDD +R  WV++L  K +V  +FQ+F+A +E +   K+  LRSDNG E+ +    E+ S  
Subjt:  VSFPSQPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIK

Query:  GIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVH
        GI H+ +   TPQ NGVAER NR ++E  RS++    LP    G+A+ TA +LINR PS  L F+ P         T   +    L+VFG   F H
Subjt:  GIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVH

Q12491 Transposon Ty2-B Gag-Pol polyprotein9.7e-1732.05Show/hide
Query:  QPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSF--IFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIV
        + Y+P   F  +H+D++GP          +F++F D+ TR  WV+ L D+ E S   +F    A I+ QFNA++ +++ D G E+   TL +F + +GI 
Subjt:  QPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSF--IFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIV

Query:  HQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPS
           +     + +GVAER NR LL   R+L+  + LP++L   A+  +  + N + S
Subjt:  HQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-3423.12Show/hide
Query:  QRIRMVLEGRHKFGYLTGEIPKPRPG---------DPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQKLYSKRQNASRHYT-LRKQIH
        +++  + +G    G+L G    P            +P    WK +D L+ S ++ ++   +   +  A TA +IW+ ++K+Y+       H T LR Q+ 
Subjt:  QRIRMVLEGRHKFGYLTGEIPKPRPG---------DPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQKLYSKRQNASRHYT-LRKQIH

Query:  ECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCSEVRLEEERSSAMNITATFI
        +  +G+  +  Y   L   + ++ L  +              ++  ++V   L +   ++  V  +I  +   PTL E+   +   E +  A++      
Subjt:  ECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCSEVRLEEERSSAMNITATFI

Query:  IDSATFSAKSSGTTGDKQNGKPPLVCE-----HCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPNQALVSETTSGFQSQHREICPADTSIVSLGVIAQSGI
        I +   S +++ TT +  NG      +     +  KPW      +            P NN+    L      G Q    + C      +S  V +Q   
Subjt:  IDSATFSAKSSGTTGDKQNGKPPLVCE-----HCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPNQALVSETTSGFQSQHREICPADTSIVSLGVIAQSGI

Query:  SQF--------LSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHIS---PFDGLILQNVLHVPKISNNLLSV-----
        S F        L+L S      W+LDSGA  H+T   +N   + P    + + + DG+ +P++  G  S       L L N+L+VP I  NL+SV     
Subjt:  SQF--------LSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHIS---PFDGLILQNVLHVPKISNNLLSV-----

Query:  -------SFP------------------------------SQP---------------------------------------YKPSKSF-----------
                FP                              SQP                                         PS  F           
Subjt:  -------SFP------------------------------SQP---------------------------------------YKPSKSF-----------

Query:  ----------------TLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLS
                          I+SDVW  S I +     ++V F+D  TR TW++ L  KS+V   F  F   +E +F  +I    SDNG EF+   L E+ S
Subjt:  ----------------TLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLS

Query:  IKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFG
          GI H +S  +TP+ NG++ERK+RH++E   +L+   S+P      A   A +LINR+P+ +L  ++P   L  + P         LRVFG
Subjt:  IKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.2e-3222.29Show/hide
Query:  QRIRMVLEGRHKFGYLTGEIPKPRPG---------DPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQKLYSKRQNASRHYTLRKQIHE
        +++  + +G    G+L G  P P            +P    W+ +D L+ S ++ ++   +   +  A TA +IW+ ++K+Y+       H T  + I  
Subjt:  QRIRMVLEGRHKFGYLTGEIPKPRPG---------DPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQKLYSKRQNASRHYTLRKQIHE

Query:  CKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCSEVRLEEERSSAMN------I
                 + F++L+L+ + +D                   E+ +RV + L      +  V  +I  +   P+L E+   +   E +  A+N      I
Subjt:  CKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPNSKFDAVRSRILGQRPIPTLMEVCSEVRLEEERSSAMN------I

Query:  TATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPNQALVSETTSGFQSQHREICPADTSIVSLGVIAQSGI
        TA  +    T + ++    GD +N              + +   W    +P +   R  N +P   L        Q    + CP      S     Q   
Subjt:  TATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPNQALVSETTSGFQSQHREICPADTSIVSLGVIAQSGI

Query:  SQF--------LSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHIS---PFDGLILQNVLHVPKISNNLLS------
        S F        L++ S      W+LDSGA  H+T   +N   + P    + + I DG+ +P+T  G  S       L L  VL+VP I  NL+S      
Subjt:  SQF--------LSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHIS---PFDGLILQNVLHVPKISNNLLS------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --VSFPSQPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLS
          V F +     SK    I+SDVW    ++    + ++V F+D  TR TW++ L  KS+V   F  F + +E +F  +I  L SDNG EF+   LR++LS
Subjt:  --VSFPSQPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLS

Query:  IKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCL
          GI H +S  +TP+ NG++ERK+RH++E+  +L+   S+P      A   A +LINR+P+ +L  Q+P   L
Subjt:  IKGIVHQSSCAYTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.4e-2028.38Show/hide
Query:  SATTTGGFSGTYHLPPDNVSSDHRCIQ--------------RIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAA
        S + T      Y+LPPD        IQ              R R  L    KFG++ G +PKP P  P  + W+  + ++   L++SM  ++ + ++YA 
Subjt:  SATTTGGFSGTYHLPPDNVSSDHRCIQ--------------RIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGEDTLLRSLLIHSMEPQIGKPLLYAA

Query:  TAREIWDAVQKLYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGV------QYYKIEEGDRVYDFL--ASPNSKFDA
        TA ++W+ +++++    +  + Y LR+++   +QG   V  YF KLS +W E+      I +C CGG       +  +  E ++ Y+FL     N  F+A
Subjt:  TAREIWDAVQKLYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGV------QYYKIEEGDRVYDFL--ASPNSKFDA

Query:  VRSRILGQRPIPTLMEVCSEVR
        V ++I+ Q+P P+L E  + V+
Subjt:  VRSRILGQRPIPTLMEVCSEVR

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0536Show/hide
Query:  NRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVH
        NR ++E  RS++    LP     DA  TA H+IN+ PS  +NF  P +    S PT        LR FG   ++H
Subjt:  NRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTACCGCAGCTGCTGGATCTAGTTTCTTCGTCGAACGCTGTGGGCTGACTACCGTCAGATTTGTGGGAGCTACTGTAGAACGAAGGGGTATTCTCTCTGATCG
AGTGGGTGATGTCTTAGCTGCTGGAAGTTTGTCTGCAACCACCACTGGGGGTTTTTCCGGCACTTATCACTTGCCTCCGGACAATGTTTCGTCAGATCACCGATGTATCC
AGAGAATTCGAATGGTGCTTGAAGGGCGCCATAAGTTCGGATATTTGACTGGCGAAATACCTAAACCTAGACCCGGAGATCCTCAAGAGCGCCTCTGGAAGGGAGAAGAT
ACATTACTCCGGTCTCTGTTAATTCACAGCATGGAACCTCAAATTGGGAAGCCTTTGTTGTATGCAGCTACGGCTCGGGAAATTTGGGATGCTGTTCAGAAACTGTACTC
CAAGAGGCAGAATGCATCACGACACTACACTCTGCGAAAACAAATCCATGAATGCAAACAAGGGTCCATGGATGTCACGTCATATTTTAATAAATTATCTCTTATCTGGC
AGGAGATAGACTTATGTCGAGAACTTATTTGGGACTGCCCTTGTGGAGGAGTCCAATATTATAAAATTGAAGAGGGCGACCGTGTATATGATTTTTTAGCAAGTCCAAAT
TCCAAGTTTGATGCTGTGCGAAGCCGAATACTGGGACAGAGGCCAATACCGACCCTGATGGAAGTATGTTCAGAGGTTCGTCTGGAGGAAGAAAGATCAAGTGCCATGAA
TATCACTGCTACCTTCATAATTGATTCTGCTACCTTCAGTGCAAAATCATCTGGAACTACTGGAGACAAGCAGAATGGGAAACCACCTCTAGTATGTGAACACTGTAAGA
AACCATGGCACATGAAGGATCAGTTTTGGAAGCTACATGGTCGACCCCCGAATGGCAGACGACGACCTCCAAACAACAAACCAAACCAAGCTCTAGTGAGTGAGACTACT
AGTGGCTTTCAATCACAACATCGGGAGATCTGCCCAGCTGACACCAGTATTGTCTCTCTTGGGGTGATTGCACAATCAGGTATTTCTCAATTCTTGAGTCTCCTCAGTAT
TACTGGCAAGAAACCTTGGATCCTTGATTCAGGAGCCATAGACCACCTAACTGGAACTTCTGACAATTTCCTCTCTTATCTTCCGTGTGTCGATAATGAAAAAATCTGGA
TTGTTGACGGGACACTTGTCCCTGTTACTGGCAAGGGTCACATTTCTCCATTCGATGGTCTGATATTACAGAATGTGCTGCATGTTCCTAAAATCTCCAATAATTTACTG
TCTGTCTCATTTCCGTCTCAACCTTACAAACCTTCGAAATCTTTCACTCTTATTCATAGCGATGTTTGGGGTCCCTCACGCATTACCACTTCTTTGTGTAAGTGTTGGTT
TGTTACCTTCATTGATGACCACACTCGCCTTACTTGGGTCTTCCTTCTAACAGATAAGTCGGAGGTCTCATTTATTTTTCAACAGTTTTACGCCACCATTGAAACTCAGT
TTAATGCCAAAATTGCCATCCTTCGAAGTGACAATGGTCGTGAATTCCTCACTAATACTCTTCGTGAGTTCTTATCCATTAAAGGTATTGTTCACCAGAGCTCGTGTGCC
TATACACCCCAACAAAATGGAGTGGCTGAAAGAAAAAACCGTCATCTCCTCGAAGTTGCTCGGTCTCTCATGCTGTCAACCTCTCTTCCGTCTTACCTGTTGGGGGATGC
AATCTTGACTGCAGCTCATCTTATAAATCGGATGCCTTCTCGGGTTCTCAACTTCCAAACTCCTCTTGATTGTCTTAAACTGTCTTACCCTACCACTTGCCTTATACCTG
ATGTCCCTCTCCGAGTATTTGGGCGTACAACATTTGTCCATAGCTTCGATCCCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTACCGCAGCTGCTGGATCTAGTTTCTTCGTCGAACGCTGTGGGCTGACTACCGTCAGATTTGTGGGAGCTACTGTAGAACGAAGGGGTATTCTCTCTGATCG
AGTGGGTGATGTCTTAGCTGCTGGAAGTTTGTCTGCAACCACCACTGGGGGTTTTTCCGGCACTTATCACTTGCCTCCGGACAATGTTTCGTCAGATCACCGATGTATCC
AGAGAATTCGAATGGTGCTTGAAGGGCGCCATAAGTTCGGATATTTGACTGGCGAAATACCTAAACCTAGACCCGGAGATCCTCAAGAGCGCCTCTGGAAGGGAGAAGAT
ACATTACTCCGGTCTCTGTTAATTCACAGCATGGAACCTCAAATTGGGAAGCCTTTGTTGTATGCAGCTACGGCTCGGGAAATTTGGGATGCTGTTCAGAAACTGTACTC
CAAGAGGCAGAATGCATCACGACACTACACTCTGCGAAAACAAATCCATGAATGCAAACAAGGGTCCATGGATGTCACGTCATATTTTAATAAATTATCTCTTATCTGGC
AGGAGATAGACTTATGTCGAGAACTTATTTGGGACTGCCCTTGTGGAGGAGTCCAATATTATAAAATTGAAGAGGGCGACCGTGTATATGATTTTTTAGCAAGTCCAAAT
TCCAAGTTTGATGCTGTGCGAAGCCGAATACTGGGACAGAGGCCAATACCGACCCTGATGGAAGTATGTTCAGAGGTTCGTCTGGAGGAAGAAAGATCAAGTGCCATGAA
TATCACTGCTACCTTCATAATTGATTCTGCTACCTTCAGTGCAAAATCATCTGGAACTACTGGAGACAAGCAGAATGGGAAACCACCTCTAGTATGTGAACACTGTAAGA
AACCATGGCACATGAAGGATCAGTTTTGGAAGCTACATGGTCGACCCCCGAATGGCAGACGACGACCTCCAAACAACAAACCAAACCAAGCTCTAGTGAGTGAGACTACT
AGTGGCTTTCAATCACAACATCGGGAGATCTGCCCAGCTGACACCAGTATTGTCTCTCTTGGGGTGATTGCACAATCAGGTATTTCTCAATTCTTGAGTCTCCTCAGTAT
TACTGGCAAGAAACCTTGGATCCTTGATTCAGGAGCCATAGACCACCTAACTGGAACTTCTGACAATTTCCTCTCTTATCTTCCGTGTGTCGATAATGAAAAAATCTGGA
TTGTTGACGGGACACTTGTCCCTGTTACTGGCAAGGGTCACATTTCTCCATTCGATGGTCTGATATTACAGAATGTGCTGCATGTTCCTAAAATCTCCAATAATTTACTG
TCTGTCTCATTTCCGTCTCAACCTTACAAACCTTCGAAATCTTTCACTCTTATTCATAGCGATGTTTGGGGTCCCTCACGCATTACCACTTCTTTGTGTAAGTGTTGGTT
TGTTACCTTCATTGATGACCACACTCGCCTTACTTGGGTCTTCCTTCTAACAGATAAGTCGGAGGTCTCATTTATTTTTCAACAGTTTTACGCCACCATTGAAACTCAGT
TTAATGCCAAAATTGCCATCCTTCGAAGTGACAATGGTCGTGAATTCCTCACTAATACTCTTCGTGAGTTCTTATCCATTAAAGGTATTGTTCACCAGAGCTCGTGTGCC
TATACACCCCAACAAAATGGAGTGGCTGAAAGAAAAAACCGTCATCTCCTCGAAGTTGCTCGGTCTCTCATGCTGTCAACCTCTCTTCCGTCTTACCTGTTGGGGGATGC
AATCTTGACTGCAGCTCATCTTATAAATCGGATGCCTTCTCGGGTTCTCAACTTCCAAACTCCTCTTGATTGTCTTAAACTGTCTTACCCTACCACTTGCCTTATACCTG
ATGTCCCTCTCCGAGTATTTGGGCGTACAACATTTGTCCATAGCTTCGATCCCAACTAG
Protein sequenceShow/hide protein sequence
MDATAAAGSSFFVERCGLTTVRFVGATVERRGILSDRVGDVLAAGSLSATTTGGFSGTYHLPPDNVSSDHRCIQRIRMVLEGRHKFGYLTGEIPKPRPGDPQERLWKGED
TLLRSLLIHSMEPQIGKPLLYAATAREIWDAVQKLYSKRQNASRHYTLRKQIHECKQGSMDVTSYFNKLSLIWQEIDLCRELIWDCPCGGVQYYKIEEGDRVYDFLASPN
SKFDAVRSRILGQRPIPTLMEVCSEVRLEEERSSAMNITATFIIDSATFSAKSSGTTGDKQNGKPPLVCEHCKKPWHMKDQFWKLHGRPPNGRRRPPNNKPNQALVSETT
SGFQSQHREICPADTSIVSLGVIAQSGISQFLSLLSITGKKPWILDSGAIDHLTGTSDNFLSYLPCVDNEKIWIVDGTLVPVTGKGHISPFDGLILQNVLHVPKISNNLL
SVSFPSQPYKPSKSFTLIHSDVWGPSRITTSLCKCWFVTFIDDHTRLTWVFLLTDKSEVSFIFQQFYATIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCA
YTPQQNGVAERKNRHLLEVARSLMLSTSLPSYLLGDAILTAAHLINRMPSRVLNFQTPLDCLKLSYPTTCLIPDVPLRVFGRTTFVHSFDPN