; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0099601 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0099601
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionBeta-galactosidase
Genome locationCMiso1.1chr04:15382722..15384583
RNA-Seq ExpressionCmc04g0099601
SyntenyCmc04g0099601
Gene Ontology termsGO:0006116 - NADH oxidation (biological process)
GO:0015031 - protein transport (biological process)
GO:0015074 - DNA integration (biological process)
GO:0032456 - endocytic recycling (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005768 - endosome (cellular component)
GO:0005777 - peroxisome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003954 - NADH dehydrogenase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025138.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.3e-29582.74Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPT+PCTNNTMSENDKSD AVLENMEEKN  D+TEVRIETSNDE +QGHTRK DEYDPSLD+PIALRKGTRSCTKH I NYVSY+NLSPQFRAFTA L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DST IP+ IY ALECPEWKN VMEE+KALEKN   EICALPKGHK VGCKWVF+LKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAKLN VRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ D+KNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTK S+T KIA+LIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVL+GDDQ EISQLKQR+G+EFEI+DLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGC   DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYE+HMEAV RILRYLK TPGKGLMFRKT++KTIEAYTDSDWAGSI+DRKSTSGYCTFVWGNLVTWRSKKQ+VV+R
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS
        SS EAEY+AMSLGICEEIWLQKVLS+LHQECETPLKLF   K+ + +  TQF+MIELNMLRLIGISSKKDLTV AYAFRT L+ NRLLM LPRGFSD TS
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS

Query:  TFALASWDSLIFTSQLEGKC
        TF LASW SLIFT QLEG+C
Subjt:  TFALASWDSLIFTSQLEGKC

KAA0025363.1 Beta-galactosidase [Cucumis melo var. makuwa]2.0e-28490.89Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPTKPCTNNTMSENDKSD+AVLENMEEKN DDETEVRIETSNDE +QGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTA+L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DSTIIPKNIYTALECPEWKN VMEE+KALEKN+TWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ DVKNAFLNGDLVEEVYMSPPPGFEAQFGQ+VCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKT KIAILIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGCRP DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGS+IDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        SS EAEYRAMSLGICEEIWLQKVLS+LHQECETPLKLF
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

KAA0034386.1 Beta-galactosidase [Cucumis melo var. makuwa]2.0e-28490.89Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPTKPCTNNTMSENDKSD+AVLENMEEKN DDETEVRIETSNDE +QGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTA+L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DSTIIPKNIYTALECPEWKN VMEE+KALEKN+TWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ DVKNAFLNGDLVEEVYMSPPPGFEAQFGQ+VCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKT KIAILIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGCRP DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGS+IDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        SS EAEYRAMSLGICEEIWLQKVLS+LHQECETPLKLF
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

KAA0048203.1 Beta-galactosidase [Cucumis melo var. makuwa]2.0e-28490.89Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPTKPCTNNTMSENDKSD+AVLENMEEKN DDETEVRIETSNDE +QGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTA+L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DSTIIPKNIYTALECPEWKN VMEE+KALEKN+TWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ DVKNAFLNGDLVEEVYMSPPPGFEAQFGQ+VCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKT KIAILIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGCRP DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGS+IDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        SS EAEYRAMSLGICEEIWLQKVLS+LHQECETPLKLF
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

TYK23097.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.3e-29582.58Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPT+PCTNNTMSENDKSD AVLENMEEKN  D+TEVRIETSNDE +QGHTRK DEYDPSLD+PIALRKGTRSCTKH I NYVSY+NLSPQFRAFTA L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DST IP+ IY ALECPEWKN VMEE+KALEKN   EICALPKGHK VGCKWVF+LKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAKLN VRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ D+KNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTK S+T KIA+LIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVL+GDDQ EISQLKQR+G+EFEI+DLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGC   DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYE+HMEAV RILRYLK TPGKGLMFRKT++KTIEAYTDSDWAGS++DRKSTSGYCTFVWGNLVTWRSKKQ+VV+R
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS
        SS EAEY+AMSLGICEEIWLQKVLS+LHQECETPLKLF   K+ + +  TQF+MIELNMLRLIGISSKKDLTV AYAFRT L+ NRLLM LPRGFSD TS
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS

Query:  TFALASWDSLIFTSQLEGKC
        TF LASW SLIFT QLEG+C
Subjt:  TFALASWDSLIFTSQLEGKC

TrEMBL top hitse value%identityAlignment
A0A5A7SG80 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-29582.74Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPT+PCTNNTMSENDKSD AVLENMEEKN  D+TEVRIETSNDE +QGHTRK DEYDPSLD+PIALRKGTRSCTKH I NYVSY+NLSPQFRAFTA L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DST IP+ IY ALECPEWKN VMEE+KALEKN   EICALPKGHK VGCKWVF+LKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAKLN VRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ D+KNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTK S+T KIA+LIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVL+GDDQ EISQLKQR+G+EFEI+DLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGC   DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYE+HMEAV RILRYLK TPGKGLMFRKT++KTIEAYTDSDWAGSI+DRKSTSGYCTFVWGNLVTWRSKKQ+VV+R
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS
        SS EAEY+AMSLGICEEIWLQKVLS+LHQECETPLKLF   K+ + +  TQF+MIELNMLRLIGISSKKDLTV AYAFRT L+ NRLLM LPRGFSD TS
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS

Query:  TFALASWDSLIFTSQLEGKC
        TF LASW SLIFT QLEG+C
Subjt:  TFALASWDSLIFTSQLEGKC

A0A5A7SM64 Beta-galactosidase9.8e-28590.89Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPTKPCTNNTMSENDKSD+AVLENMEEKN DDETEVRIETSNDE +QGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTA+L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DSTIIPKNIYTALECPEWKN VMEE+KALEKN+TWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ DVKNAFLNGDLVEEVYMSPPPGFEAQFGQ+VCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKT KIAILIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGCRP DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGS+IDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        SS EAEYRAMSLGICEEIWLQKVLS+LHQECETPLKLF
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

A0A5A7V0Y9 Beta-galactosidase9.8e-28590.89Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPTKPCTNNTMSENDKSD+AVLENMEEKN DDETEVRIETSNDE +QGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTA+L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DSTIIPKNIYTALECPEWKN VMEE+KALEKN+TWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ DVKNAFLNGDLVEEVYMSPPPGFEAQFGQ+VCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKT KIAILIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGCRP DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGS+IDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        SS EAEYRAMSLGICEEIWLQKVLS+LHQECETPLKLF
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

A0A5A7VLQ7 Beta-galactosidase9.8e-28590.89Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPTKPCTNNTMSENDKSD+AVLENMEEKN DDETEVRIETSNDE +QGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTA+L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DSTIIPKNIYTALECPEWKN VMEE+KALEKN+TWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ DVKNAFLNGDLVEEVYMSPPPGFEAQFGQ+VCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKT KIAILIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGCRP DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGS+IDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        SS EAEYRAMSLGICEEIWLQKVLS+LHQECETPLKLF
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

A0A5D3DHC0 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-29582.58Show/hide
Query:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL
        MENPT+PCTNNTMSENDKSD AVLENMEEKN  D+TEVRIETSNDE +QGHTRK DEYDPSLD+PIALRKGTRSCTKH I NYVSY+NLSPQFRAFTA L
Subjt:  MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASL

Query:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL
        DST IP+ IY ALECPEWKN VMEE+KALEKN   EICALPKGHK VGCKWVF+LKYKADGTLDRHK RLVAKGFTQTYG+DYSETFSPVAKLN VRVLL
Subjt:  DSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLL

Query:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        SVAVNKDWPLYQ D+KNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTK S+T KIA+LIVYV
Subjt:  SVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------
        DDIVL+GDDQ EISQLKQR+G+EFEI+DLGNLKYFLGMEVARSKE ISVSQRKYTLDLLTET MLGC   DTPIEFNCKLGNSDDQVP            
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP------------

Query:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR
                            FMQAPYE+HMEAV RILRYLK TPGKGLMFRKT++KTIEAYTDSDWAGS++DRKSTSGYCTFVWGNLVTWRSKKQ+VV+R
Subjt:  --------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQSVVAR

Query:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS
        SS EAEY+AMSLGICEEIWLQKVLS+LHQECETPLKLF   K+ + +  TQF+MIELNMLRLIGISSKKDLTV AYAFRT L+ NRLLM LPRGFSD TS
Subjt:  SSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFLQTNRLLMFLPRGFSDHTS

Query:  TFALASWDSLIFTSQLEGKC
        TF LASW SLIFT QLEG+C
Subjt:  TFALASWDSLIFTSQLEGKC

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.1e-7333.39Show/hide
Query:  NPTKPCTN-NTMSENDKSDVAVLENMEEKNRDDE-TEVRIETSNDEGKQGHTRK------LDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQ--
        N +K C N   + ++ +S+   L   +++ RDD   E +   + +E ++  T +      +D    +  I I  R+  R  TK P  +Y   DN   +  
Subjt:  NPTKPCTN-NTMSENDKSDVAVLENMEEKNRDDE-TEVRIETSNDEGKQGHTRK------LDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQ--

Query:  FRAFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAK
          A T   D       I    +   W+  +  E+ A + N TW I   P+    V  +WVFS+KY   G   R+KARLVA+GFTQ Y IDY ETF+PVA+
Subjt:  FRAFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAK

Query:  LNTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLF--TKASKT
        +++ R +LS+ +  +  ++Q DVK AFLNG L EE+YM  P G        VCKL K++YGLKQ+ R WF+ F   +K   +     D  ++   K +  
Subjt:  LNTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLF--TKASKT

Query:  EKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPI--EFNCKLGNSDDQ--
        E I +L+ YVDD+V+   D T ++  K+ + ++F + DL  +K+F+G+ +   ++ I +SQ  Y   +L++ +M  C  V TP+  + N +L NSD+   
Subjt:  EKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPI--EFNCKLGNSDDQ--

Query:  ----------VPFMQAPYEKHMEAVN------------------RILRYLKNTPGKGLMFRK--TNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWG-NL
                  +  M         AVN                  R+LRYLK T    L+F+K       I  Y DSDWAGS IDRKST+GY   ++  NL
Subjt:  ----------VPFMQAPYEKHMEAVN------------------RILRYLKNTPGKGLMFRK--TNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWG-NL

Query:  VTWRSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
        + W +K+Q+ VA SS EAEY A+   + E +WL+ +L++++ + E P+K++
Subjt:  VTWRSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-7839.91Show/hide
Query:  PKNIYTALECPEWKNTVM----EEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLS
        P+++   L  PE KN +M    EE+++L+KN T+++  LPKG + + CKWVF LK   D  L R+KARLV KGF Q  GID+ E FSPV K+ ++R +LS
Subjt:  PKNIYTALECPEWKNTVM----EEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLS

Query:  VAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFE-AQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV
        +A + D  + Q DVK AFL+GDL EE+YM  P GFE A     VCKL KSLYGLKQ+PR W+ +F +F+KSQ Y + +SD  ++ K        IL++YV
Subjt:  VAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFE-AQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYV

Query:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKES--ISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLG-----------NSDDQV
        DD+++ G D+  I++LK  +   F++KDLG  +  LGM++ R + S  + +SQ KY   +L   +M   +PV TP+  + KL             +  +V
Subjt:  DDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKES--ISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLG-----------NSDDQV

Query:  P----------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWR
        P                            F++ P ++H EAV  ILRYL+ T G  L F  ++   ++ YTD+D AG I +RKS++GY     G  ++W+
Subjt:  P----------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWR

Query:  SKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLS--NLHQE
        SK Q  VA S+ EAEY A +    E IWL++ L    LHQ+
Subjt:  SKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLS--NLHQE

P92519 Uncharacterized mitochondrial protein AtMg008102.6e-3233.18Show/hide
Query:  LIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP-------
        L++YVDDI+LTG   T ++ L  ++   F +KDLG + YFLG+++      + +SQ KY   +L    ML C+P+ TP+        S  + P       
Subjt:  LIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP-------

Query:  ------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQS
                                 M  P     + + R+LRY+K T   GL   K ++  ++A+ DSDWAG    R+ST+G+CTF+  N+++W +K+Q 
Subjt:  ------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQS

Query:  VVARSSVEAEYRAMSLGICEEIW
         V+RSS E EYRA++L   E  W
Subjt:  VVARSSVEAEYRAMSLGICEEIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-8940.27Show/hide
Query:  AFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL
        +   SL +   P+    AL+   W+N +  EI A   N TW++   P  H T VGC+W+F+ KY +DG+L+R+KARLVAKG+ Q  G+DY+ETFSPV K 
Subjt:  AFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL

Query:  NTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGF-EAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEK
         ++R++L VAV++ WP+ Q DV NAFL G L ++VYMS PPGF +      VCKL+K+LYGLKQ+PRAW+     ++ + G+    SD +LF    + + 
Subjt:  NTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGF-EAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEK

Query:  IAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLG-----------
        I  ++VYVDDI++TG+D T +      +   F +KD   L YFLG+E  R    + +SQR+Y LDLL  T+M+  +PV TP+  + KL            
Subjt:  IAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLG-----------

Query:  -----------------------NSDDQVPFMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTW
                               N   Q  FM  P E+H++A+ RILRYL  TP  G+  +K N  ++ AY+D+DWAG   D  ST+GY  ++  + ++W
Subjt:  -----------------------NSDDQVPFMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTW

Query:  RSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNL
         SKKQ  V RSS EAEYR+++    E  W+  +L+ L
Subjt:  RSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.6e-8838.46Show/hide
Query:  AFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL
        ++  SL +   P+    A++   W+  +  EI A   N TW++   P    T VGC+W+F+ K+ +DG+L+R+KARLVAKG+ Q  G+DY+ETFSPV K 
Subjt:  AFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKT-VGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKL

Query:  NTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGF-EAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEK
         ++R++L VAV++ WP+ Q DV NAFL G L +EVYMS PPGF +      VC+L+K++YGLKQ+PRAW+    T++ + G+    SD +LF    +   
Subjt:  NTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGF-EAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEK

Query:  IAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLG-NSDDQVP---
        I  ++VYVDDI++TG+D   +      +   F +K+  +L YFLG+E  R  + + +SQR+YTLDLL  T+ML  +PV TP+  + KL  +S  ++P   
Subjt:  IAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLG-NSDDQVP---

Query:  ----------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRS
                                    +M  P + H  A+ R+LRYL  TP  G+  +K N  ++ AY+D+DWAG   D  ST+GY  ++  + ++W S
Subjt:  ----------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRS

Query:  KKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETP
        KKQ  V RSS EAEYR+++    E  W+  +L+ L  +   P
Subjt:  KKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-10340.43Show/hide
Query:  SCTKHPICNYVSYDNLSPQFRAFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAK
        S T H I  ++SY+ +SP + +F   +     P     A E   W   + +EI A+E   TWEIC LP   K +GCKWV+ +KY +DGT++R+KARLVAK
Subjt:  SCTKHPICNYVSYDNLSPQFRAFTASLDSTIIPKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAK

Query:  GFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQ-----VCKLQKSLYGLKQSPRAWFDRFTTF
        G+TQ  GID+ ETFSPV KL +V+++L+++   ++ L+Q D+ NAFLNGDL EE+YM  PPG+ A+ G       VC L+KS+YGLKQ+ R WF +F+  
Subjt:  GFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDWPLYQPDVKNAFLNGDLVEEVYMSPPPGFEAQFGQQ-----VCKLQKSLYGLKQSPRAWFDRFTTF

Query:  VKSQGYSQGHSDHTLFTKASKTEKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCR
        +   G+ Q HSDHT F K + T  + +L VYVDDI++  ++   + +LK ++   F+++DLG LKYFLG+E+ARS   I++ QRKY LDLL ET +LGC+
Subjt:  VKSQGYSQGHSDHTLFTKASKTEKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCR

Query:  PVDTPIE----FNCKLGNS----------------------------DDQVPFMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAG
        P   P++    F+   G                              +    F +AP   H +AV +IL Y+K T G+GL +       ++ ++D+ +  
Subjt:  PVDTPIE----FNCKLGNS----------------------------DDQVPFMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAG

Query:  SIIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF
            R+ST+GYC F+  +L++W+SKKQ VV++SS EAEYRA+S    E +WL +    L      P  LF
Subjt:  SIIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLF

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.9e-0738.1Show/hide
Query:  FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFV
        F  A     M+AV ++L Y+K T G+GL +  T+   ++A+ DSDWA     R+S +G+C+ V
Subjt:  FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-3333.18Show/hide
Query:  LIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP-------
        L++YVDDI+LTG   T ++ L  ++   F +KDLG + YFLG+++      + +SQ KY   +L    ML C+P+ TP+        S  + P       
Subjt:  LIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLGNLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVP-------

Query:  ------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQS
                                 M  P     + + R+LRY+K T   GL   K ++  ++A+ DSDWAG    R+ST+G+CTF+  N+++W +K+Q 
Subjt:  ------------------------FMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIIDRKSTSGYCTFVWGNLVTWRSKKQS

Query:  VVARSSVEAEYRAMSLGICEEIW
         V+RSS E EYRA++L   E  W
Subjt:  VVARSSVEAEYRAMSLGICEEIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.5e-2351.02Show/hide
Query:  PKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVA
        PK++  AL+ P W   + EE+ AL +NKTW +   P     +GCKWVF  K  +DGTLDR KARLVAKGF Q  GI + ET+SPV +  T+R +L+VA
Subjt:  PKNIYTALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACCCTACAAAACCTTGTACTAATAATACAATGAGTGAGAATGATAAGTCTGATGTTGCTGTTCTTGAAAATATGGAAGAAAAGAACCGTGATGATGAGACTGA
GGTTAGAATAGAAACCAGTAACGATGAAGGTAAACAGGGTCATACAAGGAAACTTGATGAGTATGATCCCTCTCTTGACATTCCAATTGCATTGAGAAAAGGTACCAGAT
CATGCACTAAACATCCCATTTGCAACTATGTTTCCTATGATAATCTCTCTCCACAGTTTAGAGCGTTTACAGCAAGCCTTGACTCTACCATAATACCGAAAAATATCTAC
ACTGCTCTAGAGTGTCCTGAATGGAAGAATACTGTTATGGAAGAGATAAAGGCTCTCGAAAAGAATAAAACTTGGGAGATCTGTGCTTTACCCAAGGGACATAAAACTGT
AGGATGCAAATGGGTATTCTCTCTCAAATACAAAGCAGATGGTACACTTGATAGACACAAGGCAAGGTTAGTGGCAAAGGGATTCACTCAAACCTATGGTATTGACTATT
CAGAAACTTTTTCTCCAGTTGCTAAATTGAATACTGTTAGAGTTCTGCTATCTGTTGCTGTGAACAAAGATTGGCCTCTATACCAGCCGGATGTTAAGAATGCTTTTTTG
AATGGAGACCTTGTGGAGGAAGTCTACATGAGCCCCCCACCAGGATTTGAAGCCCAATTTGGTCAGCAGGTGTGTAAACTCCAAAAATCTCTATATGGTTTGAAACAGTC
TCCGAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGTCAAGGGCACTCTGACCATACTTTATTTACAAAGGCTTCCAAGACAGAAAAGATTG
CTATTCTAATAGTGTATGTGGATGACATTGTTTTGACTGGAGATGATCAAACAGAAATCAGTCAACTAAAGCAGAGAATGGGTGATGAATTTGAAATTAAAGATTTGGGA
AATCTGAAATATTTCCTTGGAATGGAGGTGGCTAGATCAAAAGAAAGTATTTCCGTGTCTCAAAGAAAATACACCCTTGATTTGCTAACCGAGACAGATATGTTGGGATG
TCGTCCTGTTGATACTCCTATTGAATTCAACTGTAAACTAGGAAACTCTGATGATCAAGTTCCATTTATGCAGGCTCCCTATGAGAAACATATGGAAGCTGTTAACAGAA
TCCTAAGATACTTGAAAAATACACCTGGTAAAGGGTTGATGTTTAGAAAAACAAATAGAAAGACCATTGAGGCATATACTGACTCAGATTGGGCAGGATCTATTATTGAT
AGAAAGTCTACCTCCGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGTGTTGAGGCTGAATACAGAGC
TATGAGTCTGGGAATATGTGAGGAAATTTGGCTCCAGAAAGTCTTGTCAAATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTTGTGATAATAAAGTCGCTATTAG
TATTGCTAACAACCCAGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGGCATTTCATCAAAGAAAGACTTGACAGTGGAAGCATATGCATTCCGTACATTCCTT
CAAACCAACAGATTGCTGATGTTCTTACCAAGGGGCTTCTCAGACCACACTTCGACCTTTGCGTTAGCAAGTTGGGACTCATTGATATTTACCTCCCAACTTGAGGGGAA
GTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACCCTACAAAACCTTGTACTAATAATACAATGAGTGAGAATGATAAGTCTGATGTTGCTGTTCTTGAAAATATGGAAGAAAAGAACCGTGATGATGAGACTGA
GGTTAGAATAGAAACCAGTAACGATGAAGGTAAACAGGGTCATACAAGGAAACTTGATGAGTATGATCCCTCTCTTGACATTCCAATTGCATTGAGAAAAGGTACCAGAT
CATGCACTAAACATCCCATTTGCAACTATGTTTCCTATGATAATCTCTCTCCACAGTTTAGAGCGTTTACAGCAAGCCTTGACTCTACCATAATACCGAAAAATATCTAC
ACTGCTCTAGAGTGTCCTGAATGGAAGAATACTGTTATGGAAGAGATAAAGGCTCTCGAAAAGAATAAAACTTGGGAGATCTGTGCTTTACCCAAGGGACATAAAACTGT
AGGATGCAAATGGGTATTCTCTCTCAAATACAAAGCAGATGGTACACTTGATAGACACAAGGCAAGGTTAGTGGCAAAGGGATTCACTCAAACCTATGGTATTGACTATT
CAGAAACTTTTTCTCCAGTTGCTAAATTGAATACTGTTAGAGTTCTGCTATCTGTTGCTGTGAACAAAGATTGGCCTCTATACCAGCCGGATGTTAAGAATGCTTTTTTG
AATGGAGACCTTGTGGAGGAAGTCTACATGAGCCCCCCACCAGGATTTGAAGCCCAATTTGGTCAGCAGGTGTGTAAACTCCAAAAATCTCTATATGGTTTGAAACAGTC
TCCGAGAGCATGGTTTGACAGATTCACTACCTTTGTCAAGTCCCAAGGGTACAGTCAAGGGCACTCTGACCATACTTTATTTACAAAGGCTTCCAAGACAGAAAAGATTG
CTATTCTAATAGTGTATGTGGATGACATTGTTTTGACTGGAGATGATCAAACAGAAATCAGTCAACTAAAGCAGAGAATGGGTGATGAATTTGAAATTAAAGATTTGGGA
AATCTGAAATATTTCCTTGGAATGGAGGTGGCTAGATCAAAAGAAAGTATTTCCGTGTCTCAAAGAAAATACACCCTTGATTTGCTAACCGAGACAGATATGTTGGGATG
TCGTCCTGTTGATACTCCTATTGAATTCAACTGTAAACTAGGAAACTCTGATGATCAAGTTCCATTTATGCAGGCTCCCTATGAGAAACATATGGAAGCTGTTAACAGAA
TCCTAAGATACTTGAAAAATACACCTGGTAAAGGGTTGATGTTTAGAAAAACAAATAGAAAGACCATTGAGGCATATACTGACTCAGATTGGGCAGGATCTATTATTGAT
AGAAAGTCTACCTCCGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGGAGCAGTGTTGAGGCTGAATACAGAGC
TATGAGTCTGGGAATATGTGAGGAAATTTGGCTCCAGAAAGTCTTGTCAAATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTTGTGATAATAAAGTCGCTATTAG
TATTGCTAACAACCCAGTTCAACATGATAGAACTAAACATGTTGAGATTGATCGGCATTTCATCAAAGAAAGACTTGACAGTGGAAGCATATGCATTCCGTACATTCCTT
CAAACCAACAGATTGCTGATGTTCTTACCAAGGGGCTTCTCAGACCACACTTCGACCTTTGCGTTAGCAAGTTGGGACTCATTGATATTTACCTCCCAACTTGAGGGGAA
GTGTTAG
Protein sequenceShow/hide protein sequence
MENPTKPCTNNTMSENDKSDVAVLENMEEKNRDDETEVRIETSNDEGKQGHTRKLDEYDPSLDIPIALRKGTRSCTKHPICNYVSYDNLSPQFRAFTASLDSTIIPKNIY
TALECPEWKNTVMEEIKALEKNKTWEICALPKGHKTVGCKWVFSLKYKADGTLDRHKARLVAKGFTQTYGIDYSETFSPVAKLNTVRVLLSVAVNKDWPLYQPDVKNAFL
NGDLVEEVYMSPPPGFEAQFGQQVCKLQKSLYGLKQSPRAWFDRFTTFVKSQGYSQGHSDHTLFTKASKTEKIAILIVYVDDIVLTGDDQTEISQLKQRMGDEFEIKDLG
NLKYFLGMEVARSKESISVSQRKYTLDLLTETDMLGCRPVDTPIEFNCKLGNSDDQVPFMQAPYEKHMEAVNRILRYLKNTPGKGLMFRKTNRKTIEAYTDSDWAGSIID
RKSTSGYCTFVWGNLVTWRSKKQSVVARSSVEAEYRAMSLGICEEIWLQKVLSNLHQECETPLKLFVIIKSLLVLLTTQFNMIELNMLRLIGISSKKDLTVEAYAFRTFL
QTNRLLMFLPRGFSDHTSTFALASWDSLIFTSQLEGKC