; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004464 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004464
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr10:9064135..9067270
RNA-Seq ExpressionPay0004464
SyntenyPay0004464
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]0.0e+0061.05Show/hide
Query:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--
        MVQTRIEERL+V+DQEIAL++KELGKMP IEL+LNDIAKNMQTMR QSDKQEQM++MIMETIAKDRTTT ERN+EPN HMSVTNKGKEKEA SSKS V  
Subjt:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--

Query:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------
                 GKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE                                                       
Subjt:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------

Query:  --------------------------------SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTT
                                        SDVPDPVV+DTFMNGLFPWIRAEV +CRPKGLAEMME AQLVENREI RNE NLN+FAGGKY  QNT 
Subjt:  --------------------------------SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTT

Query:  SSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEK
        ++R   NT SD+K NT FPMRTIT+RSSNNAEI                                   C                               
Subjt:  SSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEK

Query:  TLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWK
                                     TMKVKGK+QEREVIILID GATHNFIS+KLVESLQL +KET HYGVILGSGT VQGKG+CENVEIQL+NWK
Subjt:  TLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWK

Query:  VKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIED
        VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQI IKGDPSLT SRISLKSM KTW++QDE FLIECRAIQVH ENEQ+ TA ATI D
Subjt:  VKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIED

Query:  ESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNT
        E LQN+LKQF DVFDWPE+LPPRRGIEHQIHLKEGTNPINVRPY+YG                             PVLLVKKKDGSWRFCV+YRAVNN 
Subjt:  ESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNT

Query:  TIPDKFPTPVAEELFDELNGATI---------------------------------------------------------------------------KN
        TIPDKFP PVAEELFDELNGAT+                                                                           KN
Subjt:  TIPDKFPTPVAEELFDELNGATI---------------------------------------------------------------------------KN

Query:  KKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG
        +KDH+EHIEKVFLALRKH+LF NKK     ++    LG    GEGVEVDS+KIKAVADWPCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGG
Subjt:  KKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG

Query:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD
        FKWNE+++ESF KLK AMMSLPTLALPNF LPFEIETDASGFGV AVLIQSKRPIAFYSHTLSMRDRARP+YERELMAVVLSVQRWRPYLLGAKFVVKTD
Subjt:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD

Query:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        QKSLKFLLEQ+VIQPQYQKWLSKLLGYSFEVVYKP LENKA  AL
Subjt:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

KAA0046241.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0067.04Show/hide
Query:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------
        MLLMIMETIAKDRTT  ERN EPNVH SVTNKGK+KEATSSK A+           GKTET+E   DRNKFKK+EMPVFA EDPDSWLFRAE        
Subjt:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------

Query:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR
                                                                                       SDV D VVED FMNGLFPWIR
Subjt:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR

Query:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK
        AEV+ CRPKGLAEMME+AQLVENRE+IRNEANLN FAGGKY PQNT ++RGTTN +SDNK N TFPMRTIT+RSSNNAE+RKDTNSRRLPDAEFQARKEK
Subjt:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK

Query:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN
        GLCFRCNE YSADHKCKMKELRELKMFVVIKEGEEYEIIEEN  EEKTLA+LQVEEE K FAELSLNSVVGLNDPGT+KV+GKLQ REVIILIDCGATHN
Subjt:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN

Query:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS
        FIS+KLV SLQLSIKETAHYGVIL SGT +QGKGVCE+VEIQL NW+VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQI +KGDPS
Subjt:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS

Query:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------
        LT SRISLKSMFKTW+DQDE FLIECRAIQV +ENEQ++T I   E ESLQ +LKQF DVFDWPEKL PRR IEHQIHLKEGTNPINVRPY+YG      
Subjt:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------

Query:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------
                          VLLVKKKDGSWRFCV+Y AVNN TIPDKFP PV EELFDELNGAT+                                    
Subjt:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------

Query:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW
                                               KN+KDHVEHIEKVFL LR+H LF NKK  +  ++    LG    GEGVEV SEKIKA ADW
Subjt:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW

Query:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS
        PCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGGFKW+EE+E++FLKLK+AMMSLP LALP+FEL FEIETDASGFGVGAVL+QS+RPIAFYS
Subjt:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS

Query:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        HTLSMRDRARP+YERELMAVVLSVQRWR YLLGAKF+VKTDQKSLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKP LENKA  AL
Subjt:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0070.05Show/hide
Query:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--
        MVQTRIEERLEV+DQEIAL+KKELGKMP I+L+LNDIAKNMQTMR QSDKQEQM+LMIMETIAKDRTT  ERN++PN HMSVTNKGKEKEA SSKSAV  
Subjt:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--

Query:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------
                 GKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE                                                       
Subjt:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------

Query:  --------------------------------SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTT
                                        SDVPDPVVEDTFMNGLFPWIRAEV +CRPKGLA+ ME AQLVENREI RNEANLN+FAGGKY  QNT 
Subjt:  --------------------------------SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTT

Query:  SSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEK
        +SR   NT SD+K NT FPMRTIT+RSSNNAEIRKD NSRRLPDAEFQARKEKGLCFRCNEKYSADH+CKMKELRELKMFVV KEGEEYEIIEENT EEK
Subjt:  SSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEK

Query:  TLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWK
        TL L Q EE+QKTFAELSLNSVVGLNDPGTMKVKGK+QEREVIILIDCGATHNFIS+KLV+SLQL +KET HYGVILGSGTAVQGKG+CENVEIQL+NWK
Subjt:  TLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWK

Query:  VKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIED
        VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQISIKGDPSLT SRISLKSM KTW++QDE FLIECRA+QVH ENEQ+ TA+ TI D
Subjt:  VKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIED

Query:  ESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNT
        E LQN+LKQF DVFDWPE+LPPRRGIEHQIHLKEGTNPINVRPY+YG                             PVLLVKKKDGSWRFCV+YRAVNN 
Subjt:  ESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNT

Query:  TIPDKFPTPVAEELFDELNGATI---------------------------------------------------------------------------KN
        TIPDKFP PVAEELFDELNGAT+                                                                           KN
Subjt:  TIPDKFPTPVAEELFDELNGATI---------------------------------------------------------------------------KN

Query:  KKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG
        +KDH+EHIEKVFLALRKH+LF NKK     ++    LG    GEGVEVDSEKIKAVADWPCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGG
Subjt:  KKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG

Query:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD
        FKWNE++EESF KLK AMMSLPTLALPNF LPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARP+YERELMAVVLSVQRWRPYLLGAKFVVKTD
Subjt:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD

Query:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKA  AL
Subjt:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

TYK18957.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0067.04Show/hide
Query:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------
        MLLMIMETIAKDRTT  ERN EPNVH SVTNKGK+KEATSSK A+           GKTET+E   DRNKFKK+EMPVFA EDPDSWLFRAE        
Subjt:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------

Query:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR
                                                                                       SDV D VVED FMNGLFPWIR
Subjt:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR

Query:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK
        AEV+ CRPKGLAEMME+AQLVENRE+IRNEANLN FAGGKY PQNT ++RGTTN +SDNK N TFPMRTIT+RSSNNAE+RKDTNSRRLPDAEFQARKEK
Subjt:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK

Query:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN
        GLCFRCNE YSADHKCKMKELRELKMFVVIKEGEEYEIIEEN  EEKTLA+LQVEEE K FAELSLNSVVGLNDPGT+KV+GKLQ REVIILIDCGATHN
Subjt:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN

Query:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS
        FIS+KLV SLQLSIKETAHYGVIL SGT +QGKGVCE+VEIQL NW+VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQI +KGDPS
Subjt:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS

Query:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------
        LT SRISLKSMFKTW+DQDE FLIECRAIQV +ENEQ++T I   E ESLQ +LKQF DVFDWPEKL PRR IEHQIHLKEGTNPINVRPY+YG      
Subjt:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------

Query:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------
                          VLLVKKKDGSWRFCV+Y AVNN TIPDKFP PV EELFDELNGAT+                                    
Subjt:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------

Query:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW
                                               KN+KDHVEHIEKVFL LR+H LF NKK  +  ++    LG    GEGVEV SEKIKA ADW
Subjt:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW

Query:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS
        PCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGGFKW+EE+E++FLKLK+AMMSLP LALP+FEL FEIETDASGFGVGAVL+QS+RPIAFYS
Subjt:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS

Query:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        HTLSMRDRARP+YERELMAVVLSVQRWR YLLGAKF+VKTDQKSLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKP LENKA  AL
Subjt:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

TYK23779.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0066.73Show/hide
Query:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--
        MVQTRIEERLEV+DQEIAL+KKELGKMP IEL+LNDIAKNMQTMR QSDKQEQM+LMIMETIAKDRTTT ERN+EPN HMSVTNKGKEKEA SSKSA+  
Subjt:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--

Query:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------
                 GKTET+EAAADRNKFKKVEMPVFA EDPDSWLFRAE                                                       
Subjt:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------

Query:  -----SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRS
             SDVPDPVVEDTFMNGLFPWIRAEV +CRPK LA+ ME AQLVENREI RNEANLN+FAGGKY  QNT +SR   NT SD+K NT FPMRTIT+RS
Subjt:  -----SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRS

Query:  SNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLND
        SNNAEI KD NSRRLPDAEFQA+KEKGLCFRCNEKYSADH+CKMKELRELKMFVV KEGEEYEIIEEN  EEKTL   Q EE+QKTFAELSLNSVVGLND
Subjt:  SNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLND

Query:  PGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVLVVDWKNFTLT
        PGTMKVK K+QEREVIILIDCGATHNFIS+KLVESLQL +KET HYGVILGSGTAVQGKG+CENVEIQL+NWKVKEEF+PLELGGVDVVL          
Subjt:  PGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVLVVDWKNFTLT

Query:  FPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNP
                                  +K                 + K N+ A     +  DE LQN+LKQF DVFDWPE+LPPRRGIEHQIHLKEGT+P
Subjt:  FPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNP

Query:  INVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI---------------
        INVRPY+YG                             PVLLVKKKDGSWRFCV+YR VNN TIPDKFP PVAEELFDELNGATI               
Subjt:  INVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI---------------

Query:  ------------------------------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLG
                                                                    KN+KDH+EHIEKVFLALRKH+LF NKK     ++    LG
Subjt:  ------------------------------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLG

Query:  AHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETD
            GEGVEVDSEKIK VADWPCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGGFKWNE++EESF KLK AMMSLPTLALPNF LPFEIETD
Subjt:  AHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETD

Query:  ASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLE
        ASGFGVGAVLIQSKRPIAFYSHTLSMRDRARP+YERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLE
Subjt:  ASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLE

Query:  NKAVGAL
        NKA  AL
Subjt:  NKAVGAL

TrEMBL top hitse value%identityAlignment
A0A5A7TY09 Ty3/gypsy retrotransposon protein0.0e+0067.04Show/hide
Query:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------
        MLLMIMETIAKDRTT  ERN EPNVH SVTNKGK+KEATSSK A+           GKTET+E   DRNKFKK+EMPVFA EDPDSWLFRAE        
Subjt:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------

Query:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR
                                                                                       SDV D VVED FMNGLFPWIR
Subjt:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR

Query:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK
        AEV+ CRPKGLAEMME+AQLVENRE+IRNEANLN FAGGKY PQNT ++RGTTN +SDNK N TFPMRTIT+RSSNNAE+RKDTNSRRLPDAEFQARKEK
Subjt:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK

Query:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN
        GLCFRCNE YSADHKCKMKELRELKMFVVIKEGEEYEIIEEN  EEKTLA+LQVEEE K FAELSLNSVVGLNDPGT+KV+GKLQ REVIILIDCGATHN
Subjt:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN

Query:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS
        FIS+KLV SLQLSIKETAHYGVIL SGT +QGKGVCE+VEIQL NW+VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQI +KGDPS
Subjt:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS

Query:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------
        LT SRISLKSMFKTW+DQDE FLIECRAIQV +ENEQ++T I   E ESLQ +LKQF DVFDWPEKL PRR IEHQIHLKEGTNPINVRPY+YG      
Subjt:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------

Query:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------
                          VLLVKKKDGSWRFCV+Y AVNN TIPDKFP PV EELFDELNGAT+                                    
Subjt:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------

Query:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW
                                               KN+KDHVEHIEKVFL LR+H LF NKK  +  ++    LG    GEGVEV SEKIKA ADW
Subjt:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW

Query:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS
        PCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGGFKW+EE+E++FLKLK+AMMSLP LALP+FEL FEIETDASGFGVGAVL+QS+RPIAFYS
Subjt:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS

Query:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        HTLSMRDRARP+YERELMAVVLSVQRWR YLLGAKF+VKTDQKSLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKP LENKA  AL
Subjt:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

A0A5D3BSP2 Ty3/gypsy retrotransposon protein0.0e+0070.05Show/hide
Query:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--
        MVQTRIEERLEV+DQEIAL+KKELGKMP I+L+LNDIAKNMQTMR QSDKQEQM+LMIMETIAKDRTT  ERN++PN HMSVTNKGKEKEA SSKSAV  
Subjt:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--

Query:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------
                 GKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE                                                       
Subjt:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------

Query:  --------------------------------SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTT
                                        SDVPDPVVEDTFMNGLFPWIRAEV +CRPKGLA+ ME AQLVENREI RNEANLN+FAGGKY  QNT 
Subjt:  --------------------------------SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTT

Query:  SSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEK
        +SR   NT SD+K NT FPMRTIT+RSSNNAEIRKD NSRRLPDAEFQARKEKGLCFRCNEKYSADH+CKMKELRELKMFVV KEGEEYEIIEENT EEK
Subjt:  SSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEK

Query:  TLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWK
        TL L Q EE+QKTFAELSLNSVVGLNDPGTMKVKGK+QEREVIILIDCGATHNFIS+KLV+SLQL +KET HYGVILGSGTAVQGKG+CENVEIQL+NWK
Subjt:  TLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWK

Query:  VKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIED
        VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQISIKGDPSLT SRISLKSM KTW++QDE FLIECRA+QVH ENEQ+ TA+ TI D
Subjt:  VKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIED

Query:  ESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNT
        E LQN+LKQF DVFDWPE+LPPRRGIEHQIHLKEGTNPINVRPY+YG                             PVLLVKKKDGSWRFCV+YRAVNN 
Subjt:  ESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNT

Query:  TIPDKFPTPVAEELFDELNGATI---------------------------------------------------------------------------KN
        TIPDKFP PVAEELFDELNGAT+                                                                           KN
Subjt:  TIPDKFPTPVAEELFDELNGATI---------------------------------------------------------------------------KN

Query:  KKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG
        +KDH+EHIEKVFLALRKH+LF NKK     ++    LG    GEGVEVDSEKIKAVADWPCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGG
Subjt:  KKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG

Query:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD
        FKWNE++EESF KLK AMMSLPTLALPNF LPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARP+YERELMAVVLSVQRWRPYLLGAKFVVKTD
Subjt:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD

Query:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKA  AL
Subjt:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

A0A5D3C468 Peroxidase 640.0e+0063.98Show/hide
Query:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------
        M+LMIMETIAKDRTTT ERN+EPN HMSVTNKGKEKEA SSKSAV           GKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE        
Subjt:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------

Query:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR
                                                                                       SDVPDPV+EDTFMNGLFPWIR
Subjt:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR

Query:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK
        AEVILCRPKGLAEMME AQLVENREI+RNEANLN+ A GKY  QNT ++R + NT SDNK NT FPM+TIT+RSSNNAEIR                   
Subjt:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK

Query:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN
                                         EEY+IIEEN TEEKTL  LQ EE+QKTFAELSLNSVVGLNDPGTMKVKGKLQE+EVIILIDCGATHN
Subjt:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN

Query:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVLVVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSM
        FIS+KLVESLQL +KETAHYGVILGSGT VQGKG+CENVEIQLTNWKVKEEFLPLELGGVDVVL ++                 KG PSLT SRISLKSM
Subjt:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVLVVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSM

Query:  FKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKY------------------
         KTW++QDE FLIECRA+QVHKENEQ++TA+ TIED  LQN+LKQF DVFDW EKLPPRR IEHQIHLKEGTNPINVRPY+Y                  
Subjt:  FKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKY------------------

Query:  -----------GPVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATIKNKKD----------------------HVEHIE--------
                    PVLLVKKKDGSWRFCV+YRAVNN TIPDKFP PVAEELFDELNGAT+ +K D                      H  H E        
Subjt:  -----------GPVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATIKNKKD----------------------HVEHIE--------

Query:  -------------------KVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAA
                           +  LALRKHTLF NKK     ++    LG    GEGVEVDSEKIKAVADWPCPTNIREVRGFLGL GYYRRFVQHYGSIAA
Subjt:  -------------------KVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAA

Query:  PLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYL
        PLTQLLKK GFKWNE++EESF KLK AMMSLPTLALPNF LPFEIET+ASGFGVGAVLIQ+KRPIAFYSHTLSMRDRARP+YERELMAVVLSVQRWRPYL
Subjt:  PLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYL

Query:  LGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        LGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKA  AL
Subjt:  LGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

A0A5D3D5W7 Ty3/gypsy retrotransposon protein0.0e+0067.04Show/hide
Query:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------
        MLLMIMETIAKDRTT  ERN EPNVH SVTNKGK+KEATSSK A+           GKTET+E   DRNKFKK+EMPVFA EDPDSWLFRAE        
Subjt:  MLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV----------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE--------

Query:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR
                                                                                       SDV D VVED FMNGLFPWIR
Subjt:  -------------------------------------------------------------------------------SDVPDPVVEDTFMNGLFPWIR

Query:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK
        AEV+ CRPKGLAEMME+AQLVENRE+IRNEANLN FAGGKY PQNT ++RGTTN +SDNK N TFPMRTIT+RSSNNAE+RKDTNSRRLPDAEFQARKEK
Subjt:  AEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEK

Query:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN
        GLCFRCNE YSADHKCKMKELRELKMFVVIKEGEEYEIIEEN  EEKTLA+LQVEEE K FAELSLNSVVGLNDPGT+KV+GKLQ REVIILIDCGATHN
Subjt:  GLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHN

Query:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS
        FIS+KLV SLQLSIKETAHYGVIL SGT +QGKGVCE+VEIQL NW+VKEEFLPLELGGVDVVL           VVDWKN TLTF +EGKQI +KGDPS
Subjt:  FISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPS

Query:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------
        LT SRISLKSMFKTW+DQDE FLIECRAIQV +ENEQ++T I   E ESLQ +LKQF DVFDWPEKL PRR IEHQIHLKEGTNPINVRPY+YG      
Subjt:  LTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNPINVRPYKYG------

Query:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------
                          VLLVKKKDGSWRFCV+Y AVNN TIPDKFP PV EELFDELNGAT+                                    
Subjt:  -----------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI------------------------------------

Query:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW
                                               KN+KDHVEHIEKVFL LR+H LF NKK  +  ++    LG    GEGVEV SEKIKA ADW
Subjt:  ---------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLGAHHIGEGVEVDSEKIKAVADW

Query:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS
        PCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGGFKW+EE+E++FLKLK+AMMSLP LALP+FEL FEIETDASGFGVGAVL+QS+RPIAFYS
Subjt:  PCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYS

Query:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        HTLSMRDRARP+YERELMAVVLSVQRWR YLLGAKF+VKTDQKSLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKP LENKA  AL
Subjt:  HTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

A0A5D3DKH5 Ty3/gypsy retrotransposon protein0.0e+0066.73Show/hide
Query:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--
        MVQTRIEERLEV+DQEIAL+KKELGKMP IEL+LNDIAKNMQTMR QSDKQEQM+LMIMETIAKDRTTT ERN+EPN HMSVTNKGKEKEA SSKSA+  
Subjt:  MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAV--

Query:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------
                 GKTET+EAAADRNKFKKVEMPVFA EDPDSWLFRAE                                                       
Subjt:  --------WGKTETEEAAADRNKFKKVEMPVFAGEDPDSWLFRAE-------------------------------------------------------

Query:  -----SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRS
             SDVPDPVVEDTFMNGLFPWIRAEV +CRPK LA+ ME AQLVENREI RNEANLN+FAGGKY  QNT +SR   NT SD+K NT FPMRTIT+RS
Subjt:  -----SDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTNTISDNKANTTFPMRTITMRS

Query:  SNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLND
        SNNAEI KD NSRRLPDAEFQA+KEKGLCFRCNEKYSADH+CKMKELRELKMFVV KEGEEYEIIEEN  EEKTL   Q EE+QKTFAELSLNSVVGLND
Subjt:  SNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQKTFAELSLNSVVGLND

Query:  PGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVLVVDWKNFTLT
        PGTMKVK K+QEREVIILIDCGATHNFIS+KLVESLQL +KET HYGVILGSGTAVQGKG+CENVEIQL+NWKVKEEF+PLELGGVDVVL          
Subjt:  PGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVLVVDWKNFTLT

Query:  FPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNP
                                  +K                 + K N+ A     +  DE LQN+LKQF DVFDWPE+LPPRRGIEHQIHLKEGT+P
Subjt:  FPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLKEGTNP

Query:  INVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI---------------
        INVRPY+YG                             PVLLVKKKDGSWRFCV+YR VNN TIPDKFP PVAEELFDELNGATI               
Subjt:  INVRPYKYG-----------------------------PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI---------------

Query:  ------------------------------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLG
                                                                    KN+KDH+EHIEKVFLALRKH+LF NKK     ++    LG
Subjt:  ------------------------------------------------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEED-RVLG

Query:  AHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETD
            GEGVEVDSEKIK VADWPCPTNIREVRGFLGL GYYRRFVQHYGSIAAPLTQLLKKGGFKWNE++EESF KLK AMMSLPTLALPNF LPFEIETD
Subjt:  AHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETD

Query:  ASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLE
        ASGFGVGAVLIQSKRPIAFYSHTLSMRDRARP+YERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLE
Subjt:  ASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLE

Query:  NKAVGAL
        NKA  AL
Subjt:  NKAVGAL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.0e-3434.69Show/hide
Query:  DHVEHIEKVFLALRKHTL-FTNKKSVTLPEEDRVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG--
        +H++ +  VF  L K  L     K   L +E   LG     +G++ + EKI+A+  +P PT  +E++ FLGL GYYR+F+ ++  IA P+T+ LKK    
Subjt:  DHVEHIEKVFLALRKHTL-FTNKKSVTLPEEDRVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGG--

Query:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD
           N E + +F KLKY +   P L +P+F   F + TDAS   +GAVL Q   P+++ S TL+  +      E+EL+A+V + + +R YLLG  F + +D
Subjt:  FKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTD

Query:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
         + L +L   +    +  +W  KL  + F++ Y  G EN    AL
Subjt:  QKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy9.6e-3333.33Show/hide
Query:  KNKKDHVEHIEKVFLALRKHTL-FTNKKSVTLPEEDRVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLL--
        +N+ DHV HI+ V   L    +  + +K+    E    LG     +G + D EK+KA+ ++P P  + +VR FLGL  YYR F++ + +IA P+T +L  
Subjt:  KNKKDHVEHIEKVFLALRKHTL-FTNKKSVTLPEEDRVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLL--

Query:  ----------KKGGFKWNEESEESFLKLKYAMMSLPT-LALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQR
                  KK   ++NE    +F +L+  + S    L  P+F+ PF++ TDAS  G+GAVL Q  RPI   S TL   ++     EREL+A+V ++ +
Subjt:  ----------KKGGFKWNEESEESFLKLKYAMMSLPT-LALPNFELPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQR

Query:  WRPYLLGAKFV-VKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
         + +L G++ + + TD + L F +  R    + ++W S +  ++ +V YKPG EN    AL
Subjt:  WRPYLLGAKFV-VKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

P20825 Retrovirus-related Pol polyprotein from transposon 2974.4e-3828.57Show/hide
Query:  IEHQIH--LKEGTNPINVRPYKYGPVLLVKKKDGS----WRFCVNYRAVNNTTIPDKFPTPVAEELF-------------------------DELNGATI
        +E+Q+   L +G    +  PY     ++ KK D S    +R  ++YR +N  TIPD++P P  +E+                          + ++    
Subjt:  IEHQIH--LKEGTNPINVRPYKYGPVLLVKKKDGS----WRFCVNYRAVNNTTIPDKFPTPVAEELF-------------------------DELNGATI

Query:  KNKKDHVEHIEKVF------------------LALRKHTL-----------------------FTN----------KKSVTLPEEDRVLGAHHIGEGVEV
          K  H E++   F                    L KH L                       FT            K   L +E   LG     +G++ 
Subjt:  KNKKDHVEHIEKVF------------------LALRKHTL-----------------------FTN----------KKSVTLPEEDRVLGAHHIGEGVEV

Query:  DSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESE--ESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGA
        +  K+KA+  +P PT  +E+R FLGL GYYR+F+ +Y  IA P+T  LKK      ++ E  E+F KLK  ++  P L LP+FE  F + TDAS   +GA
Subjt:  DSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESE--ESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGA

Query:  VLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        VL Q+  PI+F S TL+  +      E+EL+A+V + + +R YLLG +F++ +D + L++L   +    + ++W  +L  Y F++ Y  G EN    AL
Subjt:  VLIQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.4e-3125.74Show/hide
Query:  PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI-----------------------------------------------------
        PV+LV KKDG++R CV+YR +N  TI D FP P  + L   +  A I                                                     
Subjt:  PVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI-----------------------------------------------------

Query:  --------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEEDRVLGAHHIG-EGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYY
                            ++ ++H +H++ V   L+   L   KK      E+     + IG + +     K  A+ D+P P  +++ + FLG+I YY
Subjt:  --------------------KNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEEDRVLGAHHIG-EGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYY

Query:  RRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVL--IQSKRP----IAFYSHTLSMRDRARPIY
        RRF+ +   IA P+ QL      +W E+ +++  KLK A+ + P L   N +  + + TDAS  G+GAVL  + +K      + ++S +L    +  P  
Subjt:  RRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVL--IQSKRP----IAFYSHTLSMRDRARPIY

Query:  ERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
        E EL+ ++ ++  +R  L G  F ++TD  SL  L  +     + Q+WL  L  Y F + Y  G +N    A+
Subjt:  ERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.3e-3630.03Show/hide
Query:  HQIHLKEGTNPINVRPYKYGPVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI--KNKKDHVEHIEKVFLALRKHTLFTN-KKSVT
        HQIH+KE   P        G    ++   G      N  A+    I D     + +  +  ++   +  ++   H +++  V  +L K  L  N +KS  
Subjt:  HQIHLKEGTNPINVRPYKYGPVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATI--KNKKDHVEHIEKVFLALRKHTLFTN-KKSVT

Query:  LPEEDRVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLK------------KGGFKWNEESEESFLKLKYA
        L  +   LG     +G++ D +K++A+++ P PT+++E++ FLG+  YYR+F+Q Y  +A PLT L +            K     +E + +SF  LK  
Subjt:  LPEEDRVLGAHHIGEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLK------------KGGFKWNEESEESFLKLKYA

Query:  MMSLPTLALPNFELPFEIETDASGFGVGAVLIQS----KRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFV-VKTDQKSLKFLLEQRV
        + S   LA P F  PF + TDAS + +GAVL Q      RPIA+ S +L+  +      E+E++A++ S+   R YL GA  + V TD + L F L  R 
Subjt:  MMSLPTLALPNFELPFEIETDASGFGVGAVLIQS----KRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFV-VKTDQKSLKFLLEQRV

Query:  IQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL
           + ++W +++  Y+ E++YKPG  N    AL
Subjt:  IQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein4.0e-1024Show/hide
Query:  LLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKE
        L ++E++  T  +     V+ L     M+  G + + +V++ ID GAT NFI  +L  SL+L    T    V+LG    +Q  G C  + + +   ++ E
Subjt:  LLQVEEEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKE

Query:  EFLPLELG--GVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDE
         FL L+L    VDV+L           +V+W+N   +F +  + I++  +    + +++ K   K+  +Q++        I+  + N+     ++ +ED+
Subjt:  EFLPLELG--GVDVVL-----------VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDE

AT3G30770.1 Eukaryotic aspartyl protease family protein5.8e-0925.13Show/hide
Query:  EEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPL
        E+ KT  ++   S         M+  G +   +V+++ID GAT+NFIS +L   L+L    T    V+LG    +Q  G C  + + +   ++ E FL L
Subjt:  EEQKTFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPL

Query:  ELGGVDVVLVVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQV---HKENEQADTAI---ATIEDESLQNILK
        +L   DV +++ +          G Q              +L+  +  WL+QD  F    + + +    KE EQ  T +   +  E E + + L+
Subjt:  ELGGVDVVLVVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQV---HKENEQADTAI---ATIEDESLQNILK

ATMG00860.1 DNA/RNA polymerases superfamily protein6.2e-2745.8Show/hide
Query:  VEHIEKVFLALRKHTLFTNKKSVTLPE-EDRVLGAHHI--GEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFK
        + H+  V     +H  + N+K     + +   LG  HI  GEGV  D  K++A+  WP P N  E+RGFLGL GYYRRFV++YG I  PLT+LLKK   K
Subjt:  VEHIEKVFLALRKHTLFTNKKSVTLPE-EDRVLGAHHI--GEGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFK

Query:  WNEESEESFLKLKYAMMSLPTLALPNFELPF
        W E +  +F  LK A+ +LP LALP+ +LPF
Subjt:  WNEESEESFLKLKYAMMSLPTLALPNFELPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACAAACTAGGATTGAAGAAAGACTCGAAGTAATCGATCAAGAAATAGCACTGATGAAGAAAGAACTCGGGAAGATGCCGATCATCGAATTAAGCCTGAAC
GATATTGCGAAGAATATGCAGACGATGCGAGCCCAGTCGGACAAACAAGAGCAGATGCTGTTGATGATAATGGAAACTATTGCCAAGGATCGTACCACAACCGGT
GAACGCAACAAAGAACCTAATGTGCACATGTCTGTGACGAATAAAGGAAAAGAAAAGGAAGCAACCTCCAGCAAATCGGCCGTATGGGGAAAAACAGAGACCGAA
GAAGCTGCAGCAGACCGGAATAAATTCAAGAAAGTTGAAATGCCCGTGTTCGCCGGAGAAGACCCAGACTCATGGCTCTTCCGCGCTGAAAGTGATGTCCCGGAT
CCTGTTGTCGAAGACACTTTCATGAATGGCTTGTTCCCATGGATAAGAGCAGAGGTGATACTTTGTCGACCAAAAGGATTGGCCGAAATGATGGAAATCGCGCAA
TTGGTTGAGAACAGAGAGATCATACGAAACGAAGCCAATTTGAATAGCTTCGCGGGCGGGAAATACCTTCCGCAAAATACAACAAGTAGCAGAGGAACGACAAAC
ACTATCAGCGATAACAAAGCGAATACCACCTTCCCAATGCGAACCATCACTATGAGGAGCTCTAATAATGCCGAAATCCGTAAAGACACCAACTCAAGAAGATTG
CCCGACGCAGAATTCCAAGCTCGAAAAGAAAAAGGACTCTGTTTTAGATGCAACGAAAAATACTCAGCTGATCATAAATGCAAAATGAAGGAACTACGCGAGTTG
AAGATGTTTGTTGTGATAAAAGAAGGCGAAGAATATGAGATAATTGAAGAAAATACTACCGAAGAAAAAACACTAGCACTACTGCAAGTAGAAGAAGAGCAGAAG
ACTTTTGCAGAATTATCCCTAAACTCTGTTGTTGGATTGAACGATCCTGGCACAATGAAAGTTAAAGGAAAGCTTCAAGAGAGAGAGGTGATCATTCTCATAGAT
TGTGGGGCCACTCACAATTTTATTTCCAAAAAATTAGTAGAATCACTACAATTGTCGATCAAGGAAACTGCCCACTATGGTGTTATACTAGGATCCGGAACAGCG
GTACAAGGTAAAGGAGTGTGTGAAAATGTAGAAATTCAACTAACGAATTGGAAGGTGAAGGAAGAATTTCTACCGTTAGAACTCGGAGGGGTTGATGTTGTGCTG
GTGGTTGATTGGAAAAACTTTACACTTACGTTTCCTAACGAAGGGAAACAGATCTCAATAAAAGGAGACCCTAGCCTTACTATATCACGAATCAGTCTCAAAAGT
ATGTTCAAAACTTGGCTTGATCAAGATGAAGAATTTTTGATCGAATGTAGAGCAATACAAGTTCATAAAGAAAATGAACAGGCCGATACAGCAATTGCAACCATT
GAAGATGAATCCCTGCAAAACATACTGAAACAGTTTGGAGATGTGTTTGACTGGCCGGAAAAGCTCCCACCAAGAAGAGGAATTGAACATCAAATACATCTCAAA
GAAGGGACTAACCCAATAAACGTTAGACCCTATAAATATGGCCCAGTACTGTTGGTTAAGAAGAAAGATGGCAGCTGGCGTTTCTGTGTCAATTATCGGGCAGTA
AATAACACTACAATTCCAGACAAGTTCCCAACACCAGTGGCTGAAGAACTATTTGATGAGCTAAATGGCGCAACGATCAAGAATAAGAAAGACCACGTAGAGCAT
ATTGAAAAGGTCTTCCTCGCATTGAGAAAACATACCCTATTTACCAATAAAAAAAGTGTAACTTTGCCAGAAGAAGATAGAGTACTTGGGGCACATCATATCGGA
GAAGGAGTAGAAGTAGATTCTGAAAAAATCAAAGCAGTTGCGGATTGGCCATGCCCAACAAACATAAGAGAAGTCCGGGGATTCCTTGGATTAATAGGCTACTAT
AGGAGATTTGTCCAACATTATGGGTCAATAGCGGCCCCATTAACTCAACTCCTCAAAAAGGGAGGGTTCAAATGGAATGAAGAATCTGAAGAATCATTTCTAAAG
TTGAAATATGCAATGATGTCTCTACCAACATTGGCCTTACCCAACTTTGAACTTCCGTTTGAAATAGAAACAGATGCATCCGGTTTTGGAGTGGGAGCCGTGTTA
ATCCAGTCAAAGAGACCCATTGCCTTCTATAGCCATACACTATCCATGAGAGATAGAGCCCGACCTATATATGAACGCGAGTTGATGGCTGTTGTACTATCCGTA
CAAAGATGGAGACCATACCTACTTGGGGCAAAGTTCGTAGTAAAAACAGATCAGAAATCACTAAAATTCTTGCTGGAACAGAGGGTCATCCAACCACAATATCAA
AAATGGCTATCTAAACTACTGGGATATTCGTTTGAAGTGGTTTACAAACCGGGCTTAGAAAATAAGGCAGTCGGTGCCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTACAAACTAGGATTGAAGAAAGACTCGAAGTAATCGATCAAGAAATAGCACTGATGAAGAAAGAACTCGGGAAGATGCCGATCATCGAATTAAGCCTGAAC
GATATTGCGAAGAATATGCAGACGATGCGAGCCCAGTCGGACAAACAAGAGCAGATGCTGTTGATGATAATGGAAACTATTGCCAAGGATCGTACCACAACCGGT
GAACGCAACAAAGAACCTAATGTGCACATGTCTGTGACGAATAAAGGAAAAGAAAAGGAAGCAACCTCCAGCAAATCGGCCGTATGGGGAAAAACAGAGACCGAA
GAAGCTGCAGCAGACCGGAATAAATTCAAGAAAGTTGAAATGCCCGTGTTCGCCGGAGAAGACCCAGACTCATGGCTCTTCCGCGCTGAAAGTGATGTCCCGGAT
CCTGTTGTCGAAGACACTTTCATGAATGGCTTGTTCCCATGGATAAGAGCAGAGGTGATACTTTGTCGACCAAAAGGATTGGCCGAAATGATGGAAATCGCGCAA
TTGGTTGAGAACAGAGAGATCATACGAAACGAAGCCAATTTGAATAGCTTCGCGGGCGGGAAATACCTTCCGCAAAATACAACAAGTAGCAGAGGAACGACAAAC
ACTATCAGCGATAACAAAGCGAATACCACCTTCCCAATGCGAACCATCACTATGAGGAGCTCTAATAATGCCGAAATCCGTAAAGACACCAACTCAAGAAGATTG
CCCGACGCAGAATTCCAAGCTCGAAAAGAAAAAGGACTCTGTTTTAGATGCAACGAAAAATACTCAGCTGATCATAAATGCAAAATGAAGGAACTACGCGAGTTG
AAGATGTTTGTTGTGATAAAAGAAGGCGAAGAATATGAGATAATTGAAGAAAATACTACCGAAGAAAAAACACTAGCACTACTGCAAGTAGAAGAAGAGCAGAAG
ACTTTTGCAGAATTATCCCTAAACTCTGTTGTTGGATTGAACGATCCTGGCACAATGAAAGTTAAAGGAAAGCTTCAAGAGAGAGAGGTGATCATTCTCATAGAT
TGTGGGGCCACTCACAATTTTATTTCCAAAAAATTAGTAGAATCACTACAATTGTCGATCAAGGAAACTGCCCACTATGGTGTTATACTAGGATCCGGAACAGCG
GTACAAGGTAAAGGAGTGTGTGAAAATGTAGAAATTCAACTAACGAATTGGAAGGTGAAGGAAGAATTTCTACCGTTAGAACTCGGAGGGGTTGATGTTGTGCTG
GTGGTTGATTGGAAAAACTTTACACTTACGTTTCCTAACGAAGGGAAACAGATCTCAATAAAAGGAGACCCTAGCCTTACTATATCACGAATCAGTCTCAAAAGT
ATGTTCAAAACTTGGCTTGATCAAGATGAAGAATTTTTGATCGAATGTAGAGCAATACAAGTTCATAAAGAAAATGAACAGGCCGATACAGCAATTGCAACCATT
GAAGATGAATCCCTGCAAAACATACTGAAACAGTTTGGAGATGTGTTTGACTGGCCGGAAAAGCTCCCACCAAGAAGAGGAATTGAACATCAAATACATCTCAAA
GAAGGGACTAACCCAATAAACGTTAGACCCTATAAATATGGCCCAGTACTGTTGGTTAAGAAGAAAGATGGCAGCTGGCGTTTCTGTGTCAATTATCGGGCAGTA
AATAACACTACAATTCCAGACAAGTTCCCAACACCAGTGGCTGAAGAACTATTTGATGAGCTAAATGGCGCAACGATCAAGAATAAGAAAGACCACGTAGAGCAT
ATTGAAAAGGTCTTCCTCGCATTGAGAAAACATACCCTATTTACCAATAAAAAAAGTGTAACTTTGCCAGAAGAAGATAGAGTACTTGGGGCACATCATATCGGA
GAAGGAGTAGAAGTAGATTCTGAAAAAATCAAAGCAGTTGCGGATTGGCCATGCCCAACAAACATAAGAGAAGTCCGGGGATTCCTTGGATTAATAGGCTACTAT
AGGAGATTTGTCCAACATTATGGGTCAATAGCGGCCCCATTAACTCAACTCCTCAAAAAGGGAGGGTTCAAATGGAATGAAGAATCTGAAGAATCATTTCTAAAG
TTGAAATATGCAATGATGTCTCTACCAACATTGGCCTTACCCAACTTTGAACTTCCGTTTGAAATAGAAACAGATGCATCCGGTTTTGGAGTGGGAGCCGTGTTA
ATCCAGTCAAAGAGACCCATTGCCTTCTATAGCCATACACTATCCATGAGAGATAGAGCCCGACCTATATATGAACGCGAGTTGATGGCTGTTGTACTATCCGTA
CAAAGATGGAGACCATACCTACTTGGGGCAAAGTTCGTAGTAAAAACAGATCAGAAATCACTAAAATTCTTGCTGGAACAGAGGGTCATCCAACCACAATATCAA
AAATGGCTATCTAAACTACTGGGATATTCGTTTGAAGTGGTTTACAAACCGGGCTTAGAAAATAAGGCAGTCGGTGCCTTATGA
Protein sequenceShow/hide protein sequence
MVQTRIEERLEVIDQEIALMKKELGKMPIIELSLNDIAKNMQTMRAQSDKQEQMLLMIMETIAKDRTTTGERNKEPNVHMSVTNKGKEKEATSSKSAVWGKTETE
EAAADRNKFKKVEMPVFAGEDPDSWLFRAESDVPDPVVEDTFMNGLFPWIRAEVILCRPKGLAEMMEIAQLVENREIIRNEANLNSFAGGKYLPQNTTSSRGTTN
TISDNKANTTFPMRTITMRSSNNAEIRKDTNSRRLPDAEFQARKEKGLCFRCNEKYSADHKCKMKELRELKMFVVIKEGEEYEIIEENTTEEKTLALLQVEEEQK
TFAELSLNSVVGLNDPGTMKVKGKLQEREVIILIDCGATHNFISKKLVESLQLSIKETAHYGVILGSGTAVQGKGVCENVEIQLTNWKVKEEFLPLELGGVDVVL
VVDWKNFTLTFPNEGKQISIKGDPSLTISRISLKSMFKTWLDQDEEFLIECRAIQVHKENEQADTAIATIEDESLQNILKQFGDVFDWPEKLPPRRGIEHQIHLK
EGTNPINVRPYKYGPVLLVKKKDGSWRFCVNYRAVNNTTIPDKFPTPVAEELFDELNGATIKNKKDHVEHIEKVFLALRKHTLFTNKKSVTLPEEDRVLGAHHIG
EGVEVDSEKIKAVADWPCPTNIREVRGFLGLIGYYRRFVQHYGSIAAPLTQLLKKGGFKWNEESEESFLKLKYAMMSLPTLALPNFELPFEIETDASGFGVGAVL
IQSKRPIAFYSHTLSMRDRARPIYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAVGAL