; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0108531 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0108531
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:28080261..28081388
RNA-Seq ExpressionCmc04g0108531
SyntenyCmc04g0108531
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.5e-17982.37Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFY+YEIW     T F NGNLEESIYMVQ EGF+ + QEQKVCKLQ  IYGLKQ SRSWNIRFD AIKSYGFEQNV+ PCVYK+I+ +
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FL+LYVDDILLIGND+ +LTD+K+WL TQFQMK+LG AQY+LG QIV NRKNKTLAMSQ SYIDK+LSRYKM NSKKG L +R+GIHLSK+QCP TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM NI Y+SAVGSLMYAMLCTRPDICY VGIVSRYQSNP RDHWT VKNILKYLRRT++YMLVYG+KD ILTGYTDSDFQ+DKDARKSTSGS+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRS+KQ+CI +STMEAEYVAACEAAKE VWL+KFLTDLEVVPNMHLPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]7.8e-17380.26Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFYDYEIW     T F NGNLEESI+M Q EGF+ +GQEQKVCKL   IYGLKQ SRSWNIRFD AIKSYGF+QNV+ PCVYK+I K 
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FLVLYVDDILLIGND+G+LTD+K WLA QFQMK+LG AQYVLG QI+ +RKNKTLA+SQ +YIDK+L RY M NSKKGLL +R+G+HLSK+Q P TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM  I YASAVGSLMYAMLCTRPDICY VGIVSRYQSNP  DHWT VK +LKYLRRT+DYMLVYG+KD ILTGYTDSDFQTDKD+RKSTSGS+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQ CI +STMEAEYVAACEAAKE VWL+KFL DLEVVPNM+LPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-17079.74Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIA FYDYEIW     T F NGNLEESI+M Q EGF+ +GQEQKVCKL   IYGLKQ SRSWNIRFD AIKSYGF+QNV+ PCVYK+I K 
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FLVLYVDDILLIGND+G+LTD+K WLA QFQMK+LG  QYVLG QI+ +RKNKTLA+SQ +YIDK+L RY M NSKKGLL +R+G+HLSK+Q P TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM  I YASAVGSLMYAMLCTRPDICY VGIVSRYQSNP  DHWT VK ILKYLRRT+DYMLVYG+KD ILTGYT+SDFQTDKD+RKSTS S+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQ CI +STMEAEYVAACEAAKE VWLKKFL DLEVVPNM+LPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]7.8e-17380.26Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFYDYEIW     T F NGNLEESI+M Q EGF+ +GQEQKVCKL   IYGLKQ SRSWNIRFD AIKSYGF+QNV+ PCVYK+I K 
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FLVLYVDDILLIGND+G+LTD+K WLA QFQMK+LG AQYVLG QI+ +RKNKTLA+SQ +YIDK+L RY M NSKKGLL +R+G+HLSK+Q P TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM  I YASAVGSLMYAMLCTRPDICY VGIVSRYQSNP  DHWT VK +LKYLRRT+DYMLVYG+KD ILTGYTDSDFQTDKD+RKSTSGS+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQ CI +STMEAEYVAACEAAKE VWL+KFL DLEVVPNM+LPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-18285.53Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFYDYEIW     T F N NLEESIYMVQ E F+QKGQEQK+CKLQ  IYGLKQ SRS NIRFD AIKSYG EQNV+ PCVYKRI+ +
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
        TV FLVLYVDDILLIGND+GHL DIK+WLA QFQMK+LGNAQYVLG QIV NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLL YRYGIHLSK+QCP TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDMSNI YASAVGSLMY MLCTRP+ICY VGIVSR QS P RDHWTTVKNILKYLRRTKDYMLVYGSKD ILTGYTD  FQTDKDARKSTSG +FT+
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQSCI +STMEAEYVA CEAAKE VWLKKFLTDLEVVPNMHLP TLYCDNSGAV NSRE RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein6.0e-17179.74Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIA FYDYEIW     T F NGNLEESI+M Q EGF+ +GQEQKVCKL   IYGLKQ SRSWNIRFD AIKSYGF+QNV+ PCVYK+I K 
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FLVLYVDDILLIGND+G+LTD+K WLA QFQMK+LG  QYVLG QI+ +RKNKTLA+SQ +YIDK+L RY M NSKKGLL +R+G+HLSK+Q P TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM  I YASAVGSLMYAMLCTRPDICY VGIVSRYQSNP  DHWT VK ILKYLRRT+DYMLVYG+KD ILTGYT+SDFQTDKD+RKSTS S+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQ CI +STMEAEYVAACEAAKE VWLKKFL DLEVVPNM+LPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

A0A5A7TZD0 Gag/pol protein3.8e-17380.26Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFYDYEIW     T F NGNLEESI+M Q EGF+ +GQEQKVCKL   IYGLKQ SRSWNIRFD AIKSYGF+QNV+ PCVYK+I K 
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FLVLYVDDILLIGND+G+LTD+K WLA QFQMK+LG AQYVLG QI+ +RKNKTLA+SQ +YIDK+L RY M NSKKGLL +R+G+HLSK+Q P TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM  I YASAVGSLMYAMLCTRPDICY VGIVSRYQSNP  DHWT VK +LKYLRRT+DYMLVYG+KD ILTGYTDSDFQTDKD+RKSTSGS+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQ CI +STMEAEYVAACEAAKE VWL+KFL DLEVVPNM+LPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

A0A5A7UYE8 Gag/pol protein3.8e-17380.26Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFYDYEIW     T F NGNLEESI+M Q EGF+ +GQEQKVCKL   IYGLKQ SRSWNIRFD AIKSYGF+QNV+ PCVYK+I K 
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FLVLYVDDILLIGND+G+LTD+K WLA QFQMK+LG AQYVLG QI+ +RKNKTLA+SQ +YIDK+L RY M NSKKGLL +R+G+HLSK+Q P TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM  I YASAVGSLMYAMLCTRPDICY VGIVSRYQSNP  DHWT VK +LKYLRRT+DYMLVYG+KD ILTGYTDSDFQTDKD+RKSTSGS+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQ CI +STMEAEYVAACEAAKE VWL+KFL DLEVVPNM+LPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

A0A5D3BX45 Gag/pol protein3.4e-18285.53Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFYDYEIW     T F N NLEESIYMVQ E F+QKGQEQK+CKLQ  IYGLKQ SRS NIRFD AIKSYG EQNV+ PCVYKRI+ +
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
        TV FLVLYVDDILLIGND+GHL DIK+WLA QFQMK+LGNAQYVLG QIV NRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLL YRYGIHLSK+QCP TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDMSNI YASAVGSLMY MLCTRP+ICY VGIVSR QS P RDHWTTVKNILKYLRRTKDYMLVYGSKD ILTGYTD  FQTDKDARKSTSG +FT+
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRSIKQSCI +STMEAEYVA CEAAKE VWLKKFLTDLEVVPNMHLP TLYCDNSGAV NSRE RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

E2GK51 Gag/pol protein (Fragment)1.2e-17982.37Show/hide
Query:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT
        M KSIRILLSIATFY+YEIW     T F NGNLEESIYMVQ EGF+ + QEQKVCKLQ  IYGLKQ SRSWNIRFD AIKSYGFEQNV+ PCVYK+I+ +
Subjt:  MTKSIRILLSIATFYDYEIW-----TIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP
         V FL+LYVDDILLIGND+ +LTD+K+WL TQFQMK+LG AQY+LG QIV NRKNKTLAMSQ SYIDK+LSRYKM NSKKG L +R+GIHLSK+QCP TP
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTP

Query:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        QEVEDM NI Y+SAVGSLMYAMLCTRPDICY VGIVSRYQSNP RDHWT VKNILKYLRRT++YMLVYG+KD ILTGYTDSDFQ+DKDARKSTSGS+FTL
Subjt:  QEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        NGGAVVWRS+KQ+CI +STMEAEYVAACEAAKE VWL+KFLTDLEVVPNMHLPITLYCDNSGAVANS+E RSHKRGKHIE
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-5032.99Show/hide
Query:  SIRILLSIATFYD-----YEIWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVY---KRIIKT
        S R +LS+   Y+      ++ T F NG L+E IYM   +G         VCKL   IYGLKQ +R W   F+ A+K   F  +    C+Y   K  I  
Subjt:  SIRILLSIATFYD-----YEIWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVY---KRIIKT

Query:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHN----SKKGLLLYRYGIHLSKKQC
         + +++LYVDD+++   D+  + + K +L  +F+M +L   ++ +G +I    +   + +SQ++Y+ K+LS++ M N    S        Y +  S + C
Subjt:  TVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHN----SKKGLLLYRYGIHLSKKQC

Query:  PNTPQEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSK---DPILTGYTDSDFQTDKDARKST
         NTP            S +G LMY MLCTRPD+   V I+SRY S    + W  +K +L+YL+ T D  L++      +  + GY DSD+   +  RKST
Subjt:  PNTPQEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSK---DPILTGYTDSDFQTDKDARKST

Query:  SGSIFTL-NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        +G +F + +   + W + +Q+ +  S+ EAEY+A  EA +E +WLK  LT + +   +  PI +Y DN G ++ +     HKR KHI+
Subjt:  SGSIFTL-NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-8946.03Show/hide
Query:  SIRILLSIATFYDYE-----IWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVY-KRIIKTTV
        SIR +LS+A   D E     + T F +G+LEE IYM Q EGF   G++  VCKL   +YGLKQ  R W ++FD  +KS  + +  + PCVY KR  +   
Subjt:  SIRILLSIATFYDYE-----IWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVY-KRIIKTTV

Query:  TFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQE
          L+LYVDD+L++G D G +  +K  L+  F MK+LG AQ +LG +IV  R ++ L +SQ  YI+++L R+ M N+K         + LSKK CP T +E
Subjt:  TFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQE

Query:  VEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTLNG
          +M+ + Y+SAVGSLMYAM+CTRPDI + VG+VSR+  NP ++HW  VK IL+YLR T    L +G  DPIL GYTD+D   D D RKS++G +FT +G
Subjt:  VEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTLNG

Query:  GAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
        GA+ W+S  Q C+  ST EAEY+AA E  KE++WLK+FL +L +    ++   +YCD+  A+  S+ S  H R KHI+
Subjt:  GAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

P25600 Putative transposon Ty5-1 protein YCL074W6.5e-2929.94Show/hide
Query:  EIWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVTFLVLYVDDILLIGNDIGHL
        ++ T F N  ++E IY+ Q  GF+ +     V +L   +YGLKQ    WN   +  +K  GF ++     +Y R       ++ +YVDD+L+        
Subjt:  EIWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVTFLVLYVDDILLIGNDIGHL

Query:  TDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEVEDMSNILYASAVGSLMYAM
          +K+ L   + MK+LG     LG  I     N  + +S   YI K  S  +++  K    L +  +  SK     T   ++D++   Y S VG L++  
Subjt:  TDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEVEDMSNILYASAVGSLMYAM

Query:  LCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIFTLNGGAVVWRSIK-QSCIVESTM
           RPDI Y V ++SR+   PR  H  + + +L+YL  T+   L Y S   + LT Y D+      D   ST G +  L G  V W S K +  I   + 
Subjt:  LCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIFTLNGGAVVWRSIK-QSCIVESTM

Query:  EAEYVAACEAAKEV
        EAEY+ A E   E+
Subjt:  EAEYVAACEAAKEV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-4131.83Show/hide
Query:  SIRILLSIATFYDYEIWTI-----FFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVT
        SIRI+L +A    + I  +     F  G L + +YM Q  GF+ K +   VCKL+  +YGLKQ  R+W +     + + GF  +V+   ++      ++ 
Subjt:  SIRILLSIATFYDYEIWTI-----FFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVT

Query:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEV
        ++++YVDDIL+ GND   L +  + L+ +F +K+     Y LG +    R    L +SQ  YI  +L+R  M  +K           LS           
Subjt:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEV

Query:  EDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIFTLNG
        E      Y   VGSL Y +  TRPDI Y V  +S++   P  +H   +K IL+YL  T ++ +     + + L  Y+D+D+  DKD   ST+G I  L  
Subjt:  EDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIFTLNG

Query:  GAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHI
          + W S KQ  +V S+ EAEY +    + E+ W+   LT+L +   +  P  +YCDN GA         H R KHI
Subjt:  GAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-4431.56Show/hide
Query:  SIRILLSIATFYDYEIWTI-----FFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVT
        SIRI+L +A    + I  +     F  G L + +YM Q  GF+ K +   VC+L+  IYGLKQ  R+W +     + + GF  +++   ++      ++ 
Subjt:  SIRILLSIATFYDYEIWTI-----FFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVT

Query:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEV
        ++++YVDDIL+ GND   L    + L+ +F +K   +  Y LG +    R  + L +SQ  Y   +L+R  M  +K           L+       P   
Subjt:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEV

Query:  EDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIFTLNG
        E      Y   VGSL Y +  TRPD+ Y V  +S+Y   P  DHW  +K +L+YL  T D+ +     + + L  Y+D+D+  D D   ST+G I  L  
Subjt:  EDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIFTLNG

Query:  GAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHI
          + W S KQ  +V S+ EAEY +    + E+ W+   LT+L +   +  P  +YCDN GA         H R KHI
Subjt:  GAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.3e-4430.1Show/hide
Query:  SIRILLSIATFYDY-----EIWTIFFNGNLEESIYMVQSEGFLQKGQE----QKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIK
        S++++L+I+  Y++     +I   F NG+L+E IYM    G+  +  +      VC L+  IYGLKQ SR W ++F + +  +GF Q+ +    + +I  
Subjt:  SIRILLSIATFYDY-----EIWTIFFNGNLEESIYMVQSEGFLQKGQE----QKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIK

Query:  TTVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNT
        T    +++YVDDI++  N+   + ++K  L + F++++LG  +Y LG +I   R    + + Q  Y   +L    +   K   +     +  S     ++
Subjt:  TTVTFLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNT

Query:  PQEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIF
          +  D     Y   +G LMY  + TR DI + V  +S++   PR  H   V  IL Y++ T    L Y S+  + L  ++D+ FQ+ KD R+ST+G   
Subjt:  PQEVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPI-LTGYTDSDFQTDKDARKSTSGSIF

Query:  TLNGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE
         L    + W+S KQ  + +S+ EAEY A   A  E++WL +F  +L++   +  P  L+CDN+ A+  +  +  H+R KHIE
Subjt:  TLNGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVWLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-1830.51Show/hide
Query:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSK--KGLLLYRYGIHLSKKQCPNTPQ
        +L+LYVDDILL G+    L  +   L++ F MK+LG   Y LG QI  +     L +SQT Y +++L+   M + K     L  +    +S  + P+ P 
Subjt:  FLVLYVDDILLIGNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSK--KGLLLYRYGIHLSKKQCPNTPQ

Query:  EVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDY-MLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL
        +        + S VG+L Y  L TRPDI Y V IV +    P    +  +K +L+Y++ T  + + ++ +    +  + DSD+      R+ST+G    L
Subjt:  EVEDMSNILYASAVGSLMYAMLCTRPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDY-MLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTL

Query:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVW
            + W + +Q  +  S+ E EY A    A E+ W
Subjt:  NGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAAGTCAATAAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGACAATCTTTTTTAATGGTAATCTTGAAGAGAGTATTTATATGGTCCAATC
AGAGGGATTTTTACAAAAAGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAATTTTATTTATGGATTAAAACAAACTTCTAGATCCTGGAATATAAGGTTTGATATTGCGA
TCAAATCTTATGGTTTTGAACAAAATGTTAATGGACCTTGTGTTTACAAAAGGATCATCAAAACTACTGTAACATTCTTAGTTCTGTATGTAGATGACATTCTACTCATT
GGGAATGATATAGGTCATCTAACTGATATTAAGGAATGGCTAGCTACGCAATTCCAAATGAAAAATTTAGGAAATGCACAATATGTTCTTGGTTTCCAAATAGTTTGGAA
CCGAAAGAACAAGACACTAGCCATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGATGCATAATTCCAAAAAAGGTCTGCTGCTGTACAGATATGGAA
TTCATTTATCAAAAAAACAATGTCCAAATACACCTCAAGAAGTTGAGGATATGAGTAACATTCTCTATGCTTCTGCTGTTGGGAGCCTGATGTATGCAATGTTATGTACT
AGACCTGACATTTGCTATTTAGTGGGAATAGTTAGTAGATATCAGTCCAATCCTAGACGTGATCATTGGACAACCGTTAAGAATATTCTAAAATATCTTAGAAGAACAAA
AGATTACATGCTTGTGTATGGTTCTAAGGATCCGATCCTTACTGGATACACTGACTCCGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAATTTTCA
CTTTGAACGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCTTGTATTGTCGAATCCACTATGGAAGCTGAATATGTAGCTGCCTGTGAAGCTGCGAAAGAAGTAGTA
TGGCTTAAAAAGTTCTTAACAGATTTGGAAGTTGTTCCAAATATGCATCTGCCAATCACCTTATATTGTGATAATAGTGGTGCAGTTGCAAATTCACGAGAATCTAGAAG
TCATAAACGAGGAAAGCACATTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAAGTCAATAAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGACAATCTTTTTTAATGGTAATCTTGAAGAGAGTATTTATATGGTCCAATC
AGAGGGATTTTTACAAAAAGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAATTTTATTTATGGATTAAAACAAACTTCTAGATCCTGGAATATAAGGTTTGATATTGCGA
TCAAATCTTATGGTTTTGAACAAAATGTTAATGGACCTTGTGTTTACAAAAGGATCATCAAAACTACTGTAACATTCTTAGTTCTGTATGTAGATGACATTCTACTCATT
GGGAATGATATAGGTCATCTAACTGATATTAAGGAATGGCTAGCTACGCAATTCCAAATGAAAAATTTAGGAAATGCACAATATGTTCTTGGTTTCCAAATAGTTTGGAA
CCGAAAGAACAAGACACTAGCCATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGATGCATAATTCCAAAAAAGGTCTGCTGCTGTACAGATATGGAA
TTCATTTATCAAAAAAACAATGTCCAAATACACCTCAAGAAGTTGAGGATATGAGTAACATTCTCTATGCTTCTGCTGTTGGGAGCCTGATGTATGCAATGTTATGTACT
AGACCTGACATTTGCTATTTAGTGGGAATAGTTAGTAGATATCAGTCCAATCCTAGACGTGATCATTGGACAACCGTTAAGAATATTCTAAAATATCTTAGAAGAACAAA
AGATTACATGCTTGTGTATGGTTCTAAGGATCCGATCCTTACTGGATACACTGACTCCGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAATTTTCA
CTTTGAACGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCTTGTATTGTCGAATCCACTATGGAAGCTGAATATGTAGCTGCCTGTGAAGCTGCGAAAGAAGTAGTA
TGGCTTAAAAAGTTCTTAACAGATTTGGAAGTTGTTCCAAATATGCATCTGCCAATCACCTTATATTGTGATAATAGTGGTGCAGTTGCAAATTCACGAGAATCTAGAAG
TCATAAACGAGGAAAGCACATTGAATGA
Protein sequenceShow/hide protein sequence
MTKSIRILLSIATFYDYEIWTIFFNGNLEESIYMVQSEGFLQKGQEQKVCKLQNFIYGLKQTSRSWNIRFDIAIKSYGFEQNVNGPCVYKRIIKTTVTFLVLYVDDILLI
GNDIGHLTDIKEWLATQFQMKNLGNAQYVLGFQIVWNRKNKTLAMSQTSYIDKMLSRYKMHNSKKGLLLYRYGIHLSKKQCPNTPQEVEDMSNILYASAVGSLMYAMLCT
RPDICYLVGIVSRYQSNPRRDHWTTVKNILKYLRRTKDYMLVYGSKDPILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVVWRSIKQSCIVESTMEAEYVAACEAAKEVV
WLKKFLTDLEVVPNMHLPITLYCDNSGAVANSRESRSHKRGKHIE