; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0051991 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0051991
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr02:19067879..19069091
RNA-Seq ExpressionCmc02g0051991
SyntenyCmc02g0051991
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU21337.1 hypothetical protein TSUD_189240 [Trifolium subterraneum]6.8e-16974.06Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK AFLN DLEEEIYMEQPEGF++H QE+KVCKLDKSLYGLKQAPKQWHEKFDNL++S GF++NESDKCIYYK +G +C IICLYVDDMLIFGSN   
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        +N+VKS+L  NFDMKDLGEA                             Y Y+D KP CTPYD SVKLFKNT DSV Q+EYASIIGSLRYA DCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        Y VGLLCRFTSRPS EH +AIERVMRYLK+T NLGLHY +FP VLEGYSDADWN+LSDDSKATSG+IF+I GGAV+WKSKKQTILAQSTMESEMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP
        SEEASWLR LLSEIP W++P+PA+LIHC+STA IAK +NR+YNGKRRQIRRKHST+REL+TTGAV VD+V +++NLADPLTKGL REKV  +S +M LKP
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP

Query:  I
        +
Subjt:  I

GAU47690.1 hypothetical protein TSUD_245810 [Trifolium subterraneum]2.1e-17074.19Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK AFLN DLEEEIYMEQPEGF++H QE+KVCKLDKSLYGLKQAPKQWHEKFDNL++S GF++NESDKCIYYK +G +C IICLYVDDMLIFGSN   
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        +N+VKS+L  NFDMKDLGEA                             Y YFD KP CTPYD SVKLFKNTSDSV Q+EYASIIGSLRYA DCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        Y VGLLCRFTSRPS EHW+AIERVMRYLK+T NLGLHY +FP VLEGYSDADWN+LSDDSKATSG+IF+I GGAV+WKSKKQTILAQSTME+EMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP
        SEEASWLR LLSEIP W++P+PA+LIHCDSTA IAK +NR+YNGKRRQIRRKH+T+REL+TTGAV VD+V +++ LADPLTKGLAREKV  +S +M L P
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP

Query:  IKT
          T
Subjt:  IKT

GAU49932.1 hypothetical protein TSUD_408340 [Trifolium subterraneum]4.9e-16774.55Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK AFLN DLEEEIYMEQPEGF++H QE+KVCKLDKSLYGLKQAPKQWHEKFDNL++S GF++NESD CIYYK +G +C IICLYVDDMLIFGSN   
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        +N+VKS+L  NFDMKDLGEA                             Y YFD KP CTPYD SVKLFKN  DSV Q+EYASIIGSLRYA DCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        Y VGLLCRFTSRPS EHW+AIERVMRYLK+T NLGLHY +FP VLEGYSDADWN+LSDDSKATSG+IF+I GGAV+WKSKKQTILAQSTME EMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSS
        SEEASWLR LLSEIP W++P+PA+LIHCDSTA IAK +NR+YNGKRRQIRRKHST+REL+TTGAV VD+V +++NLAD LTKGLAREKV  +S
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSS

KAA0058878.1 putative polyprotein [Cucumis melo var. makuwa]3.5e-18181.84Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK  FLN DLEEEIYMEQPE FIVH QE KVCKLDKSLYGL+QAPKQ HEKFD LLMSKGFKVNESDKC+YYKT+GRLCIIICLYVDDML F SN HV
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        INDVKSMLS NFDMKDLGEAD                            YNYFDSKP CTPYDSSVKLFKNT D+VNQSEY SIIGSLRY ADCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        YAVGLLCRFTSRP LEHWNAIER+MRYLKKTQNLGLHYNKF TVLEGY+D DWNSLSDDSKATSGYIFNI GG VAWKSKKQTILAQS MESEMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP
        S+EASWL+SLLSEIPTW+R IPAILIHCDST  IAK QN YYNGKRRQIRRKHSTIRELLT GAVIVDYV SDNNLADPLTK LAREKVFK+SERM L  
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP

Query:  IK
        +K
Subjt:  IK

TYJ98069.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-18388.8Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTE +       Y++  L        
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYL
                             YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYL
Subjt:  INDVKSMLSANFDMKDLGEADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYL

Query:  KKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHC
        KKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHC
Subjt:  KKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHC

Query:  DSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT
        DSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT
Subjt:  DSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT

TrEMBL top hitse value%identityAlignment
A0A2Z6MWZ1 Reverse transcriptase Ty1/copia-type domain-containing protein3.3e-16974.06Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK AFLN DLEEEIYMEQPEGF++H QE+KVCKLDKSLYGLKQAPKQWHEKFDNL++S GF++NESDKCIYYK +G +C IICLYVDDMLIFGSN   
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        +N+VKS+L  NFDMKDLGEA                             Y Y+D KP CTPYD SVKLFKNT DSV Q+EYASIIGSLRYA DCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        Y VGLLCRFTSRPS EH +AIERVMRYLK+T NLGLHY +FP VLEGYSDADWN+LSDDSKATSG+IF+I GGAV+WKSKKQTILAQSTMESEMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP
        SEEASWLR LLSEIP W++P+PA+LIHC+STA IAK +NR+YNGKRRQIRRKHST+REL+TTGAV VD+V +++NLADPLTKGL REKV  +S +M LKP
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP

Query:  I
        +
Subjt:  I

A0A2Z6PC97 CCHC-type domain-containing protein1.0e-17074.19Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK AFLN DLEEEIYMEQPEGF++H QE+KVCKLDKSLYGLKQAPKQWHEKFDNL++S GF++NESDKCIYYK +G +C IICLYVDDMLIFGSN   
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        +N+VKS+L  NFDMKDLGEA                             Y YFD KP CTPYD SVKLFKNTSDSV Q+EYASIIGSLRYA DCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        Y VGLLCRFTSRPS EHW+AIERVMRYLK+T NLGLHY +FP VLEGYSDADWN+LSDDSKATSG+IF+I GGAV+WKSKKQTILAQSTME+EMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP
        SEEASWLR LLSEIP W++P+PA+LIHCDSTA IAK +NR+YNGKRRQIRRKH+T+REL+TTGAV VD+V +++ LADPLTKGLAREKV  +S +M L P
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP

Query:  IKT
          T
Subjt:  IKT

A0A2Z6PHW1 CCHC-type domain-containing protein2.4e-16774.55Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK AFLN DLEEEIYMEQPEGF++H QE+KVCKLDKSLYGLKQAPKQWHEKFDNL++S GF++NESD CIYYK +G +C IICLYVDDMLIFGSN   
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        +N+VKS+L  NFDMKDLGEA                             Y YFD KP CTPYD SVKLFKN  DSV Q+EYASIIGSLRYA DCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        Y VGLLCRFTSRPS EHW+AIERVMRYLK+T NLGLHY +FP VLEGYSDADWN+LSDDSKATSG+IF+I GGAV+WKSKKQTILAQSTME EMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSS
        SEEASWLR LLSEIP W++P+PA+LIHCDSTA IAK +NR+YNGKRRQIRRKHST+REL+TTGAV VD+V +++NLAD LTKGLAREKV  +S
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSS

A0A5D3BGA0 Retrovirus-related Pol polyprotein from transposon TNT 1-946.2e-18488.8Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTE +       Y++  L        
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYL
                             YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYL
Subjt:  INDVKSMLSANFDMKDLGEADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYL

Query:  KKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHC
        KKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHC
Subjt:  KKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHC

Query:  DSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT
        DSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT
Subjt:  DSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT

A0A5D3DJH9 Putative polyprotein1.7e-18181.84Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDVK  FLN DLEEEIYMEQPE FIVH QE KVCKLDKSLYGL+QAPKQ HEKFD LLMSKGFKVNESDKC+YYKT+GRLCIIICLYVDDML F SN HV
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA
        INDVKSMLS NFDMKDLGEAD                            YNYFDSKP CTPYDSSVKLFKNT D+VNQSEY SIIGSLRY ADCTRPDIA
Subjt:  INDVKSMLSANFDMKDLGEAD----------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIA

Query:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA
        YAVGLLCRFTSRP LEHWNAIER+MRYLKKTQNLGLHYNKF TVLEGY+D DWNSLSDDSKATSGYIFNI GG VAWKSKKQTILAQS MESEMIALA A
Subjt:  YAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGA

Query:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP
        S+EASWL+SLLSEIPTW+R IPAILIHCDST  IAK QN YYNGKRRQIRRKHSTIRELLT GAVIVDYV SDNNLADPLTK LAREKVFK+SERM L  
Subjt:  SEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKP

Query:  IK
        +K
Subjt:  IK

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.7e-5033.17Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRL--CIIICLYVDDMLIFGSNF
        MDVK AFLN  L+EEIYM  P+G  +      VCKL+K++YGLKQA + W E F+  L    F  +  D+CIY   +G +   I + LYVDD++I   + 
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRL--CIIICLYVDDMLIFGSNF

Query:  HVINDVKSMLSANFDMKDLGE----------------------------ADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPD
          +N+ K  L   F M DL E                            + +N  +     TP  S +      SD    +   S+IG L Y   CTRPD
Subjt:  HVINDVKSMLSANFDMKDLGE----------------------------ADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPD

Query:  IAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNK---FPTVLEGYSDADWNSLSDDSKATSGYIFNI-EGGAVAWKSKKQTILAQSTMESEM
        +  AV +L R++S+ + E W  ++RV+RYLK T ++ L + K   F   + GY D+DW     D K+T+GY+F + +   + W +K+Q  +A S+ E+E 
Subjt:  IAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNK---FPTVLEGYSDADWNSLSDDSKATSGYIFNI-EGGAVAWKSKKQTILAQSTMESEM

Query:  IALAGASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSE
        +AL  A  EA WL+ LL+ I   K   P I I+ D+   I+ A N   + + + I  K+   RE +    + ++Y+ ++N LAD  TK L   +  +  +
Subjt:  IALAGASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSE

Query:  RMRL
        ++ L
Subjt:  RMRL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-7139.1Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYK--TEGRLCIIICLYVDDMLIFGSNF
        +DVK AFL+ DLEEEIYMEQPEGF V  ++  VCKL+KSLYGLKQAP+QW+ KFD+ + S+ +    SD C+Y+K  +E    II+ LYVDDMLI G + 
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYK--TEGRLCIIICLYVDDMLIFGSNF

Query:  HVINDVKSMLSANFDMKDLGEAD------------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSE-------YASIIGSLR
         +I  +K  LS +FDMKDLG A                               +N  ++KP  TP    +KL K    +  + +       Y+S +GSL 
Subjt:  HVINDVKSMLSANFDMKDLGEAD------------------------------YNYFDSKPTCTPYDSSVKLFKNTSDSVNQSE-------YASIIGSLR

Query:  YAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQST
        YA  CTRPDIA+AVG++ RF   P  EHW A++ ++RYL+ T    L +     +L+GY+DAD     D+ K+++GY+F   GGA++W+SK Q  +A ST
Subjt:  YAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQST

Query:  MESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREK
         E+E IA     +E  WL+  L E+   ++     +++CDS + I  ++N  Y+ + + I  ++  IRE++   ++ V  + ++ N AD LTK + R K
Subjt:  MESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAREK

P25600 Putative transposon Ty5-1 protein YCL074W2.1e-3534.11Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        MDV  AFLNS ++E IY++QP GF+       V +L   +YGLKQAP  W+E  +N L   GF  +E +  +Y+++     I I +YVDD+L+   +  +
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEAD----YNYFDS-------------------------KPTCTPYDSSVKLFKNTSDSVNQ-SEYASIIGSLRYAADCTRPD
         + VK  L+  + MKDLG+ D     N   S                         K T TP  +S  LF+ TS  +   + Y SI+G L + A+  RPD
Subjt:  INDVKSMLSANFDMKDLGEAD----YNYFDS-------------------------KPTCTPYDSSVKLFKNTSDSVNQ-SEYASIIGSLRYAADCTRPD

Query:  IAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNK-FPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKK-QTILAQSTMESEMI
        I+Y V LL RF   P   H  +  RV+RYL  T+++ L Y       L  Y DA   ++ D   +T GY+  + G  V W SKK + ++   + E+E I
Subjt:  IAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNK-FPTVLEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKK-QTILAQSTMESEMI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.2e-5635.31Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        +DV  AFL   L +++YM QP GFI  D+ + VCKL K+LYGLKQAP+ W+ +  N L++ GF  + SD  ++    G+  + + +YVDD+LI G++  +
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKL-FKNTSDSVNQSEYASIIGSLRYAADCTRPDI
        +++    LS  F +KD  E  Y                            N   +KP  TP   S KL   + +   + +EY  I+GSL+Y A  TRPDI
Subjt:  INDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKL-FKNTSDSVNQSEYASIIGSLRYAADCTRPDI

Query:  AYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALA
        +YAV  L +F   P+ EH  A++R++RYL  T N G+   K  T+ L  YSDADW    DD  +T+GYI  +    ++W SKKQ  + +S+ E+E  ++A
Subjt:  AYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALA

Query:  GASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAR
          S E  W+ SLL+E+       P  +I+CD+        N  ++ + + I   +  IR  + +GA+ V +V + + LAD LTK L+R
Subjt:  GASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.7e-5634.28Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV
        +DV  AFL   L +E+YM QP GF+  D+   VC+L K++YGLKQAP+ W+ +    L++ GF  + SD  ++    GR  I + +YVDD+LI G++  +
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHV

Query:  INDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKLFKNTSDSV-NQSEYASIIGSLRYAADCTRPDI
        +      LS  F +K+  +  Y                            N   +KP  TP  +S KL  ++   + + +EY  I+GSL+Y A  TRPD+
Subjt:  INDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKLFKNTSDSV-NQSEYASIIGSLRYAADCTRPDI

Query:  AYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALA
        +YAV  L ++   P+ +HWNA++RV+RYL  T + G+   K  T+ L  YSDADW   +DD  +T+GYI  +    ++W SKKQ  + +S+ E+E  ++A
Subjt:  AYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALA

Query:  GASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAR
          S E  W+ SLL+E+       P  +I+CD+        N  ++ + + I   +  IR  + +GA+ V +V + + LAD LTK L+R
Subjt:  GASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRELLTTGAVIVDYVWSDNNLADPLTKGLAR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.4e-4832.6Show/hide
Query:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQES----KVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGS
        +D+  AFLN DL+EEIYM+ P G+     +S     VC L KS+YGLKQA +QW  KF   L+  GF  + SD   + K    L + + +YVDD++I  +
Subjt:  MDVKIAFLNSDLEEEIYMEQPEGFIVHDQES----KVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGS

Query:  NFHVINDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKLFKNT-SDSVNQSEYASIIGSLRYAADCT
        N   ++++KS L + F ++DLG   Y                                 KP+  P D SV    ++  D V+   Y  +IG L Y    T
Subjt:  NFHVINDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKLFKNT-SDSVNQSEYASIIGSLRYAADCT

Query:  RPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEM
        R DI++AV  L +F+  P L H  A+ +++ Y+K T   GL Y+    + L+ +SDA + S  D  ++T+GY   +    ++WKSKKQ ++++S+ E+E 
Subjt:  RPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEM

Query:  IALAGASEEASWLRSLLSEIPTWKRPI-PAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRE
         AL+ A++E  WL     E+   + P+    L+ CD+TA I  A N  ++ + + I     ++RE
Subjt:  IALAGASEEASWLRSLLSEIPTWKRPI-PAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.7e-0632.88Show/hide
Query:  TRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGY
        TRPD+ +AV  L +F+S        A+ +V+ Y+K T   GL Y+    + L+ ++D+DW S  D  ++ +G+
Subjt:  TRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGY

ATMG00810.1 DNA/RNA polymerases superfamily protein6.4e-2429.06Show/hide
Query:  ICLYVDDMLIFGSNFHVINDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYAS
        + LYVDD+L+ GS+  ++N +   LS+ F MKDLG   Y                               D KP  TP    +    +T+   + S++ S
Subjt:  ICLYVDDMLIFGSNFHVINDVKSMLSANFDMKDLGEADY----------------------------NYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYAS

Query:  IIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQ
        I+G+L+Y    TRPDI+YAV ++C+    P+L  ++ ++RV+RY+K T   GL+ +K   + ++ + D+DW   +   ++T+G+   +    ++W +K+Q
Subjt:  IIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTV-LEGYSDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQ

Query:  TILAQSTMESEMIALAGASEEASWLRSLLSEIPT
          +++S+ E+E  ALA  + E +W  +  S  P+
Subjt:  TILAQSTMESEMIALAGASEEASWLRSLLSEIPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTAAAATAGCTTTCCTAAATAGTGATTTAGAAGAAGAGATTTACATGGAACAACCTGAAGGTTTCATAGTTCACGACCAAGAATCCAAAGTTTGCAAACTAGA
TAAATCCCTCTATGGCCTAAAACAAGCTCCCAAGCAATGGCACGAAAAGTTTGACAACTTACTCATGTCAAAAGGATTCAAAGTAAATGAGAGTGACAAATGTATCTACT
ATAAGACTGAAGGTAGACTATGTATTATCATATGCCTATACGTAGATGACATGTTAATCTTTGGATCAAACTTTCACGTCATAAATGATGTAAAATCTATGTTGAGTGCA
AATTTTGACATGAAAGACCTAGGTGAAGCTGATTACAACTACTTCGATAGTAAACCGACTTGTACACCTTATGACTCTAGTGTGAAATTATTCAAGAACACTAGTGACAG
TGTTAACCAATCTGAGTATGCTAGTATCATAGGTAGTTTGAGGTATGCTGCTGATTGCACTAGACCAGACATAGCTTACGCCGTAGGATTACTATGTAGGTTTACCAGCA
GACCCAGTCTAGAACATTGGAATGCGATAGAGAGAGTAATGAGATACCTTAAGAAAACTCAAAACCTAGGATTACATTATAACAAGTTTCCCACTGTACTTGAAGGTTAC
AGTGATGCTGATTGGAACTCCCTCTCAGATGACTCAAAGGCTACAAGTGGCTATATTTTTAATATAGAAGGAGGAGCTGTGGCTTGGAAATCCAAGAAACAGACAATCTT
AGCCCAGTCAACGATGGAGTCAGAGATGATAGCACTAGCTGGTGCTAGTGAAGAAGCAAGCTGGCTTCGAAGCTTGTTATCAGAGATTCCCACATGGAAAAGACCGATAC
CAGCCATACTAATCCACTGTGATAGTACTGCAACTATTGCAAAAGCTCAAAACCGTTACTATAATGGAAAGAGACGACAGATACGTCGTAAGCACAGTACCATTAGAGAA
TTGCTCACTACTGGTGCAGTGATAGTGGATTATGTATGGTCTGACAATAACTTGGCTGATCCTTTGACGAAAGGACTTGCTCGAGAAAAGGTTTTTAAATCCTCAGAAAG
AATGAGACTCAAGCCTATCAAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTAAAATAGCTTTCCTAAATAGTGATTTAGAAGAAGAGATTTACATGGAACAACCTGAAGGTTTCATAGTTCACGACCAAGAATCCAAAGTTTGCAAACTAGA
TAAATCCCTCTATGGCCTAAAACAAGCTCCCAAGCAATGGCACGAAAAGTTTGACAACTTACTCATGTCAAAAGGATTCAAAGTAAATGAGAGTGACAAATGTATCTACT
ATAAGACTGAAGGTAGACTATGTATTATCATATGCCTATACGTAGATGACATGTTAATCTTTGGATCAAACTTTCACGTCATAAATGATGTAAAATCTATGTTGAGTGCA
AATTTTGACATGAAAGACCTAGGTGAAGCTGATTACAACTACTTCGATAGTAAACCGACTTGTACACCTTATGACTCTAGTGTGAAATTATTCAAGAACACTAGTGACAG
TGTTAACCAATCTGAGTATGCTAGTATCATAGGTAGTTTGAGGTATGCTGCTGATTGCACTAGACCAGACATAGCTTACGCCGTAGGATTACTATGTAGGTTTACCAGCA
GACCCAGTCTAGAACATTGGAATGCGATAGAGAGAGTAATGAGATACCTTAAGAAAACTCAAAACCTAGGATTACATTATAACAAGTTTCCCACTGTACTTGAAGGTTAC
AGTGATGCTGATTGGAACTCCCTCTCAGATGACTCAAAGGCTACAAGTGGCTATATTTTTAATATAGAAGGAGGAGCTGTGGCTTGGAAATCCAAGAAACAGACAATCTT
AGCCCAGTCAACGATGGAGTCAGAGATGATAGCACTAGCTGGTGCTAGTGAAGAAGCAAGCTGGCTTCGAAGCTTGTTATCAGAGATTCCCACATGGAAAAGACCGATAC
CAGCCATACTAATCCACTGTGATAGTACTGCAACTATTGCAAAAGCTCAAAACCGTTACTATAATGGAAAGAGACGACAGATACGTCGTAAGCACAGTACCATTAGAGAA
TTGCTCACTACTGGTGCAGTGATAGTGGATTATGTATGGTCTGACAATAACTTGGCTGATCCTTTGACGAAAGGACTTGCTCGAGAAAAGGTTTTTAAATCCTCAGAAAG
AATGAGACTCAAGCCTATCAAAACTTGA
Protein sequenceShow/hide protein sequence
MDVKIAFLNSDLEEEIYMEQPEGFIVHDQESKVCKLDKSLYGLKQAPKQWHEKFDNLLMSKGFKVNESDKCIYYKTEGRLCIIICLYVDDMLIFGSNFHVINDVKSMLSA
NFDMKDLGEADYNYFDSKPTCTPYDSSVKLFKNTSDSVNQSEYASIIGSLRYAADCTRPDIAYAVGLLCRFTSRPSLEHWNAIERVMRYLKKTQNLGLHYNKFPTVLEGY
SDADWNSLSDDSKATSGYIFNIEGGAVAWKSKKQTILAQSTMESEMIALAGASEEASWLRSLLSEIPTWKRPIPAILIHCDSTATIAKAQNRYYNGKRRQIRRKHSTIRE
LLTTGAVIVDYVWSDNNLADPLTKGLAREKVFKSSERMRLKPIKT