; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006341 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006341
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr01:10821120..10828689
RNA-Seq ExpressionPay0006341
SyntenyPay0006341
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033527.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.7e-13354.48Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------
        MIFFIKMLDGKAWRALV  YDPPMI VNGVSI K EVDWTDAEEQASVGNARA N IFN VDLNVFKLINSC+ AKEAWKTLE                 
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------

Query:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR
           E +E     TSK +  ++ E      YN+RVLEI NESLLLGEKIPDSKIVQK                 EAHDITTL+LDELFGSLLTFEMAT DR
Subjt:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR

Query:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS
        +SKKGK IAFKSTHVDEEAVSDTEANMD       E+  ++  +  N N ++RRSDGYIKKK+GDRRIF+CRECG +GHYQ ECPTF+RKQ KNFCVTLS
Subjt:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS

Query:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE
        DEESGDSRDDDDNINAFTIRITDEN DDE ECSEE K+DELTIEKLEALWKEDC+   +          Q+E+ + +I          +NE         
Subjt:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE

Query:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP
         I +   + +    NG        +    R+   ++   + F + ++   +   H   +IRT                     V  +  T Y  C  + P
Subjt:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP

Query:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
                  T  +   +EYRQK DA+SEQ IFLGYSQNS  YRV+NNRS SV++TINVVINDLDS IK
Subjt:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.7e-14942.52Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--
        MIFFIK LDGKAWRALV GYDP MI VNGVSI KPEVDWTD EEQASVGNARA N IFNGVDLNVFKLIN CSTAKEAWKTLE  YEGTSKVK SRL   
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--

Query:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQKEAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKSTHVDEEAVSDTEANMDE
                        +YN+ VLEI NESLLL   I       +EAHDITTLKLDELFGSLLTFEM T +R+SKKGK IAFKSTHV+EEA  DTEANMDE
Subjt:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQKEAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKSTHVDEEAVSDTEANMDE

Query:  RNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIRITDENTDDERECSEERK
                                         CRECGGVGHYQ EC TF+RKQKKNF VTLSD+E  DSRDDD NINAFTIRIT++NTDD+ ECS E K
Subjt:  RNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIRITDENTDDERECSEERK

Query:  NDELTIEKLEALWKEDC-----------------------------------------------------------------------------------
        NDEL+IEKLE LWKEDC                                                                                   
Subjt:  NDELTIEKLEALWKEDC-----------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------KTYTVEIYKNLCLKLQREKGKKIIRIRSDHGK
                                                                            KT TVEI KNLCLKLQRE+ KKI RIRSDHGK
Subjt:  --------------------------------------------------------------------KTYTVEIYKNLCLKLQREKGKKIIRIRSDHGK

Query:  EFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEM
        EFDNE FNSFCLLEG H EFSAPITPQQNGVVE+KN  LQEM RVMIHAKNLPLCF+AEAVNTA HIHNRVTIRT T + LYE W               
Subjt:  EFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEM

Query:  TVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
                KERK NVKYFHVFGSTCYILAD+EY +K DARSEQGIFL YSQ SRAYRVYNNRSDSVM+TIN  INDLDS IK
Subjt:  TVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

TYJ95661.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]9.7e-13454.48Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------
        MIFFIKMLDGKAWRALV  YDPPMI VNGVSI K EVDWTDAEEQASVGNARA N IFN VDLNVFKLINSC+ AKEAWKTLE                 
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------

Query:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR
           E +E     TSK +  ++ E      YN+RVLEI NESLLLGEKIPDSKIVQK                 EAHDITTL+LDELFGSLLTFEMAT DR
Subjt:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR

Query:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS
        +SKKGK IAFKSTHVDEEAVSDTEANMD       E+  ++  +  N N ++RRSDGYIKKK+GDRRIF+CRECG +GHYQ ECPTF+RKQ KNFCVTLS
Subjt:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS

Query:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE
        DEESGDSRDDDDNINAFTIRITDEN DDE ECSEE K+DELTIEKLEALWKEDC+   +          Q+E+ + +I          +NE         
Subjt:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE

Query:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP
         I +   + +    NG        +    R+   ++   + F ++++   +   H   +IRT                     V  +  T Y  C  + P
Subjt:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP

Query:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
                  T  +   +EYRQK DA+SEQ IFLGYSQNS  YRV+NNRS SV++TINVVINDLDS IK
Subjt:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.5e-15456.56Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--
        MIFFIK LDGKAWRALV GYDPPMITVNGVSI KPEVDWT+ EEQA+VGNARA N IFNG+DLNVFKLINSCSTAKEAWKTL+  YEGTSKVK SRL   
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--

Query:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS
                        +YN+RVLEI NESLLL EKIPDSKIV K                 EAHDITTL+LDELFGSLLTFEMATVDR+S          
Subjt:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS

Query:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIR
                + T      RNDDSLT+RNNENSDRRSDGYIKKK+GDRRIF+CRECGGV HYQ ECPTF+RKQK NFCVTLSD ESGD RDDD NINAFTI 
Subjt:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIR

Query:  ITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQNGV
        ITDENT+DE ECSEE K+DELTIEKLEALWKEDC+  T++  +   L  + E    +I       +EF NE+     S  +L    +   + +    NG 
Subjt:  ITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQNGV

Query:  VEKKN----MMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRK-------PNVKYFNVTE---------------------
          K++    +      +     K +P     E   T      R T++T   +  Y   K ++       P +    V +                     
Subjt:  VEKKN----MMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRK-------PNVKYFNVTE---------------------

Query:  ----MTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
            MTVTL+EL K+RKPNVKYFHVFGSTCYILAD+EYRQK DA+SEQGIFLGYSQNSRAYRV+NNR  SVM+TINVVIND+DS IK
Subjt:  ----MTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

TYK02457.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.1e-13253.63Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--
        MIFFIK LDGKAWRALV GYDPPMITVNGVS+ KPEVDWTDAEEQASVGNARA NTIFNGVDLNVFKLINSCSTAKEAWKTLE   EGTSKVK SRL   
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--

Query:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS
                        +YN+RVLEI NESLLLGEKIPDSK+V+K                 EAHDITTLKLDELFGSLLTFEMAT +R+SKKGK IAFKS
Subjt:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS

Query:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYI---KKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAF
        THV  EAVS+TE+NMD+  ++ +            +  +   +K+      FKCRECGGVGHYQ ECPT++RKQKKNF VTLSDEESGDSRDDD NINAF
Subjt:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYI---KKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAF

Query:  TIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQ
        TIRITD N++D+ ECSEE KND+LTIEK EALWKEDC+   ++  +   L  + E    +I       +EF NE+     S  +L    +     +    
Subjt:  TIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQ

Query:  NGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYE--LWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTC
        NG        L  +  V        + F   ++   +  IH    IRT T+  L     + GRK +++           Y+L +E+    KY++   +  
Subjt:  NGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYE--LWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTC

Query:  YIL--------ADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
         ++        +D+EYRQK DA+SEQGIFLGYSQN R YRV+NN S SVM+TINVVIND DS IK
Subjt:  YIL--------ADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

TrEMBL top hitse value%identityAlignment
A0A5A7SWB7 Gag-proteinase polyprotein8.1e-13454.48Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------
        MIFFIKMLDGKAWRALV  YDPPMI VNGVSI K EVDWTDAEEQASVGNARA N IFN VDLNVFKLINSC+ AKEAWKTLE                 
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------

Query:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR
           E +E     TSK +  ++ E      YN+RVLEI NESLLLGEKIPDSKIVQK                 EAHDITTL+LDELFGSLLTFEMAT DR
Subjt:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR

Query:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS
        +SKKGK IAFKSTHVDEEAVSDTEANMD       E+  ++  +  N N ++RRSDGYIKKK+GDRRIF+CRECG +GHYQ ECPTF+RKQ KNFCVTLS
Subjt:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS

Query:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE
        DEESGDSRDDDDNINAFTIRITDEN DDE ECSEE K+DELTIEKLEALWKEDC+   +          Q+E+ + +I          +NE         
Subjt:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE

Query:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP
         I +   + +    NG        +    R+   ++   + F + ++   +   H   +IRT                     V  +  T Y  C  + P
Subjt:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP

Query:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
                  T  +   +EYRQK DA+SEQ IFLGYSQNS  YRV+NNRS SV++TINVVINDLDS IK
Subjt:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

A0A5A7TNK7 Gag-pol polyprotein1.8e-14942.52Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--
        MIFFIK LDGKAWRALV GYDP MI VNGVSI KPEVDWTD EEQASVGNARA N IFNGVDLNVFKLIN CSTAKEAWKTLE  YEGTSKVK SRL   
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--

Query:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQKEAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKSTHVDEEAVSDTEANMDE
                        +YN+ VLEI NESLLL   I       +EAHDITTLKLDELFGSLLTFEM T +R+SKKGK IAFKSTHV+EEA  DTEANMDE
Subjt:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQKEAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKSTHVDEEAVSDTEANMDE

Query:  RNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIRITDENTDDERECSEERK
                                         CRECGGVGHYQ EC TF+RKQKKNF VTLSD+E  DSRDDD NINAFTIRIT++NTDD+ ECS E K
Subjt:  RNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIRITDENTDDERECSEERK

Query:  NDELTIEKLEALWKEDC-----------------------------------------------------------------------------------
        NDEL+IEKLE LWKEDC                                                                                   
Subjt:  NDELTIEKLEALWKEDC-----------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------KTYTVEIYKNLCLKLQREKGKKIIRIRSDHGK
                                                                            KT TVEI KNLCLKLQRE+ KKI RIRSDHGK
Subjt:  --------------------------------------------------------------------KTYTVEIYKNLCLKLQREKGKKIIRIRSDHGK

Query:  EFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEM
        EFDNE FNSFCLLEG H EFSAPITPQQNGVVE+KN  LQEM RVMIHAKNLPLCF+AEAVNTA HIHNRVTIRT T + LYE W               
Subjt:  EFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEM

Query:  TVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
                KERK NVKYFHVFGSTCYILAD+EY +K DARSEQGIFL YSQ SRAYRVYNNRSDSVM+TIN  INDLDS IK
Subjt:  TVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

A0A5D3BAM0 Gag-proteinase polyprotein4.7e-13454.48Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------
        MIFFIKMLDGKAWRALV  YDPPMI VNGVSI K EVDWTDAEEQASVGNARA N IFN VDLNVFKLINSC+ AKEAWKTLE                 
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLE-----------------

Query:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR
           E +E     TSK +  ++ E      YN+RVLEI NESLLLGEKIPDSKIVQK                 EAHDITTL+LDELFGSLLTFEMAT DR
Subjt:  ---EVYEG----TSKVKFSRLYE------YNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDR

Query:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS
        +SKKGK IAFKSTHVDEEAVSDTEANMD       E+  ++  +  N N ++RRSDGYIKKK+GDRRIF+CRECG +GHYQ ECPTF+RKQ KNFCVTLS
Subjt:  DSKKGKRIAFKSTHVDEEAVSDTEANMD-------ERNDDSLTRRNNEN-SDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLS

Query:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE
        DEESGDSRDDDDNINAFTIRITDEN DDE ECSEE K+DELTIEKLEALWKEDC+   +          Q+E+ + +I          +NE         
Subjt:  DEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLE

Query:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP
         I +   + +    NG        +    R+   ++   + F ++++   +   H   +IRT                     V  +  T Y  C  + P
Subjt:  GIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKP

Query:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
                  T  +   +EYRQK DA+SEQ IFLGYSQNS  YRV+NNRS SV++TINVVINDLDS IK
Subjt:  NVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

A0A5D3BJA9 Gag-pol polyprotein3.1e-15456.56Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--
        MIFFIK LDGKAWRALV GYDPPMITVNGVSI KPEVDWT+ EEQA+VGNARA N IFNG+DLNVFKLINSCSTAKEAWKTL+  YEGTSKVK SRL   
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--

Query:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS
                        +YN+RVLEI NESLLL EKIPDSKIV K                 EAHDITTL+LDELFGSLLTFEMATVDR+S          
Subjt:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS

Query:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIR
                + T      RNDDSLT+RNNENSDRRSDGYIKKK+GDRRIF+CRECGGV HYQ ECPTF+RKQK NFCVTLSD ESGD RDDD NINAFTI 
Subjt:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIR

Query:  ITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQNGV
        ITDENT+DE ECSEE K+DELTIEKLEALWKEDC+  T++  +   L  + E    +I       +EF NE+     S  +L    +   + +    NG 
Subjt:  ITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQNGV

Query:  VEKKN----MMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRK-------PNVKYFNVTE---------------------
          K++    +      +     K +P     E   T      R T++T   +  Y   K ++       P +    V +                     
Subjt:  VEKKN----MMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRK-------PNVKYFNVTE---------------------

Query:  ----MTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
            MTVTL+EL K+RKPNVKYFHVFGSTCYILAD+EYRQK DA+SEQGIFLGYSQNSRAYRV+NNR  SVM+TINVVIND+DS IK
Subjt:  ----MTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

A0A5D3BSP9 Gag-proteinase polyprotein1.5e-13253.63Show/hide
Query:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--
        MIFFIK LDGKAWRALV GYDPPMITVNGVS+ KPEVDWTDAEEQASVGNARA NTIFNGVDLNVFKLINSCSTAKEAWKTLE   EGTSKVK SRL   
Subjt:  MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLY--

Query:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS
                        +YN+RVLEI NESLLLGEKIPDSK+V+K                 EAHDITTLKLDELFGSLLTFEMAT +R+SKKGK IAFKS
Subjt:  ----------------EYNERVLEIENESLLLGEKIPDSKIVQK-----------------EAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKS

Query:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYI---KKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAF
        THV  EAVS+TE+NMD+  ++ +            +  +   +K+      FKCRECGGVGHYQ ECPT++RKQKKNF VTLSDEESGDSRDDD NINAF
Subjt:  THVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYI---KKKDGDRRIFKCRECGGVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAF

Query:  TIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQ
        TIRITD N++D+ ECSEE KND+LTIEK EALWKEDC+   ++  +   L  + E    +I       +EF NE+     S  +L    +     +    
Subjt:  TIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNES---FNSFCLLEGIHDEFSAPITPQQ

Query:  NGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYE--LWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTC
        NG        L  +  V        + F   ++   +  IH    IRT T+  L     + GRK +++           Y+L +E+    KY++   +  
Subjt:  NGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYH-IHNRVTIRTETIVILYE--LWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTC

Query:  YIL--------ADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK
         ++        +D+EYRQK DA+SEQGIFLGYSQN R YRV+NN S SVM+TINVVIND DS IK
Subjt:  YIL--------ADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.6e-1625.46Show/hide
Query:  CKTYTVE-------IYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEA
        C TY ++       ++++   K +     K++ +  D+G+E+ +     FC+ +GI    + P TPQ NGV E+    + E  R M+    L   FW EA
Subjt:  CKTYTVE-------IYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEA

Query:  VNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYN
        V TA ++ NR+  R                      + + + T YE+   +KP +K+  VFG+T Y+    + + K D +S + IF+GY  N   +++++
Subjt:  VNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYN

Query:  NRSDSVMKTINVVIND
          ++  +   +VV+++
Subjt:  NRSDSVMKTINVVIND

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-1929.19Show/hide
Query:  KTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIH
        K    ++++     ++RE G+K+ R+RSD+G E+ +  F  +C   GI  E + P TPQ NGV E+ N  + E  R M+    LP  FW EAV TA ++ 
Subjt:  KTYTVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIH

Query:  NRVTIRTETIVILYELWKGRKPNVKY-FNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVM
        N                  R P+V   F + E   T       ++ +  +  VFG   +    +E R K D +S   IF+GY      YR+++     V+
Subjt:  NRVTIRTETIVILYELWKGRKPNVKY-FNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVM

Query:  KTINVVIND
        ++ +VV  +
Subjt:  KTINVVIND

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-0627.07Show/hide
Query:  YKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRT
        +KNL   L+     +I    SD+G EF   +   +    GI    S P TP+ NG+ E+K+  + E    ++   ++P  +W  A   A ++ NR+    
Subjt:  YKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRT

Query:  ETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAY
                L +   P  K F  +              PN     VFG  CY       + K D +S Q +FLGYS    AY
Subjt:  ETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-0726.49Show/hide
Query:  TVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRV
        T  I+K+L   ++     +I  + SD+G EF       +    GI    S P TP+ NG+ E+K+  + EM   ++   ++P  +W  A + A ++ NR+
Subjt:  TVEIYKNLCLKLQREKGKKIIRIRSDHGKEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRV

Query:  TIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAY
                    L + + P  K F               + PN +   VFG  CY       R K + +S+Q  F+GYS    AY
Subjt:  TIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCKERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTCTTCATTAAGATGTTAGATGGAAAAGCTTGGAGAGCTCTTGTGGAGGGTTATGATCCTCCTATGATTACAGTGAATGGTGTATCAATTTTGAAACCTGAGGT
TGATTGGACCGATGCTGAAGAGCAAGCATCCGTTGGGAATGCCAGAGCACCTAACACGATATTCAATGGTGTTGACCTGAACGTTTTCAAGTTAATAAATTCTTGCAGTA
CAGCCAAAGAAGCTTGGAAAACCTTGGAGGAAGTGTATGAAGGTACTTCCAAAGTAAAGTTTTCAAGATTATATGAATATAATGAGAGAGTTCTTGAAATCGAAAATGAA
TCCTTGTTGCTCGGTGAAAAGATACCTGACTCTAAAATAGTGCAGAAAGAAGCCCATGATATTACTACACTGAAACTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGA
AATGGCCACTGTTGATAGAGATAGTAAGAAAGGCAAGAGAATTGCTTTTAAATCCACACATGTAGATGAGGAGGCTGTAAGTGATACTGAAGCAAACATGGATGAAAGAA
ATGATGACAGTCTTACCAGGAGGAACAATGAAAATTCTGATAGAAGAAGTGATGGTTATATCAAGAAAAAGGATGGTGATAGAAGGATTTTCAAGTGTAGAGAATGTGGA
GGAGTTGGTCATTATCAGGTAGAATGTCCCACATTCATGAGAAAACAGAAGAAAAACTTTTGTGTCACACTGTCAGATGAAGAATCTGGTGATAGTAGAGATGACGATGA
CAACATAAATGCCTTCACAATACGAATTACTGATGAGAACACTGATGATGAAAGGGAATGTTCTGAAGAAAGAAAAAATGATGAACTGACAATTGAGAAACTTGAAGCTT
TATGGAAAGAAGATTGCAAAACATATACTGTTGAAATATATAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGGGAAGAAAATAATCAGGATCCGAAGTGATCATGGT
AAAGAATTTGATAATGAAAGCTTTAACAGTTTTTGTCTATTAGAAGGAATACACGATGAATTCTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAAAAAGAA
CATGATGTTACAAGAAATGACACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTATCACATTCATAACAGGGTAACTA
TTAGAACCGAAACGATTGTTATACTTTATGAACTTTGGAAAGGTAGAAAGCCAAATGTTAAATACTTCAATGTAACTGAAATGACTGTTACTCTTTATGAACTTTGCAAA
GAGAGAAAACCAAATGTCAAATACTTCCATGTGTTTGGAAGTACATGTTATATCTTAGCTGATCAAGAATACCGTCAGAAAAGGGATGCAAGGTCGGAACAAGGAATCTT
TCTCGGGTACTCTCAAAACAGTCGGGCCTATAGAGTCTACAATAACAGATCTGACAGTGTTATGAAAACAATCAATGTAGTTATAAATGATCTCGATTCTAATATCAAAT
AA
mRNA sequenceShow/hide mRNA sequence
ATGATATTCTTCATTAAGATGTTAGATGGAAAAGCTTGGAGAGCTCTTGTGGAGGGTTATGATCCTCCTATGATTACAGTGAATGGTGTATCAATTTTGAAACCTGAGGT
TGATTGGACCGATGCTGAAGAGCAAGCATCCGTTGGGAATGCCAGAGCACCTAACACGATATTCAATGGTGTTGACCTGAACGTTTTCAAGTTAATAAATTCTTGCAGTA
CAGCCAAAGAAGCTTGGAAAACCTTGGAGGAAGTGTATGAAGGTACTTCCAAAGTAAAGTTTTCAAGATTATATGAATATAATGAGAGAGTTCTTGAAATCGAAAATGAA
TCCTTGTTGCTCGGTGAAAAGATACCTGACTCTAAAATAGTGCAGAAAGAAGCCCATGATATTACTACACTGAAACTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGA
AATGGCCACTGTTGATAGAGATAGTAAGAAAGGCAAGAGAATTGCTTTTAAATCCACACATGTAGATGAGGAGGCTGTAAGTGATACTGAAGCAAACATGGATGAAAGAA
ATGATGACAGTCTTACCAGGAGGAACAATGAAAATTCTGATAGAAGAAGTGATGGTTATATCAAGAAAAAGGATGGTGATAGAAGGATTTTCAAGTGTAGAGAATGTGGA
GGAGTTGGTCATTATCAGGTAGAATGTCCCACATTCATGAGAAAACAGAAGAAAAACTTTTGTGTCACACTGTCAGATGAAGAATCTGGTGATAGTAGAGATGACGATGA
CAACATAAATGCCTTCACAATACGAATTACTGATGAGAACACTGATGATGAAAGGGAATGTTCTGAAGAAAGAAAAAATGATGAACTGACAATTGAGAAACTTGAAGCTT
TATGGAAAGAAGATTGCAAAACATATACTGTTGAAATATATAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGGGAAGAAAATAATCAGGATCCGAAGTGATCATGGT
AAAGAATTTGATAATGAAAGCTTTAACAGTTTTTGTCTATTAGAAGGAATACACGATGAATTCTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAAAAAGAA
CATGATGTTACAAGAAATGACACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTATCACATTCATAACAGGGTAACTA
TTAGAACCGAAACGATTGTTATACTTTATGAACTTTGGAAAGGTAGAAAGCCAAATGTTAAATACTTCAATGTAACTGAAATGACTGTTACTCTTTATGAACTTTGCAAA
GAGAGAAAACCAAATGTCAAATACTTCCATGTGTTTGGAAGTACATGTTATATCTTAGCTGATCAAGAATACCGTCAGAAAAGGGATGCAAGGTCGGAACAAGGAATCTT
TCTCGGGTACTCTCAAAACAGTCGGGCCTATAGAGTCTACAATAACAGATCTGACAGTGTTATGAAAACAATCAATGTAGTTATAAATGATCTCGATTCTAATATCAAAT
AA
Protein sequenceShow/hide protein sequence
MIFFIKMLDGKAWRALVEGYDPPMITVNGVSILKPEVDWTDAEEQASVGNARAPNTIFNGVDLNVFKLINSCSTAKEAWKTLEEVYEGTSKVKFSRLYEYNERVLEIENE
SLLLGEKIPDSKIVQKEAHDITTLKLDELFGSLLTFEMATVDRDSKKGKRIAFKSTHVDEEAVSDTEANMDERNDDSLTRRNNENSDRRSDGYIKKKDGDRRIFKCRECG
GVGHYQVECPTFMRKQKKNFCVTLSDEESGDSRDDDDNINAFTIRITDENTDDERECSEERKNDELTIEKLEALWKEDCKTYTVEIYKNLCLKLQREKGKKIIRIRSDHG
KEFDNESFNSFCLLEGIHDEFSAPITPQQNGVVEKKNMMLQEMTRVMIHAKNLPLCFWAEAVNTAYHIHNRVTIRTETIVILYELWKGRKPNVKYFNVTEMTVTLYELCK
ERKPNVKYFHVFGSTCYILADQEYRQKRDARSEQGIFLGYSQNSRAYRVYNNRSDSVMKTINVVINDLDSNIK