; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0015774 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0015774
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr08:15645091..15650961
RNA-Seq ExpressionIVF0015774
SyntenyIVF0015774
Gene Ontology termsNA
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031708.1 hypothetical protein E6C27_scaffold139G004940 [Cucumis melo var. makuwa]7.24e-25664.79Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARLE GCSSVPS EGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGT V++HVPISYQSWKDVPTELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  E----------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTT
        E                                              MVSNKGRE RKNNKYNHRM +KGYAN  +EMK STSNGELIDR LVWKKARTT
Subjt:  E----------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTT

Query:  KDGEIPDIDTKEVANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMK
        KDGEIPDIDTKEVANKID+                                  VGQYVTP KYFH AREKRKKVGKEEDYAEERARMAARILELEAELMK
Subjt:  KDGEIPDIDTKEVANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMK

Query:  HKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGT
        HK+VPEVA KGETDESKIKSQMASKSIDTS+DANDHDAKEENRQVLEDLTIEDLTIEKQDKVG+ENKYVCASIETLTKVKDGTSCRLAIGTKD VV AGT
Subjt:  HKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGT

Query:  IFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEK--SPLQSDGR---------------------------------
        IFDY MDGDNVKVSVDMVTDGNCFV VPTREGRTMLSQEVGSQLLWPRHLVIPLDEK  S  Q+D R                                 
Subjt:  IFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEK--SPLQSDGR---------------------------------

Query:  ---------------------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-------
                                                        +G  +    GS+SVGISKEDRAQILNARLLGT HRQILMFPYNSG       
Subjt:  ---------------------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-------

Query:  ----------------------------AFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV
                                    AFDISNKKKPVWRIIKCPKQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  ----------------------------AFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV

KAA0041518.1 uncharacterized protein E6C27_scaffold6G001110 [Cucumis melo var. makuwa]5.06e-24267.39Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARLE GCSSVPS EGVINKGKGTRGPT MSEITRVSCD HKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSW+DVP ELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV
        E                                                                       MVSNKGRE RKNNKYNHRM RKGYAN  
Subjt:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV

Query:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVG
        +EMK +TSNGELI+R LVWKKARTTKDGEIPDIDTKEVANKID+                                  VGQYVTPSKYFH AREKRKKVG
Subjt:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVG

Query:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET
        KEEDY EERARM ARILELEAELMKHKRVPEVA KGETDESKIKSQMASKSIDTS+DANDHDAKEENRQVLEDLTIEDLTIEKQDKVG+ENKYV ASIET
Subjt:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET

Query:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ
        LTKVKDGTSCRLAIGTKD VV AGTIFDYDMDGDNVKVSVDMVTDGNCFV VPT+EGRTMLSQEVGSQLLWPRHLVIPLDEK  +   + G  W +  + 
Subjt:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ

Query:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV
          R  +         +     D  +IL +                   FDISNKKKPVWRIIKCPKQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV

KAA0058341.1 uncharacterized protein E6C27_scaffold409G00270 [Cucumis melo var. makuwa]4.35e-25166.3Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARL+ GCSS+PS EGVI KGKGTRGPT MSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  E----------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKE
        E                                  MVSNKGRE RKNNKYNHRM RKGYAN  +EMK STSNGELIDR LVWKKARTTKDGEIPD DTKE
Subjt:  E----------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKE

Query:  VANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGE
        VANKID+                                  VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHK V EVA KGE
Subjt:  VANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGE

Query:  TDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVK
        +DESKIKSQMASKSIDTS+DANDHD KEENRQVLEDLTIEDLTIEKQDKVG ENKYVCASIETLTKVKD             VV AGTIFDYDMD +NVK
Subjt:  TDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVK

Query:  VSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEK--SPLQSDGR---------------------------------------------
        VSVD VT GNCFV VPTREGRTMLSQEVGSQLLWPRHLVIP DEK  S  Q+D R                                             
Subjt:  VSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEK--SPLQSDGR---------------------------------------------

Query:  ---------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPK
                                            +G  +    GS+SVGISKE+RAQILNARLLGT HRQILMFPYNSGAFDISNKKKPVW+IIKCPK
Subjt:  ---------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPK

Query:  QGGIVECEYYVMRFMRDIILSSNRTIIEV
        QGGIVEC YYVMRFMRDIIL SNRTIIEV
Subjt:  QGGIVECEYYVMRFMRDIILSSNRTIIEV

TYK22488.1 uncharacterized protein E5676_scaffold19523G00250 [Cucumis melo var. makuwa]6.13e-25066.51Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARL+ GCSS+PS EGVI KGKGTRGPT MSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSR--------------------------
        EMVSNKGRE RKNNKYNHRM RKGYAN  +EMK STSNGELIDR LVWKKARTTKDGEIPD DTKEVANKID+                           
Subjt:  EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSR--------------------------

Query:  -------VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVL
               VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHK V EVA KGE+DESKIKSQMASKSIDTS+DANDHD KEENRQVL
Subjt:  -------VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVL

Query:  EDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLW
        EDLTIEDLTIEKQDKVG ENKYVCASIETLTKVKD             VV AGTIFDYDMD +NVKVSVD VT GNCFV VPTREGRTMLSQEVGSQLLW
Subjt:  EDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLW

Query:  PRHLVIPLDEK--SPLQSDGR------------------------------------------------------------------------------E
        PRHLVIP DEK  S  Q+D R                                                                               
Subjt:  PRHLVIPLDEK--SPLQSDGR------------------------------------------------------------------------------E

Query:  WNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-----------------------------------AFDISNKKKPVWRIIKCP
          +G  +    GSVSVGISKEDRAQILNARLLG  HRQILMFPYNS                                    AFDISNKKKPVWRIIKCP
Subjt:  WNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-----------------------------------AFDISNKKKPVWRIIKCP

Query:  KQGGIVECEYYVMRFMRDIILSSNRTIIEV
        KQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  KQGGIVECEYYVMRFMRDIILSSNRTIIEV

TYK24391.1 uncharacterized protein E5676_scaffold205G001770 [Cucumis melo var. makuwa]2.63e-24467.56Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARLE GCSSVPS EGVINKGKGTRGPT MSEITRVSCD HKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSW+DVP ELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV
        E                                                                       MVSNKGRE RKNNKYNHRM RKGYAN  
Subjt:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV

Query:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVG
        +EMK +TSNGELIDR LVWKKARTTKDGEIPDIDTKEVANKID+                                  VGQYVTPSKYFH AREKRKKVG
Subjt:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSR---------------------------------VGQYVTPSKYFHIAREKRKKVG

Query:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET
        KEEDY EERARM ARILELEAELMKHKRVPEVA KGETDESKIKSQMASKSIDTS+DANDHDAKEENRQVLEDLTIEDLTIEKQDKVG+ENKYV ASIET
Subjt:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET

Query:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ
        LTKVKDGTSCRLAIGTKD VV AGTIFDYDMDGDNVKVSVDMVTDGNCFV VPT+EGRTMLSQEVGSQLLWPRHLVIPLDEK  +   + G  W +  + 
Subjt:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ

Query:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV
          R  +         +     D  +IL +                   FDISNKKKPVWRIIKCPKQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV

TrEMBL top hitse value%identityAlignment
A0A5A7SM56 ULP_PROTEASE domain-containing protein2.7e-21764.79Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARLE GCSSVPS EGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGT V++HVPISYQSWKDVPTELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  ----------------------------------------------EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTT
                                                      EMVSNKGRE RKNNKYNHRM +KGYAN  +EMK STSNGELIDR LVWKKARTT
Subjt:  ----------------------------------------------EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTT

Query:  KDGEIPDIDTKEVANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMK
        KDGEIPDIDTKEVANKID+                                  VGQYVTP KYFH AREKRKKVGKEEDYAEERARMAARILELEAELMK
Subjt:  KDGEIPDIDTKEVANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMK

Query:  HKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGT
        HK+VPEVA KGETDESKIKSQMASKSIDTS+DANDHDAKEENRQVLEDLTIEDLTIEKQDKVG+ENKYVCASIETLTKVKDGTSCRLAIGTKD VV AGT
Subjt:  HKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGT

Query:  IFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDE--KSPLQSDGR---------------------------------
        IFDY MDGDNVKVSVDMVTDGNCFV VPTREGRTMLSQEVGSQLLWPRHLVIPLDE  KS  Q+D R                                 
Subjt:  IFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDE--KSPLQSDGR---------------------------------

Query:  ---------------------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-------
                                                        +G  +    GS+SVGISKEDRAQILNARLLGT HRQILMFPYNSG       
Subjt:  ---------------------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-------

Query:  ----------------------------AFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV
                                    AFDISNKKKPVWRIIKCPKQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  ----------------------------AFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV

A0A5A7TF26 ULP_PROTEASE domain-containing protein2.3e-20067.39Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARLE GCSSVPS EGVINKGKGTRGPT MSEITRVSCD HKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSW+DVP ELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV
        E                                                                       MVSNKGRE RKNNKYNHRM RKGYAN  
Subjt:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV

Query:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVG
        +EMK +TSNGELI+R LVWKKARTTKDGEIPDIDTKEVANKID+                                  VGQYVTPSKYFH AREKRKKVG
Subjt:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVG

Query:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET
        KEEDY EERARM ARILELEAELMKHKRVPEVA KGETDESKIKSQMASKSIDTS+DANDHDAKEENRQVLEDLTIEDLTIEKQDKVG+ENKYV ASIET
Subjt:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET

Query:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ
        LTKVKDGTSCRLAIGTKD VV AGTIFDYDMDGDNVKVSVDMVTDGNCFV VPT+EGRTMLSQEVGSQLLWPRHLVIPLDEK  +   + G  W +  + 
Subjt:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ

Query:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV
          R  +         +     D  +IL                    +FDISNKKKPVWRIIKCPKQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV

A0A5A7UXY5 Uncharacterized protein9.7e-20766.3Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARL+ GCSS+PS EGVI KGKGTRGPT MSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  ----------------------------------EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKE
                                          EMVSNKGRE RKNNKYNHRM RKGYAN  +EMK STSNGELIDR LVWKKARTTKDGEIPD DTKE
Subjt:  ----------------------------------EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKE

Query:  VANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGE
        VANKID+                                  VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHK V EVA KGE
Subjt:  VANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGE

Query:  TDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVK
        +DESKIKSQMASKSIDTS+DANDHD KEENRQVLEDLTIEDLTIEKQDKVG ENKYVCASIETLTKV            KD VV AGTIFDYDMD +NVK
Subjt:  TDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVK

Query:  VSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDE--KSPLQSDGR---------------------------------------------
        VSVD VT GNCFV VPTREGRTMLSQEVGSQLLWPRHLVIP DE  KS  Q+D R                                             
Subjt:  VSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDE--KSPLQSDGR---------------------------------------------

Query:  ---------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPK
                                            +G  +    GS+SVGISKE+RAQILNARLLGT HRQILMFPYNSGAFDISNKKKPVW+IIKCPK
Subjt:  ---------------------------------EWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPK

Query:  QGGIVECEYYVMRFMRDIILSSNRTIIEV
        QGGIVEC YYVMRFMRDIIL SNRTIIEV
Subjt:  QGGIVECEYYVMRFMRDIILSSNRTIIEV

A0A5D3DGA5 ULP_PROTEASE domain-containing protein9.7e-20766.51Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARL+ GCSS+PS EGVI KGKGTRGPT MSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDS---------------------------
        EMVSNKGRE RKNNKYNHRM RKGYAN  +EMK STSNGELIDR LVWKKARTTKDGEIPD DTKEVANKID+                           
Subjt:  EMVSNKGREWRKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDS---------------------------

Query:  ------RVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVL
               VGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHK V EVA KGE+DESKIKSQMASKSIDTS+DANDHD KEENRQVL
Subjt:  ------RVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVL

Query:  EDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLW
        EDLTIEDLTIEKQDKVG ENKYVCASIETLTKV            KD VV AGTIFDYDMD +NVKVSVD VT GNCFV VPTREGRTMLSQEVGSQLLW
Subjt:  EDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLW

Query:  PRHLVIPLDE--KSPLQSDGR------------------------------------------------------------------------------E
        PRHLVIP DE  KS  Q+D R                                                                               
Subjt:  PRHLVIPLDE--KSPLQSDGR------------------------------------------------------------------------------E

Query:  WNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-----------------------------------AFDISNKKKPVWRIIKCP
          +G  +    GSVSVGISKEDRAQILNARLLG  HRQILMFPYNS                                    AFDISNKKKPVWRIIKCP
Subjt:  WNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSG-----------------------------------AFDISNKKKPVWRIIKCP

Query:  KQGGIVECEYYVMRFMRDIILSSNRTIIEV
        KQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  KQGGIVECEYYVMRFMRDIILSSNRTIIEV

A0A5D3DL96 ULP_PROTEASE domain-containing protein8.0e-20167.56Show/hide
Query:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI
        MTLGKQARLE GCSSVPS EGVINKGKGTRGPT MSEITRVSCD HKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSW+DVP ELKDKIY+LI
Subjt:  MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLI

Query:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV
        E                                                                       MVSNKGRE RKNNKYNHRM RKGYAN  
Subjt:  E-----------------------------------------------------------------------MVSNKGREWRKNNKYNHRMLRKGYANFV

Query:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVG
        +EMK +TSNGELIDR LVWKKARTTKDGEIPDIDTKEVANKID+                                  VGQYVTPSKYFH AREKRKKVG
Subjt:  KEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDS---------------------------------RVGQYVTPSKYFHIAREKRKKVG

Query:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET
        KEEDY EERARM ARILELEAELMKHKRVPEVA KGETDESKIKSQMASKSIDTS+DANDHDAKEENRQVLEDLTIEDLTIEKQDKVG+ENKYV ASIET
Subjt:  KEEDYAEERARMAARILELEAELMKHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIET

Query:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ
        LTKVKDGTSCRLAIGTKD VV AGTIFDYDMDGDNVKVSVDMVTDGNCFV VPT+EGRTMLSQEVGSQLLWPRHLVIPLDEK  +   + G  W +  + 
Subjt:  LTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGDNVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPL--QSDGREWNVGILQ

Query:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV
          R  +         +     D  +IL                    +FDISNKKKPVWRIIKCPKQGGIVEC YYVMRFMRDIILSSNRTIIEV
Subjt:  VCRRGSV-------SVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISNKKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.2e-1326.44Show/hide
Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR
            + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR

Query:  NMKREVAE
         +++++ E
Subjt:  NMKREVAE

P0CT35 Transposon Tf2-2 polyprotein3.2e-1326.44Show/hide
Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR
            + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR

Query:  NMKREVAE
         +++++ E
Subjt:  NMKREVAE

P0CT36 Transposon Tf2-3 polyprotein3.2e-1326.44Show/hide
Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR
            + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR

Query:  NMKREVAE
         +++++ E
Subjt:  NMKREVAE

P0CT41 Transposon Tf2-12 polyprotein3.2e-1326.44Show/hide
Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR
            + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR

Query:  NMKREVAE
         +++++ E
Subjt:  NMKREVAE

Q9UR07 Transposon Tf2-11 polyprotein3.2e-1326.44Show/hide
Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I      
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGA

Query:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR
            + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+
Subjt:  VTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWR

Query:  NMKREVAE
         +++++ E
Subjt:  NMKREVAE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTTGGAAAACAAGCTAGATTAGAAGTTGGTTGCTCGAGCGTACCGAGTATCGAGGGAGTCATTAATAAAGGTAAAGGTACACGTGGACCTACGGGAATGTCTGA
GATCACACGGGTTAGTTGTGATGGACACAAGAGAGTGGTTGAATACAATGAGCTCGGTCAGCCCATTGGTGAAAGTGCAACCAAGTTAAAGAGTTTCATTGGAACAACCG
TACGGGTTCATGTTCCGATCAGTTACCAATCCTGGAAGGATGTACCTACAGAGCTCAAAGATAAAATTTACAAACTAATTGAGATGGTTAGCAACAAGGGTCGTGAGTGG
AGAAAGAACAACAAGTACAACCACCGAATGTTGCGAAAAGGTTATGCAAACTTTGTCAAAGAGATGAAAGAAAGCACATCTAATGGTGAGTTAATTGATCGTGTTTTAGT
ATGGAAGAAAGCACGAACTACTAAAGATGGAGAGATTCCAGACATAGATACAAAGGAGGTGGCCAATAAAATCGATTCGAGGGTGGGTCAATACGTCACACCGAGTAAAT
ACTTCCACATTGCAAGAGAAAAACGAAAGAAGGTCGGTAAAGAAGAAGATTATGCTGAAGAGCGAGCGAGAATGGCTGCTCGTATCTTAGAACTGGAAGCGGAATTAATG
AAGCATAAGAGGGTTCCTGAAGTGGCCATAAAGGGAGAAACTGACGAGAGCAAGATCAAGAGTCAAATGGCTTCTAAAAGCATTGACACGTCAAATGATGCAAATGACCA
TGACGCAAAAGAAGAAAATAGACAAGTTCTAGAAGACTTGACAATAGAGGACTTGACAATAGAAAAACAAGATAAGGTTGGTGACGAGAACAAATATGTTTGTGCATCGA
TTGAGACTTTGACAAAAGTGAAGGATGGCACTTCTTGTCGACTTGCAATTGGGACTAAGGACACTGTTGTCAGTGCTGGAACTATTTTCGACTACGACATGGACGGTGAC
AACGTGAAGGTATCAGTGGACATGGTTACCGATGGTAATTGCTTCGTTCTTGTTCCAACAAGGGAAGGTAGGACTATGCTTTCCCAAGAAGTTGGTTCACAGTTGTTATG
GCCTCGTCATTTAGTCATTCCTCTAGACGAGAAGTCACCTCTACAAAGTGATGGAAGAGAATGGAACGTTGGGATCCTACAAGTTTGCAGACGTGGCTCTGTTTCAGTTG
GTATAAGTAAAGAAGATCGAGCACAGATCCTAAATGCTAGATTGCTTGGGACCTACCATCGTCAGATTTTGATGTTTCCATACAATTCAGGGGCTTTCGACATTTCGAAC
AAGAAGAAGCCTGTTTGGAGGATAATCAAGTGTCCGAAGCAAGGTGGCATTGTGGAGTGCGAGTATTATGTGATGCGATTCATGCGTGATATAATATTGTCAAGCAACAG
GACAATCATTGAAGTAAGTTGGCAGCAGTGGTTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCCTGAAAT
ACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGGAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCGGGCAAGGCAAATGTGGTAGCT
GATGCTCTTAGTAGGAAGGTGTCACATTCAGCAGCACTTATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTAC
TATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGC
AAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGT
TCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTGTTAGTAATGCCTGG
TGTGTCAGCAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACACTTGGAAAACAAGCTAGATTAGAAGTTGGTTGCTCGAGCGTACCGAGTATCGAGGGAGTCATTAATAAAGGTAAAGGTACACGTGGACCTACGGGAATGTCTGA
GATCACACGGGTTAGTTGTGATGGACACAAGAGAGTGGTTGAATACAATGAGCTCGGTCAGCCCATTGGTGAAAGTGCAACCAAGTTAAAGAGTTTCATTGGAACAACCG
TACGGGTTCATGTTCCGATCAGTTACCAATCCTGGAAGGATGTACCTACAGAGCTCAAAGATAAAATTTACAAACTAATTGAGATGGTTAGCAACAAGGGTCGTGAGTGG
AGAAAGAACAACAAGTACAACCACCGAATGTTGCGAAAAGGTTATGCAAACTTTGTCAAAGAGATGAAAGAAAGCACATCTAATGGTGAGTTAATTGATCGTGTTTTAGT
ATGGAAGAAAGCACGAACTACTAAAGATGGAGAGATTCCAGACATAGATACAAAGGAGGTGGCCAATAAAATCGATTCGAGGGTGGGTCAATACGTCACACCGAGTAAAT
ACTTCCACATTGCAAGAGAAAAACGAAAGAAGGTCGGTAAAGAAGAAGATTATGCTGAAGAGCGAGCGAGAATGGCTGCTCGTATCTTAGAACTGGAAGCGGAATTAATG
AAGCATAAGAGGGTTCCTGAAGTGGCCATAAAGGGAGAAACTGACGAGAGCAAGATCAAGAGTCAAATGGCTTCTAAAAGCATTGACACGTCAAATGATGCAAATGACCA
TGACGCAAAAGAAGAAAATAGACAAGTTCTAGAAGACTTGACAATAGAGGACTTGACAATAGAAAAACAAGATAAGGTTGGTGACGAGAACAAATATGTTTGTGCATCGA
TTGAGACTTTGACAAAAGTGAAGGATGGCACTTCTTGTCGACTTGCAATTGGGACTAAGGACACTGTTGTCAGTGCTGGAACTATTTTCGACTACGACATGGACGGTGAC
AACGTGAAGGTATCAGTGGACATGGTTACCGATGGTAATTGCTTCGTTCTTGTTCCAACAAGGGAAGGTAGGACTATGCTTTCCCAAGAAGTTGGTTCACAGTTGTTATG
GCCTCGTCATTTAGTCATTCCTCTAGACGAGAAGTCACCTCTACAAAGTGATGGAAGAGAATGGAACGTTGGGATCCTACAAGTTTGCAGACGTGGCTCTGTTTCAGTTG
GTATAAGTAAAGAAGATCGAGCACAGATCCTAAATGCTAGATTGCTTGGGACCTACCATCGTCAGATTTTGATGTTTCCATACAATTCAGGGGCTTTCGACATTTCGAAC
AAGAAGAAGCCTGTTTGGAGGATAATCAAGTGTCCGAAGCAAGGTGGCATTGTGGAGTGCGAGTATTATGTGATGCGATTCATGCGTGATATAATATTGTCAAGCAACAG
GACAATCATTGAAGTAAGTTGGCAGCAGTGGTTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCCTGAAAT
ACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGGAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCGGGCAAGGCAAATGTGGTAGCT
GATGCTCTTAGTAGGAAGGTGTCACATTCAGCAGCACTTATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTAC
TATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGC
AAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGT
TCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTGTTAGTAATGCCTGG
TGTGTCAGCAGGTTAA
Protein sequenceShow/hide protein sequence
MTLGKQARLEVGCSSVPSIEGVINKGKGTRGPTGMSEITRVSCDGHKRVVEYNELGQPIGESATKLKSFIGTTVRVHVPISYQSWKDVPTELKDKIYKLIEMVSNKGREW
RKNNKYNHRMLRKGYANFVKEMKESTSNGELIDRVLVWKKARTTKDGEIPDIDTKEVANKIDSRVGQYVTPSKYFHIAREKRKKVGKEEDYAEERARMAARILELEAELM
KHKRVPEVAIKGETDESKIKSQMASKSIDTSNDANDHDAKEENRQVLEDLTIEDLTIEKQDKVGDENKYVCASIETLTKVKDGTSCRLAIGTKDTVVSAGTIFDYDMDGD
NVKVSVDMVTDGNCFVLVPTREGRTMLSQEVGSQLLWPRHLVIPLDEKSPLQSDGREWNVGILQVCRRGSVSVGISKEDRAQILNARLLGTYHRQILMFPYNSGAFDISN
KKKPVWRIIKCPKQGGIVECEYYVMRFMRDIILSSNRTIIEVSWQQWFFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVA
DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHS
SPFSMHPGSTKMYQDLKRVYWWRNMKREVAELLVMPGVSAG