; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G08370 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G08370
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationClcChr07:21986635..21993754
RNA-Seq ExpressionClc07G08370
SyntenyClc07G08370
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004252 - Probable transposase, Ptta/En/Spm, plant
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031647.1 uncharacterized protein E6C27_scaffold139G004200 [Cucumis melo var. makuwa]2.7e-15748.62Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +++ RGP  M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIKEKI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT ++ PF DD++KLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R + WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE
        R+G+I D++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q K    ++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER
        L K K         D    + KE  G                     +G K+ S  +      AEN ND     L     S++ VE  VE  E   V   
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER

Query:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK
        K     + V  + +K GT C LA  +K++VV   TI D DV  DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K
Subjt:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK

Query:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY
         +   + T IQ++PVALR LLR +E++GS+IQIT P +V  IH         ME S+TL+LYKF+D+GSIS  + KE RAQLL++R+ G D + LL+ PY
Subjt:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY

Query:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV
        NSGNHWTL++I+ +K + FW+D LKNR+D D+  V+         ++    +  ++CPKQSGVVECGYYVMR+MR+II+S +TSI+ I+
Subjt:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV

KAA0035941.1 uncharacterized protein E6C27_scaffold56G001300 [Cucumis melo var. makuwa]6.7e-16145.48Show/hide
Query:  PVIDGEI-KRKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQ
        P  +G I K K TRGPTGM EITR   +GH+RVV+YNE GQPIG +A KLKSFIGT+VR HVPI+Y SWK VP E+K+KI++LI+GGFVVD RSKK+++Q
Subjt:  PVIDGEI-KRKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQ

Query:  TAGICFRQFKYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRC
         A +CFR FK +LTTK++ P+K DLEKLK PP EYS I++EHW+ FV+ RL ++FE+VS+K R +RK + YNHRMSRKGYANL+EE+K S  N +L  R 
Subjt:  TAGICFRQFKYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRC

Query:  VLWKRARIDREGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVKKEDKVTFDAEEKAKMTARIK
        ++WK+AR  ++G+IPD  T+ V ++ID +L++        +   DILSQ +G  DP   +RG+G+++ P                               
Subjt:  VLWKRARIDREGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVKKEDKVTFDAEEKAKMTARIK

Query:  ELEEELRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVE-VKVEKEE-
        +LE EL K K + ++ +             GE  +   K+K ++              + + ID S +A   ND    +  +    D  +E + +EK++ 
Subjt:  ELEEELRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVE-VKVEKEE-

Query:  VGERKQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVI-IDNEKEEG
        VGE+ +   +       +KDGTSC LAIG+K+NVVGAGTI DY ++ DNVKVS+D+V  G+C VPIP +    +LSQEVGS LLWPR LVI +D + +  
Subjt:  VGERKQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVI-IDNEKEEG

Query:  KNSNKCLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSI
          ++  L  L++   + +PV LR LL EL+YIGS IQI VP +VFG  R+                         + +LYK+ME++ TL  YKF DAGS+
Subjt:  KNSNKCLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSI

Query:  SISTNKESRAQLLSSRMAGLDN-QLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYV
        S+  +KE RAQ+L++R+ G D+ Q+LIFPYNSGNHW LI I+ S+ + +WMDPL+NR++ D   V+ MA  I    KKK  W+ IKCPKQ G+VECGYYV
Subjt:  SISTNKESRAQLLSSRMAGLDN-QLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYV

Query:  MRYMREIILSGTTSILDIV
        MR+MR+IILS   +I++++
Subjt:  MRYMREIILSGTTSILDIV

TYJ96009.1 uncharacterized protein E5676_scaffold2612G00150 [Cucumis melo var. makuwa]5.0e-17249.3Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +K+ RGPT M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIK+KI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT Y+ PF DD+EKLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R ++WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE
        R+G+IPD++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q     K++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ
        L K K         D    + KE  G  + S EK   E             D++  D+ D S + E++ D            +  VE+  E  +V   K 
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ

Query:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL
            + V  + +K GT C LA  +K++VV  GTI D D   DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K +
Subjt:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL

Query:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE
           + TPIQ++PVALR LLR +E++GS+IQIT P +VFG+ R+                         ++YLY  ME ++TL+LYKFLDAGSIS  ++KE
Subjt:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE

Query:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI
         R QLL++R+ G D +QLL+FPYNSGNHWTL++I+ +K + FW+DPLKNR+D D+  V+  +  I++  KKK +W+ +KCPKQSGVVECGYYVMR+MR+I
Subjt:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI

Query:  ILSGTTSILDIV
        I+S +TSI+ I+
Subjt:  ILSGTTSILDIV

TYK04463.1 uncharacterized protein E5676_scaffold409G00690 [Cucumis melo var. makuwa]9.1e-15848.77Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +++ RGP  M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIKEKI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT ++ PF DD+EKLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R + WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE
        R+G+I D++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q K    ++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER
        L K K         D    + KE  G                     +G K+ S  +      AEN ND     L     S++ VE  VE  E   V   
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER

Query:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK
        K     + V  + +K GT C LA  +K++VV   TI D DV  DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K
Subjt:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK

Query:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY
         +   + T IQ++PVALR LLR +E++GS+IQIT P +V  IH         ME S+TL+LYKF+D+GSIS  + KE RAQLL++R+ G D + LL+ PY
Subjt:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY

Query:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV
        NSGNHWTL++I+ +K + FW+D LKNR+D D+  V+         ++    +  ++CPKQSGVVECGYYVMR+MR+II+S +TSI+ I+
Subjt:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]5.0e-17249.3Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +K+ RGPT M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIK+KI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT Y+ PF DD+EKLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R ++WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE
        R+G+IPD++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q     K++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ
        L K K         D    + KE  G  + S EK   E             D++  D+ D S + E++ D            +  VE+  E  +V   K 
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ

Query:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL
            + V  + +K GT C LA  +K++VV  GTI D D   DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K +
Subjt:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL

Query:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE
           + TPIQ++PVALR LLR +E++GS+IQIT P +VFG+ R+                         ++YLY  ME ++TL+LYKFLDAGSIS  ++KE
Subjt:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE

Query:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI
         R QLL++R+ G D +QLL+FPYNSGNHWTL++I+ +K + FW+DPLKNR+D D+  V+  +  I++  KKK +W+ +KCPKQSGVVECGYYVMR+MR+I
Subjt:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI

Query:  ILSGTTSILDIV
        I+S +TSI+ I+
Subjt:  ILSGTTSILDIV

TrEMBL top hitse value%identityAlignment
A0A5A7SQ96 ULP_PROTEASE domain-containing protein1.3e-15748.62Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +++ RGP  M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIKEKI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT ++ PF DD++KLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R + WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE
        R+G+I D++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q K    ++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER
        L K K         D    + KE  G                     +G K+ S  +      AEN ND     L     S++ VE  VE  E   V   
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER

Query:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK
        K     + V  + +K GT C LA  +K++VV   TI D DV  DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K
Subjt:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK

Query:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY
         +   + T IQ++PVALR LLR +E++GS+IQIT P +V  IH         ME S+TL+LYKF+D+GSIS  + KE RAQLL++R+ G D + LL+ PY
Subjt:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY

Query:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV
        NSGNHWTL++I+ +K + FW+D LKNR+D D+  V+         ++    +  ++CPKQSGVVECGYYVMR+MR+II+S +TSI+ I+
Subjt:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV

A0A5A7T2U8 ULP_PROTEASE domain-containing protein3.3e-16145.48Show/hide
Query:  PVIDGEI-KRKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQ
        P  +G I K K TRGPTGM EITR   +GH+RVV+YNE GQPIG +A KLKSFIGT+VR HVPI+Y SWK VP E+K+KI++LI+GGFVVD RSKK+++Q
Subjt:  PVIDGEI-KRKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQ

Query:  TAGICFRQFKYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRC
         A +CFR FK +LTTK++ P+K DLEKLK PP EYS I++EHW+ FV+ RL ++FE+VS+K R +RK + YNHRMSRKGYANL+EE+K S  N +L  R 
Subjt:  TAGICFRQFKYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRC

Query:  VLWKRARIDREGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVKKEDKVTFDAEEKAKMTARIK
        ++WK+AR  ++G+IPD  T+ V ++ID +L++        +   DILSQ +G  DP   +RG+G+++ P                               
Subjt:  VLWKRARIDREGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVKKEDKVTFDAEEKAKMTARIK

Query:  ELEEELRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVE-VKVEKEE-
        +LE EL K K + ++ +             GE  +   K+K ++              + + ID S +A   ND    +  +    D  +E + +EK++ 
Subjt:  ELEEELRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVE-VKVEKEE-

Query:  VGERKQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVI-IDNEKEEG
        VGE+ +   +       +KDGTSC LAIG+K+NVVGAGTI DY ++ DNVKVS+D+V  G+C VPIP +    +LSQEVGS LLWPR LVI +D + +  
Subjt:  VGERKQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVI-IDNEKEEG

Query:  KNSNKCLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSI
          ++  L  L++   + +PV LR LL EL+YIGS IQI VP +VFG  R+                         + +LYK+ME++ TL  YKF DAGS+
Subjt:  KNSNKCLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSI

Query:  SISTNKESRAQLLSSRMAGLDN-QLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYV
        S+  +KE RAQ+L++R+ G D+ Q+LIFPYNSGNHW LI I+ S+ + +WMDPL+NR++ D   V+ MA  I    KKK  W+ IKCPKQ G+VECGYYV
Subjt:  SISTNKESRAQLLSSRMAGLDN-QLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYV

Query:  MRYMREIILSGTTSILDIV
        MR+MR+IILS   +I++++
Subjt:  MRYMREIILSGTTSILDIV

A0A5D3BXK2 ULP_PROTEASE domain-containing protein4.4e-15848.77Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +++ RGP  M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIKEKI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT ++ PF DD+EKLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R + WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE
        R+G+I D++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q K    ++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVK----KEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER
        L K K         D    + KE  G                     +G K+ S  +      AEN ND     L     S++ VE  VE  E   V   
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE---VGER

Query:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK
        K     + V  + +K GT C LA  +K++VV   TI D DV  DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K
Subjt:  KQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNK

Query:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY
         +   + T IQ++PVALR LLR +E++GS+IQIT P +V  IH         ME S+TL+LYKF+D+GSIS  + KE RAQLL++R+ G D + LL+ PY
Subjt:  CLGVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLD-NQLLIFPY

Query:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV
        NSGNHWTL++I+ +K + FW+D LKNR+D D+  V+         ++    +  ++CPKQSGVVECGYYVMR+MR+II+S +TSI+ I+
Subjt:  NSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIV

A0A5D3CDJ5 ULP_PROTEASE domain-containing protein2.4e-17249.3Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +K+ RGPT M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIK+KI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT Y+ PF DD+EKLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R ++WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE
        R+G+IPD++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q     K++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ
        L K K         D    + KE  G  + S EK   E             D++  D+ D S + E++ D            +  VE+  E  +V   K 
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ

Query:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL
            + V  + +K GT C LA  +K++VV  GTI D D   DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K +
Subjt:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL

Query:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE
           + TPIQ++PVALR LLR +E++GS+IQIT P +VFG+ R+                         ++YLY  ME ++TL+LYKFLDAGSIS  ++KE
Subjt:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE

Query:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI
         R QLL++R+ G D +QLL+FPYNSGNHWTL++I+ +K + FW+DPLKNR+D D+  V+  +  I++  KKK +W+ +KCPKQSGVVECGYYVMR+MR+I
Subjt:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI

Query:  ILSGTTSILDIV
        I+S +TSI+ I+
Subjt:  ILSGTTSILDIV

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein2.4e-17249.3Show/hide
Query:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF
        +K+ RGPT M EITRA SEG + V+QYNE GQ IG NA KLKSFIGT+VR HVPI Y+ W  VP EIK+KI++LI+ GFVVD RSKK ++Q AG+CFRQF
Subjt:  RKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNSWKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQF

Query:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID
        KY LTT Y+ PF DD+EKLK PP EYS I+Q+HW  FV+ RL+E+F+  S   + KRKKH YNHR SRKGYANL EELK S  + D   R ++WK+AR+D
Subjt:  KYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRKGYANLSEELKESLPNEDLTKRCVLWKRARID

Query:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE
        R+G+IPD++T+ VV+ IDE++ T N  + N     DIL++ LG KD P ++RG+GK++   KYFHT  Q     K++DK T  ++E  +M  RIKELEEE
Subjt:  REGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQ----VKKEDKVTFDAEEKAKMTARIKELEEE

Query:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ
        L K K         D    + KE  G  + S EK   E             D++  D+ D S + E++ D            +  VE+  E  +V   K 
Subjt:  LRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDI-DKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEEVGERKQ

Query:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL
            + V  + +K GT C LA  +K++VV  GTI D D   DNVKV++DVVV GDC++PIP +     +SQEVGSH+LWPRDLVI +N K +     K +
Subjt:  QNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCL

Query:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE
           + TPIQ++PVALR LLR +E++GS+IQIT P +VFG+ R+                         ++YLY  ME ++TL+LYKFLDAGSIS  ++KE
Subjt:  GVLSMTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQ-------------------------VLYLYKLMEKSKTLSLYKFLDAGSISISTNKE

Query:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI
         R QLL++R+ G D +QLL+FPYNSGNHWTL++I+ +K + FW+DPLKNR+D D+  V+  +  I++  KKK +W+ +KCPKQSGVVECGYYVMR+MR+I
Subjt:  SRAQLLSSRMAGLD-NQLLIFPYNSGNHWTLIIIDHSKSSIFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREI

Query:  ILSGTTSILDIV
        I+S +TSI+ I+
Subjt:  ILSGTTSILDIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGATACGATTGATGATACACAAGACGAGCAGTTGGATCGTTCGATGGATGATACATTCGACGATGCACAAGCCAATGACTTAGACGATATAGTTGACAATGAACA
TGATAGTGTTGCAGATTCACCGGTTATAGATGGTGAAATTAAGAGAAAGAAAACCAGAGGACCTACTGGAATGCATGAGATCACTCGTGCGTGCAGTGAAGGTCATAGAA
GAGTGGTGCAATACAATGAGAATGGTCAGCCAATTGGGTCTAATGCAGTAAAATTGAAAAGTTTCATAGGAACGTCCGTCCGATCCCATGTTCCAATTACATACAATAGT
TGGAAAGCAGTGCCATTGGAGATAAAAGAAAAGATTTGGGACCTCATTCAGGGTGGATTTGTTGTTGACCATAGATCGAAGAAGAATCTCATGCAAACTGCGGGCATTTG
TTTCCGTCAATTCAAATATGCTTTGACTACGAAGTATATAAGGCCATTTAAGGATGATCTAGAGAAATTAAAATGTCCCCCAATTGAGTACTCCCATATTGAACAAGAAC
ATTGGGATGCCTTTGTCAGTTTTCGGCTTAGGGAAGAGTTCGAAGTGGTCAGTAGTAAGAACCGAGCAAAGAGGAAAAAACACATGTATAATCATCGAATGTCGCGGAAG
GGGTATGCAAATCTTTCGGAAGAGCTGAAGGAGTCGTTACCAAATGAAGATCTAACTAAGCGATGCGTTTTATGGAAAAGGGCTCGAATAGATCGAGAGGGAAAAATTCC
AGACAAGCAAACGCAAGGAGTGGTGGATCGTATAGATGAAATGTTGATCACTCATAATGTCGAAAGTGACAACATATCTACGGGTGGAGATATCCTATCACAAGGGTTGG
GGAGCAAAGATCCTCCAAGAGTAGTTAGAGGATTGGGTAAGTTTATCATGCCCAGTAAATATTTTCATACACCAAAACAAGTGAAGAAGGAAGACAAAGTTACATTTGAC
GCGGAAGAGAAGGCGAAGATGACCGCACGAATAAAAGAGTTGGAAGAAGAACTTAGAAAACAAAAAAATCTAACCAAGATGGAATCAAACAAGGATGGTTCTTCAGAAGA
CATTAAAGAAGGAGACGGTGAACCTGTTTCCAGCAATGAAAAGGTGAAGAAAGAATTATTGGACCAGAGGAAGACTTCACGGATAGGAAAAAAGGATGATTCTCCAAGAG
ATATTGATAAAAGTGGTGAAGCAGAAAATAAAAATGACCAGGCGAAGGCAAAATTGTTGAAGCACACGGGAAGTGATCAATATGTTGAAGTGAAGGTTGAAAAAGAAGAG
GTGGGTGAACGTAAACAACAGAACTCAAGCCAATCTGTCGCCCCAAAGCCCATTAAGGATGGAACTTCTTGTTTGTTGGCGATCGGATCTAAAAATAATGTAGTCGGAGC
GGGGACTATATTTGACTATGACGTTAATAGTGATAATGTGAAGGTTTCACTTGACGTAGTTGTTCATGGTGATTGTTCTGTCCCAATACCAAGAAAGGGAGAAGAAAATT
TGTTGTCTCAAGAAGTAGGCTCACACTTGCTATGGCCACGGGATCTTGTGATAATTGATAATGAAAAGGAGGAGGGAAAAAATTCAAATAAATGTTTAGGAGTTCTATCA
ATGACTCCCATTCAAAGTTCACCGGTTGCTTTACGCTGTTTACTCCGTGAACTTGAATACATTGGATCATCAATTCAAATCACTGTACCGATGGAAGTATTTGGAATCCA
CCGCCAAGTGCTGTACTTATACAAACTAATGGAGAAATCAAAGACTTTGAGTTTGTATAAGTTTTTAGATGCTGGTTCCATCTCAATCTCAACCAACAAAGAATCTCGAG
CTCAATTGTTGTCCTCGAGAATGGCAGGATTGGATAATCAACTTCTCATCTTCCCTTACAATTCTGGGAATCATTGGACTTTGATCATTATTGATCATTCGAAAAGTAGC
ATATTTTGGATGGACCCTTTAAAAAATCGACTCGACCAAGATATCATCAGTGTGATAGGCATGGCTTGCAAGATTGTTGACCCTAAGAAGAAAAAGCTTAGTTGGAAATC
AATTAAGTGCCCTAAGCAATCTGGTGTGGTGGAATGCGGATATTATGTCATGCGATACATGCGTGAGATAATATTATCAGGAACCACATCAATTTTAGATATAGTTTTGA
AACTCTCATATTATTCACCTAAGACTACTTCACATTCACGCCACCTTTCTCCATCGTACCATCGTCTGCTAGATCTCGCCTTACTGTCATTCGTCTGTCATAGTGGTAGT
AGTAGTTCGTCCGTCATTCGTGGTAGCAGCCGTTCGTCCACTGTTCGTGGTAGCCGTTCGTCCGCTAGTTGTGTGTCTGCCATTCGTAGGGTTTCGCATCTGGCCAATTT
AACACTAGAAGCATCACGTCTGACTCACACACCGCCCACCTATTATAGGCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGATACGATTGATGATACACAAGACGAGCAGTTGGATCGTTCGATGGATGATACATTCGACGATGCACAAGCCAATGACTTAGACGATATAGTTGACAATGAACA
TGATAGTGTTGCAGATTCACCGGTTATAGATGGTGAAATTAAGAGAAAGAAAACCAGAGGACCTACTGGAATGCATGAGATCACTCGTGCGTGCAGTGAAGGTCATAGAA
GAGTGGTGCAATACAATGAGAATGGTCAGCCAATTGGGTCTAATGCAGTAAAATTGAAAAGTTTCATAGGAACGTCCGTCCGATCCCATGTTCCAATTACATACAATAGT
TGGAAAGCAGTGCCATTGGAGATAAAAGAAAAGATTTGGGACCTCATTCAGGGTGGATTTGTTGTTGACCATAGATCGAAGAAGAATCTCATGCAAACTGCGGGCATTTG
TTTCCGTCAATTCAAATATGCTTTGACTACGAAGTATATAAGGCCATTTAAGGATGATCTAGAGAAATTAAAATGTCCCCCAATTGAGTACTCCCATATTGAACAAGAAC
ATTGGGATGCCTTTGTCAGTTTTCGGCTTAGGGAAGAGTTCGAAGTGGTCAGTAGTAAGAACCGAGCAAAGAGGAAAAAACACATGTATAATCATCGAATGTCGCGGAAG
GGGTATGCAAATCTTTCGGAAGAGCTGAAGGAGTCGTTACCAAATGAAGATCTAACTAAGCGATGCGTTTTATGGAAAAGGGCTCGAATAGATCGAGAGGGAAAAATTCC
AGACAAGCAAACGCAAGGAGTGGTGGATCGTATAGATGAAATGTTGATCACTCATAATGTCGAAAGTGACAACATATCTACGGGTGGAGATATCCTATCACAAGGGTTGG
GGAGCAAAGATCCTCCAAGAGTAGTTAGAGGATTGGGTAAGTTTATCATGCCCAGTAAATATTTTCATACACCAAAACAAGTGAAGAAGGAAGACAAAGTTACATTTGAC
GCGGAAGAGAAGGCGAAGATGACCGCACGAATAAAAGAGTTGGAAGAAGAACTTAGAAAACAAAAAAATCTAACCAAGATGGAATCAAACAAGGATGGTTCTTCAGAAGA
CATTAAAGAAGGAGACGGTGAACCTGTTTCCAGCAATGAAAAGGTGAAGAAAGAATTATTGGACCAGAGGAAGACTTCACGGATAGGAAAAAAGGATGATTCTCCAAGAG
ATATTGATAAAAGTGGTGAAGCAGAAAATAAAAATGACCAGGCGAAGGCAAAATTGTTGAAGCACACGGGAAGTGATCAATATGTTGAAGTGAAGGTTGAAAAAGAAGAG
GTGGGTGAACGTAAACAACAGAACTCAAGCCAATCTGTCGCCCCAAAGCCCATTAAGGATGGAACTTCTTGTTTGTTGGCGATCGGATCTAAAAATAATGTAGTCGGAGC
GGGGACTATATTTGACTATGACGTTAATAGTGATAATGTGAAGGTTTCACTTGACGTAGTTGTTCATGGTGATTGTTCTGTCCCAATACCAAGAAAGGGAGAAGAAAATT
TGTTGTCTCAAGAAGTAGGCTCACACTTGCTATGGCCACGGGATCTTGTGATAATTGATAATGAAAAGGAGGAGGGAAAAAATTCAAATAAATGTTTAGGAGTTCTATCA
ATGACTCCCATTCAAAGTTCACCGGTTGCTTTACGCTGTTTACTCCGTGAACTTGAATACATTGGATCATCAATTCAAATCACTGTACCGATGGAAGTATTTGGAATCCA
CCGCCAAGTGCTGTACTTATACAAACTAATGGAGAAATCAAAGACTTTGAGTTTGTATAAGTTTTTAGATGCTGGTTCCATCTCAATCTCAACCAACAAAGAATCTCGAG
CTCAATTGTTGTCCTCGAGAATGGCAGGATTGGATAATCAACTTCTCATCTTCCCTTACAATTCTGGGAATCATTGGACTTTGATCATTATTGATCATTCGAAAAGTAGC
ATATTTTGGATGGACCCTTTAAAAAATCGACTCGACCAAGATATCATCAGTGTGATAGGCATGGCTTGCAAGATTGTTGACCCTAAGAAGAAAAAGCTTAGTTGGAAATC
AATTAAGTGCCCTAAGCAATCTGGTGTGGTGGAATGCGGATATTATGTCATGCGATACATGCGTGAGATAATATTATCAGGAACCACATCAATTTTAGATATAGTTTTGA
AACTCTCATATTATTCACCTAAGACTACTTCACATTCACGCCACCTTTCTCCATCGTACCATCGTCTGCTAGATCTCGCCTTACTGTCATTCGTCTGTCATAGTGGTAGT
AGTAGTTCGTCCGTCATTCGTGGTAGCAGCCGTTCGTCCACTGTTCGTGGTAGCCGTTCGTCCGCTAGTTGTGTGTCTGCCATTCGTAGGGTTTCGCATCTGGCCAATTT
AACACTAGAAGCATCACGTCTGACTCACACACCGCCCACCTATTATAGGCGCTGA
Protein sequenceShow/hide protein sequence
MDDTIDDTQDEQLDRSMDDTFDDAQANDLDDIVDNEHDSVADSPVIDGEIKRKKTRGPTGMHEITRACSEGHRRVVQYNENGQPIGSNAVKLKSFIGTSVRSHVPITYNS
WKAVPLEIKEKIWDLIQGGFVVDHRSKKNLMQTAGICFRQFKYALTTKYIRPFKDDLEKLKCPPIEYSHIEQEHWDAFVSFRLREEFEVVSSKNRAKRKKHMYNHRMSRK
GYANLSEELKESLPNEDLTKRCVLWKRARIDREGKIPDKQTQGVVDRIDEMLITHNVESDNISTGGDILSQGLGSKDPPRVVRGLGKFIMPSKYFHTPKQVKKEDKVTFD
AEEKAKMTARIKELEEELRKQKNLTKMESNKDGSSEDIKEGDGEPVSSNEKVKKELLDQRKTSRIGKKDDSPRDIDKSGEAENKNDQAKAKLLKHTGSDQYVEVKVEKEE
VGERKQQNSSQSVAPKPIKDGTSCLLAIGSKNNVVGAGTIFDYDVNSDNVKVSLDVVVHGDCSVPIPRKGEENLLSQEVGSHLLWPRDLVIIDNEKEEGKNSNKCLGVLS
MTPIQSSPVALRCLLRELEYIGSSIQITVPMEVFGIHRQVLYLYKLMEKSKTLSLYKFLDAGSISISTNKESRAQLLSSRMAGLDNQLLIFPYNSGNHWTLIIIDHSKSS
IFWMDPLKNRLDQDIISVIGMACKIVDPKKKKLSWKSIKCPKQSGVVECGYYVMRYMREIILSGTTSILDIVLKLSYYSPKTTSHSRHLSPSYHRLLDLALLSFVCHSGS
SSSSVIRGSSRSSTVRGSRSSASCVSAIRRVSHLANLTLEASRLTHTPPTYYRR