; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g16580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g16580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr5:12451036..12464494
RNA-Seq ExpressionMoc05g16580
SyntenyMoc05g16580
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053269.1 transposase [Cucumis melo var. makuwa]8.6e-8829.84Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L  +GLGSL  REE   DRPLS G YVRP +Q LKQAHL++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------
         T+RW+AH PN  V+TYEGY VNGC Y+TKSRD  +T+QNSGI LVATTMQVSSA DKNP++ DM+F+G                               
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHIPQQH
                                                                            +RLT+TY+    DD  KLK    + S I QQH
Subjt:  --------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHIPQQH

Query:  WDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEEL---------KNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIR--IEI
        W  F  SRL  +F+  S   +E +    YNH  S+KGYANL EEL          N+  EE    R    K    D+ G +R     V  ++      + 
Subjt:  WDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEEL---------KNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIR--IEI

Query:  KVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKV
        K  + E+D   S   D +                +RIKELEEEL K K+N     +SK ++ M             G ++    E  EN+ ++E     +
Subjt:  KVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKV

Query:  KIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEG---DSCQLSFESSDNVVALEMIFECNLEGGNVK-------------------EVATR
        + ++              DV +V   +E+   N+ K+    EG   +  +L+FE  D+VVA   I + ++EG NVK                   E+   
Subjt:  KIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEG---DSCQLSFESSDNVVALEMIFECNLEGGNVK-------------------EVATR

Query:  EEVLQLYPFEHDGTT---------PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKF
         + +    F  D +T         P+ALR +L +V+                              F  MRP+   C+DAYI+YLY ++      ++YKF
Subjt:  EEVLQLYPFEHDGTT---------PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKF

Query:  LNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTE
        ++ GSIS  +  KE R++ LT RL+  +  QLLL PYN  +HW L+VI+  K   + I+ L NR+D D+ +VV R              VKC +Q G  E
Subjt:  LNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTE

Query:  YGYYIMQFMRAIVNQQSTSIPDVTK
         GYY+M+FMR I+   STSI  + K
Subjt:  YGYYIMQFMRAIVNQQSTSIPDVTK

KAA0056083.1 hypothetical protein E6C27_scaffold323G00380 [Cucumis melo var. makuwa]7.3e-8731.42Show/hide
Query:  MESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNISKTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSA
        M  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S T+RW+AH PN  V+TYEGY VNGC Y+TKSRD  Q+IQNSGI LVATT QVSSA
Subjt:  MESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNISKTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSA

Query:  NDKNPVVADMTFFG--------------------------------------------------------------------------------------
         DKNPV+ DM+F+G                                                                                      
Subjt:  NDKNPVVADMTFFG--------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHI
                                                                                +RLT+TY+    DD  KLK  P + S I
Subjt:  ------------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHI

Query:  PQQHWDRFFHSRLSMEFQALSVRN-RELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGE
         QQHW  F  SRL  +F+     N +E +  H YNHR S+KGY NL EELK S  ++   DR  +WK+ARMD+ G I +  T+ VV  I   E+  TQ  
Subjt:  PQQHWDRFFHSRLSMEFQALSVRN-RELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGE

Query:  NDSNNSPGEDALSLALGTRDCPGWL-----------------------------------TCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVM
        N +N    +D L+ ALG +D PG L                                     A+RIKELEEEL K K+N     +SK +  M  ++   M
Subjt:  NDSNNSPGEDALSLALGTRDCPGWL-----------------------------------TCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVM

Query:  KEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEGDSCQLSFESSDNVVALEMIFECNLEGGN
        +    G E+  + +   N  E E   + V    E+I    AK  K+   + +E                K G  C+L+FE+ D++VA   I + + +G N
Subjt:  KEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEGDSCQLSFESSDNVVALEMIFECNLEGGN

Query:  VKE-------------------------------------VATREEVLQLYPFEHD----GTTPI-----ALRCLLHVVQ--------------------
        VK                                      V T    +    F  D      TPI     ALR LL +V+                    
Subjt:  VKE-------------------------------------VATREEVLQLYPFEHD----GTTPI-----ALRCLLHVVQ--------------------

Query:  ---------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVY
                  F  MRP+   C+DAYI+YLY ++      ++YKF++ GSIS  +  KE R++ LT RL+  +  QLLL PYNSG+HW L+VI+  K   +
Subjt:  ---------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVY

Query:  SINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDVTK
         I+ L NR+D D+ +VV R+FN  +K K  W  VKC +Q G  EYGYY+M FMR ++   STSI  + K
Subjt:  SINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDVTK

KAA0060105.1 hypothetical protein E6C27_scaffold39G00240 [Cucumis melo var. makuwa]8.6e-9631.93Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L  +GLGSL  REE   DRPLS G Y RP +Q LKQA+L++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-----WRLTSTYINT----CH----------
         T+RW+AH PN  V+TYEGY VNGC Y+TKSRD  + +QNSGI LVATTMQVSSA DKNPV+ DM+F+G     W ++    NT    C           
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-----WRLTSTYINT----CH----------

Query:  DD--------------------------------------------------------------------------------------------------
        DD                                                                                                  
Subjt:  DD--------------------------------------------------------------------------------------------------

Query:  -------PTKLKVIPKDES---------------------------------HIP--QQHW-------DRFFHSRLSMEFQALSVRNRELQGMHVYNHRL
               PT++K I +  S                                 H+P     W       D      +       S   +E +  H YNHR 
Subjt:  -------PTKLKVIPKDES---------------------------------HIP--QQHW-------DRFFHSRLSMEFQALSVRNRELQGMHVYNHRL

Query:  SQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWL------TCARRIK
        S+KGYANL EELK S  ++   DR  +WK+ARMD+ G I +      ++ ++ +  ++   +N +N    ED L+ ALG +D PG L         ++IK
Subjt:  SQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWL------TCARRIK

Query:  ELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNA
        ELEEEL K K+N     +SK +  M  ++   M+    G E+  + E   N  E E   + V    E+I    AK  K+ + + +E              
Subjt:  ELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNA

Query:  SFKEGDSCQLSFESSDNVVALEMIFECNLEGGNVKE-------------------------------------VATREEVLQLYPFEHDGTT--------
          K G  C+L+FE+ D+VVA   I + + EG NVK                                      V T    +    F  D +T        
Subjt:  SFKEGDSCQLSFESSDNVVALEMIFECNLEGGNVKE-------------------------------------VATREEVLQLYPFEHDGTT--------

Query:  -PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKR
         P+ALR LL +V+                              F  MRP+   C+DAYI+YLY ++      ++YKF++ GSIS  +  KE R++ LT R
Subjt:  -PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKR

Query:  LMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDV
        L+     QLLL PYNSG+HW L+VI+  K   + I+ L NR+D D+ +VV R              VKC +Q G  E GYY+M+FMR I+   STSI  +
Subjt:  LMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDV

Query:  TK
        TK
Subjt:  TK

KAA0064179.1 hypothetical protein E6C27_scaffold548G001100 [Cucumis melo var. makuwa]7.0e-9836.5Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L P+GLGSL  R++   DR LS   YVRP +Q LKQAHL++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------
         T+RW+AH PN  V+TY GY VNGC Y+TKSRD  +T+QNSGI LVATTMQVSSANDKN V+ DM+F+G                               
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------

Query:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD
         +RLT+TY+    DD  KLK    + S I QQHW  F  SRL  +F+  S   +E +  H YNHR  +KGYANL EELK S  ++   DR  +WK+A MD
Subjt:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD

Query:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE
        + G I +  T+ VV  I   E+  TQ  N +N    ED L+ ALG RD PG L    +IKELEEEL K K+      D K +  +   +DVV    +  +
Subjt:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE

Query:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN
        +   EG   + +K     E   K+  E      A G   D  +E           V+KN       E    ++S E   +++   +++   N++   G  
Subjt:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN

Query:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS
         K+++T      +  L   PF     EH G     TTP     +  +C  ++  ++ F  MRP+   C+DAYI+YLY  +      ++YKF+        
Subjt:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS

Query:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM
                                    ++G+HW L+VI+  K   + I+ L NR+D ++ +VV R+ +        +  ++C +Q G  E GYY+MQFM
Subjt:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM

Query:  RAIVNQQSTSIPDVTKTLWFISTNAP
          I+   ST + + T  ++ I    P
Subjt:  RAIVNQQSTSIPDVTKTLWFISTNAP

TYK02851.1 hypothetical protein E5676_scaffold218G00510 [Cucumis melo var. makuwa]4.6e-9735.95Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L P+GLGSL  R++   DR LS   YVRP +Q LKQAHL++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------
         T+RW+AH PN  V+TY GY VNGC Y+TKSRD  +T+QNSGI LVATTMQVSSANDKN V+ DM+F+G                               
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------

Query:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD
         +RLT+TY+    DD  KLK    + S I QQHW  F  SRL  +F+  S   +E +  H YNHR  +KGYANL EELK S  ++   DR  +WK+A MD
Subjt:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD

Query:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE
        + G I +  T+ V E +          +N +N    ED L+ ALG RD PG L    +IKELEEEL K K+      D K +  +   +DVV    +  +
Subjt:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE

Query:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN
        +   EG   + +K     E   K+  E      A G   D  +E           V+KN       E    ++S E   +++   +++   N++   G  
Subjt:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN

Query:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS
         K+++T      +  L   PF     EH G     TTP     +  +C  ++  ++ F  MRP+   C+DAYI+YLY  +      ++YKF+        
Subjt:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS

Query:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM
                                    ++G+HW L+VI+  K   + I+ L NR+D ++ +VV R+ +        +  ++C +Q G  E GYY+MQFM
Subjt:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM

Query:  RAIVNQQSTSIPDVTKTLWFISTNAP
          I+   ST + + T  ++ I    P
Subjt:  RAIVNQQSTSIPDVTKTLWFISTNAP

TrEMBL top hitse value%identityAlignment
A0A5A7UIG5 Transposase4.2e-8829.84Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L  +GLGSL  REE   DRPLS G YVRP +Q LKQAHL++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------
         T+RW+AH PN  V+TYEGY VNGC Y+TKSRD  +T+QNSGI LVATTMQVSSA DKNP++ DM+F+G                               
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHIPQQH
                                                                            +RLT+TY+    DD  KLK    + S I QQH
Subjt:  --------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHIPQQH

Query:  WDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEEL---------KNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIR--IEI
        W  F  SRL  +F+  S   +E +    YNH  S+KGYANL EEL          N+  EE    R    K    D+ G +R     V  ++      + 
Subjt:  WDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEEL---------KNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIR--IEI

Query:  KVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKV
        K  + E+D   S   D +                +RIKELEEEL K K+N     +SK ++ M             G ++    E  EN+ ++E     +
Subjt:  KVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKV

Query:  KIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEG---DSCQLSFESSDNVVALEMIFECNLEGGNVK-------------------EVATR
        + ++              DV +V   +E+   N+ K+    EG   +  +L+FE  D+VVA   I + ++EG NVK                   E+   
Subjt:  KIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEG---DSCQLSFESSDNVVALEMIFECNLEGGNVK-------------------EVATR

Query:  EEVLQLYPFEHDGTT---------PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKF
         + +    F  D +T         P+ALR +L +V+                              F  MRP+   C+DAYI+YLY ++      ++YKF
Subjt:  EEVLQLYPFEHDGTT---------PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKF

Query:  LNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTE
        ++ GSIS  +  KE R++ LT RL+  +  QLLL PYN  +HW L+VI+  K   + I+ L NR+D D+ +VV R              VKC +Q G  E
Subjt:  LNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTE

Query:  YGYYIMQFMRAIVNQQSTSIPDVTK
         GYY+M+FMR I+   STSI  + K
Subjt:  YGYYIMQFMRAIVNQQSTSIPDVTK

A0A5A7UJF8 ULP_PROTEASE domain-containing protein3.5e-8731.42Show/hide
Query:  MESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNISKTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSA
        M  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S T+RW+AH PN  V+TYEGY VNGC Y+TKSRD  Q+IQNSGI LVATT QVSSA
Subjt:  MESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNISKTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSA

Query:  NDKNPVVADMTFFG--------------------------------------------------------------------------------------
         DKNPV+ DM+F+G                                                                                      
Subjt:  NDKNPVVADMTFFG--------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHI
                                                                                +RLT+TY+    DD  KLK  P + S I
Subjt:  ------------------------------------------------------------------------WRLTSTYINTCHDDPTKLKVIPKDESHI

Query:  PQQHWDRFFHSRLSMEFQALSVRN-RELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGE
         QQHW  F  SRL  +F+     N +E +  H YNHR S+KGY NL EELK S  ++   DR  +WK+ARMD+ G I +  T+ VV  I   E+  TQ  
Subjt:  PQQHWDRFFHSRLSMEFQALSVRN-RELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGE

Query:  NDSNNSPGEDALSLALGTRDCPGWL-----------------------------------TCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVM
        N +N    +D L+ ALG +D PG L                                     A+RIKELEEEL K K+N     +SK +  M  ++   M
Subjt:  NDSNNSPGEDALSLALGTRDCPGWL-----------------------------------TCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVM

Query:  KEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEGDSCQLSFESSDNVVALEMIFECNLEGGN
        +    G E+  + +   N  E E   + V    E+I    AK  K+   + +E                K G  C+L+FE+ D++VA   I + + +G N
Subjt:  KEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEGDSCQLSFESSDNVVALEMIFECNLEGGN

Query:  VKE-------------------------------------VATREEVLQLYPFEHD----GTTPI-----ALRCLLHVVQ--------------------
        VK                                      V T    +    F  D      TPI     ALR LL +V+                    
Subjt:  VKE-------------------------------------VATREEVLQLYPFEHD----GTTPI-----ALRCLLHVVQ--------------------

Query:  ---------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVY
                  F  MRP+   C+DAYI+YLY ++      ++YKF++ GSIS  +  KE R++ LT RL+  +  QLLL PYNSG+HW L+VI+  K   +
Subjt:  ---------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVY

Query:  SINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDVTK
         I+ L NR+D D+ +VV R+FN  +K K  W  VKC +Q G  EYGYY+M FMR ++   STSI  + K
Subjt:  SINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDVTK

A0A5A7V2K1 ULP_PROTEASE domain-containing protein4.2e-9631.93Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L  +GLGSL  REE   DRPLS G Y RP +Q LKQA+L++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-----WRLTSTYINT----CH----------
         T+RW+AH PN  V+TYEGY VNGC Y+TKSRD  + +QNSGI LVATTMQVSSA DKNPV+ DM+F+G     W ++    NT    C           
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-----WRLTSTYINT----CH----------

Query:  DD--------------------------------------------------------------------------------------------------
        DD                                                                                                  
Subjt:  DD--------------------------------------------------------------------------------------------------

Query:  -------PTKLKVIPKDES---------------------------------HIP--QQHW-------DRFFHSRLSMEFQALSVRNRELQGMHVYNHRL
               PT++K I +  S                                 H+P     W       D      +       S   +E +  H YNHR 
Subjt:  -------PTKLKVIPKDES---------------------------------HIP--QQHW-------DRFFHSRLSMEFQALSVRNRELQGMHVYNHRL

Query:  SQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWL------TCARRIK
        S+KGYANL EELK S  ++   DR  +WK+ARMD+ G I +      ++ ++ +  ++   +N +N    ED L+ ALG +D PG L         ++IK
Subjt:  SQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWL------TCARRIK

Query:  ELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNA
        ELEEEL K K+N     +SK +  M  ++   M+    G E+  + E   N  E E   + V    E+I    AK  K+ + + +E              
Subjt:  ELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNA

Query:  SFKEGDSCQLSFESSDNVVALEMIFECNLEGGNVKE-------------------------------------VATREEVLQLYPFEHDGTT--------
          K G  C+L+FE+ D+VVA   I + + EG NVK                                      V T    +    F  D +T        
Subjt:  SFKEGDSCQLSFESSDNVVALEMIFECNLEGGNVKE-------------------------------------VATREEVLQLYPFEHDGTT--------

Query:  -PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKR
         P+ALR LL +V+                              F  MRP+   C+DAYI+YLY ++      ++YKF++ GSIS  +  KE R++ LT R
Subjt:  -PIALRCLLHVVQ-----------------------------TFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKR

Query:  LMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDV
        L+     QLLL PYNSG+HW L+VI+  K   + I+ L NR+D D+ +VV R              VKC +Q G  E GYY+M+FMR I+   STSI  +
Subjt:  LMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDV

Query:  TK
        TK
Subjt:  TK

A0A5A7VF70 DUF4218 domain-containing protein3.4e-9836.5Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L P+GLGSL  R++   DR LS   YVRP +Q LKQAHL++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------
         T+RW+AH PN  V+TY GY VNGC Y+TKSRD  +T+QNSGI LVATTMQVSSANDKN V+ DM+F+G                               
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------

Query:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD
         +RLT+TY+    DD  KLK    + S I QQHW  F  SRL  +F+  S   +E +  H YNHR  +KGYANL EELK S  ++   DR  +WK+A MD
Subjt:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD

Query:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE
        + G I +  T+ VV  I   E+  TQ  N +N    ED L+ ALG RD PG L    +IKELEEEL K K+      D K +  +   +DVV    +  +
Subjt:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE

Query:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN
        +   EG   + +K     E   K+  E      A G   D  +E           V+KN       E    ++S E   +++   +++   N++   G  
Subjt:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN

Query:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS
         K+++T      +  L   PF     EH G     TTP     +  +C  ++  ++ F  MRP+   C+DAYI+YLY  +      ++YKF+        
Subjt:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS

Query:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM
                                    ++G+HW L+VI+  K   + I+ L NR+D ++ +VV R+ +        +  ++C +Q G  E GYY+MQFM
Subjt:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM

Query:  RAIVNQQSTSIPDVTKTLWFISTNAP
          I+   ST + + T  ++ I    P
Subjt:  RAIVNQQSTSIPDVTKTLWFISTNAP

A0A5D3BSX7 DUF4218 domain-containing protein2.2e-9735.95Show/hide
Query:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS
        L P+GLGSL  R++   DR LS   YVRP +Q LKQAHL++L+NT+ V PYI  HM  L ++ P RA NE+WL  EHNRSFG+WIR++VM EL  GK +S
Subjt:  LAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPYISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNIS

Query:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------
         T+RW+AH PN  V+TY GY VNGC Y+TKSRD  +T+QNSGI LVATTMQVSSANDKN V+ DM+F+G                               
Subjt:  KTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPVVADMTFFG-------------------------------

Query:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD
         +RLT+TY+    DD  KLK    + S I QQHW  F  SRL  +F+  S   +E +  H YNHR  +KGYANL EELK S  ++   DR  +WK+A MD
Subjt:  -WRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMD

Query:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE
        + G I +  T+ V E +          +N +N    ED L+ ALG RD PG L    +IKELEEEL K K+      D K +  +   +DVV    +  +
Subjt:  KDGNIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGE

Query:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN
        +   EG   + +K     E   K+  E      A G   D  +E           V+KN       E    ++S E   +++   +++   N++   G  
Subjt:  EDSKEGEKTENIKEIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASF---KEGDSCQLSFESSDNVV-ALEMIFECNLE---GGN

Query:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS
         K+++T      +  L   PF     EH G     TTP     +  +C  ++  ++ F  MRP+   C+DAYI+YLY  +      ++YKF+        
Subjt:  VKEVAT-----REEVLQLYPF-----EHDG-----TTP-----IALRC--LLHVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTS

Query:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM
                                    ++G+HW L+VI+  K   + I+ L NR+D ++ +VV R+ +        +  ++C +Q G  E GYY+MQFM
Subjt:  NLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMDVVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFM

Query:  RAIVNQQSTSIPDVTKTLWFISTNAP
          I+   ST + + T  ++ I    P
Subjt:  RAIVNQQSTSIPDVTKTLWFISTNAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGCACGGTGAAGACTTAGGAAATATCAGAGTCGAACAGACGTACAATAATGATGCACTCGAGAGTGATG
AAGATGATCTCGACTTTTTTGAGGTGGGTGACTTGGTTCAATCTGTTCAGGATGAATTCTCTGATGTACCAAAATCATTTGAAACTATGTTTCATAATGCTAAGAAGCCG
TTGTACCCCGGATGTGGAAAGTTTACAAAAGTATCAACTCTTGTGAGATTGTACAATTTGAAGGTGAGATTTGGGTGGAGTAACTCCAGTTTTTCTGAACTTCTGTCCAT
AATAAGTAATATATTACCAGGCCCTAATGAAATGCTGGATTCTATGAAAGAAGCAAAGAAAACGTTACTAGCACCAGTTGGGCTAGGCTCACTTTGCAGGAGAGAAGAAT
CACTTATTGATAGACCTCTATCAACTGGATGTTACGTGAGACCTACTGTACAACATCTCAAACAAGCTCATCTTTACGTGTTGGAGAACACTGATGTGGTACAACCATAT
ATAAGTCATCATATGGAGTCTCTAACATTGAAATATCCAAGTAGAGCGACGAATGAGAAGTGGTTGCATGATGAACACAATCGATCATTTGGTTCTTGGATCCGTGAGGA
GGTTATGGGCGAATTGCGTAATGGTAAAAACATTTCAAAGACTTTAAGATGGATAGCACATAGCCCCAATCCAGATGTTCTGACTTATGAGGGTTATAAGGTTAATGGAT
GTTGCTACAACACTAAATCTCGTGACTCTACTCAAACTATTCAGAATAGTGGAATTAGTTTGGTTGCTACAACCATGCAAGTGTCTAGTGCGAATGATAAGAACCCTGTT
GTTGCAGACATGACGTTCTTTGGATGGAGGTTGACTTCCACCTACATAAACACTTGTCACGATGATCCGACAAAATTGAAGGTGATTCCGAAGGATGAATCCCATATTCC
ACAACAACACTGGGACCGGTTTTTCCATTCTAGACTTAGTATGGAATTCCAGGCCTTAAGTGTACGGAATAGAGAATTGCAGGGAATGCATGTATACAACCACAGGTTGT
CACAAAAGGGGTACGCGAATCTCTCCGAAGAATTGAAAAATTCCCAATGTGAGGAACACCCGAGGGATCGAGGCACGTTATGGAAGAAAGCGCGTATGGACAAAGATGGA
AATATCAGGGAAGCATGTACTAGAGTCGTAGTCGAACGCATAATACGTATTGAGATAAAAGTTACACAAGGTGAGAATGATAGCAACAACTCTCCAGGAGAAGATGCACT
ATCACTTGCTTTGGGTACCCGAGACTGTCCTGGATGGCTTACATGTGCTAGAAGGATTAAGGAGTTGGAAGAGGAATTGGCGAAATATAAACAAAATGCTTCAATACCAA
GTGATAGTAAGTTGCAAGAGACAATGAAACGAGAAAAAGATGTGGTGATGAAGGAAGGAGAAACAGGAGAGGAGGACAGTAAGGAGGGAGAAAAAACAGAGAATATAAAG
GAAATAGAAACAGGAGAAAAGAAAGTTAAGATAAAGGAAGAAGAAATAGGAAGGCAGACTGCGAAAGGAGGAAAAAGAGATGACGTTATTGAAGTGGAGACAAGTCACGA
AGAGGCAACTAGTAATGTTCAGAAAAATGCGTCCTTTAAGGAGGGAGATTCTTGTCAATTGTCGTTCGAGTCAAGCGACAATGTCGTGGCATTAGAAATGATATTCGAAT
GCAACTTGGAAGGAGGCAATGTGAAGGAGGTTGCAACCAGAGAGGAGGTGCTTCAGCTGTACCCGTTCGAACATGATGGAACGACCCCAATTGCTCTGCGATGCCTACTT
CACGTTGTACAAACCTTTTATGAGATGAGGCCATTAACTGATCCGTGCATAGATGCATATATCATATACTTATACAAGAAGTTGGTGGAGCAAGATGGATCTCATATGTA
CAAATTTCTTAATGATGGATCAATATCGACATCCAATCTTTTGAAAGAAGCACGTTCGGAGAAGTTGACAAAACGACTTATGCAAATGGAGTCAGGTCAATTGCTTCTTG
CTCCTTACAACAGTGGATCTCATTGGATATTGATAGTCATTGATTACTTCAAGACCATGGTATATTCAATAAACTCTTTGATAAATCGTTTGGACAATGATATAATGGAT
GTCGTCAACCGGGCTTTCAATAAATGCGACAAAGTCAAGGTGCAATGGAACTGTGTAAAGTGCTCAAGACAACCTGGATCAACAGAATATGGGTATTACATTATGCAATT
CATGCGGGCCATAGTAAACCAACAAAGCACTTCGATCCCAGATGTAACTAAGACACTTTGGTTCATATCAACGAATGCCCCTGACACGATCATCGAACAAGAAAAGGGCA
TTCCATCTGCCGAAGAATTGGAGGTTGTGCCCGTCTTGGGGCCAGGAAAGACCACCAACGTGGGAGATGTGTTATCGGGATCCGTCTCATTCGGATGTGGTATCAGTTCA
TTCATGTACCCCTCTTCTCTAGTTGGGCGTTACATGCCCACTAGCTTCCGCTTGGTTCGTCCCCGAACCACATCGTACTGGAAGAAGTTCTGCTTTGATACATTTTGTAA
CTCCCCAGGTCTAGGATTCAGACAAGAATTTGGAATCCAGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCAT
ACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGCACGGTGAAGACTTAGGAAATATCAGAGTCGAACAGACGTACAATAATGATGCACTCGAGAGTGATG
AAGATGATCTCGACTTTTTTGAGGTGGGTGACTTGGTTCAATCTGTTCAGGATGAATTCTCTGATGTACCAAAATCATTTGAAACTATGTTTCATAATGCTAAGAAGCCG
TTGTACCCCGGATGTGGAAAGTTTACAAAAGTATCAACTCTTGTGAGATTGTACAATTTGAAGGTGAGATTTGGGTGGAGTAACTCCAGTTTTTCTGAACTTCTGTCCAT
AATAAGTAATATATTACCAGGCCCTAATGAAATGCTGGATTCTATGAAAGAAGCAAAGAAAACGTTACTAGCACCAGTTGGGCTAGGCTCACTTTGCAGGAGAGAAGAAT
CACTTATTGATAGACCTCTATCAACTGGATGTTACGTGAGACCTACTGTACAACATCTCAAACAAGCTCATCTTTACGTGTTGGAGAACACTGATGTGGTACAACCATAT
ATAAGTCATCATATGGAGTCTCTAACATTGAAATATCCAAGTAGAGCGACGAATGAGAAGTGGTTGCATGATGAACACAATCGATCATTTGGTTCTTGGATCCGTGAGGA
GGTTATGGGCGAATTGCGTAATGGTAAAAACATTTCAAAGACTTTAAGATGGATAGCACATAGCCCCAATCCAGATGTTCTGACTTATGAGGGTTATAAGGTTAATGGAT
GTTGCTACAACACTAAATCTCGTGACTCTACTCAAACTATTCAGAATAGTGGAATTAGTTTGGTTGCTACAACCATGCAAGTGTCTAGTGCGAATGATAAGAACCCTGTT
GTTGCAGACATGACGTTCTTTGGATGGAGGTTGACTTCCACCTACATAAACACTTGTCACGATGATCCGACAAAATTGAAGGTGATTCCGAAGGATGAATCCCATATTCC
ACAACAACACTGGGACCGGTTTTTCCATTCTAGACTTAGTATGGAATTCCAGGCCTTAAGTGTACGGAATAGAGAATTGCAGGGAATGCATGTATACAACCACAGGTTGT
CACAAAAGGGGTACGCGAATCTCTCCGAAGAATTGAAAAATTCCCAATGTGAGGAACACCCGAGGGATCGAGGCACGTTATGGAAGAAAGCGCGTATGGACAAAGATGGA
AATATCAGGGAAGCATGTACTAGAGTCGTAGTCGAACGCATAATACGTATTGAGATAAAAGTTACACAAGGTGAGAATGATAGCAACAACTCTCCAGGAGAAGATGCACT
ATCACTTGCTTTGGGTACCCGAGACTGTCCTGGATGGCTTACATGTGCTAGAAGGATTAAGGAGTTGGAAGAGGAATTGGCGAAATATAAACAAAATGCTTCAATACCAA
GTGATAGTAAGTTGCAAGAGACAATGAAACGAGAAAAAGATGTGGTGATGAAGGAAGGAGAAACAGGAGAGGAGGACAGTAAGGAGGGAGAAAAAACAGAGAATATAAAG
GAAATAGAAACAGGAGAAAAGAAAGTTAAGATAAAGGAAGAAGAAATAGGAAGGCAGACTGCGAAAGGAGGAAAAAGAGATGACGTTATTGAAGTGGAGACAAGTCACGA
AGAGGCAACTAGTAATGTTCAGAAAAATGCGTCCTTTAAGGAGGGAGATTCTTGTCAATTGTCGTTCGAGTCAAGCGACAATGTCGTGGCATTAGAAATGATATTCGAAT
GCAACTTGGAAGGAGGCAATGTGAAGGAGGTTGCAACCAGAGAGGAGGTGCTTCAGCTGTACCCGTTCGAACATGATGGAACGACCCCAATTGCTCTGCGATGCCTACTT
CACGTTGTACAAACCTTTTATGAGATGAGGCCATTAACTGATCCGTGCATAGATGCATATATCATATACTTATACAAGAAGTTGGTGGAGCAAGATGGATCTCATATGTA
CAAATTTCTTAATGATGGATCAATATCGACATCCAATCTTTTGAAAGAAGCACGTTCGGAGAAGTTGACAAAACGACTTATGCAAATGGAGTCAGGTCAATTGCTTCTTG
CTCCTTACAACAGTGGATCTCATTGGATATTGATAGTCATTGATTACTTCAAGACCATGGTATATTCAATAAACTCTTTGATAAATCGTTTGGACAATGATATAATGGAT
GTCGTCAACCGGGCTTTCAATAAATGCGACAAAGTCAAGGTGCAATGGAACTGTGTAAAGTGCTCAAGACAACCTGGATCAACAGAATATGGGTATTACATTATGCAATT
CATGCGGGCCATAGTAAACCAACAAAGCACTTCGATCCCAGATGTAACTAAGACACTTTGGTTCATATCAACGAATGCCCCTGACACGATCATCGAACAAGAAAAGGGCA
TTCCATCTGCCGAAGAATTGGAGGTTGTGCCCGTCTTGGGGCCAGGAAAGACCACCAACGTGGGAGATGTGTTATCGGGATCCGTCTCATTCGGATGTGGTATCAGTTCA
TTCATGTACCCCTCTTCTCTAGTTGGGCGTTACATGCCCACTAGCTTCCGCTTGGTTCGTCCCCGAACCACATCGTACTGGAAGAAGTTCTGCTTTGATACATTTTGTAA
CTCCCCAGGTCTAGGATTCAGACAAGAATTTGGAATCCAGATTTAG
Protein sequenceShow/hide protein sequence
MGGARRLGSLQKNWFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLHGEDLGNIRVEQTYNNDALESDEDDLDFFEVGDLVQSVQDEFSDVPKSFETMFHNAKKP
LYPGCGKFTKVSTLVRLYNLKVRFGWSNSSFSELLSIISNILPGPNEMLDSMKEAKKTLLAPVGLGSLCRREESLIDRPLSTGCYVRPTVQHLKQAHLYVLENTDVVQPY
ISHHMESLTLKYPSRATNEKWLHDEHNRSFGSWIREEVMGELRNGKNISKTLRWIAHSPNPDVLTYEGYKVNGCCYNTKSRDSTQTIQNSGISLVATTMQVSSANDKNPV
VADMTFFGWRLTSTYINTCHDDPTKLKVIPKDESHIPQQHWDRFFHSRLSMEFQALSVRNRELQGMHVYNHRLSQKGYANLSEELKNSQCEEHPRDRGTLWKKARMDKDG
NIREACTRVVVERIIRIEIKVTQGENDSNNSPGEDALSLALGTRDCPGWLTCARRIKELEEELAKYKQNASIPSDSKLQETMKREKDVVMKEGETGEEDSKEGEKTENIK
EIETGEKKVKIKEEEIGRQTAKGGKRDDVIEVETSHEEATSNVQKNASFKEGDSCQLSFESSDNVVALEMIFECNLEGGNVKEVATREEVLQLYPFEHDGTTPIALRCLL
HVVQTFYEMRPLTDPCIDAYIIYLYKKLVEQDGSHMYKFLNDGSISTSNLLKEARSEKLTKRLMQMESGQLLLAPYNSGSHWILIVIDYFKTMVYSINSLINRLDNDIMD
VVNRAFNKCDKVKVQWNCVKCSRQPGSTEYGYYIMQFMRAIVNQQSTSIPDVTKTLWFISTNAPDTIIEQEKGIPSAEELEVVPVLGPGKTTNVGDVLSGSVSFGCGISS
FMYPSSLVGRYMPTSFRLVRPRTTSYWKKFCFDTFCNSPGLGFRQEFGIQI