; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr9:19678490..19691124
RNA-Seq ExpressionMoc09g26340
SyntenyMoc09g26340
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant
IPR004332 - Transposase, MuDR, plant
IPR029480 - Transposase-associated domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035941.1 uncharacterized protein E6C27_scaffold56G001300 [Cucumis melo var. makuwa]2.8e-11540.52Show/hide
Query:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI
        G  +K +  RGPTG+ +ITR+S +GH+RV++YN  GQPIG +ATKLK+FIG +V+ H+P  Y++WK VP+E K+KIY+LI+GGFVVD R+KK+++Q A +
Subjt:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI

Query:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK
         FRNFK  LT K + P+  D +KLK PP +YS I  EHW +FV SRLTK FE  S   RE+RK + YNH M RKGYANL EE+K   S+    DR ++WK
Subjt:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK

Query:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAV------ELEVKLKKHEKGSPKSKHGTPAKKTPKKSPKLKRT
        KART ++G+I    T++V  +ID+L++++       +   D+LS A+G  D  G ++ V      +LE +L KH+K    +  G   +   K     K  
Subjt:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAV------ELEVKLKKHEKGSPKSKHGTPAKKTPKKSPKLKRT

Query:  TPSKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTI
          S +A            + +D K+ ++ +    +        Q K G  +K+         VC  ++ L K      K+GT CRLA+G+ DNVV A TI
Subjt:  TPSKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTI

Query:  FESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIIS
         +      NVKVS+D+V D +  +PIPT  G  +LSQE+GS +LWP++LVI  +EK     +      S    T+ APV+L  L  E++YIG  IQ+ + 
Subjt:  FESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIIS

Query:  KDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGDHWTLIV
          VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        YKF DAG+ S+  +SKE+R Q+L  RL   +  Q+L+FPY+SG+HW LI 
Subjt:  KDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGDHWTLIV

Query:  MSPAKNMTFFLD
        ++ ++   +++D
Subjt:  MSPAKNMTFFLD

TYJ96009.1 uncharacterized protein E5676_scaffold2612G00150 [Cucumis melo var. makuwa]2.7e-11842.07Show/hide
Query:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI
        GE+S  ++KRGPT + +ITR  SEG + VI+YN  GQ IG NATKLK+FIG +V+ H+P IY  W +VP E K+KI++LI+ GFVVD R+KK +IQ AG+
Subjt:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI

Query:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK
         FR FKYRLT  ++ PF++D +KLK PP +YS I  +HW  FV SRL +DF+++SE  +EKRKKH YNH   RKGYANL EELK  +S +   DR I+WK
Subjt:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK

Query:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--
        +AR +R+G+I    T++VV  ID+L+ TQ+      ++  D+L+ ALG +DRPG+++ V   V  KK  H     K+      K T K+  ++ +     
Subjt:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--

Query:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA
         +   K       +  SK +P   S+                      S+ D   + ++   + + I   ++ +E       K GT C+LA  + D+VVA
Subjt:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA

Query:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA
          TI +S  +  NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP++LVI++N K    + ++++ +FA   +P Q APV+L  L R + ++G A
Subjt:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA

Query:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD
        IQ+    DVFG   K  IM+E ++    M P  T C+DAY  +L+  + +    + YKFLDAG+ S  + SKE RVQ+LT RL   + +QLL+FPY+SG+
Subjt:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD

Query:  HWTLIVMSPAKNMTFFLD
        HWTL+V++  K   F++D
Subjt:  HWTLIVMSPAKNMTFFLD

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]2.7e-11842.07Show/hide
Query:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI
        GE+S  ++KRGPT + +ITR  SEG + VI+YN  GQ IG NATKLK+FIG +V+ H+P IY  W +VP E K+KI++LI+ GFVVD R+KK +IQ AG+
Subjt:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI

Query:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK
         FR FKYRLT  ++ PF++D +KLK PP +YS I  +HW  FV SRL +DF+++SE  +EKRKKH YNH   RKGYANL EELK  +S +   DR I+WK
Subjt:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK

Query:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--
        +AR +R+G+I    T++VV  ID+L+ TQ+      ++  D+L+ ALG +DRPG+++ V   V  KK  H     K+      K T K+  ++ +     
Subjt:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--

Query:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA
         +   K       +  SK +P   S+                      S+ D   + ++   + + I   ++ +E       K GT C+LA  + D+VVA
Subjt:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA

Query:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA
          TI +S  +  NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP++LVI++N K    + ++++ +FA   +P Q APV+L  L R + ++G A
Subjt:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA

Query:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD
        IQ+    DVFG   K  IM+E ++    M P  T C+DAY  +L+  + +    + YKFLDAG+ S  + SKE RVQ+LT RL   + +QLL+FPY+SG+
Subjt:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD

Query:  HWTLIVMSPAKNMTFFLD
        HWTL+V++  K   F++D
Subjt:  HWTLIVMSPAKNMTFFLD

XP_022156873.1 uncharacterized protein LOC111023710 [Momordica charantia]3.6e-14794.37Show/hide
Query:  MTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQ
        MTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQ
Subjt:  MTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQ

Query:  VAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRC
        VAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRC
Subjt:  VAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRC

Query:  ILWKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKKHEKGSPKSKHGTP
        ILWKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAV   +   K +     +KH TP
Subjt:  ILWKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKKHEKGSPKSKHGTP

XP_022156878.1 uncharacterized protein LOC111023711 [Momordica charantia]2.5e-132100Show/hide
Query:  YRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDN
        YRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDN
Subjt:  YRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDN

Query:  EKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFL
        EKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFL
Subjt:  EKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFL

Query:  DAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSG
        DAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSG
Subjt:  DAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSG

TrEMBL top hitse value%identityAlignment
A0A5A7T2U8 ULP_PROTEASE domain-containing protein1.3e-11540.52Show/hide
Query:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI
        G  +K +  RGPTG+ +ITR+S +GH+RV++YN  GQPIG +ATKLK+FIG +V+ H+P  Y++WK VP+E K+KIY+LI+GGFVVD R+KK+++Q A +
Subjt:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI

Query:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK
         FRNFK  LT K + P+  D +KLK PP +YS I  EHW +FV SRLTK FE  S   RE+RK + YNH M RKGYANL EE+K   S+    DR ++WK
Subjt:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK

Query:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAV------ELEVKLKKHEKGSPKSKHGTPAKKTPKKSPKLKRT
        KART ++G+I    T++V  +ID+L++++       +   D+LS A+G  D  G ++ V      +LE +L KH+K    +  G   +   K     K  
Subjt:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAV------ELEVKLKKHEKGSPKSKHGTPAKKTPKKSPKLKRT

Query:  TPSKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTI
          S +A            + +D K+ ++ +    +        Q K G  +K+         VC  ++ L K      K+GT CRLA+G+ DNVV A TI
Subjt:  TPSKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTI

Query:  FESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIIS
         +      NVKVS+D+V D +  +PIPT  G  +LSQE+GS +LWP++LVI  +EK     +      S    T+ APV+L  L  E++YIG  IQ+ + 
Subjt:  FESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIIS

Query:  KDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGDHWTLIV
          VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        YKF DAG+ S+  +SKE+R Q+L  RL   +  Q+L+FPY+SG+HW LI 
Subjt:  KDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGDHWTLIV

Query:  MSPAKNMTFFLD
        ++ ++   +++D
Subjt:  MSPAKNMTFFLD

A0A5D3CDJ5 ULP_PROTEASE domain-containing protein1.3e-11842.07Show/hide
Query:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI
        GE+S  ++KRGPT + +ITR  SEG + VI+YN  GQ IG NATKLK+FIG +V+ H+P IY  W +VP E K+KI++LI+ GFVVD R+KK +IQ AG+
Subjt:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI

Query:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK
         FR FKYRLT  ++ PF++D +KLK PP +YS I  +HW  FV SRL +DF+++SE  +EKRKKH YNH   RKGYANL EELK  +S +   DR I+WK
Subjt:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK

Query:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--
        +AR +R+G+I    T++VV  ID+L+ TQ+      ++  D+L+ ALG +DRPG+++ V   V  KK  H     K+      K T K+  ++ +     
Subjt:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--

Query:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA
         +   K       +  SK +P   S+                      S+ D   + ++   + + I   ++ +E       K GT C+LA  + D+VVA
Subjt:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA

Query:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA
          TI +S  +  NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP++LVI++N K    + ++++ +FA   +P Q APV+L  L R + ++G A
Subjt:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA

Query:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD
        IQ+    DVFG   K  IM+E ++    M P  T C+DAY  +L+  + +    + YKFLDAG+ S  + SKE RVQ+LT RL   + +QLL+FPY+SG+
Subjt:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD

Query:  HWTLIVMSPAKNMTFFLD
        HWTL+V++  K   F++D
Subjt:  HWTLIVMSPAKNMTFFLD

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein1.3e-11842.07Show/hide
Query:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI
        GE+S  ++KRGPT + +ITR  SEG + VI+YN  GQ IG NATKLK+FIG +V+ H+P IY  W +VP E K+KI++LI+ GFVVD R+KK +IQ AG+
Subjt:  GETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQVAGI

Query:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK
         FR FKYRLT  ++ PF++D +KLK PP +YS I  +HW  FV SRL +DF+++SE  +EKRKKH YNH   RKGYANL EELK  +S +   DR I+WK
Subjt:  AFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCILWK

Query:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--
        +AR +R+G+I    T++VV  ID+L+ TQ+      ++  D+L+ ALG +DRPG+++ V   V  KK  H     K+      K T K+  ++ +     
Subjt:  KARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKK--HEKGSPKSKHGTPAKKTPKKSPKLKRTTP--

Query:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA
         +   K       +  SK +P   S+                      S+ D   + ++   + + I   ++ +E       K GT C+LA  + D+VVA
Subjt:  SKNAPKKSPRSKRITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLE------FKEGTHCRLALGSIDNVVA

Query:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA
          TI +S  +  NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP++LVI++N K    + ++++ +FA   +P Q APV+L  L R + ++G A
Subjt:  ADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDNEKEI--KHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRA

Query:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD
        IQ+    DVFG   K  IM+E ++    M P  T C+DAY  +L+  + +    + YKFLDAG+ S  + SKE RVQ+LT RL   + +QLL+FPY+SG+
Subjt:  IQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGD

Query:  HWTLIVMSPAKNMTFFLD
        HWTL+V++  K   F++D
Subjt:  HWTLIVMSPAKNMTFFLD

A0A6J1DRS8 uncharacterized protein LOC1110237101.7e-14794.37Show/hide
Query:  MTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQ
        MTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQ
Subjt:  MTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDHRTKKNLIQ

Query:  VAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRC
        VAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRC
Subjt:  VAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRC

Query:  ILWKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKKHEKGSPKSKHGTP
        ILWKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAV   +   K +     +KH TP
Subjt:  ILWKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKKHEKGSPKSKHGTP

A0A6J1DRT3 uncharacterized protein LOC1110237111.2e-132100Show/hide
Query:  YRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDN
        YRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDN
Subjt:  YRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRLPIPTHGGNNILSQEIGSHILWPQNLVISDN

Query:  EKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFL
        EKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFL
Subjt:  EKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTTFLHRSLGNEYESSPYKFL

Query:  DAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSG
        DAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSG
Subjt:  DAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTAGCTAGTCTTTTAAGTGATGGAATGACATTTGAGATATACGTAGAACATAAATGCGAAGATGAAGCAATATTGTCAGACAATGATAGTGATGCATTACCTTA
TATGAGTGGCTGTGAATCTGAATTTAAAGATGATGATAATACATATTTTGATAAGAGTGTAGAGGGAACTTCAAAATCATTGGATATCATAAGAACATATGAGAATGTTG
CGAAGAATGATGGAAGTGACATTGGCTTGAGTGACCTAATCTCATTGAAGGACTCGTCGATTCTGAAGAATGTCCTAGATGATTACGCTGTTAAGGGAGGATGGCATATA
CGATTTGTGAAAAATGATAAGATAAGAGTTAGAGCCAAATGCATGGATGGTTGTAAATGGTTGGCTTATGTAGCAAAGGTAAAAGGAGAAATGACTTATCAATTGAAGAC
CTATGTTGATGACCACTCATGTAGTAGATGCTTCAACAATCCGCGCCGAACATCTAGGCCATTTATTGGTTTAGATGCTTGCCATCTGAAAGGTCCTTCTGGTGGACAAC
TAATTGTAGCAGTTGGAAGAGATCCTAATGACCAATACTTTCCCTTAGCCTTTTCAGTTATTGAGGCAGAGACTAAGGACTCATGGACTTGTGTAGAGCTCGTTACAACG
ATTCCAACGACTGTTAGATCATCCAAATTGGAGCTCAAATGGAGAAGTTACGATCGAAACACTACCCGAGAAGATGACGTGGACAGTTCATGGTCCTCCCTACGCTGCCT
TTCATGGGATGATCTCATGCTCTTGAAGTTTGACAACAAGTTTAATCATACTGACAATGGACAAATCATGGATGAAAAAAATAGATTGTCTGAAGATTATGAGTTAGGAG
TCGAGTCATTCATTGAATTTGGGTTCCATCATGCAAATGGTTCTAAGGTTATACGTTGTCCCTGCTTGAAATGTGGAAACCGTATAGATAAGGATGCTGCAACCATTAGA
AATCACTTGTACGAACACGGTATTGACCAGAGCTATAGGGTATGGTTCTGGCATGGTGAAGAACATGCACCTAGAACAGATGAGGATAGGTTGAATGGTGAGTTCAATAA
GAATCATGAAGATGATAATGATTTGTTCGATGTGATTAACATGGTTCAGACTGTTTATGATGGAATTTCACATGTACCGAAATCGTTTGAAAATATGTTCGATAATGCTA
AGAAACCATTATACCCTGGATATGCAGTTGGACTTGGCTTGCAAAACAATGTAGAAGAGAGTCTCCGTGATAGGCCTTTATCAGCTGGATTATACGTAAAGCCGAGTGTT
GAACATTTAAAGCAAGCTCATATTTACGTATTGGGGAACACTGAAGAAGTGGAACCATATCAAAGACAACACATGAAACACTTGAAAGAGGAAAATCCGAATAGGTCAAA
TAATATGAAGTGGCTTCAGAATGAACACACCAGAAGTTCAGCAACTGGATACGTGATGAGCGAAATTACGGACGGTAAATGCATTTCGGAAACGTTGCGCTGGATGGCTC
ATGGGCCGAACCCAGAGGTTAGGATTTACACTTATTCCGATGATGATCTTGATGATTCTTCGTTGCACTATACTTCAATCTCGAAGGACACACCATCAGCTGAAGTTGAA
GAAAATGGCCCACTATATACAAGAGAAGATTGTGAAGGCACCTGGAACGACAAAATGACAGATGTAGGCGAAACCAGCAAGACGAGGAAGAAGCGTGGACCTACGGGGTT
GCACAAGATTACACGAATTAGCAGCGAAGGTCACCGTCGGGTTATTAAGTATAACGTGAAAGGGCAACCGATTGGACATAATGCGACAAAATTGAAGACTTTTATTGGAA
TAAGTGTGCAACACCATATTCCCACAATATACGAAACATGGAAATCGGTTCCATCCGAGACGAAGGAGAAAATCTATGACTTGATACAGGGTGGATTTGTAGTTGATCAT
AGAACCAAAAAAAATCTAATTCAAGTGGCTGGGATTGCTTTTAGAAACTTTAAGTACAGGCTCACGAACAAGTTCATTAAGCCATTCATAAACGACAAGGATAAATTAAA
AGCTCCTCCGGAAGACTACTCGCATATTTCGCATGAACATTGGGAGGTGTTCGTGAAATCTAGATTGACTAAAGACTTTGAGAGACAAAGTGAGACGAATAGAGAAAAAC
GTAAAAAGCACGTGTATAACCATTGCATGTTGCGGAAGGGTTATGCAAACCTTGCTGAGGAATTGAAAATTGATGCATCACATGAGAGACCATCAGATCGTTGCATCTTA
TGGAAGAAGGCTCGAACGAATCGCGAGGGGAAGATTGTGCATGGACCGACACAGAAAGTCGTAGAGCGCATAGATGATCTCATGATAACGCAAGACTTGCAAGAAGAGAA
TGTGGACAAATATGGGGATTTGTTATCCATCGCACTGGGCAGTCGGGATCGGCCCGGACTTGTTAAAGCAGTAGAACTTGAAGTGAAACTGAAGAAGCATGAAAAAGGTT
CCCCAAAATCAAAACATGGCACGCCAGCTAAGAAGACTCCAAAAAAATCTCCTAAACTGAAGCGTACCACACCATCTAAAAATGCTCCAAAGAAATCTCCTCGGTCGAAG
CGTATCACACCATCAAAAAACGATCCAAAGAAATCTTCTCAGTCAATGCATACCACACCATCTAAAAATGCTTCAAAAAAATCTCCACAATCAAAGCGTGGTACTTTATC
GAAGCATGATTACCGATACTTAACTGATGTTACTGTATGTGTTCACTTGAAGATATTGGACAAGGAAGAAACTTTGGAGTTCAAAGAGGGAACTCATTGTCGCCTGGCAC
TTGGATCCATCGATAATGTTGTCGCTGCGGACACTATATTTGAATCTGGGAGGAAGGATGGAAACGTGAAAGTGTCCATAGACGTGGTGGTTGATGACGACTCCCGACTT
CCAATTCCGACGCATGGAGGAAACAATATTCTCTCACAAGAAATAGGTTCACATATATTATGGCCACAAAATCTAGTCATATCCGATAATGAGAAGGAAATAAAACATTC
GAGGGAGCTGTTCAGTTTCGCAAGTGGAAAATCTCCAACACAACCTGCGCCCGTTAGTCTAACATGTTTAACTCGTGAGATAAACTACATTGGAAGGGCAATTCAAATGA
TTATATCGAAGGATGTGTTCGGTCATGAACATAAGTTGTTTATCATGGTGGAGGATGTACAGAAGTTGTTTCATATGGAACCGACAACTACTCCGTGCATTGATGCCTAC
ACGACGTTCTTACATAGATCGTTGGGCAACGAATATGAATCAAGCCCGTACAAGTTTCTAGATGCTGGGGCCACTTCCATAACTAACCTGTCTAAAGAAAACCGCGTGCA
AGTATTGACTAAAAGACTCTCAGAATTGGAGTTGAACCAACTGCTGATGTTTCCATATCATTCCGGAGATCATTGGACGTTAATAGTGATGTCTCCCGCAAAGAATATGA
CATTTTTTCTTGACTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTAGCTAGTCTTTTAAGTGATGGAATGACATTTGAGATATACGTAGAACATAAATGCGAAGATGAAGCAATATTGTCAGACAATGATAGTGATGCATTACCTTA
TATGAGTGGCTGTGAATCTGAATTTAAAGATGATGATAATACATATTTTGATAAGAGTGTAGAGGGAACTTCAAAATCATTGGATATCATAAGAACATATGAGAATGTTG
CGAAGAATGATGGAAGTGACATTGGCTTGAGTGACCTAATCTCATTGAAGGACTCGTCGATTCTGAAGAATGTCCTAGATGATTACGCTGTTAAGGGAGGATGGCATATA
CGATTTGTGAAAAATGATAAGATAAGAGTTAGAGCCAAATGCATGGATGGTTGTAAATGGTTGGCTTATGTAGCAAAGGTAAAAGGAGAAATGACTTATCAATTGAAGAC
CTATGTTGATGACCACTCATGTAGTAGATGCTTCAACAATCCGCGCCGAACATCTAGGCCATTTATTGGTTTAGATGCTTGCCATCTGAAAGGTCCTTCTGGTGGACAAC
TAATTGTAGCAGTTGGAAGAGATCCTAATGACCAATACTTTCCCTTAGCCTTTTCAGTTATTGAGGCAGAGACTAAGGACTCATGGACTTGTGTAGAGCTCGTTACAACG
ATTCCAACGACTGTTAGATCATCCAAATTGGAGCTCAAATGGAGAAGTTACGATCGAAACACTACCCGAGAAGATGACGTGGACAGTTCATGGTCCTCCCTACGCTGCCT
TTCATGGGATGATCTCATGCTCTTGAAGTTTGACAACAAGTTTAATCATACTGACAATGGACAAATCATGGATGAAAAAAATAGATTGTCTGAAGATTATGAGTTAGGAG
TCGAGTCATTCATTGAATTTGGGTTCCATCATGCAAATGGTTCTAAGGTTATACGTTGTCCCTGCTTGAAATGTGGAAACCGTATAGATAAGGATGCTGCAACCATTAGA
AATCACTTGTACGAACACGGTATTGACCAGAGCTATAGGGTATGGTTCTGGCATGGTGAAGAACATGCACCTAGAACAGATGAGGATAGGTTGAATGGTGAGTTCAATAA
GAATCATGAAGATGATAATGATTTGTTCGATGTGATTAACATGGTTCAGACTGTTTATGATGGAATTTCACATGTACCGAAATCGTTTGAAAATATGTTCGATAATGCTA
AGAAACCATTATACCCTGGATATGCAGTTGGACTTGGCTTGCAAAACAATGTAGAAGAGAGTCTCCGTGATAGGCCTTTATCAGCTGGATTATACGTAAAGCCGAGTGTT
GAACATTTAAAGCAAGCTCATATTTACGTATTGGGGAACACTGAAGAAGTGGAACCATATCAAAGACAACACATGAAACACTTGAAAGAGGAAAATCCGAATAGGTCAAA
TAATATGAAGTGGCTTCAGAATGAACACACCAGAAGTTCAGCAACTGGATACGTGATGAGCGAAATTACGGACGGTAAATGCATTTCGGAAACGTTGCGCTGGATGGCTC
ATGGGCCGAACCCAGAGGTTAGGATTTACACTTATTCCGATGATGATCTTGATGATTCTTCGTTGCACTATACTTCAATCTCGAAGGACACACCATCAGCTGAAGTTGAA
GAAAATGGCCCACTATATACAAGAGAAGATTGTGAAGGCACCTGGAACGACAAAATGACAGATGTAGGCGAAACCAGCAAGACGAGGAAGAAGCGTGGACCTACGGGGTT
GCACAAGATTACACGAATTAGCAGCGAAGGTCACCGTCGGGTTATTAAGTATAACGTGAAAGGGCAACCGATTGGACATAATGCGACAAAATTGAAGACTTTTATTGGAA
TAAGTGTGCAACACCATATTCCCACAATATACGAAACATGGAAATCGGTTCCATCCGAGACGAAGGAGAAAATCTATGACTTGATACAGGGTGGATTTGTAGTTGATCAT
AGAACCAAAAAAAATCTAATTCAAGTGGCTGGGATTGCTTTTAGAAACTTTAAGTACAGGCTCACGAACAAGTTCATTAAGCCATTCATAAACGACAAGGATAAATTAAA
AGCTCCTCCGGAAGACTACTCGCATATTTCGCATGAACATTGGGAGGTGTTCGTGAAATCTAGATTGACTAAAGACTTTGAGAGACAAAGTGAGACGAATAGAGAAAAAC
GTAAAAAGCACGTGTATAACCATTGCATGTTGCGGAAGGGTTATGCAAACCTTGCTGAGGAATTGAAAATTGATGCATCACATGAGAGACCATCAGATCGTTGCATCTTA
TGGAAGAAGGCTCGAACGAATCGCGAGGGGAAGATTGTGCATGGACCGACACAGAAAGTCGTAGAGCGCATAGATGATCTCATGATAACGCAAGACTTGCAAGAAGAGAA
TGTGGACAAATATGGGGATTTGTTATCCATCGCACTGGGCAGTCGGGATCGGCCCGGACTTGTTAAAGCAGTAGAACTTGAAGTGAAACTGAAGAAGCATGAAAAAGGTT
CCCCAAAATCAAAACATGGCACGCCAGCTAAGAAGACTCCAAAAAAATCTCCTAAACTGAAGCGTACCACACCATCTAAAAATGCTCCAAAGAAATCTCCTCGGTCGAAG
CGTATCACACCATCAAAAAACGATCCAAAGAAATCTTCTCAGTCAATGCATACCACACCATCTAAAAATGCTTCAAAAAAATCTCCACAATCAAAGCGTGGTACTTTATC
GAAGCATGATTACCGATACTTAACTGATGTTACTGTATGTGTTCACTTGAAGATATTGGACAAGGAAGAAACTTTGGAGTTCAAAGAGGGAACTCATTGTCGCCTGGCAC
TTGGATCCATCGATAATGTTGTCGCTGCGGACACTATATTTGAATCTGGGAGGAAGGATGGAAACGTGAAAGTGTCCATAGACGTGGTGGTTGATGACGACTCCCGACTT
CCAATTCCGACGCATGGAGGAAACAATATTCTCTCACAAGAAATAGGTTCACATATATTATGGCCACAAAATCTAGTCATATCCGATAATGAGAAGGAAATAAAACATTC
GAGGGAGCTGTTCAGTTTCGCAAGTGGAAAATCTCCAACACAACCTGCGCCCGTTAGTCTAACATGTTTAACTCGTGAGATAAACTACATTGGAAGGGCAATTCAAATGA
TTATATCGAAGGATGTGTTCGGTCATGAACATAAGTTGTTTATCATGGTGGAGGATGTACAGAAGTTGTTTCATATGGAACCGACAACTACTCCGTGCATTGATGCCTAC
ACGACGTTCTTACATAGATCGTTGGGCAACGAATATGAATCAAGCCCGTACAAGTTTCTAGATGCTGGGGCCACTTCCATAACTAACCTGTCTAAAGAAAACCGCGTGCA
AGTATTGACTAAAAGACTCTCAGAATTGGAGTTGAACCAACTGCTGATGTTTCCATATCATTCCGGAGATCATTGGACGTTAATAGTGATGTCTCCCGCAAAGAATATGA
CATTTTTTCTTGACTCGTGA
Protein sequenceShow/hide protein sequence
MELASLLSDGMTFEIYVEHKCEDEAILSDNDSDALPYMSGCESEFKDDDNTYFDKSVEGTSKSLDIIRTYENVAKNDGSDIGLSDLISLKDSSILKNVLDDYAVKGGWHI
RFVKNDKIRVRAKCMDGCKWLAYVAKVKGEMTYQLKTYVDDHSCSRCFNNPRRTSRPFIGLDACHLKGPSGGQLIVAVGRDPNDQYFPLAFSVIEAETKDSWTCVELVTT
IPTTVRSSKLELKWRSYDRNTTREDDVDSSWSSLRCLSWDDLMLLKFDNKFNHTDNGQIMDEKNRLSEDYELGVESFIEFGFHHANGSKVIRCPCLKCGNRIDKDAATIR
NHLYEHGIDQSYRVWFWHGEEHAPRTDEDRLNGEFNKNHEDDNDLFDVINMVQTVYDGISHVPKSFENMFDNAKKPLYPGYAVGLGLQNNVEESLRDRPLSAGLYVKPSV
EHLKQAHIYVLGNTEEVEPYQRQHMKHLKEENPNRSNNMKWLQNEHTRSSATGYVMSEITDGKCISETLRWMAHGPNPEVRIYTYSDDDLDDSSLHYTSISKDTPSAEVE
ENGPLYTREDCEGTWNDKMTDVGETSKTRKKRGPTGLHKITRISSEGHRRVIKYNVKGQPIGHNATKLKTFIGISVQHHIPTIYETWKSVPSETKEKIYDLIQGGFVVDH
RTKKNLIQVAGIAFRNFKYRLTNKFIKPFINDKDKLKAPPEDYSHISHEHWEVFVKSRLTKDFERQSETNREKRKKHVYNHCMLRKGYANLAEELKIDASHERPSDRCIL
WKKARTNREGKIVHGPTQKVVERIDDLMITQDLQEENVDKYGDLLSIALGSRDRPGLVKAVELEVKLKKHEKGSPKSKHGTPAKKTPKKSPKLKRTTPSKNAPKKSPRSK
RITPSKNDPKKSSQSMHTTPSKNASKKSPQSKRGTLSKHDYRYLTDVTVCVHLKILDKEETLEFKEGTHCRLALGSIDNVVAADTIFESGRKDGNVKVSIDVVVDDDSRL
PIPTHGGNNILSQEIGSHILWPQNLVISDNEKEIKHSRELFSFASGKSPTQPAPVSLTCLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAY
TTFLHRSLGNEYESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFPYHSGDHWTLIVMSPAKNMTFFLDS