; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g17730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g17730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBromo domain-containing protein
Genome locationchr3:11753392..11771570
RNA-Seq ExpressionMoc03g17730
SyntenyMoc03g17730
Gene Ontology termsNA
InterPro domainsIPR025312 - Domain of unknown function DUF4216
IPR036008 - Aconitase, iron-sulfur domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3956246.1 hypothetical protein CMV_018610 [Castanea mollissima]5.7e-8031.65Show/hide
Query:  SRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIK
        SR+L+ +EW QAHLY+LKNCD+VLPYI EH   +Q    +N + +H K+F EWFESH+T+LY++   +V KQL DLARGP ++ + Y GYIVNGFRFR  
Subjt:  SRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIK

Query:  DADDLRKTQNSGVVVRG---TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRN
        + D  R+TQ+ GV+V+G   T NR+Y+GVL +II+L YM GN + +FKC+W D++  G+GI VD +G T VN+T +   NEPFVLACQ+EQV Y      
Subjt:  DADDLRKTQNSGVVVRG---TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRN

Query:  PTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGP
                                         K  +     +KF                                            T++++      
Subjt:  PTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGP

Query:  RKVRGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKY-DTVAEALAN
                              V + NKR +                   F  + E FKVEG E  V RQ+  SY+N+RDSLKKKW   Y + + EA  N
Subjt:  RKVRGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKY-DTVAEALAN

Query:  IPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKESE--------------------------------------------
        +PP + ++D   L +LW   +Y ++C +NK NR K ++ HT GS+S QQR  +E E                                            
Subjt:  IPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKESE--------------------------------------------

Query:  --------------------------------------------------------------------------------LKSYASQVMDGTLDMNNDEI
                                                                                        L++  +QV +G L ++ D++
Subjt:  --------------------------------------------------------------------------------LKSYASQVMDGTLDMNNDEI

Query:  FVQVFGPEKHGRVRGYGAGVTPSELFGSSS-KVRDLERRLNESEQRLQESKRQR
        FV+VFGPE+HGRVRGYGAGVTP+ L+GSSS ++ DLE+RL ESEQ+  ES+ +R
Subjt:  FVQVFGPEKHGRVRGYGAGVTPSELFGSSS-KVRDLERRLNESEQRLQESKRQR

RWR98116.1 hypothetical protein CKAN_02761400 [Cinnamomum micranthum f. kanehirae]3.8e-7631.82Show/hide
Query:  RRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ-----HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFR
        R L+T EW QAHLY+L NCDDV P++  H   ++      V  R+  RKH KEF +WFE H+          V++QL  LA  P + V  +KGYI+NGFR
Subjt:  RRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ-----HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFR

Query:  FRIKDADDLRKTQNSGVVVRG--------------TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDI-NSRGKGIKVDDHGLTSVNMTYTFAVNEPFV
        F  +D +  RKTQNSGV +                T N  Y+GVL ++I+LQY+ GN +VLFKC WWD+ N  G+GIK D++G T VN T T   NEPF+
Subjt:  FRIKDADDLRKTQNSGVVVRG--------------TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDI-NSRGKGIKVDDHGLTSVNMTYTFAVNEPFV

Query:  LACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRN-SINSHENSTVGATISTASTKSKTQVGLEASSSFT
        LA Q++QV Y++D   P W+  +K+ PRD Y +   + V CQ+ + +       H       D N  + +     +   +  A+     Q        F 
Subjt:  LACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRN-SINSHENSTVGATISTASTKSKTQVGLEASSSFT

Query:  GEIGSTTSYERTVNG-------GPRKVRGPTRNLKLV-RLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETS---------
         +     ++    +G       G ++ RG +R +KL  RLP            +P+GDNA  F + C I+ R     PLQ     +  E S         
Subjt:  GEIGSTTSYERTVNG-------GPRKVRGPTRNLKLV-RLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETS---------

Query:  ------------VLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKE-
                    VL+ L + Y ++R  +K K+   ++T  E L + PP+++ ED  +L   W S E Q +  +NK NRS   + HT G++S  +  ++E 
Subjt:  ------------VLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKE-

Query:  -----------------------------------SELKSYASQVMDGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSK-----VRDLERR
                                           S+ K  ++    G+     D+I+ QV G E+HGRVRGYG G TP+ +FGS+S+       +L+  
Subjt:  -----------------------------------SELKSYASQVMDGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSK-----VRDLERR

Query:  LNESEQRLQESKRQRK
        L+   ++LQ+ +  +K
Subjt:  LNESEQRLQESKRQRK

XP_016180369.1 uncharacterized protein LOC107622839 isoform X2 [Arachis ipaensis]1.2e-6447.13Show/hide
Query:  HGYNEEPNEEVSKFYTLLNDAEKELYPGCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEV
        HGY  +  ++V   +       +     CK +RRL+ +E +Q+HLYILKNCD V P+I +H   L+  + RN Q++HD+EF +WFESHVT LY K   +V
Subjt:  HGYNEEPNEEVSKFYTLLNDAEKELYPGCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEV

Query:  DKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVN
          QLL LARGP++E + YKGY  NGF F  KD ++ RKTQ+SGV+V+    +EY+GV+ +I++L YM  N VV+FKC WWD+++ G+G+KVD++G+T VN
Subjt:  DKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVN

Query:  MTYTFAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKM
           T    E FV+ACQ EQV Y+ED  N  W  V+KV PRDY+ +P +++ + +EE  E M
Subjt:  MTYTFAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKM

XP_020967551.1 uncharacterized protein LOC107618200 [Arachis ipaensis]5.7e-7237.78Show/hide
Query:  EHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLH
        +H   L+  + RN Q++HD+EF +WFESHVT LY +   +V  QLL LARGP++E + YKGY  NGF F  KD ++ RKTQ+SGV+V+    +EY+GV+ 
Subjt:  EHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLH

Query:  EIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVD-CQEEDG
        +I++L YM  N VV+FKC WWD+++ G+G+KVD++G+T    +    +N   V A +++ +  L  RRN        + P  +     ++  D     DG
Subjt:  EIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVD-CQEEDG

Query:  EKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRP
           K  SS       Q R   +S  N   G     AS+ S +Q+  +  S    +      +E       +KVRGPTRNL+L +LP G R +++WR  RP
Subjt:  EKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRP

Query:  VGDNADIFKSQCTILARQVNFTPLQER--------------------FKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAE
        VG NA +FKS+CT+L R V   PL+ +                    FKVEG +  V  Q+N SY ++R  LKK++   YD    A ANIP E+ +ED +
Subjt:  VGDNADIFKSQCTILARQVNFTPLQER--------------------FKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAE

Query:  YLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEK
        YL NLW    +Q++ ++NK++R+  ++ HT G++  QQR E+
Subjt:  YLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEK

XP_029151584.1 uncharacterized protein LOC112778502 [Arachis hypogaea]4.4e-7237.75Show/hide
Query:  EHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLH
        +H   L+  + RN Q++HD+EF +WFESHVT LY +   +V  QLL LARGP++E + YKGY  NGF F  KD ++ RKTQ+SGV+V+    +EY+GV+ 
Subjt:  EHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLH

Query:  EIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVD-CQEEDG
        +I++L YM  N VV+FKC WWD+++ G+G+KVD++G+T    +    +N   V A +++ +  L  RRN        + P  +     ++  D     DG
Subjt:  EIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVD-CQEEDG

Query:  EKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRP
           K  SS       Q R   +S  N   G     AS+ S +Q+  +  S    +      +E       +KVRGPTRNL+L +LP G R +++WR  RP
Subjt:  EKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRP

Query:  VGDNADIFKSQCTILARQVNFTPLQER--------------------FKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAE
        VG NA +FKS+CT+L R V   PL+ +                    FKVEG +  V  Q+N SY ++R  LKK++   YD    A ANIP E+ +ED +
Subjt:  VGDNADIFKSQCTILARQVNFTPLQER--------------------FKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAE

Query:  YLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKESE
        YL NLW    +Q++ ++NK++R+  ++ HT G++  QQR E+  E
Subjt:  YLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKESE

TrEMBL top hitse value%identityAlignment
A0A2N9GUI6 Uncharacterized protein1.8e-7156.52Show/hide
Query:  SRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIK
        SR+L+ +EW QAH Y+LKNCD+V  +I EH   ++   ARN + +H K+FIEWFESH+TKLY++ + +V KQLLDLARGPSQE   Y GYIVNGFRFR  
Subjt:  SRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIK

Query:  DADDLRKTQNSGVVVRG---TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRN
        + D  RKTQ+ GV+V+G   T N +Y+GVL +II+L+YM GN +V+FKC+WWD+N+ G+GI VD++G T VN+T     NEPFVLACQ EQV Y++D +N
Subjt:  DADDLRKTQNSGVVVRG---TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVVYLEDRRN

Query:  PTWYFVLKVDPRDYYKIPVIKD-VDCQEED
        P W FV+K +PR+YY +P  +D  D +EED
Subjt:  PTWYFVLKVDPRDYYKIPVIKD-VDCQEED

A0A438CRS2 DUF4216 domain-containing protein3.4e-6229.47Show/hide
Query:  GCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ---HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVN
        G   SR L+T+EW QAHLY+L NC +V+ +I EH  +++    + AR+    H +EFI WFE  + ++ ++G   + + +L L+RGPS  V  Y+GYI+N
Subjt:  GCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ---HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVN

Query:  GFRFRIKDADDLRKTQNSGVVV----------RGTN----NREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEP
        GFRF  ++ +  +KT NSGVVV          R  N    +  Y+GVL ++I+L Y+ GN V+LFKC WWD+ + G+G+K D++G T +N   T   +EP
Subjt:  GFRFRIKDADDLRKTQNSGVVV----------RGTN----NREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEP

Query:  FVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATIST---ASTKSKTQV-----
        FVLA Q++QV Y+++      + V+++  R  Y +   K      E  +++    S R   +  + + IN   N   G TI T   +  + + Q+     
Subjt:  FVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATIST---ASTKSKTQV-----

Query:  ---------------------GLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTP
                                + SS    +GS++S +RT        RG TRNL L+ +  G         ++ + D+  I   +  ++ +  +   
Subjt:  ---------------------GLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTP

Query:  LQERFKVEGHETSVLRQLNRSYNN-------WRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSI
        + E +            LN  Y +       +R+ +K K+   Y+T  E L + PP ++ +D  +L + W + E +++ E+NK NR+K  + HT GS+S 
Subjt:  LQERFKVEGHETSVLRQLNRSYNN-------WRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSI

Query:  QQRVEKESELKSYASQV------------MDGT-LDMNN------------------------------DEIFVQVFGPEKHGRVRGYGAGVTPSELFGS
         Q   ++++ K   S+              DGT +D ++                              DEI+ QV GPE+HGRVRGYG G T + +FGS
Subjt:  QQRVEKESELKSYASQV------------MDGT-LDMNN------------------------------DEIFVQVFGPEKHGRVRGYGAGVTPSELFGS

Query:  SSKVRD---LERRLNESEQRL
        +S+ R    L  +L  +++ L
Subjt:  SSKVRD---LERRLNESEQRL

A0A438HJE6 Uncharacterized protein1.5e-6230.49Show/hide
Query:  GCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ---HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVN
        G   SR L+T+EW QAHLY+L NC++V  +I EH  +++    + AR+    H +EFI WFE  + ++ ++G   + + +L L+RGPS  V  Y+GYI+N
Subjt:  GCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ---HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVN

Query:  GFRFRIKDADDLRKTQNSGVVV----------RGTN----NREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEP
        GFRF  ++ +  +KTQNSGVVV          R  N    +  Y+GVL ++I+L Y+ GN V+LFKC WWD+ + G+GIK D++G T +N   T   +EP
Subjt:  GFRFRIKDADDLRKTQNSGVVV----------RGTN----NREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEP

Query:  FVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATIST---ASTKSKTQV-----
        FVLA Q++QV Y+++     W+ V+++  R  Y +   K      E  +++    S R   +  + + IN   N   G TI T   +  + + Q+     
Subjt:  FVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATIST---ASTKSKTQV-----

Query:  ---------------------GLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTP
                                + SS    +GS++S +RT        RG TRNL L+ +  G         K+ + D+  I + +  ++ +  +   
Subjt:  ---------------------GLEASSSFTGEIGSTTSYERTVNGGPRKVRGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTP

Query:  LQERFKVEGHETSVLRQLNRSYNN-------WRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLW-----KSTEYQEMCERNKVNRSKLNVYHTC
        + E +            LN  Y +       +R+ +K K+   Y+T  E L + PP ++ +D  +L + W     K+ + ++  E N++    L      
Subjt:  LQERFKVEGHETSVLRQLNRSYNN-------WRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLW-----KSTEYQEMCERNKVNRSKLNVYHTC

Query:  GSQSIQQRVEKESELKSYASQVMDGTLDMNN---------DEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSK
        G+       E   + +   SQ  +GT    +         DEI+ QV GPE+HGRVRGYG G TP+ +FGS+S+
Subjt:  GSQSIQQRVEKESELKSYASQVMDGTLDMNN---------DEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSK

A0A443Q533 Uncharacterized protein1.8e-7631.82Show/hide
Query:  RRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ-----HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFR
        R L+T EW QAHLY+L NCDDV P++  H   ++      V  R+  RKH KEF +WFE H+          V++QL  LA  P + V  +KGYI+NGFR
Subjt:  RRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQ-----HVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFR

Query:  FRIKDADDLRKTQNSGVVVRG--------------TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDI-NSRGKGIKVDDHGLTSVNMTYTFAVNEPFV
        F  +D +  RKTQNSGV +                T N  Y+GVL ++I+LQY+ GN +VLFKC WWD+ N  G+GIK D++G T VN T T   NEPF+
Subjt:  FRIKDADDLRKTQNSGVVVRG--------------TNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDI-NSRGKGIKVDDHGLTSVNMTYTFAVNEPFV

Query:  LACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRN-SINSHENSTVGATISTASTKSKTQVGLEASSSFT
        LA Q++QV Y++D   P W+  +K+ PRD Y +   + V CQ+ + +       H       D N  + +     +   +  A+     Q        F 
Subjt:  LACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRN-SINSHENSTVGATISTASTKSKTQVGLEASSSFT

Query:  GEIGSTTSYERTVNG-------GPRKVRGPTRNLKLV-RLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETS---------
         +     ++    +G       G ++ RG +R +KL  RLP            +P+GDNA  F + C I+ R     PLQ     +  E S         
Subjt:  GEIGSTTSYERTVNG-------GPRKVRGPTRNLKLV-RLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETS---------

Query:  ------------VLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKE-
                    VL+ L + Y ++R  +K K+   ++T  E L + PP+++ ED  +L   W S E Q +  +NK NRS   + HT G++S  +  ++E 
Subjt:  ------------VLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKE-

Query:  -----------------------------------SELKSYASQVMDGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSK-----VRDLERR
                                           S+ K  ++    G+     D+I+ QV G E+HGRVRGYG G TP+ +FGS+S+       +L+  
Subjt:  -----------------------------------SELKSYASQVMDGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSK-----VRDLERR

Query:  LNESEQRLQESKRQRK
        L+   ++LQ+ +  +K
Subjt:  LNESEQRLQESKRQRK

A0A6V7QH08 Uncharacterized protein2.3e-6339.33Show/hide
Query:  LGENDNDSGEEDIFEILEDHFGVFNTNNWTKKGESSKHGYNEEPNEEVSKFYTLLNDAEKELYPGC-KQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHI
        +G +D DS  E + +ILE HFG  N   W  +   +    +EEPNE  +KF+ LL D  ++L     K+ R+LT +EW  A L++L+N ++V P++ E  
Subjt:  LGENDNDSGEEDIFEILEDHFGVFNTNNWTKKGESSKHGYNEEPNEEVSKFYTLLNDAEKELYPGC-KQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHI

Query:  PALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNR---EYFGVLH
          +     +N + K DK+F EWFE+ + +++ +G  +V  QLL LA GP QEV  Y GYIVNGFRF+ K+ +  +K+QNSGVVV+G +     +++G+L 
Subjt:  PALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGPSQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNR---EYFGVLH

Query:  EIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYT--FAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEED
         I++L+YM  + VVLFKC WWD+++  +GIK D++G  ++N + T  +  ++PFVLAC SEQV Y+ D R   W+ V++ +PRD+Y +P       +EE 
Subjt:  EIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYT--FAVNEPFVLACQSEQVVYLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEED

Query:  GEKMKNFSSHRAKIKFQDRNSINSHENS
         +K        A   FQ   S + ++NS
Subjt:  GEKMKNFSSHRAKIKFQDRNSINSHENS

SwissProt top hitse value%identityAlignment
Q94AR8 3-isopropylmalate dehydratase large subunit, chloroplastic1.5e-0657.14Show/hide
Query:  NSHTCTDGAFGRFASGIGNSNAGFVLGTRKSLLKFKISFCFLGENDNDS
        +SHTCT GAFG+FA+GIGN++AGFVLGT K LLK   +  F+ + +  S
Subjt:  NSHTCTDGAFGRFASGIGNSNAGFVLGTRKSLLKFKISFCFLGENDNDS

Arabidopsis top hitse value%identityAlignment
AT1G40087.1 Plant transposase (Ptta/En/Spm family)5.8e-0626.23Show/hide
Query:  YKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQR-----------------------------VEKESELKS
        YK + + E L N P ++  +   +L +L  + ++++M ERN  N+    + H CG +S  ++                             V  E++L++
Subjt:  YKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQR-----------------------------VEKESELKS

Query:  YASQVM---------DGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEQRLQESKRQRKYEG
         A   +         +GT  +  D+ + QVFGPE+ GRVR  G G TPS L   S+  R   R+  E+ + + + K Q K  G
Subjt:  YASQVM---------DGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEQRLQESKRQRKYEG

AT3G30200.1 Plant transposase (Ptta/En/Spm family)4.9e-0525.14Show/hide
Query:  YKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQR-----------------------------VEKESELKS
        +K + + E L N P ++  +   +L +L  + ++++M ERN  N+    + H CG +S  ++                             V  E++L++
Subjt:  YKYDTVAEALANIPPEITREDAEYLANLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQR-----------------------------VEKESELKS

Query:  YASQVM---------DGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEQRLQESKRQRKYEG
         A   +         +GT  +  D+ + QVFGPE+ GRV   G G TPS L   S+  R   R+  E+ + + + K Q K  G
Subjt:  YASQVM---------DGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEQRLQESKRQRKYEG

AT4G13430.1 isopropyl malate isomerase large subunit 11.1e-0757.14Show/hide
Query:  NSHTCTDGAFGRFASGIGNSNAGFVLGTRKSLLKFKISFCFLGENDNDS
        +SHTCT GAFG+FA+GIGN++AGFVLGT K LLK   +  F+ + +  S
Subjt:  NSHTCTDGAFGRFASGIGNSNAGFVLGTRKSLLKFKISFCFLGENDNDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAATTGGTTGTCGTCGGAGAATGTAGAAGTTTTAATCGTCACGCCGTCGTTCAGCCGCTGGGTATCGTCGAAAAGGAAACCGACAGAGGATGTTTGCTTCAAAA
CTTATGCGCCAGTCGTCGGGCTTCCCGTTCGCTTAGAATCACCACGCTGTTGCACACAGATACGTCGAAGGAGGACTCGTGCGACCTGCTTTGTGTCGCTGAACATCGTC
GTCACAGATCTCGGCTCGAGCTGTCGTCGTGCGTCGGGACAAGCGTGACGCCGTTGTTGTTGAAGACCACGGGAGGGGCTGCGCACCGATCGGAAACCATGGAGACACGC
CGCTGCTATCTGGCCGTCACTGCTGCTGCCGACCGCCAATGTCTAAGGCGTTGCCGGAGGGAGCCGTGCCGGTGCTGCTTCGCCGGAGAAGAAAACTCTCATACCTGTAC
AGATGGTGCATTTGGTCGATTTGCTAGTGGAATTGGTAACAGTAATGCAGGATTTGTACTAGGCACTCGGAAATCATTGCTCAAGTTTAAGATCTCTTTTTGTTTTCTAG
GAGAAAATGATAATGACTCTGGTGAAGAAGATATTTTTGAAATATTAGAGGATCACTTTGGTGTTTTTAACACCAACAATTGGACCAAGAAAGGAGAATCAAGTAAACAT
GGTTATAATGAAGAACCAAATGAGGAAGTTTCCAAGTTTTATACATTGTTAAATGACGCAGAAAAGGAACTTTATCCTGGGTGTAAACAATCAAGAAGGCTCACGACAAA
AGAGTGGAGACAAGCTCATCTTTATATTTTAAAGAATTGTGATGATGTCCTACCGTATATTGGTGAGCACATACCAGCATTACAACATGTTGATGCCAGAAATGCACAGA
GAAAGCACGATAAAGAGTTTATAGAATGGTTTGAAAGTCATGTTACTAAACTATATAATAAAGGAAGCAATGAAGTTGACAAACAGCTATTAGATTTAGCTCGAGGTCCA
TCACAAGAAGTAATGTACTACAAGGGATATATTGTTAATGGTTTTAGGTTTCGTATAAAGGATGCTGATGATTTAAGAAAAACACAAAATAGCGGGGTAGTAGTGAGAGG
AACTAACAATCGAGAGTACTTTGGTGTTTTGCATGAAATTATTAAATTGCAGTATATGAGAGGAAATAATGTTGTTTTGTTTAAATGTAAATGGTGGGATATCAATAGTC
GTGGTAAAGGAATTAAAGTTGATGATCATGGATTAACTAGTGTGAATATGACCTACACATTTGCTGTAAATGAGCCATTTGTATTGGCATGTCAATCTGAACAAGTTGTT
TATCTTGAGGATAGAAGAAATCCAACTTGGTATTTTGTGTTGAAGGTTGATCCAAGAGATTATTACAAGATTCCTGTAATCAAAGATGTAGATTGTCAAGAAGAAGATGG
GGAAAAGATGAAAAACTTTTCATCTCATAGAGCAAAGATCAAATTCCAAGATAGAAATAGTATAAATAGTCATGAGAATAGTACTGTAGGGGCGACTATATCTACAGCTT
CAACAAAATCTAAAACACAAGTTGGTCTAGAAGCTTCATCATCTTTTACAGGAGAAATTGGATCAACTACTAGTTACGAAAGGACGGTTAATGGAGGGCCAAGAAAAGTG
CGTGGTCCCACACGCAATCTTAAACTAGTTAGATTACCTCACGGGATTAGATTTGAAGTATCATGGAGGAACAAAAGACCTGTTGGAGATAATGCTGATATTTTCAAAAG
CCAATGCACTATTTTGGCTCGACAAGTCAACTTTACACCTCTACAAGAGCGATTTAAAGTCGAAGGCCATGAGACTTCTGTTTTACGTCAACTTAATCGATCTTATAATA
ATTGGAGAGATAGCTTGAAGAAAAAATGGTTGTACAAATATGATACAGTTGCAGAAGCTTTAGCTAATATTCCACCAGAAATTACAAGAGAAGATGCAGAATATCTTGCA
AACTTGTGGAAGTCAACTGAATATCAGGAGATGTGTGAAAGGAACAAAGTGAACCGTTCTAAATTGAATGTATATCACACATGCGGTTCACAAAGTATACAACAAAGAGT
TGAGAAAGAGTCGGAACTCAAATCATATGCCTCTCAAGTGATGGATGGTACACTTGACATGAACAACGACGAGATTTTTGTGCAAGTCTTTGGACCAGAGAAACATGGGC
GTGTTCGAGGTTATGGAGCCGGTGTTACTCCTTCTGAGTTGTTTGGATCATCTTCCAAAGTTCGTGATCTTGAGCGACGCCTTAACGAGTCAGAACAACGTCTTCAAGAA
TCTAAACGACAAAGAAAATATGAGGGGAGTGGGGGCAGTTATGACGTGGGGAGTTATCCCCACGTCATATCAGTTTTTTTGCAGAAGTGTTCTGCAACAGGAAAAAGGCC
CCCAACGAAGAACAGAGTGATATTCTCTCAATCAAGCTCTCTCCCTCAACTCTCCCTCACATTGAAAACGAAATGCTCCCACAAGCGTGTTCTCGAAACCGAAGAGGATA
GCACGGAAGACTCGGTGGTGCTGTTCTTGTGGAAACCATTGAAGAAAAGTTCTTCAAAGCATCTGCTCTTTTCTAAGGTTAAGTTCGCTGCTATGATAATTGGCAATGTC
TCTGATAAACCCAGCATTTGCTCTAGAGCTTCTACCAACGGATATGGCGGCGCTGGTCAATATTCTACTAGAGCATTTACATGCTCTGCTTGTTTTCTTTTCTCGAGCAT
TATATTTTATGTAGACCCTGTCTCTGGTGCTCTAGCAGTTACTGGATTGACTGATTTATTGGATGCGTTTTTACCATCCCTAGGCTGTCGAGCTATTTACTCTGATGGAG
TTGTTACGATTACGTCTGCTGGCTCACTGCCCTATCCGTATCTAACTGGCCTACTTGCAATTCCAGATTCCTTAATGATGCGGCTTGGCTTTGCACAGTGGCATCATTAT
TGGCCACGTATTGCTTCATTAGCTTTTCCAGTGATGCAAAAGATCCTTCAGATTGCCTCTGTGCTACTACCTGTCCTTGATTTGCAAAACCAAGAGGATAACTTACCTTC
TGCTGAAATGTCGAGGCATTGGATGTCCCTGAATTGTTTCCTCATTGAGTTCCACTCCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAATTGGTTGTCGTCGGAGAATGTAGAAGTTTTAATCGTCACGCCGTCGTTCAGCCGCTGGGTATCGTCGAAAAGGAAACCGACAGAGGATGTTTGCTTCAAAA
CTTATGCGCCAGTCGTCGGGCTTCCCGTTCGCTTAGAATCACCACGCTGTTGCACACAGATACGTCGAAGGAGGACTCGTGCGACCTGCTTTGTGTCGCTGAACATCGTC
GTCACAGATCTCGGCTCGAGCTGTCGTCGTGCGTCGGGACAAGCGTGACGCCGTTGTTGTTGAAGACCACGGGAGGGGCTGCGCACCGATCGGAAACCATGGAGACACGC
CGCTGCTATCTGGCCGTCACTGCTGCTGCCGACCGCCAATGTCTAAGGCGTTGCCGGAGGGAGCCGTGCCGGTGCTGCTTCGCCGGAGAAGAAAACTCTCATACCTGTAC
AGATGGTGCATTTGGTCGATTTGCTAGTGGAATTGGTAACAGTAATGCAGGATTTGTACTAGGCACTCGGAAATCATTGCTCAAGTTTAAGATCTCTTTTTGTTTTCTAG
GAGAAAATGATAATGACTCTGGTGAAGAAGATATTTTTGAAATATTAGAGGATCACTTTGGTGTTTTTAACACCAACAATTGGACCAAGAAAGGAGAATCAAGTAAACAT
GGTTATAATGAAGAACCAAATGAGGAAGTTTCCAAGTTTTATACATTGTTAAATGACGCAGAAAAGGAACTTTATCCTGGGTGTAAACAATCAAGAAGGCTCACGACAAA
AGAGTGGAGACAAGCTCATCTTTATATTTTAAAGAATTGTGATGATGTCCTACCGTATATTGGTGAGCACATACCAGCATTACAACATGTTGATGCCAGAAATGCACAGA
GAAAGCACGATAAAGAGTTTATAGAATGGTTTGAAAGTCATGTTACTAAACTATATAATAAAGGAAGCAATGAAGTTGACAAACAGCTATTAGATTTAGCTCGAGGTCCA
TCACAAGAAGTAATGTACTACAAGGGATATATTGTTAATGGTTTTAGGTTTCGTATAAAGGATGCTGATGATTTAAGAAAAACACAAAATAGCGGGGTAGTAGTGAGAGG
AACTAACAATCGAGAGTACTTTGGTGTTTTGCATGAAATTATTAAATTGCAGTATATGAGAGGAAATAATGTTGTTTTGTTTAAATGTAAATGGTGGGATATCAATAGTC
GTGGTAAAGGAATTAAAGTTGATGATCATGGATTAACTAGTGTGAATATGACCTACACATTTGCTGTAAATGAGCCATTTGTATTGGCATGTCAATCTGAACAAGTTGTT
TATCTTGAGGATAGAAGAAATCCAACTTGGTATTTTGTGTTGAAGGTTGATCCAAGAGATTATTACAAGATTCCTGTAATCAAAGATGTAGATTGTCAAGAAGAAGATGG
GGAAAAGATGAAAAACTTTTCATCTCATAGAGCAAAGATCAAATTCCAAGATAGAAATAGTATAAATAGTCATGAGAATAGTACTGTAGGGGCGACTATATCTACAGCTT
CAACAAAATCTAAAACACAAGTTGGTCTAGAAGCTTCATCATCTTTTACAGGAGAAATTGGATCAACTACTAGTTACGAAAGGACGGTTAATGGAGGGCCAAGAAAAGTG
CGTGGTCCCACACGCAATCTTAAACTAGTTAGATTACCTCACGGGATTAGATTTGAAGTATCATGGAGGAACAAAAGACCTGTTGGAGATAATGCTGATATTTTCAAAAG
CCAATGCACTATTTTGGCTCGACAAGTCAACTTTACACCTCTACAAGAGCGATTTAAAGTCGAAGGCCATGAGACTTCTGTTTTACGTCAACTTAATCGATCTTATAATA
ATTGGAGAGATAGCTTGAAGAAAAAATGGTTGTACAAATATGATACAGTTGCAGAAGCTTTAGCTAATATTCCACCAGAAATTACAAGAGAAGATGCAGAATATCTTGCA
AACTTGTGGAAGTCAACTGAATATCAGGAGATGTGTGAAAGGAACAAAGTGAACCGTTCTAAATTGAATGTATATCACACATGCGGTTCACAAAGTATACAACAAAGAGT
TGAGAAAGAGTCGGAACTCAAATCATATGCCTCTCAAGTGATGGATGGTACACTTGACATGAACAACGACGAGATTTTTGTGCAAGTCTTTGGACCAGAGAAACATGGGC
GTGTTCGAGGTTATGGAGCCGGTGTTACTCCTTCTGAGTTGTTTGGATCATCTTCCAAAGTTCGTGATCTTGAGCGACGCCTTAACGAGTCAGAACAACGTCTTCAAGAA
TCTAAACGACAAAGAAAATATGAGGGGAGTGGGGGCAGTTATGACGTGGGGAGTTATCCCCACGTCATATCAGTTTTTTTGCAGAAGTGTTCTGCAACAGGAAAAAGGCC
CCCAACGAAGAACAGAGTGATATTCTCTCAATCAAGCTCTCTCCCTCAACTCTCCCTCACATTGAAAACGAAATGCTCCCACAAGCGTGTTCTCGAAACCGAAGAGGATA
GCACGGAAGACTCGGTGGTGCTGTTCTTGTGGAAACCATTGAAGAAAAGTTCTTCAAAGCATCTGCTCTTTTCTAAGGTTAAGTTCGCTGCTATGATAATTGGCAATGTC
TCTGATAAACCCAGCATTTGCTCTAGAGCTTCTACCAACGGATATGGCGGCGCTGGTCAATATTCTACTAGAGCATTTACATGCTCTGCTTGTTTTCTTTTCTCGAGCAT
TATATTTTATGTAGACCCTGTCTCTGGTGCTCTAGCAGTTACTGGATTGACTGATTTATTGGATGCGTTTTTACCATCCCTAGGCTGTCGAGCTATTTACTCTGATGGAG
TTGTTACGATTACGTCTGCTGGCTCACTGCCCTATCCGTATCTAACTGGCCTACTTGCAATTCCAGATTCCTTAATGATGCGGCTTGGCTTTGCACAGTGGCATCATTAT
TGGCCACGTATTGCTTCATTAGCTTTTCCAGTGATGCAAAAGATCCTTCAGATTGCCTCTGTGCTACTACCTGTCCTTGATTTGCAAAACCAAGAGGATAACTTACCTTC
TGCTGAAATGTCGAGGCATTGGATGTCCCTGAATTGTTTCCTCATTGAGTTCCACTCCAACTAA
Protein sequenceShow/hide protein sequence
MLKLVVVGECRSFNRHAVVQPLGIVEKETDRGCLLQNLCASRRASRSLRITTLLHTDTSKEDSCDLLCVAEHRRHRSRLELSSCVGTSVTPLLLKTTGGAAHRSETMETR
RCYLAVTAAADRQCLRRCRREPCRCCFAGEENSHTCTDGAFGRFASGIGNSNAGFVLGTRKSLLKFKISFCFLGENDNDSGEEDIFEILEDHFGVFNTNNWTKKGESSKH
GYNEEPNEEVSKFYTLLNDAEKELYPGCKQSRRLTTKEWRQAHLYILKNCDDVLPYIGEHIPALQHVDARNAQRKHDKEFIEWFESHVTKLYNKGSNEVDKQLLDLARGP
SQEVMYYKGYIVNGFRFRIKDADDLRKTQNSGVVVRGTNNREYFGVLHEIIKLQYMRGNNVVLFKCKWWDINSRGKGIKVDDHGLTSVNMTYTFAVNEPFVLACQSEQVV
YLEDRRNPTWYFVLKVDPRDYYKIPVIKDVDCQEEDGEKMKNFSSHRAKIKFQDRNSINSHENSTVGATISTASTKSKTQVGLEASSSFTGEIGSTTSYERTVNGGPRKV
RGPTRNLKLVRLPHGIRFEVSWRNKRPVGDNADIFKSQCTILARQVNFTPLQERFKVEGHETSVLRQLNRSYNNWRDSLKKKWLYKYDTVAEALANIPPEITREDAEYLA
NLWKSTEYQEMCERNKVNRSKLNVYHTCGSQSIQQRVEKESELKSYASQVMDGTLDMNNDEIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEQRLQE
SKRQRKYEGSGGSYDVGSYPHVISVFLQKCSATGKRPPTKNRVIFSQSSSLPQLSLTLKTKCSHKRVLETEEDSTEDSVVLFLWKPLKKSSSKHLLFSKVKFAAMIIGNV
SDKPSICSRASTNGYGGAGQYSTRAFTCSACFLFSSIIFYVDPVSGALAVTGLTDLLDAFLPSLGCRAIYSDGVVTITSAGSLPYPYLTGLLAIPDSLMMRLGFAQWHHY
WPRIASLAFPVMQKILQIASVLLPVLDLQNQEDNLPSAEMSRHWMSLNCFLIEFHSN