; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020513 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020513
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor IBH1-like 1
Genome locationtig00153533:1030184..1030723
RNA-Seq ExpressionSgr020513
SyntenySgr020513
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR044549 - Transcription factor IBH1-like, bHLH domain
IPR044660 - Transcription factor IBH1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048635.1 transcription factor IBH1 [Cucumis melo var. makuwa]2.8e-5166.48Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS
        M NPNKLKQQFLK WLVGL + TSS T M+ L+RKKAIK+SAD AMA+ RNGTT WS++IIAKS+K G  P E +LNR      +TS   LLR+++    
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS

Query:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK
        L+KMGR++ R++A RS LP SKAL S++AKRLV KRTKVLRSL+PGGEFM+DE LLIEEALDYI FL+AQVDGMRFLA+ CK
Subjt:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK

XP_004149100.1 transcription factor IBH1-like 1 [Cucumis sativus]2.3e-5066.3Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTA-MDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNR-PIYEKLQTSSGLLRRRKKTN
        M NPNKLKQQFLKKWLVGL + TSS+T  M+ L+RKKAIK+SAD AMA  R GTT WS++IIAKS+K G  P + +LNR  IY        LLR+++   
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTA-MDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNR-PIYEKLQTSSGLLRRRKKTN

Query:  SLQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS--CK
         LQKMGR++ RR+A RS LP SK L  ++AKRLV KRTKVLRSL+PGGEFMEDEVLLIEEALDYI FL+AQVDGMRFLA+  CK
Subjt:  SLQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS--CK

XP_008452339.1 PREDICTED: uncharacterized protein LOC103493395 [Cucumis melo]2.0e-4965.38Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS
        M N NKLKQQFLK WLVGL + TSS T M+ L+RKKAIK+SAD AMA+ RNGTT WS++IIAK +K G  P E +LNR      +TS   LLR+++    
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS

Query:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK
        L+KMGR++ R++A RS LP SKAL S++AKRLV KRTKVLRSL+PGGEFM+DE LLIEEALDYI FL+AQVDGMRFLA+ CK
Subjt:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK

XP_022153351.1 transcription factor IBH1-like 1 [Momordica charantia]3.5e-5470.33Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSL
        M NPNKLKQ+FL+KW V L   T+SNT MD+ ERKKAIK+SADLAMASARNGTT WSRAIIAKSMK  +VPAE++L+ P++    +  G    RKKT  +
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSL

Query:  QKMG-RRVRRRVAR--SHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK
        Q  G RRV RR  R  +  PPSKA A  VAKRLV KR KVLRSLVPGGEFMEDEVLLIEE LDYISFLRAQVDGMRFLASCK
Subjt:  QKMG-RRVRRRVAR--SHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK

XP_038890559.1 transcription factor IBH1-like 1 [Benincasa hispida]2.6e-5774.03Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNR-PIYEKLQTSSGLLRRRKKTNS
        M NPNKLKQQFLKKWLVGL + TSSNT M+ILERKKAIK+SAD AMA+ RNG+T WSRAIIAKS+K GQ P E + NR  IY KLQ      R++  T +
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNR-PIYEKLQTSSGLLRRRKKTNS

Query:  LQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK
        L KMGRRV R+VARS  PP KALA +VAKRLV KRTKVLRSLVPGGEFMEDEVLLIEEALDYI+FL+AQVDGMRFLA+ CK
Subjt:  LQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK

TrEMBL top hitse value%identityAlignment
A0A0A0LM92 Uncharacterized protein1.1e-5066.3Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTA-MDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNR-PIYEKLQTSSGLLRRRKKTN
        M NPNKLKQQFLKKWLVGL + TSS+T  M+ L+RKKAIK+SAD AMA  R GTT WS++IIAKS+K G  P + +LNR  IY        LLR+++   
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTA-MDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNR-PIYEKLQTSSGLLRRRKKTN

Query:  SLQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS--CK
         LQKMGR++ RR+A RS LP SK L  ++AKRLV KRTKVLRSL+PGGEFMEDEVLLIEEALDYI FL+AQVDGMRFLA+  CK
Subjt:  SLQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS--CK

A0A1S3BT01 uncharacterized protein LOC1034933959.6e-5065.38Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS
        M N NKLKQQFLK WLVGL + TSS T M+ L+RKKAIK+SAD AMA+ RNGTT WS++IIAK +K G  P E +LNR      +TS   LLR+++    
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS

Query:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK
        L+KMGR++ R++A RS LP SKAL S++AKRLV KRTKVLRSL+PGGEFM+DE LLIEEALDYI FL+AQVDGMRFLA+ CK
Subjt:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK

A0A5D3CK75 Transcription factor IBH11.3e-5166.48Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS
        M NPNKLKQQFLK WLVGL + TSS T M+ L+RKKAIK+SAD AMA+ RNGTT WS++IIAKS+K G  P E +LNR      +TS   LLR+++    
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTS-SGLLRRRKKTNS

Query:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK
        L+KMGR++ R++A RS LP SKAL S++AKRLV KRTKVLRSL+PGGEFM+DE LLIEEALDYI FL+AQVDGMRFLA+ CK
Subjt:  LQKMGRRVRRRVA-RSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS-CK

A0A6J1DKD7 transcription factor IBH1-like 11.7e-5470.33Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSL
        M NPNKLKQ+FL+KW V L   T+SNT MD+ ERKKAIK+SADLAMASARNGTT WSRAIIAKSMK  +VPAE++L+ P++    +  G    RKKT  +
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSL

Query:  QKMG-RRVRRRVAR--SHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK
        Q  G RRV RR  R  +  PPSKA A  VAKRLV KR KVLRSLVPGGEFMEDEVLLIEE LDYISFLRAQVDGMRFLASCK
Subjt:  QKMG-RRVRRRVAR--SHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK

A0A6J1HLT3 transcription factor IBH1-like 11.7e-4362.01Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSL
        M NP  LK+ FLKKW++GL+  T SNT M+ LERKKAIK S+DLAMA+ RNG T WSRAIIAKS+   Q P E  L                RRK T  L
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSL

Query:  QKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK
         K+GR++ RR   +H   S+++A SVAKRLV+KRTKVLRSLVPGGEFMEDE LLI+EALDYISFLR QVDGMRFLA+CK
Subjt:  QKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK

SwissProt top hitse value%identityAlignment
Q9M0B9 Transcription factor IBH1-like 13.5e-2541.21Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQ---------VPAELLLNRPIYEKLQTSSGLL
        M   + + ++FLKKW +GL     S     + ERKKAIKLSAD+AMAS R GTT WSRA+I K+  +           + AE L+N+ + +K      ++
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQ---------VPAELLLNRPIYEKLQTSSGLL

Query:  RRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMR
        RR KK          + RR ++S    +   A++ AKRLV++RT+ LR++VPGGE M ++VLL++E LDYI  L+ QV+ MR
Subjt:  RRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMR

Arabidopsis top hitse value%identityAlignment
AT1G09250.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.5e-0429.32Show/hide
Query:  KLKQQFLKKWLVG----LHAC--------------TSSNTAMDILERKKA--IKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQ
        ++ ++ LK+W       ++AC              T+SN   D L    A  I+ +AD  +A++  GTT WSRAI+A      +V A+L  +R    K +
Subjt:  KLKQQFLKKWLVG----LHAC--------------TSSNTAMDILERKKA--IKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQ

Query:  TSSGLLRRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLA
         S+G  + RK                 R  LP             V ++ K+L  LVPG   +    LL +EA DYI+ L  QV  M  LA
Subjt:  TSSGLLRRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLA

AT3G17100.1 sequence-specific DNA binding transcription factors4.6e-0429.68Show/hide
Query:  SSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMK-QGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSLQKMGRRVRRRVARSHLPPSKAL
        SS+++  I    +A++  AD A+A A  G T WSRAI++K++K + +      ++ P    L T  G +R +K+        R    R+    LP     
Subjt:  SSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMK-QGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSLQKMGRRVRRRVARSHLPPSKAL

Query:  ASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS
                V+++ KVL  LVPG       V+L EE  DYI+ +  Q+  M  + S
Subjt:  ASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS

AT4G30410.1 sequence-specific DNA binding transcription factors2.5e-2641.21Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQ---------VPAELLLNRPIYEKLQTSSGLL
        M   + + ++FLKKW +GL     S     + ERKKAIKLSAD+AMAS R GTT WSRA+I K+  +           + AE L+N+ + +K      ++
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQ---------VPAELLLNRPIYEKLQTSSGLL

Query:  RRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMR
        RR KK          + RR ++S    +   A++ AKRLV++RT+ LR++VPGGE M ++VLL++E LDYI  L+ QV+ MR
Subjt:  RRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMR

AT4G30410.2 sequence-specific DNA binding transcription factors2.5e-2641.21Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQ---------VPAELLLNRPIYEKLQTSSGLL
        M   + + ++FLKKW +GL     S     + ERKKAIKLSAD+AMAS R GTT WSRA+I K+  +           + AE L+N+ + +K      ++
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQ---------VPAELLLNRPIYEKLQTSSGLL

Query:  RRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMR
        RR KK          + RR ++S    +   A++ AKRLV++RT+ LR++VPGGE M ++VLL++E LDYI  L+ QV+ MR
Subjt:  RRRKKTNSLQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMR

AT5G57780.1 EXPRESSED IN: 18 plant structures1.9e-2642.7Show/hide
Query:  MTNPNKLKQQFLKKWLVGLHACTSS-NTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNS
        M   N +KQ+F+KKW+  LH   SS    +++ ERK AI+LS+DLAMA+ARNG+T WSRA+I++S   G   A   + R I +K +      R + + N 
Subjt:  MTNPNKLKQQFLKKWLVGLHACTSS-NTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNS

Query:  LQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS
        L++ G                   +  AK  VRKRT +L+SLVPGGE ++D+  LI E LDYI +LRAQVD MR +A+
Subjt:  LQKMGRRVRRRVARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAATCCCAACAAGCTCAAACAACAATTTCTCAAGAAATGGCTGGTGGGTCTGCACGCCTGTACCTCTTCAAACACAGCCATGGACATCTTAGAGAGAAAGAAAGC
CATAAAGCTGTCCGCCGACTTGGCCATGGCTTCGGCAAGAAACGGCACAACCTTTTGGAGCCGAGCCATCATTGCCAAATCCATGAAACAAGGCCAAGTTCCAGCCGAGC
TCCTCTTGAATCGACCCATTTATGAGAAGCTTCAGACATCCTCGGGTCTGCTCAGAAGGAGGAAGAAGACGAATTCGCTGCAAAAGATGGGCCGCAGAGTCCGTCGAAGA
GTGGCGAGAAGTCACTTGCCGCCATCGAAGGCTTTGGCGAGCTCAGTTGCAAAGAGATTGGTTCGGAAAAGAACGAAAGTTCTGAGGAGTTTGGTTCCTGGAGGAGAGTT
TATGGAGGATGAAGTTTTGTTGATTGAAGAAGCCCTAGATTATATATCGTTTCTTCGAGCTCAGGTTGATGGAATGCGATTTCTTGCTAGCTGCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCAATCCCAACAAGCTCAAACAACAATTTCTCAAGAAATGGCTGGTGGGTCTGCACGCCTGTACCTCTTCAAACACAGCCATGGACATCTTAGAGAGAAAGAAAGC
CATAAAGCTGTCCGCCGACTTGGCCATGGCTTCGGCAAGAAACGGCACAACCTTTTGGAGCCGAGCCATCATTGCCAAATCCATGAAACAAGGCCAAGTTCCAGCCGAGC
TCCTCTTGAATCGACCCATTTATGAGAAGCTTCAGACATCCTCGGGTCTGCTCAGAAGGAGGAAGAAGACGAATTCGCTGCAAAAGATGGGCCGCAGAGTCCGTCGAAGA
GTGGCGAGAAGTCACTTGCCGCCATCGAAGGCTTTGGCGAGCTCAGTTGCAAAGAGATTGGTTCGGAAAAGAACGAAAGTTCTGAGGAGTTTGGTTCCTGGAGGAGAGTT
TATGGAGGATGAAGTTTTGTTGATTGAAGAAGCCCTAGATTATATATCGTTTCTTCGAGCTCAGGTTGATGGAATGCGATTTCTTGCTAGCTGCAAATAA
Protein sequenceShow/hide protein sequence
MTNPNKLKQQFLKKWLVGLHACTSSNTAMDILERKKAIKLSADLAMASARNGTTFWSRAIIAKSMKQGQVPAELLLNRPIYEKLQTSSGLLRRRKKTNSLQKMGRRVRRR
VARSHLPPSKALASSVAKRLVRKRTKVLRSLVPGGEFMEDEVLLIEEALDYISFLRAQVDGMRFLASCK