; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030325 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030325
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionbZIP transcription factor 53
Genome locationtig00153640:2326390..2329698
RNA-Seq ExpressionSgr030325
SyntenySgr030325
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155257.1 bZIP transcription factor 53 [Momordica charantia]3.9e-5691.43Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASIQR+GS GSNG SQS IPDERKRKRMQSNRESARRSRM+KQKQLEDL SEV RLEIANNQ LQSISAKEQAFIQVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIADI
        SVL IVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV+AD+
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIADI

XP_022998313.1 bZIP transcription factor 53-like [Cucurbita maxima]3.8e-5188.15Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI RQ S GSNGGS   IPDERKRKRMQSNRESARRSRM+KQKQ+EDLT EVSRL+IANNQ LQSI AKEQAF+QVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP
        SVLQIVEEVSGLAMDIPEIPDPLLKPWE SRPV P
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP

XP_023525883.1 bZIP transcription factor 53-like [Cucurbita pepo subsp. pepo]1.3e-5188.89Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI RQ S GSNGGS   IPDERKRKRMQSNRESARRSRM+KQKQ+EDLT EVSRL+IANNQ LQSI AKEQAF+QVDNMNNVLRAQAMELTDRLQSLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP
        SVLQIVEEVSGLAMDIPEIPDPLLKPWE SRPV P
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP

XP_023529126.1 bZIP transcription factor 53-like [Cucurbita pepo subsp. pepo]1.3e-5187.5Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI R+ S GSNGGSQ  IPDERKRKRMQSNRESARRSRMKKQKQ+EDLT E+SRL+IANNQ LQSI AKEQAF+QVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV
        SVLQIVEEVSGLAMDIPEIPDPLLKPWE SRP  PV
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV

XP_038905019.1 bZIP transcription factor 53 [Benincasa hispida]7.6e-5288.24Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI RQ SSGSNGGS S +PDERKRKRMQSNRESARRSRM+KQKQLEDL  EVSRL+IANNQ +QSI AKEQAF+QVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV
        SVL IVEEVSGLAMDIPEIPDPLLKPWELSRPV PV
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV

TrEMBL top hitse value%identityAlignment
A0A6J1DNV4 bZIP transcription factor 531.9e-5691.43Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASIQR+GS GSNG SQS IPDERKRKRMQSNRESARRSRM+KQKQLEDL SEV RLEIANNQ LQSISAKEQAFIQVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIADI
        SVL IVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV+AD+
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIADI

A0A6J1F1P1 bZIP transcription factor 53-like7.0e-5186.03Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI R+ S GSNGG Q  IPDERKRKRMQSNRESARRSRMKKQKQ+EDLT E+SRL+IANNQ LQSI AKEQAF+QVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV
        SVLQIVE+VSGLAMDIPEIPDPLLKPWE SRP  PV
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV

A0A6J1GCN2 bZIP transcription factor 53-like4.1e-5187.41Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI RQ S GSNGGS   IPDERKRKRMQSNRESARRSRM+KQKQ+EDLT EVSRL+IANNQ LQSI AKEQAF+QVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP
        SVLQIVEEVSGLAMDIPEIPDPLLKPW+ SRPV P
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP

A0A6J1J475 bZIP transcription factor 53-like1.6e-5085.29Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI R+ S GSNGGSQ  IPDERKRKRMQSNRESARRSRMKKQKQ+EDLT E+SRL++ANNQ LQSI AKEQAF+QVDNMNNVLRAQA+EL DRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV
        SVLQIVEEVSGLAMDIPEIPDPLLKPWE SRP  PV
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPV

A0A6J1KGE0 bZIP transcription factor 53-like1.8e-5188.15Show/hide
Query:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN
        MASI RQ S GSNGGS   IPDERKRKRMQSNRESARRSRM+KQKQ+EDLT EVSRL+IANNQ LQSI AKEQAF+QVDNMNNVLRAQAMELTDRL+SLN
Subjt:  MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLN

Query:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP
        SVLQIVEEVSGLAMDIPEIPDPLLKPWE SRPV P
Subjt:  SVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPP

SwissProt top hitse value%identityAlignment
C0Z2L5 bZIP transcription factor 447.8e-1542.45Show/hide
Query:  SNGGSQSVIP-----DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIV
        +N GS+S +      DERKRKR QSNRESARRSRM+KQK L+DLT++V+ L   N Q +  I+   Q ++ ++  N++LRAQ +EL  RLQSLN ++  V
Subjt:  SNGGSQSVIP-----DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIV

Query:  E-EVSGLAMDIPE------IPDPLLKPWELSRPVPPVIA
        E   SG  M+  +      + D ++ P  L     P++A
Subjt:  E-EVSGLAMDIPE------IPDPLLKPWELSRPVPPVIA

O65683 bZIP transcription factor 111.3e-1450Show/hide
Query:  SNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVE
        ++ GS+  + ++RKRKRM SNRESARRSRMKKQK L+DLT++V+ L+  N + + S+S   Q ++ V+  N+VLRAQ  EL  RLQSLN +++ ++
Subjt:  SNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVE

P24068 Ocs element-binding factor 17.5e-1848.03Show/hide
Query:  SSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVEE
        +SGS+G   S     R+ KR  SNRESARRSR++KQ+ L++L  EV+RL+  N +           + +V+  N VLRA+A EL DRL+S+N VL++VEE
Subjt:  SSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVEE

Query:  VSGLAMDIPE---IPDPLLKPWELSRP
         SG+AMDI E     DPLL+PW+L  P
Subjt:  VSGLAMDIPE---IPDPLLKPWELSRP

Q9LZP8 bZIP transcription factor 538.0e-2853.96Show/hide
Query:  MASIQRQGSSGS-NGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSL
        M S+Q Q S  S N    + + DERKRKRM SNRESARRSRM+KQKQL DL +EV+ L+  N +  + +    + +I++++ NNVLRAQA ELTDRL+SL
Subjt:  MASIQRQGSSGS-NGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSL

Query:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIA
        NSVL++VEE+SG A+DIPEIP+ +  PW++  P+ P+ A
Subjt:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIA

Q9SI15 bZIP transcription factor 27.8e-1545.9Show/hide
Query:  ASIQRQGSS---GSNGGSQSVIP-DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQ
        +S  R  SS   G+N  S SV+  DERKRKRM SNRESARRSRM+KQK ++DLT+++++L   N Q L S++   Q ++++   N+VL AQ  EL+ RLQ
Subjt:  ASIQRQGSS---GSNGGSQSVIP-DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQ

Query:  SLNSVLQIVEEVSGLAMDIPEI
        SLN ++ +V+  +G    + +I
Subjt:  SLNSVLQIVEEVSGLAMDIPEI

Arabidopsis top hitse value%identityAlignment
AT1G75390.1 basic leucine-zipper 445.5e-1642.45Show/hide
Query:  SNGGSQSVIP-----DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIV
        +N GS+S +      DERKRKR QSNRESARRSRM+KQK L+DLT++V+ L   N Q +  I+   Q ++ ++  N++LRAQ +EL  RLQSLN ++  V
Subjt:  SNGGSQSVIP-----DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIV

Query:  E-EVSGLAMDIPE------IPDPLLKPWELSRPVPPVIA
        E   SG  M+  +      + D ++ P  L     P++A
Subjt:  E-EVSGLAMDIPE------IPDPLLKPWELSRPVPPVIA

AT1G75390.2 basic leucine-zipper 442.0e-1050Show/hide
Query:  SNGGSQSVIP-----DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQ
        +N GS+S +      DERKRKR QSNRESARRSRM+KQK L+DLT++V+ L   N Q +  I+   Q ++ ++  N++LRAQ
Subjt:  SNGGSQSVIP-----DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQ

AT2G18160.1 basic leucine-zipper 25.5e-1645.9Show/hide
Query:  ASIQRQGSS---GSNGGSQSVIP-DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQ
        +S  R  SS   G+N  S SV+  DERKRKRM SNRESARRSRM+KQK ++DLT+++++L   N Q L S++   Q ++++   N+VL AQ  EL+ RLQ
Subjt:  ASIQRQGSS---GSNGGSQSVIP-DERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQ

Query:  SLNSVLQIVEEVSGLAMDIPEI
        SLN ++ +V+  +G    + +I
Subjt:  SLNSVLQIVEEVSGLAMDIPEI

AT3G62420.1 basic region/leucine zipper motif 535.7e-2953.96Show/hide
Query:  MASIQRQGSSGS-NGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSL
        M S+Q Q S  S N    + + DERKRKRM SNRESARRSRM+KQKQL DL +EV+ L+  N +  + +    + +I++++ NNVLRAQA ELTDRL+SL
Subjt:  MASIQRQGSSGS-NGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSL

Query:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIA
        NSVL++VEE+SG A+DIPEIP+ +  PW++  P+ P+ A
Subjt:  NSVLQIVEEVSGLAMDIPEIPDPLLKPWELSRPVPPVIA

AT4G34590.1 G-box binding factor 69.4e-1650Show/hide
Query:  SNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVE
        ++ GS+  + ++RKRKRM SNRESARRSRMKKQK L+DLT++V+ L+  N + + S+S   Q ++ V+  N+VLRAQ  EL  RLQSLN +++ ++
Subjt:  SNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTATTCAAAGGCAAGGCAGCTCGGGATCTAATGGAGGTTCGCAATCTGTGATTCCTGATGAGAGGAAGAGGAAGCGGATGCAATCTAACAGAGAATCGGCTCG
GAGATCTCGGATGAAGAAGCAGAAGCAATTAGAGGACCTCACAAGCGAGGTAAGCCGACTAGAAATTGCGAATAATCAGCATCTGCAGAGCATTAGTGCCAAGGAGCAAG
CATTTATCCAGGTCGACAACATGAACAATGTTCTCAGGGCTCAGGCTATGGAACTTACTGATCGCTTGCAGTCCCTGAACTCGGTCCTCCAGATTGTGGAGGAGGTTAGC
GGGCTTGCCATGGATATTCCGGAAATTCCCGATCCTCTCTTGAAGCCGTGGGAACTCTCTCGGCCAGTTCCCCCCGTCATAGCTGACATTCTCAGGGTTGTCCTTGCAGT
AGTTCTCCAATGGATCCGACGACTCCTTCTTCCTGTCACGAGCGTGGCTCGCAGCTGCGCTCAGCTCCTCGACTTCATCCCACGCCGCAGCGCACTCGCCGCTCACGGGG
TCGCCTGCGCACGCCTCCTCTGCATTCTTGATGCTCTCTTCTACCTTGTCGGATATCTTATCCGGTGCGGCGTGAACTGGTGGAGCTCTCAACCGAATGGCCGACTGCCT
CCACGGGCGGCGGAGCCACCCGGCATTGAGGGCCTGCACTTTAGGAGTCTCAGCTGCCTTGGCCAAGATCCTAGGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTATTCAAAGGCAAGGCAGCTCGGGATCTAATGGAGGTTCGCAATCTGTGATTCCTGATGAGAGGAAGAGGAAGCGGATGCAATCTAACAGAGAATCGGCTCG
GAGATCTCGGATGAAGAAGCAGAAGCAATTAGAGGACCTCACAAGCGAGGTAAGCCGACTAGAAATTGCGAATAATCAGCATCTGCAGAGCATTAGTGCCAAGGAGCAAG
CATTTATCCAGGTCGACAACATGAACAATGTTCTCAGGGCTCAGGCTATGGAACTTACTGATCGCTTGCAGTCCCTGAACTCGGTCCTCCAGATTGTGGAGGAGGTTAGC
GGGCTTGCCATGGATATTCCGGAAATTCCCGATCCTCTCTTGAAGCCGTGGGAACTCTCTCGGCCAGTTCCCCCCGTCATAGCTGACATTCTCAGGGTTGTCCTTGCAGT
AGTTCTCCAATGGATCCGACGACTCCTTCTTCCTGTCACGAGCGTGGCTCGCAGCTGCGCTCAGCTCCTCGACTTCATCCCACGCCGCAGCGCACTCGCCGCTCACGGGG
TCGCCTGCGCACGCCTCCTCTGCATTCTTGATGCTCTCTTCTACCTTGTCGGATATCTTATCCGGTGCGGCGTGAACTGGTGGAGCTCTCAACCGAATGGCCGACTGCCT
CCACGGGCGGCGGAGCCACCCGGCATTGAGGGCCTGCACTTTAGGAGTCTCAGCTGCCTTGGCCAAGATCCTAGGACTTGA
Protein sequenceShow/hide protein sequence
MASIQRQGSSGSNGGSQSVIPDERKRKRMQSNRESARRSRMKKQKQLEDLTSEVSRLEIANNQHLQSISAKEQAFIQVDNMNNVLRAQAMELTDRLQSLNSVLQIVEEVS
GLAMDIPEIPDPLLKPWELSRPVPPVIADILRVVLAVVLQWIRRLLLPVTSVARSCAQLLDFIPRRSALAAHGVACARLLCILDALFYLVGYLIRCGVNWWSSQPNGRLP
PRAAEPPGIEGLHFRSLSCLGQDPRT