; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012133 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012133
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGATA transcription factor-like protein
Genome locationscaffold708:536199..537004
RNA-Seq ExpressionMS012133
SyntenyMS012133
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]2.1e-9476.25Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        M SRLTAIA    W FSLAQ  RLRR   T  T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AK+NY+R+DS +G  NGPF P KAQY SSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        +T V Q SKPITQQKR H TV+DDVSC+GFDGGP P ++  R   R+E+E+DEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
         DT +DSL+RATEIWKQNA+RGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

KAG7014335.1 hypothetical protein SDJN02_24512 [Cucurbita argyrosperma subsp. argyrosperma]5.1e-9375.42Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        M SRLTAIA    W FSLAQ  RLRR   T  T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AK+NY+R+DS +G  NGPF P KAQY SSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        +T V Q SKPITQQKR H TV+ DVSC+GFDGGP P ++  R   R+E+EED REYYKHHKASPLAEIEF DTRKPIT ATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
         DT +DSL+RATEIWKQNA+RGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

XP_022151127.1 uncharacterized protein LOC111019128 [Momordica charantia]1.5e-12998.33Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        MQSRLTAIAPNSKWAFSLAQLHRLRRGLA PETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPE+KDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
        VDTAEDSLRR TEIWK+NAVRGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

XP_022991237.1 uncharacterized protein LOC111487953 [Cucurbita maxima]2.7e-9477.5Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        M SRLTAIA    WAFSLAQ  RLRR   T  T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD AKANY  +DS +G  NGPF P KAQY SSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        +T V Q SKPITQQKR H TV+ DVSC+GFDGGP PE++  R   R+E+EEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI WLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
         DT +DSLRRATEIWKQNA+RGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]2.7e-9476.67Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        M SRLTAIA    W+FSLAQ  RLRR   T  T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AKANY+R+DS +G  NGPF P KAQY SSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        +T V Q SKPITQQKR H TV+ DVSC+GFDGGP P ++  R   R+E+EEDEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
         DT +DSLRRA EIWKQNA+RGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein1.1e-8572.54Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHA---VDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGK-GAKNGPFGPTKAQYGSS
        MQS L AIAP S WAF + Q   LRRG  T  T RTADPS+HA    DDNDPAV SGEPE+SQ+  EPDNAKANYDR D  K G   GPFG   AQ+ SS
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHA---VDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGK-GAKNGPFGPTKAQYGSS

Query:  PRLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGW
        PRLE+T VGQ SKPITQQKR H   IDDVSC+G  GGP  + K+ R T  +EEEED R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDG    VIGW
Subjt:  PRLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGW

Query:  LPEQVDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
        LPEQVDT +DSLRRATEIWKQNA+RGDPDAPQSRVLRALRGE+F
Subjt:  LPEQVDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

A0A5D3D3D5 Uncharacterized protein6.5e-8672.11Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDD--NDPAVPSGEPEKSQEVSEPDNAKANYD-REDSGKGAKNGPFGPTKAQYGSSP
        MQSRL AIAP S WA  + QL  LRRG  T  TGRTADPSVHA DD  NDP+V SGEPE+SQ+  EPDNAKANY+ R+D  +G  NGPFGP+KAQ+ SSP
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDD--NDPAVPSGEPEKSQEVSEPDNAKANYD-REDSGKGAKNGPFGPTKAQYGSSP

Query:  RLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRT---GREEEEEDE-----REYYKHHKASPLAEIEFADTRKPITRATDGTAYDGG
        RLE+T VGQ SKPITQQKR H   IDDVSC+G  GGP  E K+ R T    +EEEEE+E     R+YYKHHKASPLAEIEF DTRKPITRATDGTA  G 
Subjt:  RLESTGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRT---GREEEEEDE-----REYYKHHKASPLAEIEFADTRKPITRATDGTAYDGG

Query:  GKDVIGWLPEQVDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
        GK VIGWLPEQVDT +DSLRRATEIWKQNA+RGDPDAPQSRVLRALRGE F
Subjt:  GKDVIGWLPEQVDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

A0A6J1DCP1 uncharacterized protein LOC1110191287.4e-13098.33Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        MQSRLTAIAPNSKWAFSLAQLHRLRRGLA PETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPE+KDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
        VDTAEDSLRR TEIWK+NAVRGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

A0A6J1GPT4 uncharacterized protein LOC1114563882.5e-9375Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        M SRLTAIA    W FSLAQ  RLRR   T  T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD+AK+NY+R+DS +G  NGPF P KAQY SSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        +T V Q SKPITQQKR H TV+ DVSC+GFDGGP P ++  R   R+E+++DEREYYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKD+IGWLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
         DT +DSL+RATEIWKQNA+RGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

A0A6J1JL82 uncharacterized protein LOC1114879531.3e-9477.5Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE
        M SRLTAIA    WAFSLAQ  RLRR   T  T RTADPSVHA DDN PAVPSGEPE+SQ+  EPD AKANY  +DS +G  NGPF P KAQY SSPRLE
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLE

Query:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ
        +T V Q SKPITQQKR H TV+ DVSC+GFDGGP PE++  R   R+E+EEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI WLPEQ
Subjt:  STGVGQPSKPITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQ

Query:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF
         DT +DSLRRATEIWKQNA+RGDPDAPQSRVLRALRGEQF
Subjt:  VDTAEDSLRRATEIWKQNAVRGDPDAPQSRVLRALRGEQF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein7.9e-5250.4Show/hide
Query:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDN-DPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL
        MQSRL A A  ++         RL  G +T  +GRTADP +HA +D  DPA+   +PE   +V+ P  A      +      +  P  P K+   ++ +L
Subjt:  MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDN-DPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRL

Query:  ESTGVGQPSKPITQQKRPHGTV----IDDVSCVGFDGGPAP----EKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGK
        EST VG PS+P  QQKR + T     +D VSC G DG P P    E ++ RR  RE+E E ++E+YKHHKASPL+EIEFADTRKPIT+ATDGTAY   GK
Subjt:  ESTGVGQPSKPITQQKRPHGTV----IDDVSCVGFDGGPAP----EKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGK

Query:  DVIGWLPEQVDTAEDSLRRATEIWKQNAVRGDPDA-PQSRVLRALRGEQF
        DVIGWLPEQ+DTAE+SL +AT I+K+NA RGDP+  P SR+LR +RGE F
Subjt:  DVIGWLPEQVDTAEDSLRRATEIWKQNAVRGDPDA-PQSRVLRALRGEQF

AT4G02140.1 unknown protein4.7e-0435.79Show/hide
Query:  RLRRGLATPETGRTADPSVHAVDDND-PAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLESTGVGQPSKPITQQKR
        RL     +  TGRTADP +HA +D D P++   +PE   +V+ P         +    G    P  P K    +S +LEST VG P+    QQKR
Subjt:  RLRRGLATPETGRTADPSVHAVDDND-PAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLESTGVGQPSKPITQQKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCCAGATTGACGGCGATAGCGCCGAATTCGAAGTGGGCCTTCTCTCTGGCCCAATTACACCGTCTCCGGCGAGGCCTGGCCACGCCGGAGACTGGTCGGACGGC
TGACCCCTCCGTTCATGCCGTCGACGACAACGATCCCGCCGTTCCTTCCGGTGAACCCGAAAAATCACAGGAAGTTTCAGAACCGGATAATGCCAAAGCCAATTACGACA
GAGAGGATTCCGGGAAAGGGGCAAAAAATGGGCCGTTTGGGCCAACGAAGGCCCAATACGGTTCCTCCCCGCGTTTAGAGAGCACGGGTGTGGGCCAGCCCTCGAAGCCC
ATCACCCAGCAGAAGAGGCCCCACGGGACGGTGATCGACGACGTGAGCTGCGTCGGGTTCGACGGCGGGCCGGCGCCGGAGAAGAAGGACGGCCGACGGACCGGCAGAGA
AGAGGAGGAGGAAGACGAGAGAGAGTACTACAAGCACCACAAGGCATCGCCGTTGGCGGAGATCGAGTTCGCGGATACGCGGAAGCCGATAACACGAGCGACGGACGGGA
CGGCCTACGACGGCGGCGGGAAAGACGTGATCGGATGGCTGCCGGAGCAGGTGGACACGGCGGAGGATTCACTCCGGCGAGCGACGGAGATTTGGAAACAGAACGCCGTC
CGCGGGGACCCCGATGCTCCCCAATCCAGGGTTCTTAGGGCTTTGCGTGGGGAACAGTTT
mRNA sequenceShow/hide mRNA sequence
ATGCAATCCAGATTGACGGCGATAGCGCCGAATTCGAAGTGGGCCTTCTCTCTGGCCCAATTACACCGTCTCCGGCGAGGCCTGGCCACGCCGGAGACTGGTCGGACGGC
TGACCCCTCCGTTCATGCCGTCGACGACAACGATCCCGCCGTTCCTTCCGGTGAACCCGAAAAATCACAGGAAGTTTCAGAACCGGATAATGCCAAAGCCAATTACGACA
GAGAGGATTCCGGGAAAGGGGCAAAAAATGGGCCGTTTGGGCCAACGAAGGCCCAATACGGTTCCTCCCCGCGTTTAGAGAGCACGGGTGTGGGCCAGCCCTCGAAGCCC
ATCACCCAGCAGAAGAGGCCCCACGGGACGGTGATCGACGACGTGAGCTGCGTCGGGTTCGACGGCGGGCCGGCGCCGGAGAAGAAGGACGGCCGACGGACCGGCAGAGA
AGAGGAGGAGGAAGACGAGAGAGAGTACTACAAGCACCACAAGGCATCGCCGTTGGCGGAGATCGAGTTCGCGGATACGCGGAAGCCGATAACACGAGCGACGGACGGGA
CGGCCTACGACGGCGGCGGGAAAGACGTGATCGGATGGCTGCCGGAGCAGGTGGACACGGCGGAGGATTCACTCCGGCGAGCGACGGAGATTTGGAAACAGAACGCCGTC
CGCGGGGACCCCGATGCTCCCCAATCCAGGGTTCTTAGGGCTTTGCGTGGGGAACAGTTT
Protein sequenceShow/hide protein sequence
MQSRLTAIAPNSKWAFSLAQLHRLRRGLATPETGRTADPSVHAVDDNDPAVPSGEPEKSQEVSEPDNAKANYDREDSGKGAKNGPFGPTKAQYGSSPRLESTGVGQPSKP
ITQQKRPHGTVIDDVSCVGFDGGPAPEKKDGRRTGREEEEEDEREYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQVDTAEDSLRRATEIWKQNAV
RGDPDAPQSRVLRALRGEQF