; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015758 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015758
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor-like protein
Genome locationtig00005725:58284..59105
RNA-Seq ExpressionSgr015758
SyntenySgr015758
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]1.7e-9174.39Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA
        M SRLTA  +K NW FS AQFQ LRR GL    T RTA+P VHA DDN PAV SGEPE+SQD  EPD A++NY+R+DS Q D      NGPFAPPK QYA
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA

Query:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
        SSPRLE+T V Q SKPITQQKR H TV+DDVSCIG  GGP   ++ +R  DRKEQE+D+R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKDVI
Subjt:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI

Query:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        GWLPEQ DT +DSL+RATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_022151127.1 uncharacterized protein LOC111019128 [Momordica charantia]3.1e-10179.18Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYAS
        M SRLTA    S WAFS AQ   LRRGL    TGRTA+P VHAVDDNDPAV SGEPEKSQ+V+EPD+A+ANYDREDS     GK  KNGPF P K QY S
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYAS

Query:  SPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG
        SPRLEST VGQPSKPITQQKR HGTVIDDVSC+G  GGP  E+KD RRT R+E+EED+R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG
Subjt:  SPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG

Query:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        WLPEQ+DTAEDSLRR TEIWK+NA+RGDPDAPQSRVLRALRGE+F
Subjt:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_022991237.1 uncharacterized protein LOC111487953 [Cucurbita maxima]4.9e-9175.61Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA
        M SRLTA  +K NWAFS AQFQ LRR GL    T RTA+P VHA DDN PAV SGEPE+SQD  EPD A+ANY  +DS Q D      NGPFAPPK QYA
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA

Query:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
        SSPRLE+T V Q SKPITQQKR H TV+ DVSCIG  GGP  E++ +R  DRKEQEED+R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
Subjt:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI

Query:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
         WLPEQ DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]2.2e-9174.8Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA
        M SRLTA  +K NW+FS AQFQ LRR GL    T RTA+P VHA DDN PAV SGEPE+SQD  EPD A+ANY+R+DS Q D      NGPFAPPK QYA
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA

Query:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
        SSPRLE+T V Q SKPITQQKR H TV+ DVSCIG  GGP   ++ +R  DRKEQEED+R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKDVI
Subjt:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI

Query:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        GWLPEQ DT +DSLRRA EIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_038899333.1 uncharacterized protein LOC120086662 isoform X1 [Benincasa hispida]1.7e-9174.29Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYAS
        M SRLTA   KSNWA S AQFQ LRR   +  T RTA+P VHA DDNDPAV SGEPE+SQD  EPD+ +ANY+R+D    D      NGPF  PK Q+AS
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYAS

Query:  SPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG
        SPRLE+  VGQ SKPITQQKR   TV D+VSCIGVYGGPL + K++R T+ KEQEED+RDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG GKDVIG
Subjt:  SPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG

Query:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        WLPEQLDT +DSLRRATEIWKQNAMRGDPDAPQSR+LRALRGE+F
Subjt:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein7.9e-8771.77Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHA---VDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQ
        M S L A   KSNWAF   QFQ LRRG  +  T RTA+P +HA    DDNDPAV SGEPE+SQD  EPD+A+ANYDR D    D  +    GPF  P  Q
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHA---VDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQ

Query:  YASSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKD
        +ASSPRLE+T VGQ SKPITQQKR H   IDDVSCIGVYGGPL + K++R T+ KE+EED+RDYYKHHKASPLAEIEFADTRKPITRATDGTAYDG    
Subjt:  YASSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKD

Query:  VIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        VIGWLPEQ+DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
Subjt:  VIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A1S3BR22 uncharacterized protein LOC1034927781.3e-8469.26Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDD--NDPAVSSGEPEKSQDVAEPDDAEANYD-REDSAQADLGKEGKNGPFAPPKPQ
        M SRL A   +SNWA    QFQ LRRG     TGRTA+P VHA DD  NDP+V SGEPE+SQD  EPD+A+ANY+ R+D  Q D      NGPF P K Q
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDD--NDPAVSSGEPEKSQDVAEPDDAEANYD-REDSAQADLGKEGKNGPFAPPKPQ

Query:  YASSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQE---------EDDRDYYKHHKASPLAEIEFADTRKPITRATDG
        +ASSPRLE+T VGQ SKPITQQKR H   IDDVSCIGVYGGPL E K+ R T+ K++E         ED+RDYYKHHKASPLAEIEF DTRKPITRATDG
Subjt:  YASSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQE---------EDDRDYYKHHKASPLAEIEFADTRKPITRATDG

Query:  TAYDGGGKDVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        TA  G GK VIGWLPEQ+DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  TAYDGGGKDVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A6J1DCP1 uncharacterized protein LOC1110191281.5e-10179.18Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYAS
        M SRLTA    S WAFS AQ   LRRGL    TGRTA+P VHAVDDNDPAV SGEPEKSQ+V+EPD+A+ANYDREDS     GK  KNGPF P K QY S
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYAS

Query:  SPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG
        SPRLEST VGQPSKPITQQKR HGTVIDDVSC+G  GGP  E+KD RRT R+E+EED+R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG
Subjt:  SPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIG

Query:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        WLPEQ+DTAEDSLRR TEIWK+NA+RGDPDAPQSRVLRALRGE+F
Subjt:  WLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A6J1GPT4 uncharacterized protein LOC1114563882.0e-9073.17Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA
        M SRLTA  +K NW FS AQFQ LRR GL    T RTA+P VHA DDN PAV SGEPE+SQD  EPD A++NY+R+DS Q D      NGPFAPPK QYA
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA

Query:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
        SSPRLE+T V Q SKPITQQKR H TV+ DVSCIG  GGP   ++ +R  DRKEQ++D+R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDGGGKD+I
Subjt:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI

Query:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        GWLPEQ DT +DSL+RATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A6J1JL82 uncharacterized protein LOC1114879532.4e-9175.61Show/hide
Query:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA
        M SRLTA  +K NWAFS AQFQ LRR GL    T RTA+P VHA DDN PAV SGEPE+SQD  EPD A+ANY  +DS Q D      NGPFAPPK QYA
Subjt:  MLSRLTATVAKSNWAFSPAQFQCLRR-GLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYA

Query:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
        SSPRLE+T V Q SKPITQQKR H TV+ DVSCIG  GGP  E++ +R  DRKEQEED+R+YYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI
Subjt:  SSPRLESTAVGQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVI

Query:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
         WLPEQ DT +DSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQLDTAEDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein2.4e-5149.61Show/hide
Query:  MMLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDN-DPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQY
        MM SRL A  + +     P   + L  G  S+ +GRTA+P++HA +D  DPA+   +PE   DVA P  A A    +D+ +  L ++    P  PPK   
Subjt:  MMLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDN-DPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQY

Query:  ASSPRLESTAVGQPSKPITQQKRTHGTV----IDDVSCIGVYGGPLSEDKD--DRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYD
        A++ +LEST VG PS+P  QQKR + T     +D VSC G+ G P   D+   + +  R+++ E D+++YKHHKASPL+EIEFADTRKPIT+ATDGTAY 
Subjt:  ASSPRLESTAVGQPSKPITQQKRTHGTV----IDDVSCIGVYGGPLSEDKD--DRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYD

Query:  GGGKDVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDA-PQSRVLRALRGEEF
          GKDVIGWLPEQLDTAE+SL +AT I+K+NA RGDP+  P SR+LR +RGE F
Subjt:  GGGKDVIGWLPEQLDTAEDSLRRATEIWKQNAMRGDPDA-PQSRVLRALRGEEF

AT4G02140.1 unknown protein6.8e-0641.94Show/hide
Query:  SAPTGRTANPDVHAVDDND-PAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYASSPRLESTAVGQPSKPITQQKR
        S+ TGRTA+P++HA +D D P++   +PE   DVA P    A+ D  D       KE    P +PPK    +S +LEST VG P+    QQKR
Subjt:  SAPTGRTANPDVHAVDDND-PAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYASSPRLESTAVGQPSKPITQQKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTATCGAGATTGACGGCGACGGTAGCCAAGTCGAATTGGGCCTTCTCTCCGGCCCAATTCCAATGTCTCCGTCGAGGTCTGCCGTCGGCTCCGACTGGTCGCAC
GGCTAACCCTGACGTTCATGCCGTCGATGACAACGATCCTGCCGTTTCATCCGGTGAACCTGAAAAATCGCAGGATGTTGCAGAACCAGATGATGCTGAAGCCAACTACG
ACAGAGAAGATTCTGCACAGGCAGATCTCGGCAAAGAAGGAAAAAATGGGCCGTTTGCACCACCGAAGCCCCAGTACGCTTCCTCCCCTCGGTTAGAGAGCACGGCGGTA
GGCCAGCCCTCGAAGCCAATCACTCAGCAAAAGAGAACCCACGGGACGGTGATCGACGACGTGAGCTGCATTGGAGTATACGGCGGCCCGTTGTCGGAGGACAAAGACGA
CCGACGAACCGACAGAAAAGAACAAGAAGAAGACGACAGAGATTACTACAAGCACCACAAAGCGTCGCCGTTGGCCGAAATCGAGTTTGCGGATACACGGAAGCCGATAA
CCAGAGCGACGGACGGGACGGCTTACGACGGCGGTGGGAAAGACGTGATCGGATGGCTGCCGGAGCAGCTGGACACGGCGGAGGATTCACTCCGGAGAGCGACGGAGATA
TGGAAACAAAACGCCATGCGTGGGGACCCTGATGCTCCGCAATCGAGGGTTCTTAGGGCTTTACGTGGCGAAGAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTTATCGAGATTGACGGCGACGGTAGCCAAGTCGAATTGGGCCTTCTCTCCGGCCCAATTCCAATGTCTCCGTCGAGGTCTGCCGTCGGCTCCGACTGGTCGCAC
GGCTAACCCTGACGTTCATGCCGTCGATGACAACGATCCTGCCGTTTCATCCGGTGAACCTGAAAAATCGCAGGATGTTGCAGAACCAGATGATGCTGAAGCCAACTACG
ACAGAGAAGATTCTGCACAGGCAGATCTCGGCAAAGAAGGAAAAAATGGGCCGTTTGCACCACCGAAGCCCCAGTACGCTTCCTCCCCTCGGTTAGAGAGCACGGCGGTA
GGCCAGCCCTCGAAGCCAATCACTCAGCAAAAGAGAACCCACGGGACGGTGATCGACGACGTGAGCTGCATTGGAGTATACGGCGGCCCGTTGTCGGAGGACAAAGACGA
CCGACGAACCGACAGAAAAGAACAAGAAGAAGACGACAGAGATTACTACAAGCACCACAAAGCGTCGCCGTTGGCCGAAATCGAGTTTGCGGATACACGGAAGCCGATAA
CCAGAGCGACGGACGGGACGGCTTACGACGGCGGTGGGAAAGACGTGATCGGATGGCTGCCGGAGCAGCTGGACACGGCGGAGGATTCACTCCGGAGAGCGACGGAGATA
TGGAAACAAAACGCCATGCGTGGGGACCCTGATGCTCCGCAATCGAGGGTTCTTAGGGCTTTACGTGGCGAAGAGTTTTAA
Protein sequenceShow/hide protein sequence
MMLSRLTATVAKSNWAFSPAQFQCLRRGLPSAPTGRTANPDVHAVDDNDPAVSSGEPEKSQDVAEPDDAEANYDREDSAQADLGKEGKNGPFAPPKPQYASSPRLESTAV
GQPSKPITQQKRTHGTVIDDVSCIGVYGGPLSEDKDDRRTDRKEQEEDDRDYYKHHKASPLAEIEFADTRKPITRATDGTAYDGGGKDVIGWLPEQLDTAEDSLRRATEI
WKQNAMRGDPDAPQSRVLRALRGEEF