; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G004170 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G004170
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPLATZ transcription factor family protein
Genome locationCmo_Chr06:2002948..2004076
RNA-Seq ExpressionCmoCh06G004170
SyntenyCmoCh06G004170
Gene Ontology termsNA
InterPro domainsIPR006734 - PLATZ transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144595.1 uncharacterized protein LOC111014242 isoform X1 [Momordica charantia]1.6e-8986.39Show/hide
Query:  MMKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVM
        MMKQKS MSAPPWLEPLL TAFFSICHTHG++ARSERNMYCLDCH DAFCFYCRSSHH DHQVIQIRRSSYHDVVRVA+IE++LDISGVQTYVINSARVM
Subjt:  MMKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVM

Query:  FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + G+ RNGDA F LE KKEA+VMERREGISSRRRKGIPHRAPFGS
Subjt:  FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

XP_022144597.1 uncharacterized protein LOC111014242 isoform X2 [Momordica charantia]6.0e-8986.32Show/hide
Query:  MKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
        MKQKS MSAPPWLEPLL TAFFSICHTHG++ARSERNMYCLDCH DAFCFYCRSSHH DHQVIQIRRSSYHDVVRVA+IE++LDISGVQTYVINSARVMF
Subjt:  MKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF

Query:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + G+ RNGDA F LE KKEA+VMERREGISSRRRKGIPHRAPFGS
Subjt:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

XP_022951161.1 uncharacterized protein LOC111454064 isoform X1 [Cucurbita moschata]3.2e-9894.21Show/hide
Query:  MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
        MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
Subjt:  MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF

Query:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + GMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
Subjt:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

XP_023540023.1 uncharacterized protein LOC111800522 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-9794.18Show/hide
Query:  MKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFL
        MKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFL
Subjt:  MKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFL

Query:  NERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        NERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + GMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
Subjt:  NERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

XP_038903534.1 uncharacterized protein LOC120090101 isoform X1 [Benincasa hispida]6.0e-8986.39Show/hide
Query:  MMKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVM
        MMKQKS MSAPPWLEPLL T FFSICHTHG++ARSERNMYCLDCH DAFCFYCRSSHH DHQVIQIRRSSYHDVVRVA+IE+ALDISGVQTYVINSARVM
Subjt:  MMKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVM

Query:  FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + G+ RNGDASF+LE KKEAM +ERREGISSRRRKGIPHRAPFGS
Subjt:  FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

TrEMBL top hitse value%identityAlignment
A0A6J1CS33 uncharacterized protein LOC111014242 isoform X17.7e-9086.39Show/hide
Query:  MMKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVM
        MMKQKS MSAPPWLEPLL TAFFSICHTHG++ARSERNMYCLDCH DAFCFYCRSSHH DHQVIQIRRSSYHDVVRVA+IE++LDISGVQTYVINSARVM
Subjt:  MMKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVM

Query:  FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + G+ RNGDA F LE KKEA+VMERREGISSRRRKGIPHRAPFGS
Subjt:  FLNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

A0A6J1CTQ1 uncharacterized protein LOC111014242 isoform X37.2e-8886.41Show/hide
Query:  MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQ
        MSAPPWLEPLL TAFFSICHTHG++ARSERNMYCLDCH DAFCFYCRSSHH DHQVIQIRRSSYHDVVRVA+IE++LDISGVQTYVINSARVMFLNERPQ
Subjt:  MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQ

Query:  PKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        PKAGKGGAHICEICGRSLLDPFRFCSLGCK         + G+ RNGDA F LE KKEA+VMERREGISSRRRKGIPHRAPFGS
Subjt:  PKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

A0A6J1CU41 uncharacterized protein LOC111014242 isoform X22.9e-8986.32Show/hide
Query:  MKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
        MKQKS MSAPPWLEPLL TAFFSICHTHG++ARSERNMYCLDCH DAFCFYCRSSHH DHQVIQIRRSSYHDVVRVA+IE++LDISGVQTYVINSARVMF
Subjt:  MKQKS-MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF

Query:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + G+ RNGDA F LE KKEA+VMERREGISSRRRKGIPHRAPFGS
Subjt:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

A0A6J1GGT0 uncharacterized protein LOC111454064 isoform X11.5e-9894.21Show/hide
Query:  MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
        MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
Subjt:  MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF

Query:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + GMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
Subjt:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

A0A6J1KYH1 uncharacterized protein LOC111498296 isoform X11.5e-9894.21Show/hide
Query:  MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
        MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF
Subjt:  MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMF

Query:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
        LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCK         + GMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS
Subjt:  LNERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS

SwissProt top hitse value%identityAlignment
Q1G3Q4 Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 14.3e-2946.88Show/hide
Query:  PPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQPKA
        P WL+ L    FF  C  H  + ++ERN+ CLDC C + C +C  S H+ H+++Q+RR  YHDVVR+ D+++ +D S VQ Y INSA+V+F+ +RPQ + 
Subjt:  PPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQPKA

Query:  GKGGAHICEICGRSLLDPFRFCSLGCKV
         KG  + C  C RSL +P+  CSLGCKV
Subjt:  GKGGAHICEICGRSLLDPFRFCSLGCKV

Arabidopsis top hitse value%identityAlignment
AT1G21000.1 PLATZ transcription factor family protein9.7e-4554.97Show/hide
Query:  KQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLN
        +++  ++PPWL P+L  ++F  C  H +S ++E N++CLDC  +AFC YC    HKDH+V+QIRRSSYH+VVRV +I++ +DI+ VQTY+INSA+++FLN
Subjt:  KQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLN

Query:  ERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKV-----SHHSLSFSISG
        ERPQP+ GKG  + CEIC RSLLD FRFCSLGCK+        SL+FS+ G
Subjt:  ERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKV-----SHHSLSFSISG

AT1G32700.1 PLATZ transcription factor family protein1.7e-4950.96Show/hide
Query:  KQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLN
        ++ + + P WL+PLL   FF  C  H +S +SE NMYCLDC     C  C  S HKDH  IQIRRSSYHDV+RV++I++ LDI+GVQTYVINSA+V+FLN
Subjt:  KQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLN

Query:  ERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHS------------------LSFSISGMNRNGDA---SFSLETKKEAMVMERREGISSRRRKG
        ERPQP+ GKG  + CE+C RSL+D FRFCSLGCK+S  S                   S SI  + +N D    SF+  T   + V  R     ++RRKG
Subjt:  ERPQPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHS------------------LSFSISGMNRNGDA---SFSLETKKEAMVMERREGISSRRRKG

Query:  IPHRAPFG
        IPHRAPFG
Subjt:  IPHRAPFG

AT1G76590.1 PLATZ transcription factor family protein1.1e-4555.41Show/hide
Query:  SMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERP
        +++ PPWL P+L   +F  C  H  S +SE NM+CLDC  +AFC YC   +H++H+V+QIRRSSYH+VVRV +I++ +DIS VQTY+INSAR++FLNERP
Subjt:  SMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERP

Query:  QPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETK
        QP+ GKG  + CEIC RSLLD FRFCSLGCK         + GM R+   +FSL  K
Subjt:  QPKAGKGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETK

AT2G27930.1 PLATZ transcription factor family protein4.7e-5556.86Show/hide
Query:  MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQ
        M  P WLE LL T FFSIC  H E+ R+E NM+CL C   AFCFYCRSS H DH V+QIRRSSYHDVVRV++IE ALDI GVQTYVINSARV+FLNERPQ
Subjt:  MSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQ

Query:  PKAGKGGA---------HICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGI-----------SSRRRKGIPHRA
        PK    GA         + CE C R+LLDPFRFCSLGCKV          GM +N       E ++E +  ER++             +SRRRKGIPHRA
Subjt:  PKAGKGGA---------HICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGI-----------SSRRRKGIPHRA

Query:  PFGS
        PF S
Subjt:  PFGS

AT4G17900.1 PLATZ transcription factor family protein1.6e-4750.77Show/hide
Query:  PPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQPKA
        PPWL+PLL   FF  C  HG+S +SE NMYCLDC     C  C  +HHKDH+ IQIRRSSYHDV+RV +I++ LDI G+QTYVINSA+V+FLNERPQP+ 
Subjt:  PPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQPKA

Query:  GKGGAHICEICGRSLL-DPFRFCSLGCKVSHHSLSFSISGMN---RNGDASFSLETKKEAMVMERREGISS-----------RRRKGIPHRAPFG
        GKG  + C++C RSL+ D FRFCSLGCK++  S  F     N      D+S S+   K    ++     +            +RRKGIPHR+P G
Subjt:  GKGGAHICEICGRSLL-DPFRFCSLGCKVSHHSLSFSISGMN---RNGDASFSLETKKEAMVMERREGISS-----------RRRKGIPHRAPFG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGCAAAAATCAATGTCGGCGCCGCCGTGGTTGGAGCCATTGTTGAACACGGCGTTCTTCTCAATCTGCCACACGCACGGCGAATCCGCTAGGAGTGAGCGGAA
TATGTACTGTCTTGATTGCCACTGCGACGCATTTTGCTTCTATTGCCGCTCATCTCACCACAAGGATCATCAAGTAATTCAGATAAGGAGATCTTCGTATCATGATGTAG
TAAGAGTGGCAGATATCGAAGAAGCTTTGGATATAAGTGGCGTTCAAACTTATGTGATTAATAGCGCTAGAGTCATGTTCTTGAACGAGAGGCCTCAACCCAAAGCTGGT
AAAGGAGGAGCACATATTTGTGAAATTTGTGGGAGAAGCTTATTGGATCCATTTCGATTCTGTTCACTCGGCTGTAAGGTTAGCCACCATTCCCTTTCCTTTTCGATTTC
GGGGATGAACAGAAATGGGGATGCCAGCTTTAGCTTGGAGACGAAGAAAGAAGCAATGGTAATGGAAAGAAGAGAAGGAATTTCATCAAGGAGAAGAAAAGGCATTCCTC
ATAGGGCACCTTTTGGGTCCTAA
mRNA sequenceShow/hide mRNA sequence
AAGATCCAAAAGCAGCGTAAGCGCCAAACTTTGTCATCTCTGTCAAATTCGATCTGACGGCTGTAATGAAGCCTCAGCCACCAAATCCATCTCCACCGTCCATAGGCTCC
TGCTTTCCACAACACATTAACCTCGTCTCTTCAGCCTTGGCCTTTTCATTTTCGTTTCAATTCTCAGAAATTGGCATCGGTTTTCTCTCTGAGTTATGATAATAAGGCAA
AAGATCACATTTTTTTGGGTACCCATTTGAGAATAAATATGATGAAGCAAAAATCAATGTCGGCGCCGCCGTGGTTGGAGCCATTGTTGAACACGGCGTTCTTCTCAATC
TGCCACACGCACGGCGAATCCGCTAGGAGTGAGCGGAATATGTACTGTCTTGATTGCCACTGCGACGCATTTTGCTTCTATTGCCGCTCATCTCACCACAAGGATCATCA
AGTAATTCAGATAAGGAGATCTTCGTATCATGATGTAGTAAGAGTGGCAGATATCGAAGAAGCTTTGGATATAAGTGGCGTTCAAACTTATGTGATTAATAGCGCTAGAG
TCATGTTCTTGAACGAGAGGCCTCAACCCAAAGCTGGTAAAGGAGGAGCACATATTTGTGAAATTTGTGGGAGAAGCTTATTGGATCCATTTCGATTCTGTTCACTCGGC
TGTAAGGTTAGCCACCATTCCCTTTCCTTTTCGATTTCGGGGATGAACAGAAATGGGGATGCCAGCTTTAGCTTGGAGACGAAGAAAGAAGCAATGGTAATGGAAAGAAG
AGAAGGAATTTCATCAAGGAGAAGAAAAGGCATTCCTCATAGGGCACCTTTTGGGTCCTAA
Protein sequenceShow/hide protein sequence
MMKQKSMSAPPWLEPLLNTAFFSICHTHGESARSERNMYCLDCHCDAFCFYCRSSHHKDHQVIQIRRSSYHDVVRVADIEEALDISGVQTYVINSARVMFLNERPQPKAG
KGGAHICEICGRSLLDPFRFCSLGCKVSHHSLSFSISGMNRNGDASFSLETKKEAMVMERREGISSRRRKGIPHRAPFGS