; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013201 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013201
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionZyxin-like
Genome locationChr01:27713316..27714786
RNA-Seq ExpressionHG10013201
SyntenyHG10013201
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133795.1 DNA-directed RNA polymerase II subunit RPB1 [Cucumis sativus]2.0e-9878.23Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRR-DQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT
        MAGR FGR  YRFSSANRPLAP  NT GQ+SAQY+GR+Y S+ RDTS+EPR+  P ++ RR DQPLP SPTYSIKK TSPPSSP YRAPAAR ISSP KT
Subjt:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRR-DQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT

Query:  VDEYPKYKPTTQPRSPEAKPKP-VIHKA-VDKVTKSDRHHESSKTVSSHKVQ-QPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIG
        VDEYPKYKP TQPRSPEAK KP + HK+ V+KVTKSDR+HESSKT+SSHK Q QPN INIKGENVGAVMEIVESSKREGGH+IKK KE  R ILNNND+ 
Subjt:  VDEYPKYKPTTQPRSPEAKPKP-VIHKA-VDKVTKSDRHHESSKTVSSHKVQ-QPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIG

Query:  NDQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        NDQNNEASK  NSS PTNTFLN+NFQSVNNSLLYNA LTH DPGLHL FSRNPTGER  VD  KKQHH +Y
Subjt:  NDQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

XP_008437848.1 PREDICTED: uncharacterized protein LOC103483156 [Cucumis melo]1.8e-9979.26Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQ-PLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT
        MAGR FGR  YRFSSANRPLAP  NT GQ+SAQY+ RQY SA RDTS+EPR+  P IS R+D  PLP SPTYSIKK TSPPSSP YR  AAR ISSPPK 
Subjt:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQ-PLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT

Query:  VDEYPKYKPTTQPRSPEAKPKPVIHK--AVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIGN
        VDEYPKYKP TQPRSPEAK KP IHK   V+KVTKSDR+HE SK VSSHK QQPN INIKGENVGAVMEIVESSKREGGH+IKK KE  R ILNNND+ N
Subjt:  VDEYPKYKPTTQPRSPEAKPKPVIHK--AVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIGN

Query:  DQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        DQNNEASKTTNSS PTNTFLN+NFQSVNNSLLYNA LT+ DPGLHL+FSRNPTG+RF VD DKKQHH KY
Subjt:  DQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

XP_022147459.1 serine/arginine repetitive matrix protein 1-like [Momordica charantia]4.2e-8867.52Show/hide
Query:  MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVD
        MAGR+GRSFYRFSS NRPLAPG NT  Q+SAQY+GRQY SA RD+SVEPRARSPP SPRR+   P SPTYS+KK  SPPSSP YRAPAAR +SSPP+ VD
Subjt:  MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVD

Query:  EYPKYKPTTQPRSPEAKPKPVIHKAVDK-VTKSDRHHESSKTVSSHKVQQ--PNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNNNDIGNDQ
        EYPKYKPTTQPRSPEA  KPVI+KA++K  TKSDR+ E+ KT SS K QQ  PN INI GEN+GAVMEIV+S KREGGH+I+KKE     L++ND  N+ 
Subjt:  EYPKYKPTTQPRSPEAKPKPVIHKAVDK-VTKSDRHHESSKTVSSHKVQQ--PNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNNNDIGNDQ

Query:  NNE------ASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        + +       S +++SS P NTFLN+NFQSVNNSLLYNA+L H DPGLHL F+RNP G+RF    D K+HH KY
Subjt:  NNE------ASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

XP_038879892.1 uncharacterized protein At1g10890-like isoform X1 [Benincasa hispida]3.8e-10582.16Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAP-GANTYGQESAQYEGRQYSSAGRDTSVEPRARS-PPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPK
        MAGR FGR  YRFSSANRPLAP  ANT GQ+SAQY+GRQY SA RDTS+EPRARS PP+SPRRDQPLP SPTYSIKK TSPP SP YRAPAARVISSPPK
Subjt:  MAGR-FGRSFYRFSSANRPLAP-GANTYGQESAQYEGRQYSSAGRDTSVEPRARS-PPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPK

Query:  TVDEYPKYKPTTQPRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREG-GHVIKKKEKTREILNNNDIGND
        TVDEY KYKPTTQPRSPEAK KP I K VDKVTKSDRHHESSKTVSSHKVQQPN INIKG+NVGAVMEIVESSKREG GHVIKKKE  RE+LN ++  ND
Subjt:  TVDEYPKYKPTTQPRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREG-GHVIKKKEKTREILNNNDIGND

Query:  QNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        Q NEASKT NSS P +TFLN+NFQSVNNSLL+NATL H DPGLHL FS NPTG+R TVD DKKQHH KY
Subjt:  QNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

XP_038879901.1 uncharacterized protein LOC120071611 isoform X2 [Benincasa hispida]4.5e-9879.18Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAP-GANTYGQESAQYEGRQYSSAGRDTSVEPRARS-PPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPK
        MAGR FGR  YRFSSANRPLAP  ANT GQ+SAQY+GRQY SA RDTS+EPRARS PP+SPRRDQPLP SPTYSIKK TSPP SP YRAPAARVISSPPK
Subjt:  MAGR-FGRSFYRFSSANRPLAP-GANTYGQESAQYEGRQYSSAGRDTSVEPRARS-PPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPK

Query:  TVDEYPKYKPTTQPRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREG-GHVIKKKEKTREILNNNDIGND
        TVDEY K        SPEAK KP I K VDKVTKSDRHHESSKTVSSHKVQQPN INIKG+NVGAVMEIVESSKREG GHVIKKKE  RE+LN ++  ND
Subjt:  TVDEYPKYKPTTQPRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREG-GHVIKKKEKTREILNNNDIGND

Query:  QNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        Q NEASKT NSS P +TFLN+NFQSVNNSLL+NATL H DPGLHL FS NPTG+R TVD DKKQHH KY
Subjt:  QNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

TrEMBL top hitse value%identityAlignment
A0A0A0L3R5 Uncharacterized protein9.7e-9978.23Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRR-DQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT
        MAGR FGR  YRFSSANRPLAP  NT GQ+SAQY+GR+Y S+ RDTS+EPR+  P ++ RR DQPLP SPTYSIKK TSPPSSP YRAPAAR ISSP KT
Subjt:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRR-DQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT

Query:  VDEYPKYKPTTQPRSPEAKPKP-VIHKA-VDKVTKSDRHHESSKTVSSHKVQ-QPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIG
        VDEYPKYKP TQPRSPEAK KP + HK+ V+KVTKSDR+HESSKT+SSHK Q QPN INIKGENVGAVMEIVESSKREGGH+IKK KE  R ILNNND+ 
Subjt:  VDEYPKYKPTTQPRSPEAKPKP-VIHKA-VDKVTKSDRHHESSKTVSSHKVQ-QPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIG

Query:  NDQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        NDQNNEASK  NSS PTNTFLN+NFQSVNNSLLYNA LTH DPGLHL FSRNPTGER  VD  KKQHH +Y
Subjt:  NDQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

A0A1S3AVL1 uncharacterized protein LOC1034831568.8e-10079.26Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQ-PLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT
        MAGR FGR  YRFSSANRPLAP  NT GQ+SAQY+ RQY SA RDTS+EPR+  P IS R+D  PLP SPTYSIKK TSPPSSP YR  AAR ISSPPK 
Subjt:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQ-PLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT

Query:  VDEYPKYKPTTQPRSPEAKPKPVIHK--AVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIGN
        VDEYPKYKP TQPRSPEAK KP IHK   V+KVTKSDR+HE SK VSSHK QQPN INIKGENVGAVMEIVESSKREGGH+IKK KE  R ILNNND+ N
Subjt:  VDEYPKYKPTTQPRSPEAKPKPVIHK--AVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIGN

Query:  DQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        DQNNEASKTTNSS PTNTFLN+NFQSVNNSLLYNA LT+ DPGLHL+FSRNPTG+RF VD DKKQHH KY
Subjt:  DQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

A0A5A7U5I8 Zyxin-like8.8e-10079.26Show/hide
Query:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQ-PLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT
        MAGR FGR  YRFSSANRPLAP  NT GQ+SAQY+ RQY SA RDTS+EPR+  P IS R+D  PLP SPTYSIKK TSPPSSP YR  AAR ISSPPK 
Subjt:  MAGR-FGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQ-PLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKT

Query:  VDEYPKYKPTTQPRSPEAKPKPVIHK--AVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIGN
        VDEYPKYKP TQPRSPEAK KP IHK   V+KVTKSDR+HE SK VSSHK QQPN INIKGENVGAVMEIVESSKREGGH+IKK KE  R ILNNND+ N
Subjt:  VDEYPKYKPTTQPRSPEAKPKPVIHK--AVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKK-KEKTREILNNNDIGN

Query:  DQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        DQNNEASKTTNSS PTNTFLN+NFQSVNNSLLYNA LT+ DPGLHL+FSRNPTG+RF VD DKKQHH KY
Subjt:  DQNNEASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

A0A6J1D126 serine/arginine repetitive matrix protein 1-like2.0e-8867.52Show/hide
Query:  MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVD
        MAGR+GRSFYRFSS NRPLAPG NT  Q+SAQY+GRQY SA RD+SVEPRARSPP SPRR+   P SPTYS+KK  SPPSSP YRAPAAR +SSPP+ VD
Subjt:  MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVD

Query:  EYPKYKPTTQPRSPEAKPKPVIHKAVDK-VTKSDRHHESSKTVSSHKVQQ--PNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNNNDIGNDQ
        EYPKYKPTTQPRSPEA  KPVI+KA++K  TKSDR+ E+ KT SS K QQ  PN INI GEN+GAVMEIV+S KREGGH+I+KKE     L++ND  N+ 
Subjt:  EYPKYKPTTQPRSPEAKPKPVIHKAVDK-VTKSDRHHESSKTVSSHKVQQ--PNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNNNDIGNDQ

Query:  NNE------ASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY
        + +       S +++SS P NTFLN+NFQSVNNSLLYNA+L H DPGLHL F+RNP G+RF    D K+HH KY
Subjt:  NNE------ASKTTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY

A0A6J1INF1 uncharacterized protein LOC111479075 isoform X13.2e-4951.1Show/hide
Query:  MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVD
        MA RFGRS YRFSS NRP AP                   A R  S EPR  SP   PR                   P+SP    P++R+I+SPP  V 
Subjt:  MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVD

Query:  EYPKYKPTTQPRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNN-NDIGNDQNN
        +YPK        SPE K K ++HK V+K  KS+R+ +S +T    K QQPN INI GENVGAVMEIVESSK EGGHV+KKKE  R +++N +D  NDQ+ 
Subjt:  EYPKYKPTTQPRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNN-NDIGNDQNN

Query:  EASK---------TTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHH
        +ASK           +SS PT TFLNNNFQSVNNSLL++A+L H DPGLHL FSRN TG+RFT+D DKKQHH
Subjt:  EASK---------TTNSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46630.1 unknown protein1.2e-0828.46Show/hide
Query:  SVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYR-APAARVISSPPKTVDEYPKYKPTTQPRSPEAKPKPVIHKA-------------------
        ++ PR  + P SP        S T    KT SP  S  +R AP+ RV+S            + TTQ     A+     H+                    
Subjt:  SVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYR-APAARVISSPPKTVDEYPKYKPTTQPRSPEAKPKPVIHKA-------------------

Query:  VDKVTKSDRH--------HESSKTVSSHKVQQPNEINIKGENVGAVMEIVES--SKREGGHVIKKK-------EKTREILNNNDIGNDQNNEASKTT---
             ++  H        H    +  S  +     I I GEN GAVMEI+ S    + GG             EK R + +++   +D+     KTT   
Subjt:  VDKVTKSDRH--------HESSKTVSSHKVQQPNEINIKGENVGAVMEIVES--SKREGGHVIKKK-------EKTREILNNNDIGNDQNNEASKTT---

Query:  ----NSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNP
            NS+ P   F+N+N Q +NNS++YN+T +HHDPG+HL  SR P
Subjt:  ----NSSTPTNTFLNNNFQSVNNSLLYNATLTHHDPGLHLTFSRNP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGTCGTTTTGGTCGTTCGTTTTACCGTTTTTCTTCTGCAAACCGACCCCTAGCTCCCGGTGCCAATACTTATGGCCAAGAGTCAGCTCAGTATGAGGGCCGCCA
GTATTCTTCAGCTGGCCGAGACACGTCGGTGGAACCCCGGGCTCGGTCGCCACCGATTTCGCCGAGAAGAGATCAGCCCCTTCCTGTTTCTCCCACGTACTCCATCAAGA
AGACTACTTCGCCGCCGTCTTCTCCGCGGTACAGGGCTCCGGCTGCTCGTGTCATCAGCTCGCCGCCAAAGACGGTAGATGAATACCCCAAGTATAAACCTACTACTCAA
CCCAGGTCGCCGGAGGCAAAGCCGAAACCAGTGATCCACAAGGCGGTCGATAAGGTGACGAAATCAGACCGTCATCATGAATCGAGCAAAACGGTGTCGTCCCACAAAGT
GCAACAACCAAATGAAATAAACATTAAAGGAGAAAATGTTGGGGCAGTAATGGAAATTGTTGAGTCATCGAAACGGGAAGGGGGACATGTTATAAAGAAGAAAGAGAAAA
CAAGAGAAATATTAAATAACAACGATATTGGCAATGATCAAAACAATGAAGCTTCAAAAACAACTAATTCATCAACGCCAACAAACACTTTCTTGAACAACAATTTTCAG
AGTGTCAACAATTCCCTTCTTTACAATGCAACTTTGACTCACCATGACCCCGGTTTACACCTCACTTTCTCTCGAAACCCGACCGGTGAACGGTTCACTGTTGATCATGA
CAAGAAACAACACCACATAAAATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGTCGTTTTGGTCGTTCGTTTTACCGTTTTTCTTCTGCAAACCGACCCCTAGCTCCCGGTGCCAATACTTATGGCCAAGAGTCAGCTCAGTATGAGGGCCGCCA
GTATTCTTCAGCTGGCCGAGACACGTCGGTGGAACCCCGGGCTCGGTCGCCACCGATTTCGCCGAGAAGAGATCAGCCCCTTCCTGTTTCTCCCACGTACTCCATCAAGA
AGACTACTTCGCCGCCGTCTTCTCCGCGGTACAGGGCTCCGGCTGCTCGTGTCATCAGCTCGCCGCCAAAGACGGTAGATGAATACCCCAAGTATAAACCTACTACTCAA
CCCAGGTCGCCGGAGGCAAAGCCGAAACCAGTGATCCACAAGGCGGTCGATAAGGTGACGAAATCAGACCGTCATCATGAATCGAGCAAAACGGTGTCGTCCCACAAAGT
GCAACAACCAAATGAAATAAACATTAAAGGAGAAAATGTTGGGGCAGTAATGGAAATTGTTGAGTCATCGAAACGGGAAGGGGGACATGTTATAAAGAAGAAAGAGAAAA
CAAGAGAAATATTAAATAACAACGATATTGGCAATGATCAAAACAATGAAGCTTCAAAAACAACTAATTCATCAACGCCAACAAACACTTTCTTGAACAACAATTTTCAG
AGTGTCAACAATTCCCTTCTTTACAATGCAACTTTGACTCACCATGACCCCGGTTTACACCTCACTTTCTCTCGAAACCCGACCGGTGAACGGTTCACTGTTGATCATGA
CAAGAAACAACACCACATAAAATACTAG
Protein sequenceShow/hide protein sequence
MAGRFGRSFYRFSSANRPLAPGANTYGQESAQYEGRQYSSAGRDTSVEPRARSPPISPRRDQPLPVSPTYSIKKTTSPPSSPRYRAPAARVISSPPKTVDEYPKYKPTTQ
PRSPEAKPKPVIHKAVDKVTKSDRHHESSKTVSSHKVQQPNEINIKGENVGAVMEIVESSKREGGHVIKKKEKTREILNNNDIGNDQNNEASKTTNSSTPTNTFLNNNFQ
SVNNSLLYNATLTHHDPGLHLTFSRNPTGERFTVDHDKKQHHIKY