; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013939 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013939
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUPF0307 protein plu4061 isoform X2
Genome locationChr02:6261672..6264465
RNA-Seq ExpressionHG10013939
SyntenyHG10013939
Gene Ontology termsNA
InterPro domainsIPR006839 - Ribosome-associated, YjgA
IPR023153 - PSPTO4464-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024147.1 hypothetical protein SDJN02_12960, partial [Cucurbita argyrosperma subsp. argyrosperma]2.7e-10070.99Show/hide
Query:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV
        M+HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSKGLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAV
Subjt:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV

Query:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE
        QWGMDLAAFSTPQIKRILR                  G  V   K  + S               S+  ATK GDH +LQ L  SV   +DE ++S YEE
Subjt:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE

Query:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
        EEEEGPHVDI TRWLDGLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KPLARFLCRMAKQLP YEL
Subjt:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

XP_004136378.1 uncharacterized protein LOC101214378 [Cucumis sativus]9.3e-10172.97Show/hide
Query:  MSHMVRALRQWP-MVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR
        MSHMVRALRQWP MVQKHC GCAVHHFL SSPPWVAKRI SRRLSLATVHSAR EVQY SKGLRL KAPA AKSQE ES+ DDD D RKSRNQLKREARR
Subjt:  MSHMVRALRQWP-MVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR

Query:  AVQWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGSSI-----------------TLATKAGDHKILQKLCASVDDEVSNSVY-
        AVQWGMDLA FST QIKRIL                   G  V   K  + + I                   +TKAGDHKILQ+LCASVDDEVS  VY 
Subjt:  AVQWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGSSI-----------------TLATKAGDHKILQKLCASVDDEVSNSVY-

Query:  -EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
         EEEEEEGPHVDIATRWLDGL+SK+N IT EIYSLQTVEFDRQELRRLVRKVH +EERKAAIEEN DEVNTA+TNA KPLARFLCRMAKQLPS EL
Subjt:  -EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

XP_022937103.1 uncharacterized protein LOC111443507 [Cucurbita moschata]9.3e-10171.33Show/hide
Query:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV
        M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSKGLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAV
Subjt:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV

Query:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE
        QWGMDLAAFSTPQIKRILR                  G  V   K  + S               S+  ATK GDH +LQ L  SV   DDE ++S YEE
Subjt:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE

Query:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
        EEEEGPHVDI TRWLDGLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KPLARFLCRMAKQLP YEL
Subjt:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

XP_023535041.1 uncharacterized protein LOC111796585 [Cucurbita pepo subsp. pepo]1.2e-10071.33Show/hide
Query:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV
        M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSKGLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAV
Subjt:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV

Query:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE
        QWGMDLAAFSTPQIKRILR                  G  V   K  + S               S+  ATK GDH +LQ L  SV   DDE ++S YEE
Subjt:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE

Query:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
        EEE+GPHVDIATRWLDGLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KPLARFLCRMAKQLP YEL
Subjt:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

XP_038897014.1 UPF0307 protein ECA0281 isoform X1 [Benincasa hispida]4.2e-10172.26Show/hide
Query:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV
        MSHMVRALRQWPM+QKH  GCAV H   S  PWV KR DSRRLSLATVHSAR EVQ  SKGLRLPKAPAPAKSQEDESV+DDSD RKSRNQLKREARRAV
Subjt:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV

Query:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGSSI-----------------TLATKAGDHKILQKLCASVDDEVSNSVYEEE
        QWGMDLAAFS PQIKRIL                   G  V   K  + + I                   ATK GDHKILQKLCASVDD+VS SVYEEE
Subjt:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGSSI-----------------TLATKAGDHKILQKLCASVDDEVSNSVYEEE

Query:  EEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
        EEEGPHV+IATRWLDGL+SKDNNITNEIYSLQTVEFDRQELRRLVRKV  IE++KAAIEEN DEVN  ITNA KPLA FLCR+AKQLPSYEL
Subjt:  EEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

TrEMBL top hitse value%identityAlignment
A0A0A0LH61 Uncharacterized protein4.5e-10172.97Show/hide
Query:  MSHMVRALRQWP-MVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR
        MSHMVRALRQWP MVQKHC GCAVHHFL SSPPWVAKRI SRRLSLATVHSAR EVQY SKGLRL KAPA AKSQE ES+ DDD D RKSRNQLKREARR
Subjt:  MSHMVRALRQWP-MVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR

Query:  AVQWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGSSI-----------------TLATKAGDHKILQKLCASVDDEVSNSVY-
        AVQWGMDLA FST QIKRIL                   G  V   K  + + I                   +TKAGDHKILQ+LCASVDDEVS  VY 
Subjt:  AVQWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGSSI-----------------TLATKAGDHKILQKLCASVDDEVSNSVY-

Query:  -EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
         EEEEEEGPHVDIATRWLDGL+SK+N IT EIYSLQTVEFDRQELRRLVRKVH +EERKAAIEEN DEVNTA+TNA KPLARFLCRMAKQLPS EL
Subjt:  -EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

A0A1S4E5I7 UPF0307 protein Asuc_0809 isoform X17.2e-9971.72Show/hide
Query:  MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR
        MSHMVRALRQW PM+QKHC GCAVHHFLS SPPWVAKRI SRRLSLATVHSAR EVQY SKGLRL KAPA AKSQEDES+ DDDSD RKSRNQLKREARR
Subjt:  MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR

Query:  AVQWGMDLAAFSTPQIKRIL----------------RGLG----------------VMSEKESEGSSITL---ATKAGDHKILQKLCASVDDEVSNSVY-
        AVQWGMDLA FST QIKRIL                + LG                ++ + + +   + +   ATKAGDHKILQ+LCASVDDEVS SV+ 
Subjt:  AVQWGMDLAAFSTPQIKRIL----------------RGLG----------------VMSEKESEGSSITL---ATKAGDHKILQKLCASVDDEVSNSVY-

Query:  --EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
          EEEEEEGPHVD+ATRW DGL+SKDN IT EIYS QTVEFDRQELRRLVRKVH +EERKAAIEEN DEVN AITNA KPLARFL RMAKQLPS EL
Subjt:  --EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

A0A1S4E5J2 UPF0307 protein plu4061 isoform X25.5e-9971.96Show/hide
Query:  MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR
        MSHMVRALRQW PM+QKHC GCAVHHFLS SPPWVAKRI SRRLSLATVHSAR EVQY SKGLRL KAPA AKSQEDES+ DDDSD RKSRNQLKREARR
Subjt:  MSHMVRALRQW-PMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESV-DDDSDARKSRNQLKREARR

Query:  AVQWGMDLAAFSTPQIKRIL----------------RGLG----------------VMSEKESEGSSITL---ATKAGDHKILQKLCASVDDEVSNSVY-
        AVQWGMDLA FST QIKRIL                + LG                ++ + + +   + +   ATKAGDHKILQ+LCASVDDEVS SV+ 
Subjt:  AVQWGMDLAAFSTPQIKRIL----------------RGLG----------------VMSEKESEGSSITL---ATKAGDHKILQKLCASVDDEVSNSVY-

Query:  -EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
         EEEEEEGPHVD+ATRW DGL+SKDN IT EIYS QTVEFDRQELRRLVRKVH +EERKAAIEEN DEVN AITNA KPLARFL RMAKQLPS EL
Subjt:  -EEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

A0A6J1FF35 uncharacterized protein LOC1114435074.5e-10171.33Show/hide
Query:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV
        M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDSRRL+LATVHSAR EVQYGSKGLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAV
Subjt:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV

Query:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE
        QWGMDLAAFSTPQIKRILR                  G  V   K  + S               S+  ATK GDH +LQ L  SV   DDE ++S YEE
Subjt:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE

Query:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
        EEEEGPHVDI TRWLDGLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KPLARFLCRMAKQLP YEL
Subjt:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

A0A6J1IM05 uncharacterized protein LOC1114768126.5e-10070.99Show/hide
Query:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV
        M HMVRALR WPM+Q HC GC VHHFL SSPPWVAKRIDS RL+LATVHSAR EVQ+GSKGLRL KA APA+ QEDESVD+D D RKSRNQLKREARRAV
Subjt:  MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAV

Query:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE
        QWGMDLAAFSTPQIKRILR                  G  V   K  + S               S+  ATK GDH  LQ L  SV   DDE ++S YEE
Subjt:  QWGMDLAAFSTPQIKRILR------------------GLGVMSEKESEGS---------------SITLATKAGDHKILQKLCASV---DDEVSNSVYEE

Query:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL
        EEEEGPHVDIATRWLDGLVSKD N+TNEIYSLQTVEFDRQELRRLVRKVH +EERKAA EENEDEVN AIT A KPLARFLCRMAKQLP YEL
Subjt:  EEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G24175.1 unknown protein8.8e-3339.08Show/hide
Query:  PKAPAPAKSQEDESVDDD---SDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRGLGVMSE------------------------------KESEG
        P+A  P     +E  D+D   SD+ +SRNQ KR+ARRAV+WGM+LA+FS  Q+K+IL+   +  E                              +E E 
Subjt:  PKAPAPAKSQEDESVDDD---SDARKSRNQLKREARRAVQWGMDLAAFSTPQIKRILRGLGVMSE------------------------------KESEG

Query:  ---SSITLATKAGDHKILQKLCASV-----------DDEVSNSVYEEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTI-
            ++  ATK GDH  LQ L +S            DD+      +EEE    +  +A RW DGL+S++  +T E+YSLQ+V+FDRQELR+LVRKV  + 
Subjt:  ---SSITLATKAGDHKILQKLCASV-----------DDEVSNSVYEEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTI-

Query:  EERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPS
        E+RK   EE + EV  A+  A K L +FLC MAKQ+ S
Subjt:  EERKAAIEENEDEVNTAITNATKPLARFLCRMAKQLPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATATGGTTCGGGCTCTCCGGCAGTGGCCGATGGTGCAGAAACATTGTCGTGGTTGCGCCGTACATCATTTTCTCTCCTCATCTCCGCCGTGGGTGGCCAAAAG
AATCGACTCTCGTCGACTATCTTTAGCTACCGTTCATTCTGCTCGTGGCGAAGTCCAATATGGATCAAAAGGACTCAGATTACCCAAAGCTCCAGCACCAGCCAAATCCC
AAGAAGATGAGAGCGTCGATGATGATTCGGATGCTAGAAAGAGCCGCAACCAGCTTAAACGGGAAGCTCGACGAGCCGTCCAATGGGGCATGGATCTTGCGGCCTTCTCC
ACTCCTCAAATTAAACGCATCCTTAGAGGTTTGGGAGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAATTACATTGGCCACAAAAGCCGGTGACCATAAGATACTACA
GAAATTGTGTGCTTCAGTAGATGATGAAGTTTCAAATTCTGTATACGAGGAGGAGGAAGAAGAGGGTCCGCATGTGGACATCGCTACAAGATGGCTTGACGGGCTAGTCA
GTAAGGACAACAACATTACAAATGAAATTTATTCACTACAAACTGTTGAATTTGACCGTCAGGAGCTGCGGAGACTTGTTCGAAAAGTTCATACGATTGAAGAACGCAAG
GCAGCAATTGAAGAGAATGAGGATGAAGTCAATACCGCGATAACAAACGCCACAAAGCCCCTTGCTCGTTTCCTTTGTAGAATGGCGAAACAGTTGCCCTCCTATGAACT
CTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATATGGTTCGGGCTCTCCGGCAGTGGCCGATGGTGCAGAAACATTGTCGTGGTTGCGCCGTACATCATTTTCTCTCCTCATCTCCGCCGTGGGTGGCCAAAAG
AATCGACTCTCGTCGACTATCTTTAGCTACCGTTCATTCTGCTCGTGGCGAAGTCCAATATGGATCAAAAGGACTCAGATTACCCAAAGCTCCAGCACCAGCCAAATCCC
AAGAAGATGAGAGCGTCGATGATGATTCGGATGCTAGAAAGAGCCGCAACCAGCTTAAACGGGAAGCTCGACGAGCCGTCCAATGGGGCATGGATCTTGCGGCCTTCTCC
ACTCCTCAAATTAAACGCATCCTTAGAGGTTTGGGAGTGATGTCAGAGAAGGAAAGCGAAGGCAGTTCAATTACATTGGCCACAAAAGCCGGTGACCATAAGATACTACA
GAAATTGTGTGCTTCAGTAGATGATGAAGTTTCAAATTCTGTATACGAGGAGGAGGAAGAAGAGGGTCCGCATGTGGACATCGCTACAAGATGGCTTGACGGGCTAGTCA
GTAAGGACAACAACATTACAAATGAAATTTATTCACTACAAACTGTTGAATTTGACCGTCAGGAGCTGCGGAGACTTGTTCGAAAAGTTCATACGATTGAAGAACGCAAG
GCAGCAATTGAAGAGAATGAGGATGAAGTCAATACCGCGATAACAAACGCCACAAAGCCCCTTGCTCGTTTCCTTTGTAGAATGGCGAAACAGTTGCCCTCCTATGAACT
CTAG
Protein sequenceShow/hide protein sequence
MSHMVRALRQWPMVQKHCRGCAVHHFLSSSPPWVAKRIDSRRLSLATVHSARGEVQYGSKGLRLPKAPAPAKSQEDESVDDDSDARKSRNQLKREARRAVQWGMDLAAFS
TPQIKRILRGLGVMSEKESEGSSITLATKAGDHKILQKLCASVDDEVSNSVYEEEEEEGPHVDIATRWLDGLVSKDNNITNEIYSLQTVEFDRQELRRLVRKVHTIEERK
AAIEENEDEVNTAITNATKPLARFLCRMAKQLPSYEL