; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019406 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019406
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationChr04:21473081..21475948
RNA-Seq ExpressionHG10019406
SyntenyHG10019406
Gene Ontology termsGO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018071.1 hypothetical protein SDJN02_19937, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-11581.63Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MALSA SPLP  VS GF RPQ PTT  PRF Y  ASLSSSSS S+SAK SSPTDN   SNGTS PFVEP RA DSNF+YAFANP+A G +LHPILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

XP_008461820.1 PREDICTED: uncharacterized protein LOC103500330 [Cucumis melo]2.1e-11683.28Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYP-CASLSSSSSESKSAKSSSPTDNLV-SSNGTSPP-FVEPPRAPDSNFTYAFANPT-ASGGSLHPILG
        MA SAFSPLPFPVSLGFRRPQSPTT  PR  +P  ASLSSSSSESKSAKSSSPTDNLV SSNGT+PP FVEP   PDSNF YAF NPT A+  SLHPILG
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYP-CASLSSSSSESKSAKSSSPTDNLV-SSNGTSPP-FVEPPRAPDSNFTYAFANPT-ASGGSLHPILG

Query:  FMQSTESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPP
        FMQSTESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPP
Subjt:  FMQSTESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPP

Query:  PVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
         VDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  PVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

XP_022934188.1 uncharacterized protein LOC111441432 [Cucurbita moschata]7.8e-11681.98Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MA SA SPLP  VS GF RPQ PTT  PRF Y  ASLSSSSSESKSAK SSPT N   SNGTS PFVEP RA DSNF+YAFANP+A GG+LHPILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

XP_022983902.1 uncharacterized protein LOC111482385 [Cucurbita maxima]6.6e-11581.63Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MALS  SPLP  VS GF RPQ PT   PRF Y  ASLSSSSSESKSAK SSPT N   SNGTS PFVEP RA DSNF+YAFANP+A GG+LHPILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

XP_038905839.1 uncharacterized protein LOC120091793 [Benincasa hispida]1.4e-12586.22Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MA SAFSPLPFPVSLGF RPQS TT LPRF YPCASLSSSSSES SAKSS PTDNLVSSNGTSPPFVEP RAPDSNF+YAFANP+ +GGSLHPILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

TrEMBL top hitse value%identityAlignment
A0A0A0LCC7 Uncharacterized protein7.2e-11580.97Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYP-CASLSSSSSESKSAKSSSPTDNLV-SSNGTSPP---FVEPPRAPDSNFTYAFANPT-ASGGSLHPI
        MA SAFSPLPFP+SLGFRRPQSPTT  PR P+P  +SLSSSSSESKSAKSSSPTDNLV SSNGT+PP   FV+P   P SNFTYAF NPT  +  SLHPI
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYP-CASLSSSSSESKSAKSSSPTDNLV-SSNGTSPP---FVEPPRAPDSNFTYAFANPT-ASGGSLHPI

Query:  LGFMQSTESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDE
        LGFMQS ESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDE
Subjt:  LGFMQSTESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDE

Query:  PPPVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        PP VDRAL+GSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  PPPVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

A0A1S3CFG0 uncharacterized protein LOC1035003301.0e-11683.28Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYP-CASLSSSSSESKSAKSSSPTDNLV-SSNGTSPP-FVEPPRAPDSNFTYAFANPT-ASGGSLHPILG
        MA SAFSPLPFPVSLGFRRPQSPTT  PR  +P  ASLSSSSSESKSAKSSSPTDNLV SSNGT+PP FVEP   PDSNF YAF NPT A+  SLHPILG
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYP-CASLSSSSSESKSAKSSSPTDNLV-SSNGTSPP-FVEPPRAPDSNFTYAFANPT-ASGGSLHPILG

Query:  FMQSTESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPP
        FMQSTESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPP
Subjt:  FMQSTESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPP

Query:  PVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
         VDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  PVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

A0A6J1DH02 uncharacterized protein LOC1110203581.1e-11280.21Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MALSA S  PF VSLGF RP        R PYPCASLSSSSSESKSAK+SSPTDN+VSSNGTS P VEP RA DSNF YAFANP+  GG+ H ILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPD PP VDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

A0A6J1F6Z4 uncharacterized protein LOC1114414323.8e-11681.98Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MA SA SPLP  VS GF RPQ PTT  PRF Y  ASLSSSSSESKSAK SSPT N   SNGTS PFVEP RA DSNF+YAFANP+A GG+LHPILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

A0A6J1J0L4 uncharacterized protein LOC1114823853.2e-11581.63Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        MALS  SPLP  VS GF RPQ PT   PRF Y  ASLSSSSSESKSAK SSPT N   SNGTS PFVEP RA DSNF+YAFANP+A GG+LHPILGFMQS
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
        TESSIER                          GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR
Subjt:  TESSIER--------------------------GCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDR

Query:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLHRPE
Subjt:  ALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.2e-3439.76Show/hide
Query:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS
        + +S+F  L  P         S  +   R P   AS S+S+S S S      T   V+SN T+    E          Y+    T  G      LG +  
Subjt:  MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQS

Query:  TESSIERGCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDRALKGSSLFGMFALKERPKWMKISSLD
        +     +GC+Y+ D++  Y       ++ G+++  LVEAID+YL GTVML+FG+GLY LFISN+   E    D     SSLFGMF LKERP+W+++ S+ 
Subjt:  TESSIERGCVYICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDRALKGSSLFGMFALKERPKWMKISSLD

Query:  ELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLH
        ELKTK+GHVIVM+LL+ +F++SK V I +  DLL  SV IF SSA L++L  L+
Subjt:  ELKTKVGHVIVMILLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLH

AT5G13720.1 Uncharacterised protein family (UPF0114)1.1e-7267.62Show/hide
Query:  RFPYPCASLSSSSSESKSAKSSSPT------DNLVSSNGT---SPPFVEPPRAPDSN-----FTYAFANPTASGGSL-HPILGFMQSTESSIERGCVYIC
        RF  P A+L SS  ES SA SS PT      + L SS GT     PF +  R+ +SN     F + F    A GGSL   +L F+         GCVYI 
Subjt:  RFPYPCASLSSSSSESKSAKSSSPT------DNLVSSNGT---SPPFVEPPRAPDSN-----FTYAFANPTASGGSL-HPILGFMQSTESSIERGCVYIC

Query:  DAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMI
        +AYKVYW++C KGIHTGQMVLRLVEAIDVYLAGTVMLIF MGLYGLFIS+   D PP  DRAL+ SSLFGMFA+KERPKWMKISSLDELKTKVGHVIVMI
Subjt:  DAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMI

Query:  LLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE
        LLVKMFERSKMVTIATG+DLLSYSVCIFLSSASLYILHNLH+ E
Subjt:  LLVKMFERSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTTTCTGCTTTCTCCCCGCTTCCATTCCCCGTCTCCTTAGGATTCCGCCGCCCTCAATCTCCGACCACCATGCTTCCTCGCTTTCCCTACCCTTGTGCTTCTTT
AAGCTCTTCCTCTTCCGAATCTAAATCCGCCAAATCATCCTCCCCCACTGATAATCTCGTTTCCTCCAATGGAACTTCCCCTCCCTTCGTTGAACCCCCCAGAGCTCCCG
ACTCCAATTTCACCTACGCCTTTGCTAACCCTACTGCTTCCGGTGGTTCTCTTCACCCCATTCTTGGGTTTATGCAATCCACTGAATCCTCAATTGAGAGGGGCTGCGTT
TATATTTGTGATGCATATAAAGTTTACTGGTCAAGCTGTGTCAAAGGGATTCACACCGGACAAATGGTTCTGCGACTTGTTGAAGCTATTGATGTATATCTTGCTGGAAC
CGTCATGTTAATCTTTGGGATGGGCCTATATGGATTGTTTATCAGTAATGTGTCTCCTGATGAACCTCCTCCTGTTGATCGTGCCCTGAAAGGATCCTCACTGTTTGGAA
TGTTTGCCTTGAAGGAGAGGCCAAAATGGATGAAAATTAGCTCTCTTGATGAGCTGAAAACAAAAGTCGGACATGTCATTGTCATGATTCTTTTAGTCAAAATGTTCGAG
AGAAGCAAGATGGTAACGATAGCAACTGGTGTCGATCTACTCAGTTATTCCGTCTGTATTTTCCTGTCTTCTGCATCTTTATACATCCTCCATAATCTACACAGGCCAGA
ATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGCTTTCTGCTTTCTCCCCGCTTCCATTCCCCGTCTCCTTAGGATTCCGCCGCCCTCAATCTCCGACCACCATGCTTCCTCGCTTTCCCTACCCTTGTGCTTCTTT
AAGCTCTTCCTCTTCCGAATCTAAATCCGCCAAATCATCCTCCCCCACTGATAATCTCGTTTCCTCCAATGGAACTTCCCCTCCCTTCGTTGAACCCCCCAGAGCTCCCG
ACTCCAATTTCACCTACGCCTTTGCTAACCCTACTGCTTCCGGTGGTTCTCTTCACCCCATTCTTGGGTTTATGCAATCCACTGAATCCTCAATTGAGAGGGGCTGCGTT
TATATTTGTGATGCATATAAAGTTTACTGGTCAAGCTGTGTCAAAGGGATTCACACCGGACAAATGGTTCTGCGACTTGTTGAAGCTATTGATGTATATCTTGCTGGAAC
CGTCATGTTAATCTTTGGGATGGGCCTATATGGATTGTTTATCAGTAATGTGTCTCCTGATGAACCTCCTCCTGTTGATCGTGCCCTGAAAGGATCCTCACTGTTTGGAA
TGTTTGCCTTGAAGGAGAGGCCAAAATGGATGAAAATTAGCTCTCTTGATGAGCTGAAAACAAAAGTCGGACATGTCATTGTCATGATTCTTTTAGTCAAAATGTTCGAG
AGAAGCAAGATGGTAACGATAGCAACTGGTGTCGATCTACTCAGTTATTCCGTCTGTATTTTCCTGTCTTCTGCATCTTTATACATCCTCCATAATCTACACAGGCCAGA
ATAG
Protein sequenceShow/hide protein sequence
MALSAFSPLPFPVSLGFRRPQSPTTMLPRFPYPCASLSSSSSESKSAKSSSPTDNLVSSNGTSPPFVEPPRAPDSNFTYAFANPTASGGSLHPILGFMQSTESSIERGCV
YICDAYKVYWSSCVKGIHTGQMVLRLVEAIDVYLAGTVMLIFGMGLYGLFISNVSPDEPPPVDRALKGSSLFGMFALKERPKWMKISSLDELKTKVGHVIVMILLVKMFE
RSKMVTIATGVDLLSYSVCIFLSSASLYILHNLHRPE