; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012693 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012693
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMitochondrial intermediate peptidase
Genome locationChr01:23444228..23449059
RNA-Seq ExpressionHG10012693
SyntenyHG10012693
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582303.1 hypothetical protein SDJN03_22305, partial [Cucurbita argyrosperma subsp. sororia]2.0e-12286.52Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVL+SKQNSLTIEEAN+LQTCKSKAVRDFTFG L+GGGVTWAGTWRLNKF+RL+LSGGA  L GL RFSRSL+SCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+YHNDPRTMQ ISKHF+YEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTH NDPKDNLH N H+DSSNRDS+ +Q DSYG+PDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
         PVLTKPG DAATADPLD IFGTL REEEIQHSSASSPSPKSH RS+RYNRRHRR NQTMPT+FEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]2.5e-12588.01Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEAN+LQTCKSKAVRDFTFG L+GGGVTWAGTWRLNKF+RLNLSGGA AL GL RFSRSL+SCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+YHNDPRTMQHISKHF+YEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTH NDPKDNLH N H+DSSNRDS+ +Q DSYG+PDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
         PVLTKPG DAATADPLD IFGTL REEEIQHSSASSPSPKSH RS+RYNRRHRR NQTMPT+FEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

XP_022980008.1 uncharacterized protein LOC111479542 [Cucurbita maxima]6.2e-12487.64Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEAN+LQTCKSKAVRDFTFG L+GGGVTWAGTWRLNKF+RLNLSGGA AL GL RFSRSL+SCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+ HNDPRTMQHISKHF+YEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQR H NDPKDNLH N H+DSSNRDS+ +Q DSYGEPDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
         PVLTKPG DAATADPLD IFGTL REEEIQHSSASSPSPKSH RS+RYNRRHRR NQTMPT+FEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-12587.64Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEAN+LQTCKSKAVRDFTFG L+GGGVTWAGTWRLNKF+RLNLSGGA AL GL RFSRSL+SCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+YHNDPRTMQHISKHF+YEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTH NDPKDNLH N H+DSSNRDS+ +Q DSYG+PDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
         PVLTKPG DAATADPLD IFGT+ REEEIQHSSASSPSPKSH RS+RYNRRHRR NQTMPT+FEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

XP_038878005.1 uncharacterized protein LOC120070209 isoform X1 [Benincasa hispida]8.4e-12989.96Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEANLLQTC+SKAVRDFT GGL+GGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFS SL SCVDHILAL GSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VTRYHNDPR MQ ISKHFYYEEVFDDSTLDRPKIRWR RNFFSDDVAHAQRT DNDPKDNLH NSH+DSSNRDSSA+Q DSYG+PDDKGNA   E 
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAR--EEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
        KPVLTKPGTDA T DPLDCIFGTLAR  EEEIQHSSASSPSPKSHSRSRRYNRRHR+ NQTMPTNFEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAR--EEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

TrEMBL top hitse value%identityAlignment
A0A0A0L4T1 Uncharacterized protein3.6e-11782.4Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGE L  LE VLRSK N LTIEEA LLQTC+SKAVRDFTFGG+LGGG+TWAG WRLNKF RLNLS GAA+LCG WRFSRSLNSCVD+ILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VTRYHNDP  MQ+ISKHFYYEEVFDDST DRPKIRWRYRNFFSDDVAH+QRTH ND  +N+HENSH     RDSSA+QGDSYG+PDD GNA  HEF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
        KPVLTKPGTDAATADPLDCIFGTLAR+EEIQ+S+ S PSPK HSRSRRYNRRHR+DN T  TNFEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X11.0e-11983.52Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGE L  LE VLRSK N LTIEEA LLQTC+SKAVRDFTFGG+LGGG+TWAGTWRLNKF RLNLSGGAAALCG WRFSRSLNSCVD+IL+LDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VTRYHNDPR MQ+ISKHF+YEEVFDDST DRPKIRWRYRNFFSDDVAH+QRTH ND  +N+HENSH     RDSSAHQ DSYG+ DDKGNA  HEF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
        KPVLTK GTD+ATADPLDCIFGTLAREEEIQHS+ S+PSPK HSRSRRYNRRHR+DNQT PTNFE+V
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X11.4e-11685.33Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEA LLQTCKSKAVRDFTFG L GGGVTWAGTWRLNKFIRLNLSGGAAAL GLWRFSRSLNSCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+YHNDPRTMQHISKHFYYE+VFDDSTLDRP+IRWRYRNFFSDDVAH QRTHDND K+NLH NSH+ SSN DS+++Q  SY EPDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQT
        KPVLTKPGTD ATADPLDC+FG LA+ EEIQHS++S+ + KSHSRSRRY+RRHRR NQT
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQT

A0A6J1GVC2 uncharacterized protein LOC1114578781.2e-12588.01Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEAN+LQTCKSKAVRDFTFG L+GGGVTWAGTWRLNKF+RLNLSGGA AL GL RFSRSL+SCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+YHNDPRTMQHISKHF+YEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTH NDPKDNLH N H+DSSNRDS+ +Q DSYG+PDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
         PVLTKPG DAATADPLD IFGTL REEEIQHSSASSPSPKSH RS+RYNRRHRR NQTMPT+FEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

A0A6J1IXZ4 uncharacterized protein LOC1114795423.0e-12487.64Show/hide
Query:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL
        MGEALF LEQVLRSKQNSLTIEEAN+LQTCKSKAVRDFTFG L+GGGVTWAGTWRLNKF+RLNLSGGA AL GL RFSRSL+SCVDHILALDGSRMQKEL
Subjt:  MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKEL

Query:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF
        ANI+VT+ HNDPRTMQHISKHF+YEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQR H NDPKDNLH N H+DSSNRDS+ +Q DSYGEPDDKGNA   EF
Subjt:  ANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEF

Query:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV
         PVLTKPG DAATADPLD IFGTL REEEIQHSSASSPSPKSH RS+RYNRRHRR NQTMPT+FEHV
Subjt:  KPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein1.6e-1632.01Show/hide
Query:  ALFALEQVLRSK--QNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNK---FIRLNLSGGAAA----LCGLWRFSRSLNSCVDHILALDGS
        AL  L  VL SK  Q  +T EE+  + +C  KA+    F   +GGG+TW  T +L K     R+ L+ G AA    +   W  S+   S +DHIL+ D +
Subjt:  ALFALEQVLRSK--QNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNK---FIRLNLSGGAAA----LCGLWRFSRSLNSCVDHILALDGS

Query:  RMQKELANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFS------DDV--AHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSY
        RMQKEL N++V     +    Q +SKHFY E V+ D   D+P++RWR R  F+      DDV    +QR  +  P  +    S    +++     Q  S 
Subjt:  RMQKELANIIVTRYHNDPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFS------DDV--AHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSY

Query:  GEPDDKGNAHVHEFKPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSR-SRRYNRRHRRDNQTMPTN
              GN+              + A  D LD +FG     E I     S  + K+ +R  +R  RR R  N+   TN
Subjt:  GEPDDKGNAHVHEFKPVLTKPGTDAATADPLDCIFGTLAREEEIQHSSASSPSPKSHSR-SRRYNRRHRRDNQTMPTN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAAGCTTTATTTGCACTCGAACAAGTTCTCAGGTCCAAACAGAACAGCTTGACGATCGAGGAAGCGAATTTGCTCCAAACATGTAAGTCTAAGGCTGTTCGAGA
TTTTACCTTTGGAGGACTCCTTGGAGGTGGCGTGACATGGGCAGGAACATGGAGGCTGAATAAATTCATTCGGCTAAATCTTTCTGGAGGAGCTGCTGCGCTATGTGGAT
TATGGAGATTTAGCAGGTCCCTAAATTCATGTGTCGATCATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCTAATATTATAGTGACGAGATATCACAAT
GATCCTCGTACCATGCAGCACATATCCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAATAAGGTGGCGTTATCGAAATTTCTTTAG
TGATGATGTTGCTCATGCTCAGAGGACGCATGACAATGACCCTAAGGACAATTTGCATGAAAACTCCCACAATGACTCATCCAACCGTGATTCCAGTGCCCACCAGGGTG
ATTCCTATGGTGAGCCTGATGACAAAGGAAATGCTCATGTTCATGAGTTCAAGCCAGTCCTTACTAAGCCTGGCACCGATGCTGCGACCGCAGACCCTCTAGATTGTATT
TTTGGTACATTGGCCAGAGAAGAAGAAATTCAACACTCAAGTGCCTCTAGTCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAA
CCAGACAATGCCAACAAACTTTGAACATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAAGCTTTATTTGCACTCGAACAAGTTCTCAGGTCCAAACAGAACAGCTTGACGATCGAGGAAGCGAATTTGCTCCAAACATGTAAGTCTAAGGCTGTTCGAGA
TTTTACCTTTGGAGGACTCCTTGGAGGTGGCGTGACATGGGCAGGAACATGGAGGCTGAATAAATTCATTCGGCTAAATCTTTCTGGAGGAGCTGCTGCGCTATGTGGAT
TATGGAGATTTAGCAGGTCCCTAAATTCATGTGTCGATCATATTCTTGCACTGGATGGAAGTAGGATGCAAAAGGAATTGGCTAATATTATAGTGACGAGATATCACAAT
GATCCTCGTACCATGCAGCACATATCCAAGCATTTTTATTATGAGGAAGTATTTGACGATTCAACCTTGGACCGGCCAAAAATAAGGTGGCGTTATCGAAATTTCTTTAG
TGATGATGTTGCTCATGCTCAGAGGACGCATGACAATGACCCTAAGGACAATTTGCATGAAAACTCCCACAATGACTCATCCAACCGTGATTCCAGTGCCCACCAGGGTG
ATTCCTATGGTGAGCCTGATGACAAAGGAAATGCTCATGTTCATGAGTTCAAGCCAGTCCTTACTAAGCCTGGCACCGATGCTGCGACCGCAGACCCTCTAGATTGTATT
TTTGGTACATTGGCCAGAGAAGAAGAAATTCAACACTCAAGTGCCTCTAGTCCATCTCCCAAATCTCACTCTCGCAGCAGAAGATACAATCGTCGGCATCGAAGAGATAA
CCAGACAATGCCAACAAACTTTGAACATGTGTAA
Protein sequenceShow/hide protein sequence
MGEALFALEQVLRSKQNSLTIEEANLLQTCKSKAVRDFTFGGLLGGGVTWAGTWRLNKFIRLNLSGGAAALCGLWRFSRSLNSCVDHILALDGSRMQKELANIIVTRYHN
DPRTMQHISKHFYYEEVFDDSTLDRPKIRWRYRNFFSDDVAHAQRTHDNDPKDNLHENSHNDSSNRDSSAHQGDSYGEPDDKGNAHVHEFKPVLTKPGTDAATADPLDCI
FGTLAREEEIQHSSASSPSPKSHSRSRRYNRRHRRDNQTMPTNFEHV