; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g16650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g16650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:12758319..12761332
RNA-Seq ExpressionMoc08g16650
SyntenyMoc08g16650
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV30041.1 uncharacterized protein ycf68 [Tanacetum cinerariifolium]1.4e-7546.43Show/hide
Query:  IQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGVRSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVN
        ++RMQALSGMIGRKASVG  L+ P                  R AFGNADTGGAWLSSA ATAGDKPEEGEDDVKSSC LCPGRHTCYNG+DKGS +   
Subjt:  IQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGVRSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVN

Query:  SFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHL-------GSSLLENPYIPYQYMDSYLS
          +   H       A                G AS+    + RRIDGAIQVRSNVD TF SLVGSGR     L       G+S L             + 
Subjt:  SFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHL-------GSSLLENPYIPYQYMDSYLS

Query:  STGLGLEKAAINRIFLILPSRKRKEELEIL-FPFFRRDQEIGSSRLKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMI
             + K +   +       K+  +L +L  P            LKKDLRVSRVGPGG LNAFFFLLI VISQRLAMVRKKGGTSTLGERST ES    
Subjt:  STGLGLEKAAINRIFLILPSRKRKEELEIL-FPFFRRDQEIGSSRLKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMI

Query:  YFTSEVSGSSPGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCE
                                                                                    GVG   RV   GIPGEEDQVGP E
Subjt:  YFTSEVSGSSPGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCE

Query:  QLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGCLRYPGVAD
        QLDALSPFNPLSEMRQKE KSMDR H LHPV TTR PQG LR+ GV D
Subjt:  QLDALSPFNPLSEMRQKEGKSMDRPHRLHPVGTTRSPQGCLRYPGVAD

KAF1876869.1 hypothetical protein Lal_00033161 [Lupinus albus]3.3e-7756.57Show/hide
Query:  MQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV------------------------------------------------------RSAFGNA
        MQALSGMIGRKASVGGFL+ PSNPRAQPWTG GNYQAGV                                                      R AFGNA
Subjt:  MQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV------------------------------------------------------RSAFGNA

Query:  DTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVT-RRIDGA
        DTGGAWLSSA ATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGS +                G G    R   R       RA D   VV   RIDGA
Subjt:  DTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVT-RRIDGA

Query:  IQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLRVSRVG
        IQVRSNVDPTFYSLVGSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGLG                                       LKKDLRVSRVG
Subjt:  IQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLRVSRVG

Query:  PGGFLNAFFFLLIGVISQRLAMVRKKG
        PGG LNAFFFLLIGVISQR AMVRKKG
Subjt:  PGGFLNAFFFLLIGVISQRLAMVRKKG

KZV53052.1 hypothetical protein F511_23525 [Dorcoceras hygrometricum]8.7e-9475.46Show/hide
Query:  RIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLR
        RIDGAI+VRSNVDPTF SLVGSGRSGGDH GSSLLENPYIPYQ M+SYLSST  GLEKAAINRI LILPS  RK E EILFP FRRDQEIGSSRLKKDLR
Subjt:  RIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLR

Query:  VSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMIYFTSEVSGSSPGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQ
        VSRVGPGG LNAFFFLLIGVISQ LA VRKKGGTSTLGERSTTE C +   +     ++PG       K  RIEEASD FM APLGSGGYSSVGRAPLLQ
Subjt:  VSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMIYFTSEVSGSSPGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQ

Query:  LGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGK
        LGRCDYG  +        D T T +G       PSS IPGEEDQVGPCEQLDALSPFNPLSEMRQK+ K
Subjt:  LGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGK

OIV94276.1 hypothetical protein TanjilG_00025 [Lupinus angustifolius]6.7e-10281.22Show/hide
Query:  PDGAMPRG-----AAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV---------------RSAFGNADTGGAWLSSAHATAGDKPE
        PDGAMPRG     AA V QRMQALSGMIGRKAS+GGFL+ PSNPRAQPWTG GNYQAGV               R AFGNADTGGAWLSSA ATAGDKPE
Subjt:  PDGAMPRG-----AAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV---------------RSAFGNADTGGAWLSSAHATAGDKPE

Query:  EGEDDVKSSCPLCPGRHTCYNGRDKGS--------------HTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSN
        EGEDDVKSSCPLCPGRHTCYNGRDKGS              HTAVNSF GLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSN
Subjt:  EGEDDVKSSCPLCPGRHTCYNGRDKGS--------------HTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSN

Query:  VDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGL
        VDPTFYSLVGSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGLG+
Subjt:  VDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGL

RDY14309.1 putative protein ycf68, partial [Mucuna pruriens]3.8e-8156.61Show/hide
Query:  PDGAMPRG-------AAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGVRS---------------AFGNADTGGAWLSSAHATAGDK
        PDGAMPRG       AAAV QRMQALSGMIGRKASVGGFL+ PSNPRAQPWTG GNYQAGVR+               AFGNADTGGAWLSSA ATAGDK
Subjt:  PDGAMPRG-------AAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGVRS---------------AFGNADTGGAWLSSAHATAGDK

Query:  PEEGEDDVKSSCPLCPGRHTCYNGRDKGS-----------------HTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAI
        PEEGEDDVKSSCPLCPGRHTCYNGRDKGS                 HTAVNSF GLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVT  +D ++
Subjt:  PEEGEDDVKSSCPLCPGRHTCYNGRDKGS-----------------HTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAI

Query:  QVRSNVDPTFYSLVGS---GRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPF---FRRDQEIGSSRLKK---
           S  DP  + + G+     S  D  G S        + +     +  G  +     +R F+   S   +  LE  F F    +R  E  ++ L +   
Subjt:  QVRSNVDPTFYSLVGS---GRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPF---FRRDQEIGSSRLKK---

Query:  -------------DLRVSRVGPGGFLNAFFFLLIGVISQRLAMVRKKG
                     DLRVSRV PGG LNAF FLLIGVISQRL MV+KKG
Subjt:  -------------DLRVSRVGPGGFLNAFFFLLIGVISQRLAMVRKKG

TrEMBL top hitse value%identityAlignment
A0A2N9GIA5 Uncharacterized protein ycf681.7e-12772.78Show/hide
Query:  PDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV----------------------------------------------
        PDGAMPRG AAVIQRMQALSGMIGRKASVGGFL+PPSNPRAQPWTG GNYQAGV                                              
Subjt:  PDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV----------------------------------------------

Query:  --------RSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRA
                R AFGNADTGGAWLSSA ATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKG HTAVNSF GLVHTARHTMGAGHARSRYLN KEGDAEGRA
Subjt:  --------RSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRA

Query:  SDWSEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIG
        SDWSEVVTRRIDGAIQVRSNVDPTFYS+VGSGRSGGDH GSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILP   RKEE+EILFP FRRDQEIG
Subjt:  SDWSEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIG

Query:  SSRLK---------------KDLR---VSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGG
        SS  K               + LR   +S +G  G LNAFFFLLIGVISQRLAMVRKK G
Subjt:  SSRLK---------------KDLR---VSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGG

A0A2N9HJI4 Uncharacterized protein ycf685.5e-8664.81Show/hide
Query:  PDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAG-----------------------------------------------
        PDGAMPRG AAVIQRMQALSGMIGRKASVGGFL+PPSNPRAQPWTG GNYQAG                                               
Subjt:  PDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAG-----------------------------------------------

Query:  -----------------VRSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNS---------FSGLVHTARHTMGA
                          R AFGNADTGGAWLSSA ATAGDKPEEGEDDVKSSCPLCPGRHT      K     +           F GLVHTARHTMGA
Subjt:  -----------------VRSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNS---------FSGLVHTARHTMGA

Query:  GHARSRYLNRKEGDAEGRASDWSEVV----TRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLG
        GHARSRYLN KEGDAEGRASDWSEVV    TRRIDGAIQVRSNVDPTFYS+VGSGRSGGDH GSSLLENPYIPYQYMDSYLSSTGLG
Subjt:  GHARSRYLNRKEGDAEGRASDWSEVV----TRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLG

A0A2N9HP93 Uncharacterized protein ycf681.9e-11861.36Show/hide
Query:  PDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV----------------------------------------------
        PDGAMPRG AAVIQRMQALSGMIGRKASVGGFL+PPSNPRAQPWTG GNYQAGV                                              
Subjt:  PDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV----------------------------------------------

Query:  --------RSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRA
                R AFGNADTGGAWLSSA ATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKG HTAVNSF GLVHTARHTMGAGHARSRYLN KEGDAEGRA
Subjt:  --------RSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRDKGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRA

Query:  SDWSEVVTR-------------------------------------------------------------------RIDGAIQVRSNVDPTFYSLVGSGR
        SDWSEVVTR                                                                   RIDGAIQVRSNVDPTFYS+VGSGR
Subjt:  SDWSEVVTR-------------------------------------------------------------------RIDGAIQVRSNVDPTFYSLVGSGR

Query:  SGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLK---------------KDLR---VSRVGP
        SGGDH GSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILP   RKEE+EILFP FRRDQEIGSS  K               + LR   +S +G 
Subjt:  SGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLK---------------KDLR---VSRVGP

Query:  GGFLNAFFFLLIGVISQRLAMVRKKGG
         G LNAFFFLLIGVISQRLAMVRKK G
Subjt:  GGFLNAFFFLLIGVISQRLAMVRKKGG

A0A2Z7D0R2 Uncharacterized protein ycf684.2e-9475.46Show/hide
Query:  RIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLR
        RIDGAI+VRSNVDPTF SLVGSGRSGGDH GSSLLENPYIPYQ M+SYLSST  GLEKAAINRI LILPS  RK E EILFP FRRDQEIGSSRLKKDLR
Subjt:  RIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLR

Query:  VSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMIYFTSEVSGSSPGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQ
        VSRVGPGG LNAFFFLLIGVISQ LA VRKKGGTSTLGERSTTE C +   +     ++PG       K  RIEEASD FM APLGSGGYSSVGRAPLLQ
Subjt:  VSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMIYFTSEVSGSSPGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQ

Query:  LGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGK
        LGRCDYG  +        D T T +G       PSS IPGEEDQVGPCEQLDALSPFNPLSEMRQK+ K
Subjt:  LGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMRQKEGK

A0A4P1QSZ5 Uncharacterized protein ycf683.2e-10281.22Show/hide
Query:  PDGAMPRG-----AAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV---------------RSAFGNADTGGAWLSSAHATAGDKPE
        PDGAMPRG     AA V QRMQALSGMIGRKAS+GGFL+ PSNPRAQPWTG GNYQAGV               R AFGNADTGGAWLSSA ATAGDKPE
Subjt:  PDGAMPRG-----AAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGV---------------RSAFGNADTGGAWLSSAHATAGDKPE

Query:  EGEDDVKSSCPLCPGRHTCYNGRDKGS--------------HTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSN
        EGEDDVKSSCPLCPGRHTCYNGRDKGS              HTAVNSF GLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSN
Subjt:  EGEDDVKSSCPLCPGRHTCYNGRDKGS--------------HTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSN

Query:  VDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGL
        VDPTFYSLVGSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGLG+
Subjt:  VDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLGL

SwissProt top hitse value%identityAlignment
P03938 Uncharacterized protein ycf681.9e-0678.79Show/hide
Query:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGG
        ++++ RRIDGAIQVRS+VD TFYSLVGSGRSGG
Subjt:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGG

P52807 Uncharacterized protein ycf684.4e-1670.97Show/hide
Query:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG
        +E++ RRIDGAIQVRSN DPTFYS V  G  G     HLGSSLLENPYIPYQ MD YLS TG
Subjt:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG

Q49KT9 Uncharacterized protein ycf682.4e-2294.64Show/hide
Query:  RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLG
        RRIDGAIQVRSNVDPTFYSLVGSGRSGGD  GSSLLENPYIPYQ MDSYLSSTGLG
Subjt:  RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLG

Q6L3C9 Uncharacterized protein ycf681.9e-0678.79Show/hide
Query:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGG
        ++++ RRIDGAIQVRS+VD TFYSLVGSGRSGG
Subjt:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGG

Q85WV9 Uncharacterized protein ycf682.6e-1670.97Show/hide
Query:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG
        +E++ RRIDGAIQVRSN DPTFYS V  G  GG    H GSSLLENPYIPYQ MD YLS TG
Subjt:  SEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGACGGAGCAATGCCGCGTGGAGCAGCCGCAGTAATACAGAGGATGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCTGTAGGTGGCTTTTTAAAT
CCGCCGTCAAATCCCAGGGCTCAACCCTGGACAGGCGATGGAAACTACCAAGCTGGAGTACGGAGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGCTGTCG
TCAGCTCATGCCACTGCTGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGGCGACACACGTGCTACAATGGCCGGGAC
AAAGGGTCCCATACGGCGGTGAATTCGTTCTCGGGCCTTGTACACACCGCCCGTCACACTATGGGAGCTGGCCATGCCCGAAGTCGTTACCTTAACCGCAAGGAG
GGGGATGCCGAAGGCAGGGCTAGTGACTGGAGTGAAGTCGTAACAAGGAGGATAGATGGGGCGATTCAGGTGAGATCCAATGTAGATCCAACTTTCTATTCACTC
GTGGGATCCGGGCGGTCCGGGGGGGACCACCTCGGCTCCTCTCTTCTCGAGAATCCATACATCCCTTATCAGTATATGGACAGCTATCTCTCGAGCACAGGTTTA
GGTCTGGAGAAAGCTGCAATCAATAGGATTTTCTTAATCCTCCCTTCCCGAAAGCGAAAGGAAGAACTTGAAATTCTTTTTCCTTTCTTCCGCAGGGACCAGGAG
ATTGGATCTAGCCGTTTGAAAAAGGATCTTAGAGTGTCTAGGGTTGGACCAGGAGGGTTTCTTAACGCCTTCTTTTTTCTTCTCATCGGAGTTATTTCACAAAGA
CTTGCCATGGTAAGGAAGAAGGGGGGAACAAGCACACTTGGAGAGCGCAGTACAACGGAGAGTTGTGCGATGATTTACTTCACGAGCGAGGTCTCTGGTTCAAGT
CCAGGATGGCCCAGCTACACCAAGGAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGT
AGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGATGGGCAGTTGGTCAGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGC
TTTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGG
CAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGATGCCTTCGGTATCCAGGGGTCGCGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGACGGAGCAATGCCGCGTGGAGCAGCCGCAGTAATACAGAGGATGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCTGTAGGTGGCTTTTTAAAT
CCGCCGTCAAATCCCAGGGCTCAACCCTGGACAGGCGATGGAAACTACCAAGCTGGAGTACGGAGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGCTGTCG
TCAGCTCATGCCACTGCTGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGGCGACACACGTGCTACAATGGCCGGGAC
AAAGGGTCCCATACGGCGGTGAATTCGTTCTCGGGCCTTGTACACACCGCCCGTCACACTATGGGAGCTGGCCATGCCCGAAGTCGTTACCTTAACCGCAAGGAG
GGGGATGCCGAAGGCAGGGCTAGTGACTGGAGTGAAGTCGTAACAAGGAGGATAGATGGGGCGATTCAGGTGAGATCCAATGTAGATCCAACTTTCTATTCACTC
GTGGGATCCGGGCGGTCCGGGGGGGACCACCTCGGCTCCTCTCTTCTCGAGAATCCATACATCCCTTATCAGTATATGGACAGCTATCTCTCGAGCACAGGTTTA
GGTCTGGAGAAAGCTGCAATCAATAGGATTTTCTTAATCCTCCCTTCCCGAAAGCGAAAGGAAGAACTTGAAATTCTTTTTCCTTTCTTCCGCAGGGACCAGGAG
ATTGGATCTAGCCGTTTGAAAAAGGATCTTAGAGTGTCTAGGGTTGGACCAGGAGGGTTTCTTAACGCCTTCTTTTTTCTTCTCATCGGAGTTATTTCACAAAGA
CTTGCCATGGTAAGGAAGAAGGGGGGAACAAGCACACTTGGAGAGCGCAGTACAACGGAGAGTTGTGCGATGATTTACTTCACGAGCGAGGTCTCTGGTTCAAGT
CCAGGATGGCCCAGCTACACCAAGGAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGT
AGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGATGGGCAGTTGGTCAGATCTAGTATGGATCGTACATGGACGGTAGTTGGAGTCGGCGGC
TTTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACTATCTCCCTTCAACCCTTTGAGCGAAATGCGG
CAAAAGGAAGGAAAATCCATGGACCGACCCCATCGTCTCCACCCCGTAGGAACTACGAGATCACCCCAAGGATGCCTTCGGTATCCAGGGGTCGCGGACTGA
Protein sequenceShow/hide protein sequence
MPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLNPPSNPRAQPWTGDGNYQAGVRSAFGNADTGGAWLSSAHATAGDKPEEGEDDVKSSCPLCPGRHTCYNGRD
KGSHTAVNSFSGLVHTARHTMGAGHARSRYLNRKEGDAEGRASDWSEVVTRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
GLEKAAINRIFLILPSRKRKEELEILFPFFRRDQEIGSSRLKKDLRVSRVGPGGFLNAFFFLLIGVISQRLAMVRKKGGTSTLGERSTTESCAMIYFTSEVSGSS
PGWPSYTKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGLDDGQLVRSSMDRTWTVVGVGGFLRVPSSGIPGEEDQVGPCEQLDALSPFNPLSEMR
QKEGKSMDRPHRLHPVGTTRSPQGCLRYPGVAD