; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022904 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022904
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAutophagy-related protein
Genome locationscaffold64:84196..88557
RNA-Seq ExpressionSpg022904
SyntenySpg022904
Gene Ontology termsGO:0006914 - autophagy (biological process)
GO:0009767 - photosynthetic electron transport chain (biological process)
GO:0018298 - protein-chromophore linkage (biological process)
GO:0050821 - protein stabilization (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
GO:0042301 - phosphate ion binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD3336706.1 hypothetical protein E3N88_32225 [Mikania micrantha]2.8e-7089.47Show/hide
Query:  ATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGA
        ATLGLR GPDSYGRQQWG FRNGRKPDGAMP G AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQ W GGGNYQAGVRGANGIR P+SPS KRWILG 
Subjt:  ATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGA

Query:  VRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA
        VRIDPCS VANALSIPP ET  GLDMPRILLKERG+FGNADTGGAWLSSA A
Subjt:  VRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA

KAD5317821.1 hypothetical protein E3N88_17767 [Mikania micrantha]4.7e-7395.3Show/hide
Query:  GLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRI
        G RHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQ WTGGGNYQAGVRGANGIR PS+PSRKRWILGAVRI
Subjt:  GLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRI

Query:  DPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA
        DPCSAVANALSI PGETLPGLDMPRILLKERGAF NADTGGAWLSSARA
Subjt:  DPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA

KAF1876761.1 hypothetical protein Lal_00044212 [Lupinus albus]1.3e-6285.33Show/hide
Query:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA
        MQALSGMIGRKASVGGFLS PSNPRAQPWTGGGNYQAGV GANGIRYPSSPSRKRWILG VRIDPC+AVANALSIPPGETLPGLDMPRILLKERGAFGNA
Subjt:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA

Query:  DTGGAWLSSARAVRLPVISRRKVRMTSSHHAPYALGDTRATMAGTKGRDP
        DTGGAWLSSARA         +  + SS   PYALGDTRATMAGTK RDP
Subjt:  DTGGAWLSSARAVRLPVISRRKVRMTSSHHAPYALGDTRATMAGTKGRDP

KAF1876869.1 hypothetical protein Lal_00033161 [Lupinus albus]2.2e-8374.36Show/hide
Query:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA
        MQALSGMIGRKASVGGFLS PSNPRAQPWTGGGNYQAGV GANGIRYPSSPSRKRWILG VRIDPC+AVANALSIPPGETLPGLDMPRILLKERGAFGNA
Subjt:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA

Query:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV
        DTGGAWLSSARA     P      V+    +    H  Y   D  +R+  AG  GR PAR  +  + E   +  +++   RIDGAIQVRSNVDPTFYSLV
Subjt:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV

Query:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
        GSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGL
Subjt:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL

OIV94276.1 hypothetical protein TanjilG_00025 [Lupinus angustifolius]9.8e-7151.27Show/hide
Query:  MKDNSELAL-MNAGGMLNTCKS---------DGKWCFQWRTAT-------LGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG-----AAAVIQRMQALSGM
        MK+NSE AL  N  G+ N  ++         D +W               L   HGPDSYGRQQWGIFRNGRKPDGAMPRG     AA V QRMQALSGM
Subjt:  MKDNSELAL-MNAGGMLNTCKS---------DGKWCFQWRTAT-------LGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG-----AAAVIQRMQALSGM

Query:  IGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWL
        IGRKAS+GGFLS PSNPRAQPWTGGGNYQAGV                                        TLPGLDMPRILLKERGAFGNADTGGAWL
Subjt:  IGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWL

Query:  SSARAV--------------RLPVISRRK-----------------VRMTSSHH-APYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLL
        SSARA                 P+   R                     T  HH A  +      T   T G   AR   L   E   E      ++++ 
Subjt:  SSARAV--------------RLPVISRRK-----------------VRMTSSHH-APYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLL

Query:  RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
        RRIDGAIQVRSNVDPTFYSLVGSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGL
Subjt:  RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL

TrEMBL top hitse value%identityAlignment
A0A2N9G477 Uncharacterized protein ycf688.6e-9763.34Show/hide
Query:  TATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILG
        T+TLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILG
Subjt:  TATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILG

Query:  AVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGDTRA-TMAGTKGRDPAR
        AVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA     P      V+    +    H  Y   D  + +  G  G  P +
Subjt:  AVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGDTRA-TMAGTKGRDPAR

Query:  HGVLLLFEPGFE------------TKL----LLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLL-ENPYIPYQYMDSYLSSTGLEKAAINRIFLI
          + L+   G               KL    LL  +DG ++ +   + T +      R        S   +   +P++           EKAAINRIFLI
Subjt:  HGVLLLFEPGFE------------TKL----LLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLL-ENPYIPYQYMDSYLSSTGLEKAAINRIFLI

Query:  LPSRKRKEEREILFPFFRRDQEIGSSRKKNVWLINNSLLGR
        LP   RKEE EILFP FRRDQEIGSS KKN WLINNSLLGR
Subjt:  LPSRKRKEEREILFPFFRRDQEIGSSRKKNVWLINNSLLGR

A0A2N9GIA5 Uncharacterized protein ycf682.4e-13970.9Show/hide
Query:  MKDNSELALMNAGGMLNTCKSDG-KWCFQWR-------------TATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVG
        MKDNSE AL      ++  ++ G K C   +               TLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVG
Subjt:  MKDNSELALMNAGGMLNTCKSDG-KWCFQWR-------------TATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVG

Query:  GFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV--
        GFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA   
Subjt:  GFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV--

Query:  ------------RLPVISRRKV---RMTSSHHAPYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLLRRIDGAIQVRSNVDPTFYSLVGS
                      P+   R          H A  +      T   T G   AR   L   E   E      ++++ RRIDGAIQVRSNVDPTFYS+VGS
Subjt:  ------------RLPVISRRKV---RMTSSHHAPYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLLRRIDGAIQVRSNVDPTFYSLVGS

Query:  GRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKRKEEREILFPFFRRDQEIGSSRKKNVWLINNSLLGRRPPQSLRTPPISAM
        GRSGGDH GSSLLENPYIPYQYMDSYLSST  GLEKAAINRIFLILP   RKEE EILFP FRRDQEIGSS KKN WLINNSLLG RPPQSLR PPISAM
Subjt:  GRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKRKEEREILFPFFRRDQEIGSSRKKNVWLINNSLLGRRPPQSLRTPPISAM

Query:  GC
        GC
Subjt:  GC

A0A2N9HJI4 Uncharacterized protein ycf686.1e-11966.49Show/hide
Query:  MKDNSELAL------------------------MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALS
        MKDNSE AL                        MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALS
Subjt:  MKDNSELAL------------------------MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALS

Query:  GMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAG----------VRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERG
        GMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAG           RGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERG
Subjt:  GMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAG----------VRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERG

Query:  AFGNADTGGAWLSSARAV--------------RLPVISRRK------VRMTSSHHAPYALGDTRATMAG-------TKGRDPARHGVLLLFEPGFE----
        AFGNADTGGAWLSSARA                 P+   R       ++  S   A    G       G       T G   AR   L   E   E    
Subjt:  AFGNADTGGAWLSSARAV--------------RLPVISRRK------VRMTSSHHAPYALGDTRATMAG-------TKGRDPARHGVLLLFEPGFE----

Query:  ------TKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR
              T    RRIDGAIQVRSNVDPTFYS+VGSGRSGGDH GSSLLENPYIPYQYMDSYLSSTGL  A++ +
Subjt:  ------TKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR

A0A2N9HP93 Uncharacterized protein ycf688.2e-14866.67Show/hide
Query:  LALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY
        L+ MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY
Subjt:  LALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY

Query:  QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV---------------------R
        QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA                      R
Subjt:  QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV---------------------R

Query:  LPVISRRKVRMTSSHHAPYALGDTRATM--------------------------------------------------------------------AGTK
            + R    T+ +  P  +   R TM                                                                    AG  
Subjt:  LPVISRRKVRMTSSHHAPYALGDTRATM--------------------------------------------------------------------AGTK

Query:  GRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKRK
        GR PAR  +  + E   +  +++   RIDGAIQVRSNVDPTFYS+VGSGRSGGDH GSSLLENPYIPYQYMDSYLSST  GLEKAAINRIFLILP   RK
Subjt:  GRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKRK

Query:  EEREILFPFFRRDQEIGSSRKKNVWLINNSLLGRRPPQSLRTPPISAMGC
        EE EILFP FRRDQEIGSS KKN WLINNSLLG RPPQSLR PPISAMGC
Subjt:  EEREILFPFFRRDQEIGSSRKKNVWLINNSLLGRRPPQSLRTPPISAMGC

A0A6A5MVI7 Uncharacterized protein ycf681.1e-8374.36Show/hide
Query:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA
        MQALSGMIGRKASVGGFLS PSNPRAQPWTGGGNYQAGV GANGIRYPSSPSRKRWILG VRIDPC+AVANALSIPPGETLPGLDMPRILLKERGAFGNA
Subjt:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA

Query:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV
        DTGGAWLSSARA     P      V+    +    H  Y   D  +R+  AG  GR PAR  +  + E   +  +++   RIDGAIQVRSNVDPTFYSLV
Subjt:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV

Query:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
        GSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGL
Subjt:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL

SwissProt top hitse value%identityAlignment
P03938 Uncharacterized protein ycf682.5e-0893.75Show/hide
Query:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG
        KLLLRRIDGAIQVRS+VD TFYSLVGSGRSGG
Subjt:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG

P52807 Uncharacterized protein ycf686.2e-2073.91Show/hide
Query:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG
        L EPGF T+L+LRRIDGAIQVRSN DPTFYS V  G  G     HLGSSLLENPYIPYQ MD YLS TG
Subjt:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG

Q49KT9 Uncharacterized protein ycf682.0e-2685.71Show/hide
Query:  GFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR
        GFETK LLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD  GSSLLENPYIPYQ MDSYLSSTGL  A++ +
Subjt:  GFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR

Q6L3C9 Uncharacterized protein ycf682.5e-0893.75Show/hide
Query:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG
        KLLLRRIDGAIQVRS+VD TFYSLVGSGRSGG
Subjt:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG

Q85WV9 Uncharacterized protein ycf684.3e-2175.36Show/hide
Query:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG
        LFEPGF T+L+LRRIDGAIQVRSN DPTFYS V  G  GG    H GSSLLENPYIPYQ MD YLS TG
Subjt:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGACAATTCCGAATTAGCTTTGATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGGACGGGAAGTGGTGTTTCCAGTGGCGGACGGCCACACTGGGACTGAG
ACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAATGCCGCGTGGAGCAGCCGCGGTAATACAGAGGATGCAAG
CGTTATCCGGAATGATTGGGCGTAAAGCGTCTGTAGGTGGCTTTTTAAGTCCGCCGTCAAATCCCAGGGCTCAACCCTGGACAGGCGGTGGAAACTACCAAGCTGGAGTA
CGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGCGCTGTGCGTATCGACCCGTGCAGTGCTGTAGCTAACGCGTTAAGTATCCC
GCCTGGGGAAACCTTACCAGGGCTTGACATGCCGCGAATCCTCTTGAAAGAGAGGGGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGCCG
TAAGACTGCCGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGGCGACACACGTGCTACAATGGCCGGGACAAAGGGTCGTGAT
CCCGCGAGGCATGGCGTACTTCTCCTGTTCGAACCGGGGTTTGAAACCAAACTTCTCCTCAGGAGGATAGATGGGGCGATTCAGGTGAGATCCAATGTAGATCCAACTTT
CTATTCACTCGTGGGATCCGGGCGGTCCGGGGGGGACCACCTCGGCTCCTCTCTTCTCGAGAATCCATACATCCCTTATCAGTATATGGACAGCTATCTCTCGAGCACAG
GTCTGGAGAAAGCTGCAATCAATAGGATTTTCTTAATCCTCCCTTCCCGAAAGCGAAAGGAAGAACGTGAAATTCTTTTTCCTTTCTTCCGCAGGGACCAGGAGATTGGA
TCTAGCCGTAAGAAGAATGTTTGGCTGATAAATAACTCACTTCTTGGTCGTCGACCCCCTCAGTCACTACGAACACCCCCGATCAGTGCAATGGGATGTGCGATGATTTA
CTTCACGGGCGAGGTCTCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGGAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCT
CGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGACAATTCCGAATTAGCTTTGATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGGACGGGAAGTGGTGTTTCCAGTGGCGGACGGCCACACTGGGACTGAG
ACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAATGCCGCGTGGAGCAGCCGCGGTAATACAGAGGATGCAAG
CGTTATCCGGAATGATTGGGCGTAAAGCGTCTGTAGGTGGCTTTTTAAGTCCGCCGTCAAATCCCAGGGCTCAACCCTGGACAGGCGGTGGAAACTACCAAGCTGGAGTA
CGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGCGCTGTGCGTATCGACCCGTGCAGTGCTGTAGCTAACGCGTTAAGTATCCC
GCCTGGGGAAACCTTACCAGGGCTTGACATGCCGCGAATCCTCTTGAAAGAGAGGGGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGCCG
TAAGACTGCCGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGGCGACACACGTGCTACAATGGCCGGGACAAAGGGTCGTGAT
CCCGCGAGGCATGGCGTACTTCTCCTGTTCGAACCGGGGTTTGAAACCAAACTTCTCCTCAGGAGGATAGATGGGGCGATTCAGGTGAGATCCAATGTAGATCCAACTTT
CTATTCACTCGTGGGATCCGGGCGGTCCGGGGGGGACCACCTCGGCTCCTCTCTTCTCGAGAATCCATACATCCCTTATCAGTATATGGACAGCTATCTCTCGAGCACAG
GTCTGGAGAAAGCTGCAATCAATAGGATTTTCTTAATCCTCCCTTCCCGAAAGCGAAAGGAAGAACGTGAAATTCTTTTTCCTTTCTTCCGCAGGGACCAGGAGATTGGA
TCTAGCCGTAAGAAGAATGTTTGGCTGATAAATAACTCACTTCTTGGTCGTCGACCCCCTCAGTCACTACGAACACCCCCGATCAGTGCAATGGGATGTGCGATGATTTA
CTTCACGGGCGAGGTCTCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAGGAAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCT
CGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGTCTAA
Protein sequenceShow/hide protein sequence
MKDNSELALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGV
RGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAVRLPVISRRKVRMTSSHHAPYALGDTRATMAGTKGRD
PARHGVLLLFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINRIFLILPSRKRKEEREILFPFFRRDQEIG
SSRKKNVWLINNSLLGRRPPQSLRTPPISAMGCAMIYFTGEVSGSSPGWPSYAKEKNKRIEEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGLDV