; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020289 (gene) of Snake gourd v1 genome

Gene IDTan0020289
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAutophagy-related protein
Genome locationContig00034:125108..128148
RNA-Seq ExpressionTan0020289
SyntenyTan0020289
Gene Ontology termsGO:0006914 - autophagy (biological process)
GO:0015979 - photosynthesis (biological process)
GO:0050821 - protein stabilization (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0042301 - phosphate ion binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAD3336706.1 hypothetical protein E3N88_32225 [Mikania micrantha]2.4e-7089.47Show/hide
Query:  ATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGA
        ATLGLR GPDSYGRQQWG FRNGRKPDGAMP G AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQ W GGGNYQAGVRGANGIR P+SPS KRWILG 
Subjt:  ATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGA

Query:  VRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA
        VRIDPCS VANALSIPP ET  GLDMPRILLKERG+FGNADTGGAWLSSA A
Subjt:  VRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA

KAD5317821.1 hypothetical protein E3N88_17767 [Mikania micrantha]4.0e-7395.3Show/hide
Query:  GLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRI
        G RHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQ WTGGGNYQAGVRGANGIR PS+PSRKRWILGAVRI
Subjt:  GLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRI

Query:  DPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA
        DPCSAVANALSI PGETLPGLDMPRILLKERGAF NADTGGAWLSSARA
Subjt:  DPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA

KAF1876761.1 hypothetical protein Lal_00044212 [Lupinus albus]1.1e-6285.33Show/hide
Query:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA
        MQALSGMIGRKASVGGFLS PSNPRAQPWTGGGNYQAGV GANGIRYPSSPSRKRWILG VRIDPC+AVANALSIPPGETLPGLDMPRILLKERGAFGNA
Subjt:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA

Query:  DTGGAWLSSARAVRLPVISRRKVRMTSSHHAPYALGDTRATMAGTKGRDP
        DTGGAWLSSARA         +  + SS   PYALGDTRATMAGTK RDP
Subjt:  DTGGAWLSSARAVRLPVISRRKVRMTSSHHAPYALGDTRATMAGTKGRDP

KAF1876869.1 hypothetical protein Lal_00033161 [Lupinus albus]1.9e-8374.36Show/hide
Query:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA
        MQALSGMIGRKASVGGFLS PSNPRAQPWTGGGNYQAGV GANGIRYPSSPSRKRWILG VRIDPC+AVANALSIPPGETLPGLDMPRILLKERGAFGNA
Subjt:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA

Query:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV
        DTGGAWLSSARA     P      V+    +    H  Y   D  +R+  AG  GR PAR  +  + E   +  +++   RIDGAIQVRSNVDPTFYSLV
Subjt:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV

Query:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
        GSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGL
Subjt:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL

OIV94276.1 hypothetical protein TanjilG_00025 [Lupinus angustifolius]8.4e-7151.27Show/hide
Query:  MKDNSELAL-MNAGGMLNTCKS---------DGKWCFQWRTAT-------LGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG-----AAAVIQRMQALSGM
        MK+NSE AL  N  G+ N  ++         D +W               L   HGPDSYGRQQWGIFRNGRKPDGAMPRG     AA V QRMQALSGM
Subjt:  MKDNSELAL-MNAGGMLNTCKS---------DGKWCFQWRTAT-------LGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG-----AAAVIQRMQALSGM

Query:  IGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWL
        IGRKAS+GGFLS PSNPRAQPWTGGGNYQAGV                                        TLPGLDMPRILLKERGAFGNADTGGAWL
Subjt:  IGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWL

Query:  SSARAV--------------RLPVISRRK-----------------VRMTSSHH-APYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLL
        SSARA                 P+   R                     T  HH A  +      T   T G   AR   L   E   E      ++++ 
Subjt:  SSARAV--------------RLPVISRRK-----------------VRMTSSHH-APYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLL

Query:  RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
        RRIDGAIQVRSNVDPTFYSLVGSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGL
Subjt:  RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL

TrEMBL top hitse value%identityAlignment
A0A2N9GIA5 Uncharacterized protein ycf685.4e-11668.47Show/hide
Query:  MKDNSELALMNAGGMLNTCKSDG-KWCFQWR-------------TATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVG
        MKDNSE AL      ++  ++ G K C   +               TLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVG
Subjt:  MKDNSELALMNAGGMLNTCKSDG-KWCFQWR-------------TATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVG

Query:  GFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV--
        GFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA   
Subjt:  GFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV--

Query:  ------------RLPVISRRKV---RMTSSHHAPYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLLRRIDGAIQVRSNVDPTFYSLVGS
                      P+   R          H A  +      T   T G   AR   L   E   E      ++++ RRIDGAIQVRSNVDPTFYS+VGS
Subjt:  ------------RLPVISRRKV---RMTSSHHAPYALGDTRATMAGTKGRDPARHGVLLLFEPGFE------TKLLLRRIDGAIQVRSNVDPTFYSLVGS

Query:  GRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKKK
        GRSGGDH GSSLLENPYIPYQYMDSYLSST  GLEKAAINRIFLILP RK++
Subjt:  GRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKKK

A0A2N9HJI4 Uncharacterized protein ycf684.0e-11966.49Show/hide
Query:  MKDNSELAL------------------------MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALS
        MKDNSE AL                        MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALS
Subjt:  MKDNSELAL------------------------MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALS

Query:  GMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAG----------VRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERG
        GMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAG           RGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERG
Subjt:  GMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAG----------VRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERG

Query:  AFGNADTGGAWLSSARAV--------------RLPVISRRK------VRMTSSHHAPYALGDTRATMAG-------TKGRDPARHGVLLLFEPGFE----
        AFGNADTGGAWLSSARA                 P+   R       ++  S   A    G       G       T G   AR   L   E   E    
Subjt:  AFGNADTGGAWLSSARAV--------------RLPVISRRK------VRMTSSHHAPYALGDTRATMAG-------TKGRDPARHGVLLLFEPGFE----

Query:  ------TKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR
              T    RRIDGAIQVRSNVDPTFYS+VGSGRSGGDH GSSLLENPYIPYQYMDSYLSSTGL  A++ +
Subjt:  ------TKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR

A0A2N9HP93 Uncharacterized protein ycf681.9e-12464Show/hide
Query:  LALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY
        L+ MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY
Subjt:  LALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY

Query:  QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV---------------------R
        QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA                      R
Subjt:  QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAV---------------------R

Query:  LPVISRRKVRMTSSHHAPYALGDTRATM--------------------------------------------------------------------AGTK
            + R    T+ +  P  +   R TM                                                                    AG  
Subjt:  LPVISRRKVRMTSSHHAPYALGDTRATM--------------------------------------------------------------------AGTK

Query:  GRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKKK
        GR PAR  +  + E   +  +++   RIDGAIQVRSNVDPTFYS+VGSGRSGGDH GSSLLENPYIPYQYMDSYLSST  GLEKAAINRIFLILP RK++
Subjt:  GRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSST--GLEKAAINRIFLILPSRKKK

A0A2N9ID32 Uncharacterized protein ycf683.3e-8187.01Show/hide
Query:  LALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY
        L+ MNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRG AAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY
Subjt:  LALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNY

Query:  QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA
        QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGE +P     +++         A TGGAWLSSARA
Subjt:  QAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARA

A0A6A5MVI7 Uncharacterized protein ycf689.3e-8474.36Show/hide
Query:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA
        MQALSGMIGRKASVGGFLS PSNPRAQPWTGGGNYQAGV GANGIRYPSSPSRKRWILG VRIDPC+AVANALSIPPGETLPGLDMPRILLKERGAFGNA
Subjt:  MQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGVRGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNA

Query:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV
        DTGGAWLSSARA     P      V+    +    H  Y   D  +R+  AG  GR PAR  +  + E   +  +++   RIDGAIQVRSNVDPTFYSLV
Subjt:  DTGGAWLSSARAVR--LPVISRRKVR----MTSSHHAPYALGD--TRATMAGTKGRDPARHGVLLLFEPGFETKLLL--RRIDGAIQVRSNVDPTFYSLV

Query:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL
        GSGRSGGDH GSSLLEN YIPYQYMDSYLSSTGL
Subjt:  GSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGL

SwissProt top hitse value%identityAlignment
P03938 Uncharacterized protein ycf682.1e-0893.75Show/hide
Query:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG
        KLLLRRIDGAIQVRS+VD TFYSLVGSGRSGG
Subjt:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG

P52807 Uncharacterized protein ycf685.4e-2073.91Show/hide
Query:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG
        L EPGF T+L+LRRIDGAIQVRSN DPTFYS V  G  G     HLGSSLLENPYIPYQ MD YLS TG
Subjt:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG

Q49KT9 Uncharacterized protein ycf681.7e-2685.71Show/hide
Query:  GFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR
        GFETK LLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD  GSSLLENPYIPYQ MDSYLSSTGL  A++ +
Subjt:  GFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINR

Q6L3C9 Uncharacterized protein ycf682.1e-0893.75Show/hide
Query:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG
        KLLLRRIDGAIQVRS+VD TFYSLVGSGRSGG
Subjt:  KLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGG

Q85WV9 Uncharacterized protein ycf683.7e-2175.36Show/hide
Query:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG
        LFEPGF T+L+LRRIDGAIQVRSN DPTFYS V  G  GG    H GSSLLENPYIPYQ MD YLS TG
Subjt:  LFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGD---HLGSSLLENPYIPYQYMDSYLSSTG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGACAATTCCGAATTAGCTTTGATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGGACGGGAAGTGGTGTTTCCAGTGGCGGACGGCCACACTGGGACTGAG
ACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAATGCCGCGTGGAGCAGCCGCGGTAATACAGAGGATGCAAG
CGTTATCCGGAATGATTGGGCGTAAAGCGTCTGTAGGTGGCTTTTTAAGTCCGCCGTCAAATCCCAGGGCTCAACCCTGGACAGGCGGTGGAAACTACCAAGCTGGAGTA
CGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGCGCTGTGCGTATCGACCCGTGCAGTGCTGTAGCTAACGCGTTAAGTATCCC
GCCTGGGGAAACCTTACCAGGGCTTGACATGCCGCGAATCCTCTTGAAAGAGAGGGGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGCCG
TAAGACTGCCGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGGCGACACACGTGCTACAATGGCCGGGACAAAGGGTCGTGAT
CCCGCGAGGCATGGCGTACTTCTCCTGTTCGAACCGGGGTTTGAAACCAAACTTCTGCTCAGGAGGATAGATGGGGCGATTCAGGTGAGATCCAATGTAGATCCAACTTT
CTATTCACTCGTGGGATCCGGGCGGTCCGGGGGGGACCACCTCGGCTCCTCTCTTCTCGAGAATCCATACATCCCTTATCAGTATATGGACAGCTATCTCTCGAGCACAG
GTCTGGAGAAAGCTGCAATCAATAGGATTTTCTTAATCCTCCCTTCCCGAAAGAAAAAAGAAAGGACTCCAGGATGGCCCAGCTACGCCAAGGAAAAGAATAAAAGAATA
GAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGT
CTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGACAATTCCGAATTAGCTTTGATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGGACGGGAAGTGGTGTTTCCAGTGGCGGACGGCCACACTGGGACTGAG
ACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAATGCCGCGTGGAGCAGCCGCGGTAATACAGAGGATGCAAG
CGTTATCCGGAATGATTGGGCGTAAAGCGTCTGTAGGTGGCTTTTTAAGTCCGCCGTCAAATCCCAGGGCTCAACCCTGGACAGGCGGTGGAAACTACCAAGCTGGAGTA
CGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGCGCTGTGCGTATCGACCCGTGCAGTGCTGTAGCTAACGCGTTAAGTATCCC
GCCTGGGGAAACCTTACCAGGGCTTGACATGCCGCGAATCCTCTTGAAAGAGAGGGGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGCCG
TAAGACTGCCGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGGCGACACACGTGCTACAATGGCCGGGACAAAGGGTCGTGAT
CCCGCGAGGCATGGCGTACTTCTCCTGTTCGAACCGGGGTTTGAAACCAAACTTCTGCTCAGGAGGATAGATGGGGCGATTCAGGTGAGATCCAATGTAGATCCAACTTT
CTATTCACTCGTGGGATCCGGGCGGTCCGGGGGGGACCACCTCGGCTCCTCTCTTCTCGAGAATCCATACATCCCTTATCAGTATATGGACAGCTATCTCTCGAGCACAG
GTCTGGAGAAAGCTGCAATCAATAGGATTTTCTTAATCCTCCCTTCCCGAAAGAAAAAAGAAAGGACTCCAGGATGGCCCAGCTACGCCAAGGAAAAGAATAAAAGAATA
GAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGGTAGAGCTCCGCTCTTGCAATTGGGTCGTTGCGATTACGGGTTGGATGT
CTAA
Protein sequenceShow/hide protein sequence
MKDNSELALMNAGGMLNTCKSDGKWCFQWRTATLGLRHGPDSYGRQQWGIFRNGRKPDGAMPRGAAAVIQRMQALSGMIGRKASVGGFLSPPSNPRAQPWTGGGNYQAGV
RGANGIRYPSSPSRKRWILGAVRIDPCSAVANALSIPPGETLPGLDMPRILLKERGAFGNADTGGAWLSSARAVRLPVISRRKVRMTSSHHAPYALGDTRATMAGTKGRD
PARHGVLLLFEPGFETKLLLRRIDGAIQVRSNVDPTFYSLVGSGRSGGDHLGSSLLENPYIPYQYMDSYLSSTGLEKAAINRIFLILPSRKKKERTPGWPSYAKEKNKRI
EEASDSFMQAPLGSGGYSSVGRAPLLQLGRCDYGLDV