; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003361 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003361
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSenescence-associated protein
Genome locationChr08:32934..49285
RNA-Seq ExpressionHG10003361
SyntenyHG10003361
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6384451.1 hypothetical protein SASPL_155737 [Salvia splendens]1.1e-10361.56Show/hide
Query:  RVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRL-----NPHRSMPQVDRRTGS--SPFHIRLGHIAGPHPLPSRQFQALFDSLFKV
        RVEWGA RP PGAR CRS  E         R   H R+ G+    +L      P  ++ +   R+    SPFHIR GHIAGPHPLPSRQFQALFDSLFKV
Subjt:  RVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRL-----NPHRSMPQVDRRTGS--SPFHIRLGHIAGPHPLPSRQFQALFDSLFKV

Query:  LFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGI
        LFIFPSRYLFAIGLSP+FSLG+NLPPDWGCIPKQPDS TAPR ATGS  +GALTLSGAPFQGT ARSAAEDASPDYNSD   ARFSSWA PGSLAVT+GI
Subjt:  LFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGI

Query:  L-------------------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRY
        L                   CR D RRGL+F+P+    ARE S+ PRPR EP  GV  GA MRDAQADVPSA+ LRAQLAFKDS+V GILQFTPSIAFRY
Subjt:  L-------------------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRY

Query:  VLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTR
        VLHRCESRDIRCRES ++     APPT    G AP             + SR   + V P + WT   G  R
Subjt:  VLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTR

XP_022933354.1 uncharacterized protein LOC111440692 [Cucurbita moschata]2.4e-10394.06Show/hide
Query:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK
        DSLVRVSRRVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHIAGPHPLPSRQFQALFDSLFK
Subjt:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK

Query:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG
        VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+G
Subjt:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG

Query:  IL
        IL
Subjt:  IL

XP_022975713.1 uncharacterized protein LOC111475769 [Cucurbita maxima]1.3e-10193.07Show/hide
Query:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK
        DSLVRVSRRVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHI GPHPLPSRQFQALFDSLFK
Subjt:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK

Query:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG
        VLFIFPSRYLFAIGLSPIFSLGQNLPPDWG IPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+G
Subjt:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG

Query:  IL
        IL
Subjt:  IL

XP_023520622.1 uncharacterized protein LOC111784033 [Cucurbita pepo subsp. pepo]9.5e-9292.97Show/hide
Query:  MPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP
        MPGARRCRS P+GA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG SPFHIRL HIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP
Subjt:  MPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP

Query:  IFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        IFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  IFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

XP_023552350.1 uncharacterized protein LOC111810043 [Cucurbita pepo subsp. pepo]1.6e-10293.56Show/hide
Query:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK
        DSLVRVSRRVEWGAHRPMPGARRCRS P+GA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG SPFHIRL HIAGPHPLPSRQFQALFDSLFK
Subjt:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK

Query:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG
        VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+G
Subjt:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG

Query:  IL
        IL
Subjt:  IL

TrEMBL top hitse value%identityAlignment
A0A6J1EZI8 uncharacterized protein LOC1114406921.2e-10394.06Show/hide
Query:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK
        DSLVRVSRRVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHIAGPHPLPSRQFQALFDSLFK
Subjt:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK

Query:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG
        VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+G
Subjt:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG

Query:  IL
        IL
Subjt:  IL

A0A6J1IK28 uncharacterized protein LOC1114757696.4e-10293.07Show/hide
Query:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK
        DSLVRVSRRVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHI GPHPLPSRQFQALFDSLFK
Subjt:  DSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFK

Query:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG
        VLFIFPSRYLFAIGLSPIFSLGQNLPPDWG IPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+G
Subjt:  VLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRG

Query:  IL
        IL
Subjt:  IL

A0A6N2KB50 Uncharacterized protein (Fragment)4.9e-9462.46Show/hide
Query:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA
        SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAA
Subjt:  SPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAA

Query:  EDASPDYNSDVEDARFSSWALPGSLAVTRGIL----------------------------------------CRDDGRRGLEFKPSARRCAREASVCPRP
        EDASPDYNS+   ARFSSWA PGSLAVTRGIL                                        CR + RRGL+F+P+    ARE S+ PRP
Subjt:  EDASPDYNSDVEDARFSSWALPGSLAVTRGIL----------------------------------------CRDDGRRGLEFKPSARRCAREASVCPRP

Query:  RNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------P
        R EP  GV  GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCRES ++     APPT    G AP             
Subjt:  RNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------P

Query:  IESRKSSQSVNPYYVWTCAGGTTRP
        + SR   + V P + WT   G   P
Subjt:  IESRKSSQSVNPYYVWTCAGGTTRP

A0A6N2MTU4 Uncharacterized protein1.4e-10168.71Show/hide
Query:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGL
        R   GARR     +GARC PRS RR LHRR KG GLGR  +PHRS P+ DRRTG        G  + PHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGL
Subjt:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGL

Query:  SPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL-------------
        SPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL             
Subjt:  SPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL-------------

Query:  -----------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRD
                   CR + RRGL+F+P+    ARE S+ PRPR EP  GV  GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRD
Subjt:  -----------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRD

Query:  IRCRESLQIA
        IRCRES +++
Subjt:  IRCRESLQIA

A0A6N2NG36 Uncharacterized protein3.0e-9961.14Show/hide
Query:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGR-RLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIG
        R   GARR     +GARC PRS RR LHRR KG GLGR   + H      DR    SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIG
Subjt:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGR-RLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIG

Query:  LSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL------------
        LSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL            
Subjt:  LSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL------------

Query:  -------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR
               CR + RRGL+F+P+    A +    PRPR EP  GV  GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCR
Subjt:  -------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR

Query:  ESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPT
        ES +++   K G  P P   R   ++ +  + W  + G    V+ R+ S A+ +        G + P+
Subjt:  ESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPT

SwissProt top hitse value%identityAlignment
Q6CQE5 Protein TAR11.5e-1068Show/hide
Query:  FPPDNFKHYLTLFSKSFSSFPRGTCLLSVSRPYLALDRIYRPIGAAFPNN
        FP +NF ++ TLFSK FSSF   TC LSVSR YLALD IY P+ AAFPNN
Subjt:  FPPDNFKHYLTLFSKSFSSFPRGTCLLSVSRPYLALDRIYRPIGAAFPNN

Q8TGM6 Protein TAR11.5e-1068Show/hide
Query:  FPPDNFKHYLTLFSKSFSSFPRGTCLLSVSRPYLALDRIYRPIGAAFPNN
        FP +NF ++ TLFSK FSSF   TC LSVSR YLALD IY P+ AAFPNN
Subjt:  FPPDNFKHYLTLFSKSFSSFPRGTCLLSVSRPYLALDRIYRPIGAAFPNN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGACCTGAGACTTCTCAAGCATATCATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAACGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGG
AACGCGACGCGGGCTTTGGGGTTCGTCGTGCACCCAACGAGGGTGCCGACAGAAAAGGACTAAAGAGACGCGACGCTTCCATCACGTGAGGTACCGATGCAAGAACCGAT
CGATCGCGGCCGCTCGAAATACTTCAGGGCACGTCAATGAGGAGGAGCTGACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACT
CACGCGCCGTACGCATTCGAGCCCGGGCAACGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGGCCGAGCACGCAACGCCGTGCG
CGGTCTTTCCTTTCCCTTTCCATTCTCGATCACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGCATAGCATTGTGGGCGGGCATGGGGGGC
ACGCAAGCTGGGCATGGGGCGCACCCCCCACCCATGCAGGGGTGCTTGGTGGCATGGCACGTGGAGGGTGCAAGGTGGGTGAGGAGGCGCGGGGTGGGGCTGACTCCTTG
GTCCGTGTTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAGATGCCGAAGCCCGCCCGAAGGCGCGCGCTGCCAGCCACGATCGGGACGACG
ACGTCTCCACAGGCGTAACAAAGGCCTGGGCTTAGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGAC
TGGGGCACATCGCCGGCCCCATCCGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCG
GTCTCTCGCCCATATTTAGCCTTGGACAGAATTTACCGCCCGATTGGGGCTGCATTCCCAAACAACCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCA
ACGGGGCTCTCACCCTCTCTCGCCCCCTTCCAGGGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAG
CTGGGCTCTTCCCGGTTCGCTCGCCGTTACTAGGGGAATCCTTGAGTCCACAAGGAGGCGGCGTCAAATTCGCGACGCGGTACCGAGGTTGGATCAACCACCGTAGTGTC
GCGACGACGGGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTGG
GGGGGCAGCGATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAA
GTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTACAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATA
GAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCTCGCATGCCGTCACCTACTGGCGTCCGTTAGGACAACAGAGGGCGCCGGATCAACGCGG
GGAGCGAGCGTCATTCGTCGAAACAATCTGTAAAGGCAACACGTTTGAGAGACTTCTCAAGCATATCATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAA
CGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGACGCGGGCTTTGGGGTTCGTCGTGCACCCAACGAGGGGTGCCGACAGAAAAGGACTAAAGAGACGCGA
CGCTTCCATCACGTGAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAACGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGG
GCGAGGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTTCCCTTTCCATTCTCGATCACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGC
ATAGCATTGTGGGCGGGCATGGGGGGCACGCAAGCTGGGCATGGGGCGCACCCCCACCCATGCAGGAGTGGGTGTCACGACACATAGGAGGTGGGTGTCACGACACACAG
GAGGTGGGTGTCATGGTACGCACAGGAGGTGAATGTCATTGCATTAACGCAACACAGGGGTGCTTGGTGGCATGGCACGTGGAGGGGTGCAAGGTGGACTCCTTGGTCCG
TGTTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAGATGCCGAAGCCCGCCCGAAGGCGCGCGCTGCCAGCCACGATCGGGACGACGACGTC
TCCACAGGCGTAACAAAGGCCTGGGCTTAGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGACTGGGG
CACATCGCCGGCCCCCATCCGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCGGTCT
CTCGCCCATATTTAGCCTTGGACAGAATTTACCGCCCGATTGGGGCTGCATTCCCAAACAACCCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACG
GGGCTCTCACCCTCTCTGGCGCCCCCTTCCAGGGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAGC
TGGGCTCTTCCCGGTTCGCTCGCCGTTACTAGGGGAATCCTTTGTCGCGACGACGGGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAG
CGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTGGGGGGCAGCGATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTG
CGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAAGTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTA
CAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATAGAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCGCCGGAGGCACGAC
CCGGCCAGTTAAGGCCAGGAGCGCATCGCCGGCAGAAGGGACGAGCCGACCGGTGCTCACCATAGGCGGACCGATCGACCCAACCCAAGGTCCAACTACGAGCTTTTTAA
CTGCAACAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGACCTGAGACTTCTCAAGCATATCATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAACGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGG
AACGCGACGCGGGCTTTGGGGTTCGTCGTGCACCCAACGAGGGTGCCGACAGAAAAGGACTAAAGAGACGCGACGCTTCCATCACGTGAGGTACCGATGCAAGAACCGAT
CGATCGCGGCCGCTCGAAATACTTCAGGGCACGTCAATGAGGAGGAGCTGACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACT
CACGCGCCGTACGCATTCGAGCCCGGGCAACGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGGCCGAGCACGCAACGCCGTGCG
CGGTCTTTCCTTTCCCTTTCCATTCTCGATCACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGCATAGCATTGTGGGCGGGCATGGGGGGC
ACGCAAGCTGGGCATGGGGCGCACCCCCCACCCATGCAGGGGTGCTTGGTGGCATGGCACGTGGAGGGTGCAAGGTGGGTGAGGAGGCGCGGGGTGGGGCTGACTCCTTG
GTCCGTGTTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAGATGCCGAAGCCCGCCCGAAGGCGCGCGCTGCCAGCCACGATCGGGACGACG
ACGTCTCCACAGGCGTAACAAAGGCCTGGGCTTAGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGAC
TGGGGCACATCGCCGGCCCCATCCGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCG
GTCTCTCGCCCATATTTAGCCTTGGACAGAATTTACCGCCCGATTGGGGCTGCATTCCCAAACAACCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCA
ACGGGGCTCTCACCCTCTCTCGCCCCCTTCCAGGGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAG
CTGGGCTCTTCCCGGTTCGCTCGCCGTTACTAGGGGAATCCTTGAGTCCACAAGGAGGCGGCGTCAAATTCGCGACGCGGTACCGAGGTTGGATCAACCACCGTAGTGTC
GCGACGACGGGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTGG
GGGGGCAGCGATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAA
GTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTACAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATA
GAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCTCGCATGCCGTCACCTACTGGCGTCCGTTAGGACAACAGAGGGCGCCGGATCAACGCGG
GGAGCGAGCGTCATTCGTCGAAACAATCTGTAAAGGCAACACGTTTGAGAGACTTCTCAAGCATATCATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAA
CGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGACGCGGGCTTTGGGGTTCGTCGTGCACCCAACGAGGGGTGCCGACAGAAAAGGACTAAAGAGACGCGA
CGCTTCCATCACGTGAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAACGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGG
GCGAGGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTTCCCTTTCCATTCTCGATCACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGC
ATAGCATTGTGGGCGGGCATGGGGGGCACGCAAGCTGGGCATGGGGCGCACCCCCACCCATGCAGGAGTGGGTGTCACGACACATAGGAGGTGGGTGTCACGACACACAG
GAGGTGGGTGTCATGGTACGCACAGGAGGTGAATGTCATTGCATTAACGCAACACAGGGGTGCTTGGTGGCATGGCACGTGGAGGGGTGCAAGGTGGACTCCTTGGTCCG
TGTTTCAAGACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGGAGCGCGCAGATGCCGAAGCCCGCCCGAAGGCGCGCGCTGCCAGCCACGATCGGGACGACGACGTC
TCCACAGGCGTAACAAAGGCCTGGGCTTAGGCCGCCGTCTCAATCCGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGACTGGGG
CACATCGCCGGCCCCCATCCGCTTCCCTCCCGACAATTTCAAGCACTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCGGTCT
CTCGCCCATATTTAGCCTTGGACAGAATTTACCGCCCGATTGGGGCTGCATTCCCAAACAACCCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACG
GGGCTCTCACCCTCTCTGGCGCCCCCTTCCAGGGGACTTGTGCCCGGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAGC
TGGGCTCTTCCCGGTTCGCTCGCCGTTACTAGGGGAATCCTTTGTCGCGACGACGGGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAG
CGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTGGGGGGCAGCGATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTG
CGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAAGTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTA
CAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATAGAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCGCCGGAGGCACGAC
CCGGCCAGTTAAGGCCAGGAGCGCATCGCCGGCAGAAGGGACGAGCCGACCGGTGCTCACCATAGGCGGACCGATCGACCCAACCCAAGGTCCAACTACGAGCTTTTTAA
CTGCAACAACTTAA
Protein sequenceShow/hide protein sequence
MSGPETSQAYHAAECTASERASTDNAARNEPWKGRSGTRRGLWGSSCTQRGCRQKRTKETRRFHHVRYRCKNRSIAAARNTSGHVNEEELTPTVRCPSTEPTDPNYRITT
HAPYAFEPGQRSTISTEPTEKQGCRVVGRGRARNAVRGLSFPFPFSITLVLFPMVDPKHMPAHLHSIVGGHGGHASWAWGAPPTHAGVLGGMARGGCKVGEEARGGADSL
VRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPIRFPPDNFKHYLTLFSKSFSSFPRGTCLLS
VSRPYLALDRIYRPIGAAFPNNRLVDSASWCDRVRAQRGSHPLSPPSRGLVPGPPLRTLLQTTIRTSRTPDSQAGLFPVRSPLLGESLSPQGGGVKFATRYRGWINHRSV
ATTGAEDSNLSHPRDGAHGRPACAPARATSPHGGLRGGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPI
ESRKSSQSVNPYYVWTCSHAVTYWRPLGQQRAPDQRGERASFVETICKGNTFERLLKHIMPPNAPHPRERAPTTRPATSLGKAEAERDAGFGVRRAPNEGCRQKRTKETR
RFHHVRITTHAPYAFEPGQRSTISTEPTEKQGCRVVGRGRARNAVRGLSFPFPFSITLVLFPMVDPKHMPAHLHSIVGGHGGHASWAWGAPPPMQEWVSRHIGGGCHDTQ
EVGVMVRTGGECHCINATQGCLVAWHVEGCKVDSLVRVSRRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLG
HIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSS
WALPGSLAVTRGILCRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESL
QIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQGPTTSFLTATT