; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003363 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003363
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionC2H2-type domain-containing protein
Genome locationChr08:57929..65060
RNA-Seq ExpressionHG10003363
SyntenyHG10003363
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6384451.1 hypothetical protein SASPL_155737 [Salvia splendens]7.7e-10461.56Show/hide
Query:  RVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRL-----NPHRSMPQVDRRTGS--SPFHIRLGHIAGPHPLPSRQFQALFDSLFKV
        RVEWGA RP PGAR CRS  E         R   H R+ G+    +L      P  ++ +   R+    SPFHIR GHIAGPHPLPSRQFQALFDSLFKV
Subjt:  RVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRL-----NPHRSMPQVDRRTGS--SPFHIRLGHIAGPHPLPSRQFQALFDSLFKV

Query:  LFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGI
        LFIFPSRYLFAIGLSP+FSLG+NLPPDWGCIPKQPDS TAPR ATGS  +GALTLSGAPFQGT ARSAAEDASPDYNSD   ARFSSWA PGSLAVT+GI
Subjt:  LFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGI

Query:  L-------------------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRY
        L                   CR D RRGL+F+P+    ARE S+ PRPR EP  GV  GA MRDAQADVPSA+ LRAQLAFKDS+V GILQFTPSIAFRY
Subjt:  L-------------------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRY

Query:  VLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTR
        VLHRCESRDIRCRES ++     APPT    G AP             + SR   + V P + WT   G  R
Subjt:  VLHRCESRDIRCRESLQI-----APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTR

XP_022933354.1 uncharacterized protein LOC111440692 [Cucurbita moschata]2.5e-9992.42Show/hide
Query:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
        R   RVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
Subjt:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI

Query:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

XP_022975713.1 uncharacterized protein LOC111475769 [Cucurbita maxima]1.4e-9791.41Show/hide
Query:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
        R   RVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHI GPHPLPSRQFQALFDSLFKVLFI
Subjt:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI

Query:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        FPSRYLFAIGLSPIFSLGQNLPPDWG IPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

XP_023520622.1 uncharacterized protein LOC111784033 [Cucurbita pepo subsp. pepo]5.1e-9292.97Show/hide
Query:  MPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP
        MPGARRCRS P+GA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG SPFHIRL HIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP
Subjt:  MPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSP

Query:  IFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        IFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  IFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

XP_023552350.1 uncharacterized protein LOC111810043 [Cucurbita pepo subsp. pepo]1.7e-9891.92Show/hide
Query:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
        R   RVEWGAHRPMPGARRCRS P+GA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG SPFHIRL HIAGPHPLPSRQFQALFDSLFKVLFI
Subjt:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI

Query:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

TrEMBL top hitse value%identityAlignment
A0A6J1EZI8 uncharacterized protein LOC1114406921.2e-9992.42Show/hide
Query:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
        R   RVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
Subjt:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI

Query:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

A0A6N2KB50 Uncharacterized protein (Fragment)2.3e-9351Show/hide
Query:  PLGAGRPLLLVGNRAMGACVAS----SPDSDLEAFSHNPAHGSFAPLAFQPSAMTNCANQRFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHR
        P G  RPL     R +G C  +    SP +D  + +H+    + +P AFQ                        PG R    PP      PRS       
Subjt:  PLGAGRPLLLVGNRAMGACVAS----SPDSDLEAFSHNPAHGSFAPLAFQPSAMTNCANQRFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHR

Query:  RNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRG
                   + H      DR    SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLG+NLPPDWGCIPKQPDS TAPRG
Subjt:  RNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRG

Query:  ATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL----------------------------------------CR
        A GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL                                        CR
Subjt:  ATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL----------------------------------------CR

Query:  DDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI----
         + RRGL+F+P+    ARE S+ PRPR EP  GV  GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCRES ++    
Subjt:  DDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQI----

Query:  -APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTRP
         APPT    G AP             + SR   + V P + WT   G   P
Subjt:  -APPTK--NGHAPP-----------PIESRKSSQSVNPYYVWTCAGGTTRP

A0A6N2KB50 Uncharacterized protein (Fragment)1.5e-2053.6Show/hide
Query:  SWVGARGARDESERRRAESQWIVAARPLCHLQYPVAYLSRLQRILPVARWKLRYKAAPATHPSRGLSQRHVPLG------AGRPLLLVGNRAMGA-CVAS
        SW   RG+ DESER  AESQWIVAARPLCHLQ PVA+LSRL+RILP ARW+L +KAA A  P RGL QRHVP G      A  P   V N  +G  C   
Subjt:  SWVGARGARDESERRRAESQWIVAARPLCHLQYPVAYLSRLQRILPVARWKLRYKAAPATHPSRGLSQRHVPLG------AGRPLLLVGNRAMGA-CVAS

Query:  SPDSDLEAFSHNPAHGSFAPLAFQP
           +D+E    N A  ++ P A  P
Subjt:  SPDSDLEAFSHNPAHGSFAPLAFQP

A0A6N2MTU4 Uncharacterized protein1.0e-10168.71Show/hide
Query:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGL
        R   GARR     +GARC PRS RR LHRR KG GLGR  +PHRS P+ DRRTG        G  + PHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGL
Subjt:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIGL

Query:  SPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL-------------
        SPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL             
Subjt:  SPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL-------------

Query:  -----------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRD
                   CR + RRGL+F+P+    ARE S+ PRPR EP  GV  GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRD
Subjt:  -----------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRD

Query:  IRCRESLQIA
        IRCRES +++
Subjt:  IRCRESLQIA

A0A6N2NG36 Uncharacterized protein2.1e-9961.14Show/hide
Query:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGR-RLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIG
        R   GARR     +GARC PRS RR LHRR KG GLGR   + H      DR    SPFHIR G IAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIG
Subjt:  RPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGR-RLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFIFPSRYLFAIG

Query:  LSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL------------
        LSPIFSLG+NLPPDWGCIPKQPDS TAPRGA GS  +GALTLSGAPFQGT A SAAEDASPDYNS+   ARFSSWA PGSLAVTRGIL            
Subjt:  LSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL------------

Query:  -------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR
               CR + RRGL+F+P+    A +    PRPR EP  GV  GA MRD QADVPS RR RAQLAFKDS+V GILQFTPSIAFRYVLHRCESRDIRCR
Subjt:  -------CRDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCR

Query:  ESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPT
        ES +++   K G  P P   R   ++ +  + W  + G    V+ R+ S A+ +        G + P+
Subjt:  ESLQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPT

A0A6N2NG36 Uncharacterized protein1.4e-1388Show/hide
Query:  GAGRPLLLVGNRAMGACVASSPDSDLEAFSHNPAHGSFAPLAFQPSAMTN
        GA RPLL VGNR  GA VASSPDSDLEAFSHNP HGSFAPLAFQPSAMTN
Subjt:  GAGRPLLLVGNRAMGACVASSPDSDLEAFSHNPAHGSFAPLAFQPSAMTN

A0A6N2NG36 Uncharacterized protein6.8e-9891.41Show/hide
Query:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI
        R   RVEWGAHRPMPGARRCRS PEGA CQPRSGRRRLH+RNKGLG GRRLNPHRSMPQVD RTG  PFHIRLGHI GPHPLPSRQFQALFDSLFKVLFI
Subjt:  RFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQALFDSLFKVLFI

Query:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL
        FPSRYLFAIGLSPIFSLGQNLPPDWG IPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARS AEDASPDYNS+ E ARFSSWALPGSLAVT+GIL
Subjt:  FPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAACGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGACGCGGGCTTTGGGGTTCGTCGTGC
ACCCAACGAGGGGTGCCGACAGAAAAGGACTAAAGAGACGCGACGCTTCCATCACGTGAGGTACCGATGCAAGAACCGATCGATCGCGGCCGCTCGAAATACTTCAGGGC
ACGTCAATGAGGAGGAGCTGACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAA
CGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTTCCCTTTCCATTCTCGAT
CACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGCATAGCATTGTGGGCGGGCATGGGGGGCACGCAAGCTGGGCATGGGGCGCACCCCCCA
CCCATGCAGTCTACAAAAAAAATTTCAAAACAATCCAAGCATTGGAAGATAGATTTCACGTTTCATGGGTGTTGGGTGTCACGACACACAGGAGTTGGGTGTCACGACAC
ATAGGAGGTGGGTGTCACGACACACAGGAGGTGGGTGTCATGGTACGCACAGGAGGTGAATGTCATTGCATTAACGCAACACAGGGGTGCTTGGTGGCATGGCACGTGGA
GGGGTGCAAGGTGTGGGGCTGGGTGGGCATGGTGCGTGCGGGGTGGGCATGGGGGGCGCCCATGAGGTGGGGCTGGGTGGGCACGGGGCGCACGCACGGGGGGCTGGGTG
GGCATGGGGGCGCCCATGAGGTGGGGCTGGGTGGGCATGGGGGCGCCCATGAGGTGGGGCTGGGTGGGCATGGGGCGCAAGCACGGGGGGCAGGGTGGGCATGTGGGGCG
CCCGTGAGGTGGAGCTGGGTGGGCGCGCGGGGTGCAAGGGACGAATCGGAGCGACGAAGGGCTGAATCTCAGTGGATCGTGGCAGCAAGGCCACTCTGCCACTTACAATA
CCCTGTCGCGTATTTAAGTCGTCTGCAAAGGATTCTACCCGTCGCTCGGTGGAAATTACGTTACAAGGCGGCCCCCGCGACTCATCCGTCACGAGGGCTTAGCCAACGAC
ACGTGCCTTTGGGGGCCGGAAGGCCCCTACTGCTGGTCGGCAATCGAGCGATGGGCGCATGCGTCGCTTCTAGCCCGGATTCTGACTTAGAGGCGTTCAGTCATAATCCA
GCGCACGGTAGCTTCGCGCCACTGGCTTTTCAACCAAGCGCGATGACCAATTGTGCGAATCAACGGTTCCTCTCACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGG
AGCGCGCAGATGCCGAAGCCCGCCCGAAGGCGCGCGCTGCCAGCCACGATCGGGACGACGACGTCTCCACAGGCGTAACAAAGGCCTGGGCTTAGGCCGCCGTCTCAATC
CGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGACTGGGGCACATCGCCGGCCCCCATCCGCTTCCCTCCCGACAATTTCAAGCA
CTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCGGTCTCTCGCCCATATTTAGCCTTGGACAGAATTTACCGCCCGATTGGGG
CTGCATTCCCAAACAACCCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACGGGGCTCTCACCCTCTCTGGCGCCCCCTTCCAGGGGACTTGTGCCC
GGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAGCTGGGCTCTTCCCGGTTCGCTCGCCGTTACTAGGGGAATCCTTTGT
CGCGACGACGGGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTG
GGGGGCAGCGATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAA
GTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTACAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATA
GAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCGCCGGAGGCACGACCCGGCCAGTTAAGGCCAGGAGCGCATCGCCGGCAGAAGGGACGAG
CCGACCGGTGCTCACCATAGGCGGACCGATCGACCCAACCCAAGGTCCAACTACGAGCTTTTTAACTGCAACAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACAACGCGGCCCGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGACGCGGGCTTTGGGGTTCGTCGTGC
ACCCAACGAGGGGTGCCGACAGAAAAGGACTAAAGAGACGCGACGCTTCCATCACGTGAGGTACCGATGCAAGAACCGATCGATCGCGGCCGCTCGAAATACTTCAGGGC
ACGTCAATGAGGAGGAGCTGACGCCGACAGTTCGATGCCCGAGCACCGAGCCTACCGACCCAAACTACAGAATCACCACTCACGCGCCGTACGCATTCGAGCCCGGGCAA
CGCTCGACTATCAGCACCGAGCCTACAGAAAAGCAAGGGTGCAGAGTCGTCGGGCGAGGCCGAGCACGCAACGCCGTGCGCGGTCTTTCCTTTCCCTTTCCATTCTCGAT
CACTTTAGTTTTGTTTCCAATGGTTGACCCCAAGCACATGCCTGCCCACCTGCATAGCATTGTGGGCGGGCATGGGGGGCACGCAAGCTGGGCATGGGGCGCACCCCCCA
CCCATGCAGTCTACAAAAAAAATTTCAAAACAATCCAAGCATTGGAAGATAGATTTCACGTTTCATGGGTGTTGGGTGTCACGACACACAGGAGTTGGGTGTCACGACAC
ATAGGAGGTGGGTGTCACGACACACAGGAGGTGGGTGTCATGGTACGCACAGGAGGTGAATGTCATTGCATTAACGCAACACAGGGGTGCTTGGTGGCATGGCACGTGGA
GGGGTGCAAGGTGTGGGGCTGGGTGGGCATGGTGCGTGCGGGGTGGGCATGGGGGGCGCCCATGAGGTGGGGCTGGGTGGGCACGGGGCGCACGCACGGGGGGCTGGGTG
GGCATGGGGGCGCCCATGAGGTGGGGCTGGGTGGGCATGGGGGCGCCCATGAGGTGGGGCTGGGTGGGCATGGGGCGCAAGCACGGGGGGCAGGGTGGGCATGTGGGGCG
CCCGTGAGGTGGAGCTGGGTGGGCGCGCGGGGTGCAAGGGACGAATCGGAGCGACGAAGGGCTGAATCTCAGTGGATCGTGGCAGCAAGGCCACTCTGCCACTTACAATA
CCCTGTCGCGTATTTAAGTCGTCTGCAAAGGATTCTACCCGTCGCTCGGTGGAAATTACGTTACAAGGCGGCCCCCGCGACTCATCCGTCACGAGGGCTTAGCCAACGAC
ACGTGCCTTTGGGGGCCGGAAGGCCCCTACTGCTGGTCGGCAATCGAGCGATGGGCGCATGCGTCGCTTCTAGCCCGGATTCTGACTTAGAGGCGTTCAGTCATAATCCA
GCGCACGGTAGCTTCGCGCCACTGGCTTTTCAACCAAGCGCGATGACCAATTGTGCGAATCAACGGTTCCTCTCACGGGTCGAATGGGGAGCCCACAGGCCGATGCCAGG
AGCGCGCAGATGCCGAAGCCCGCCCGAAGGCGCGCGCTGCCAGCCACGATCGGGACGACGACGTCTCCACAGGCGTAACAAAGGCCTGGGCTTAGGCCGCCGTCTCAATC
CGCATCGGTCCATGCCCCAAGTCGATCGGCGGACCGGCTCATCACCGTTCCACATCCGACTGGGGCACATCGCCGGCCCCCATCCGCTTCCCTCCCGACAATTTCAAGCA
CTATTTGACTCTCTTTTCAAAGTCCTTTTCATCTTTCCCTCGCGGTACTTGTTTGCTATCGGTCTCTCGCCCATATTTAGCCTTGGACAGAATTTACCGCCCGATTGGGG
CTGCATTCCCAAACAACCCGACTCGTTGACAGCGCCTCGTGGTGCGACAGGGTCCGAGCGCAACGGGGCTCTCACCCTCTCTGGCGCCCCCTTCCAGGGGACTTGTGCCC
GGTCCGCCGCTGAGGACGCTTCTCCAGACTACAATTCGGACGTCGAGGACGCCCGATTCTCAAGCTGGGCTCTTCCCGGTTCGCTCGCCGTTACTAGGGGAATCCTTTGT
CGCGACGACGGGCGCCGAGGACTCGAATTTAAGCCATCCGCGCGACGGTGCGCACGGGAGGCCAGCGTGTGCCCCCGCCCGCGCAACGAGCCCACATGGGGGGTTGCGTG
GGGGGCAGCGATGCGTGACGCCCAGGCAGACGTGCCCTCGGCCAGAAGGCTCCGGGCGCAACTTGCGTTCAAAGACTCGGTGGTTCGCGGGATCCTGCAATTCACACCAA
GTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGCCGAGAGTCGTTACAAATCGCTCCACCAACTAAGAACGGCCATGCACCACCACCCATA
GAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCGCCGGAGGCACGACCCGGCCAGTTAAGGCCAGGAGCGCATCGCCGGCAGAAGGGACGAG
CCGACCGGTGCTCACCATAGGCGGACCGATCGACCCAACCCAAGGTCCAACTACGAGCTTTTTAACTGCAACAACTTAA
Protein sequenceShow/hide protein sequence
MPPNAPHPRERAPTTRPATSLGKAEAERDAGFGVRRAPNEGCRQKRTKETRRFHHVRYRCKNRSIAAARNTSGHVNEEELTPTVRCPSTEPTDPNYRITTHAPYAFEPGQ
RSTISTEPTEKQGCRVVGRGRARNAVRGLSFPFPFSITLVLFPMVDPKHMPAHLHSIVGGHGGHASWAWGAPPTHAVYKKNFKTIQALEDRFHVSWVLGVTTHRSWVSRH
IGGGCHDTQEVGVMVRTGGECHCINATQGCLVAWHVEGCKVWGWVGMVRAGWAWGAPMRWGWVGTGRTHGGLGGHGGAHEVGLGGHGGAHEVGLGGHGAQARGAGWACGA
PVRWSWVGARGARDESERRRAESQWIVAARPLCHLQYPVAYLSRLQRILPVARWKLRYKAAPATHPSRGLSQRHVPLGAGRPLLLVGNRAMGACVASSPDSDLEAFSHNP
AHGSFAPLAFQPSAMTNCANQRFLSRVEWGAHRPMPGARRCRSPPEGARCQPRSGRRRLHRRNKGLGLGRRLNPHRSMPQVDRRTGSSPFHIRLGHIAGPHPLPSRQFQA
LFDSLFKVLFIFPSRYLFAIGLSPIFSLGQNLPPDWGCIPKQPDSLTAPRGATGSERNGALTLSGAPFQGTCARSAAEDASPDYNSDVEDARFSSWALPGSLAVTRGILC
RDDGRRGLEFKPSARRCAREASVCPRPRNEPTWGVAWGAAMRDAQADVPSARRLRAQLAFKDSVVRGILQFTPSIAFRYVLHRCESRDIRCRESLQIAPPTKNGHAPPPI
ESRKSSQSVNPYYVWTCAGGTTRPVKARSASPAEGTSRPVLTIGGPIDPTQGPTTSFLTATT