; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023232 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023232
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr05:32404205..32407526
RNA-Seq ExpressionHG10023232
SyntenyHG10023232
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575620.1 hypothetical protein SDJN03_26259, partial [Cucurbita argyrosperma subsp. sororia]9.3e-9575.46Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSS------GGGGGATSVSGGFV
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KNSD+AARASPTTPLTWSS      GGGGGATS+SGGFV
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSS------GGGGGATSVSGGFV

Query:  DASDAARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQ
        DASDAARS                  KTLGELKEEE LLLKERRSL+DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S V +E N DQPQLQ
Subjt:  DASDAARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQ

Query:  MPLRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
           RSIC++TPIGVAAFGCNGVDASYQLTLPN+SCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  MPLRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

XP_023548546.1 uncharacterized protein LOC111807180 isoform X4 [Cucurbita pepo subsp. pepo]9.3e-9576.32Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSS--GGGGGATSVSGGFVDASD
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KNSD+AARASPTTPLTWSS  GGGGGATS+SGGFVDASD
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSS--GGGGGATSVSGGFVDASD

Query:  AARSK-------------------TLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPL
        AARSK                   TLGELKEEE LLLKERRSL DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S V ++ N DQPQLQ   
Subjt:  AARSK-------------------TLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPL

Query:  RSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        RSIC++TPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  RSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

XP_023548547.1 uncharacterized protein LOC111807180 isoform X5 [Cucurbita pepo subsp. pepo]7.1e-9576.6Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSS--GGGGGATSVSGGFVDASD
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KNSD+AARASPTTPLTWSS  GGGGGATS+SGGFVDASD
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSS--GGGGGATSVSGGFVDASD

Query:  AARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLR
        AARS                  KTLGELKEEE LLLKERRSL DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S V ++ N DQPQLQ   R
Subjt:  AARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLR

Query:  SICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        SIC++TPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  SICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

XP_038896159.1 uncharacterized protein LOC120084450 isoform X1 [Benincasa hispida]6.2e-10784.11Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARS---
        MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPL+WSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWS          SGGFVDASDAARS   
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARS---

Query:  ---------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRSICSSTP
                       KTLGELKEEEVLLLKERRSL+DALAT+RLSVEKQRAMNGSLKKMKLDLESQQAIE + TSAVLEEAN DQPQLQMP RSIC++T 
Subjt:  ---------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRSICSSTP

Query:  IGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        IGVAAFGCN VDASYQLTLPNVSCKLQE+GTLGTVRLLPDLNLPFQEDSGTEALYRMS
Subjt:  IGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

XP_038896160.1 uncharacterized protein LOC120084450 isoform X2 [Benincasa hispida]2.7e-11090.42Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARSKTL
        MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPL+WSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWS          SGGFVDASDAARSKTL
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARSKTL

Query:  GELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRSICSSTPIGVAAFGCNGVDASYQLT
        GELKEEEVLLLKERRSL+DALAT+RLSVEKQRAMNGSLKKMKLDLESQQAIE + TSAVLEEAN DQPQLQMP RSIC++T IGVAAFGCN VDASYQLT
Subjt:  GELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRSICSSTPIGVAAFGCNGVDASYQLT

Query:  LPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        LPNVSCKLQE+GTLGTVRLLPDLNLPFQEDSGTEALYRMS
Subjt:  LPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

TrEMBL top hitse value%identityAlignment
A0A1S3CDM0 uncharacterized protein LOC1034998301.2e-9277.99Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARS---
        MSD EWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRH        SDSAARASPTTPLTWSS GGGG     GGFVDASDAARS   
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARS---

Query:  ---------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQP-QLQMPLRSICSST
                       KTLGELKEEEVLLLKERRSL+DALAT+RLSVEKQRAMNGSLKK+KLDLESQQAIEMV TSAV  EAN +QP QLQ P RSICS+T
Subjt:  ---------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQP-QLQMPLRSICSST

Query:  PIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        PI        G DASYQLT+PNVSCKLQE+GTLGTVRLLPDLNLPFQEDS TEALYRMS
Subjt:  PIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

A0A6J1GPA3 uncharacterized protein LOC111456207 isoform X12.2e-9475.75Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGG----GGATSVSGGFVDA
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KNSD+AARASPTTPLTWSSGGG    GGATS+SGGFVDA
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGG----GGATSVSGGFVDA

Query:  SDAARSK-------------------TLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQM
        SDAARSK                   TLGELKEEE LLLKERRSL+DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S V +E N DQ QLQ 
Subjt:  SDAARSK-------------------TLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQM

Query:  PLRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
          RSIC++TPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  PLRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

A0A6J1GQQ9 uncharacterized protein LOC111456207 isoform X21.7e-9476.03Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGG----GGATSVSGGFVDA
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KNSD+AARASPTTPLTWSSGGG    GGATS+SGGFVDA
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGG----GGATSVSGGFVDA

Query:  SDAARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMP
        SDAARS                  KTLGELKEEE LLLKERRSL+DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S V +E N DQ QLQ  
Subjt:  SDAARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMP

Query:  LRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
         RSIC++TPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  LRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

A0A6J1JT79 uncharacterized protein LOC111488119 isoform X52.9e-9476.14Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGG-GGGATSVSGGFVDASDA
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KN D+AARASPTTPLTWSSGG GGGATS+SGGFVDASDA
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGG-GGGATSVSGGFVDASDA

Query:  ARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRS
        ARS                  KTLGELKEEE LLLKERRSL+DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S +  E N DQPQLQ   +S
Subjt:  ARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRS

Query:  ICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        IC++TPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  ICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

A0A6J1JV44 uncharacterized protein LOC111488119 isoform X43.8e-9475.85Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGG-GGGATSVSGGFVDASDA
        MSDEEWV+VALSDDSLVVDLLLRLNRPP      PPL LDWSVRQPRSK IL RHASDSA KN D+AARASPTTPLTWSSGG GGGATS+SGGFVDASDA
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGG-GGGATSVSGGFVDASDA

Query:  ARSK-------------------TLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLR
        ARSK                   TLGELKEEE LLLKERRSL+DALAT+R++VEKQR +NGSLKKMKL+LE QQ       S +  E N DQPQLQ   +
Subjt:  ARSK-------------------TLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLR

Query:  SICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS
        SIC++TPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQ+DSG EALYRMS
Subjt:  SICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLLPDLNLPFQEDSGTEALYRMS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15800.1 unknown protein1.2e-1842.02Show/hide
Query:  MSDEEWVDVALSDDSLVVD-LLLRLNRPPSPP---------LPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWS-----SGGGGGATSV-
        M+  +W+  A+ DDSLV + L+  L+  PS P         L L WSVRQPR+K    R       K  D   RASPTTPL+WS     SGGGGGA +  
Subjt:  MSDEEWVDVALSDDSLVVD-LLLRLNRPPSPP---------LPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWS-----SGGGGGATSV-

Query:  -----SGGFVDASDAARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQ
             S G V  S+A RS                  KTL +LKEEE +LLKER  L + LAT++  +++QRA N SLK  KL  ESQ+
Subjt:  -----SGGFVDASDAARS------------------KTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQ

AT1G80610.1 unknown protein4.3e-2139.82Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRL------NRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWS-----SGGGG------GATS
        MS E W+ VA+SDDS+V + LLRL      NR  + PL L WSVRQ RSK                   RASPTTPL+WS     SGGGG      GAT+
Subjt:  MSDEEWVDVALSDDSLVVDLLLRL------NRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWS-----SGGGG------GATS

Query:  VSGGFVDASDAA------------------------------RSKTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEM
        + G  ++ S AA                              + KTL ELKEEE++LLKE   L++ LA +R  +E+QRA N +LKKMK   ESQ A+  
Subjt:  VSGGFVDASDAA------------------------------RSKTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEM

Query:  VATSAVLEEANFDQPQLQMPL
          T    + ++F  P L MPL
Subjt:  VATSAVLEEANFDQPQLQMPL

AT4G32030.1 unknown protein1.7e-1436.56Show/hide
Query:  EEWVDVALSDDSLVVDLLLRL--------NRPPSPPLPLDWSVRQPRSKPILPRHASDS----ALKNSDSAARASPTTPLTWSSG---GGGGATSVSGGF
        ++WV VA++DD LVV+LLLRL        + P     PL W +RQ RS+    R         +LK    + RASP TPL+WS G   GGG A+  + GF
Subjt:  EEWVDVALSDDSLVVDLLLRL--------NRPPSPPLPLDWSVRQPRSKPILPRHASDS----ALKNSDSAARASPTTPLTWSSG---GGGGATSVSGGF

Query:  VDASDAA---------------------------RSKTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLES
         D S  A                           + K+  ELK EE L LKER  LE  +A++R + ++Q   N  LK++KLDL S
Subjt:  VDASDAA---------------------------RSKTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLES

AT4G32030.2 unknown protein3.2e-0840.54Show/hide
Query:  EEWVDVALSDDSLVVDLLLRL--------NRPPSPPLPLDWSVRQPRSKPILPRHASDS----ALKNSDSAARASPTTPLTWSSG---GGGGATSVSGGF
        ++WV VA++DD LVV+LLLRL        + P     PL W +RQ RS+    R         +LK    + RASP TPL+WS G   GGG A+  + GF
Subjt:  EEWVDVALSDDSLVVDLLLRL--------NRPPSPPLPLDWSVRQPRSKPILPRHASDS----ALKNSDSAARASPTTPLTWSSG---GGGGATSVSGGF

Query:  VDASDAARSKT
         D S  A   T
Subjt:  VDASDAARSKT

AT5G25210.1 unknown protein2.3e-0629.05Show/hide
Query:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLP----LDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAAR
        +S ++W   A+ D  +V +LL++L        P    L W ++QPRS+   PR  S+S         R SP+TPL+W SGG GG++S   G+VD  +A  
Subjt:  MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLP----LDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAAR

Query:  SKTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLD
         +            +    S    ++++R    ++   N +LK+MK++
Subjt:  SKTLGELKEEEVLLLKERRSLEDALATIRLSVEKQRAMNGSLKKMKLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGACGAGGAATGGGTCGACGTTGCCCTCTCTGATGACTCTCTCGTCGTCGACTTACTTCTTCGTCTCAACCGTCCTCCCTCTCCGCCGCTCCCTCTCGACTGGTC
CGTTCGTCAGCCTCGCTCTAAGCCGATTCTTCCTCGCCATGCCTCCGATTCTGCACTCAAAAACTCCGATTCCGCCGCCAGAGCCAGTCCGACCACTCCCCTCACTTGGA
GCAGCGGCGGCGGTGGTGGAGCTACCTCCGTCAGCGGCGGATTTGTCGACGCATCCGACGCCGCAAGATCTAAGACACTAGGAGAACTTAAAGAGGAGGAAGTTTTGCTA
TTAAAGGAAAGAAGAAGCTTGGAAGATGCCTTGGCTACCATAAGGCTCTCTGTGGAAAAACAAAGGGCTATGAATGGAAGCTTGAAGAAAATGAAGCTTGATCTCGAATC
ACAGCAAGCGATTGAAATGGTTGCAACGTCTGCTGTTCTGGAGGAGGCAAATTTCGACCAACCTCAACTACAGATGCCACTGAGATCTATATGCAGCTCTACGCCCATCG
GAGTCGCTGCTTTCGGTTGCAATGGCGTTGATGCTTCTTACCAATTAACACTGCCAAATGTTTCTTGCAAACTACAGGAGATGGGAACTTTAGGGACTGTTCGTTTATTA
CCCGATCTTAATTTGCCTTTTCAGGAGGATTCTGGCACTGAGGCCCTATACCGAATGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGACGAGGAATGGGTCGACGTTGCCCTCTCTGATGACTCTCTCGTCGTCGACTTACTTCTTCGTCTCAACCGTCCTCCCTCTCCGCCGCTCCCTCTCGACTGGTC
CGTTCGTCAGCCTCGCTCTAAGCCGATTCTTCCTCGCCATGCCTCCGATTCTGCACTCAAAAACTCCGATTCCGCCGCCAGAGCCAGTCCGACCACTCCCCTCACTTGGA
GCAGCGGCGGCGGTGGTGGAGCTACCTCCGTCAGCGGCGGATTTGTCGACGCATCCGACGCCGCAAGATCTAAGACACTAGGAGAACTTAAAGAGGAGGAAGTTTTGCTA
TTAAAGGAAAGAAGAAGCTTGGAAGATGCCTTGGCTACCATAAGGCTCTCTGTGGAAAAACAAAGGGCTATGAATGGAAGCTTGAAGAAAATGAAGCTTGATCTCGAATC
ACAGCAAGCGATTGAAATGGTTGCAACGTCTGCTGTTCTGGAGGAGGCAAATTTCGACCAACCTCAACTACAGATGCCACTGAGATCTATATGCAGCTCTACGCCCATCG
GAGTCGCTGCTTTCGGTTGCAATGGCGTTGATGCTTCTTACCAATTAACACTGCCAAATGTTTCTTGCAAACTACAGGAGATGGGAACTTTAGGGACTGTTCGTTTATTA
CCCGATCTTAATTTGCCTTTTCAGGAGGATTCTGGCACTGAGGCCCTATACCGAATGAGCTAG
Protein sequenceShow/hide protein sequence
MSDEEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDWSVRQPRSKPILPRHASDSALKNSDSAARASPTTPLTWSSGGGGGATSVSGGFVDASDAARSKTLGELKEEEVLL
LKERRSLEDALATIRLSVEKQRAMNGSLKKMKLDLESQQAIEMVATSAVLEEANFDQPQLQMPLRSICSSTPIGVAAFGCNGVDASYQLTLPNVSCKLQEMGTLGTVRLL
PDLNLPFQEDSGTEALYRMS