; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g220110 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g220110
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptiongolgin subfamily B member 1-like
Genome locationCsor_Chr04:6332978..6336086
RNA-Seq ExpressionCsor.00g220110
SyntenyCsor.00g220110
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601032.1 hypothetical protein SDJN03_06265, partial [Cucurbita argyrosperma subsp. sororia]2.43e-267100Show/hide
Query:  MDVWSTQYSHSICQRSSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVK
        MDVWSTQYSHSICQRSSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVK
Subjt:  MDVWSTQYSHSICQRSSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVK

Query:  HAQMEMPNDYHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDSIHQEKLNVL
        HAQMEMPNDYHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDSIHQEKLNVL
Subjt:  HAQMEMPNDYHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDSIHQEKLNVL

Query:  FLVSCLKKWENMDLGMTHLEVLQRPLLTDSHNIMLCLKRPLKSLVKIYPSKLQTMGTMHYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELV
        FLVSCLKKWENMDLGMTHLEVLQRPLLTDSHNIMLCLKRPLKSLVKIYPSKLQTMGTMHYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELV
Subjt:  FLVSCLKKWENMDLGMTHLEVLQRPLLTDSHNIMLCLKRPLKSLVKIYPSKLQTMGTMHYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELV

Query:  TAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ
        TAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ
Subjt:  TAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ

KAG7031838.1 hypothetical protein SDJN02_05879, partial [Cucurbita argyrosperma subsp. argyrosperma]4.07e-7047.65Show/hide
Query:  QRSNGWRPPGAQCLPPSSTTAKIADKPDSIHQEKLNVLFLVS----------------------------CLKKWENMDLG-------------------
        QRSNGWRPPGAQCLPPSSTTA+IADKPDSIHQEKL++   +S                             + K  + +LG                   
Subjt:  QRSNGWRPPGAQCLPPSSTTAKIADKPDSIHQEKLNVLFLVS----------------------------CLKKWENMDLG-------------------

Query:  ---------MTHLEVLQRPLLTDS-----------------------------HNIMLCLK-----------RPLKSLVKIYPSKLQTMGTM--------
                 + HL+     LL +S                              +I  CL            R  K  ++     LQ +  +        
Subjt:  ---------MTHLEVLQRPLLTDS-----------------------------HNIMLCLK-----------RPLKSLVKIYPSKLQTMGTM--------

Query:  ------------------HYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQ
                           Y +  +  RSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQ
Subjt:  ------------------HYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQ

Query:  ALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ
        ALEE+ILLKEGQITILKDTLGSKSIEFLAPPSSTWEF+LQ
Subjt:  ALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ

XP_022956804.1 uncharacterized protein LOC111458393 [Cucurbita moschata]1.21e-16363.05Show/hide
Query:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDYHSSGP
        SSPQCSKDLSS KSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDYHSSGP
Subjt:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDYHSSGP

Query:  VRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPD-------------------------
        VRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHR INGSMNKYSQRSNGWRPP AQCLPPSSTTA+IADKP                          
Subjt:  VRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPD-------------------------

Query:  -----SIHQEKLNVLFLVSCLKKWENMDLG----------------------------MTHLEVLQRPLLTDS-------------------------HN
             SI +  +N L     + K  + +LG                            + HL+     LL +S                          +
Subjt:  -----SIHQEKLNVLFLVSCLKKWENMDLG----------------------------MTHLEVLQRPLLTDS-------------------------HN

Query:  IMLCL---------------KRPLKSLV------------KIYPSKLQTM--GTMHYNSIVNIQ--RSELEAETLFSRLLKEKLYSKELEVEQLQAELVT
        I  CL               K  ++SL             K  PS L +     +  NS       RSELEAETLFSRLL+EKLYSKELEVEQLQAELVT
Subjt:  IMLCL---------------KRPLKSLV------------KIYPSKLQTM--GTMHYNSIVNIQ--RSELEAETLFSRLLKEKLYSKELEVEQLQAELVT

Query:  AVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ
        AVRGNDILKCEIQNVMDSLSCLTHTMK LELQDYLSKLQALEE+ILLKEGQITILKDTLGSKSIEFLAPPSS WEF+LQ
Subjt:  AVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ

XP_038893373.1 protein Daple isoform X2 [Benincasa hispida]1.54e-6677.51Show/hide
Query:  SSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHSSGPVRPCSRT
        S++S KQIDDSERSST PKLRRT SLSSAAFRDQGQI+FY SSDPSRSPGNASSG +RQHEQSS    PSREM+FK K  Q+EMP+DY++SGPVRPCSRT
Subjt:  SSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHSSGPVRPCSRT

Query:  CYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS
        CYDSSGNSS S S+VSN VL RYIDGEQH+EINGSMNK  QR+NGWRPP AQCL  +ST+A I DKP S
Subjt:  CYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS

XP_038893374.1 uncharacterized protein LOC120082186 isoform X3 [Benincasa hispida]5.88e-6777.51Show/hide
Query:  SSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHSSGPVRPCSRT
        S++S KQIDDSERSST PKLRRT SLSSAAFRDQGQI+FY SSDPSRSPGNASSG +RQHEQSS    PSREM+FK K  Q+EMP+DY++SGPVRPCSRT
Subjt:  SSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHSSGPVRPCSRT

Query:  CYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS
        CYDSSGNSS S S+VSN VL RYIDGEQH+EINGSMNK  QR+NGWRPP AQCL  +ST+A I DKP S
Subjt:  CYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS

TrEMBL top hitse value%identityAlignment
A0A0A0KNP2 Uncharacterized protein5.41e-6472.63Show/hide
Query:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHS
        SS +      S++S K IDDSER  T PKLRRT SLSSAAFRDQGQI+FY SSDPSRSPGN+SSG +RQHE SS    PSREM+F  K  QMEMPNDY++
Subjt:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHS

Query:  SGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS
        SG +RP SRTCYDSSGNSSTS S+VSN VL RYIDGEQH+EINGSM+K SQRSNGWRPP AQCLP +STTA I DKP S
Subjt:  SGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS

A0A1S3BDK7 rho-associated protein kinase 13.52e-6268.42Show/hide
Query:  TQYSHSICQR-SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSC---PSREMKFKVKH
        T + H    R SS +      S++S K IDDSER    PKLRRT SLSSAAFRDQGQ++FY SSDPSR+PGN+SSG ++Q E SSC   PSREM+FK K 
Subjt:  TQYSHSICQR-SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSC---PSREMKFKVKH

Query:  AQMEMPNDYHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS
         QMEMPNDY++SG VRP SR CYDSSGNSSTS S+VSN VL RYIDGEQH+EINGSMNK SQR+NGWRPP AQCLP +STTA I DKP S
Subjt:  AQMEMPNDYHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS

A0A6J1FM18 golgin subfamily B member 1-like7.21e-6675.42Show/hide
Query:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHS
        SSPQ SKDL S K  +QIDD+ERS + PKLRRT SLSSAAFRDQGQINF    DPSRSPGNASS S+RQHEQSS    PSREM+FKVK  Q E+PNDY++
Subjt:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHS

Query:  SGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS
        SG  RPCSRT YDSSGNS+T+SS VSN VL RYIDGEQH+EINGS NKYSQR+NGWRPP AQCLPPSSTTA I D P S
Subjt:  SGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS

A0A6J1H041 uncharacterized protein LOC1114583935.88e-16463.05Show/hide
Query:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDYHSSGP
        SSPQCSKDLSS KSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDYHSSGP
Subjt:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDYHSSGP

Query:  VRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPD-------------------------
        VRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHR INGSMNKYSQRSNGWRPP AQCLPPSSTTA+IADKP                          
Subjt:  VRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPD-------------------------

Query:  -----SIHQEKLNVLFLVSCLKKWENMDLG----------------------------MTHLEVLQRPLLTDS-------------------------HN
             SI +  +N L     + K  + +LG                            + HL+     LL +S                          +
Subjt:  -----SIHQEKLNVLFLVSCLKKWENMDLG----------------------------MTHLEVLQRPLLTDS-------------------------HN

Query:  IMLCL---------------KRPLKSLV------------KIYPSKLQTM--GTMHYNSIVNIQ--RSELEAETLFSRLLKEKLYSKELEVEQLQAELVT
        I  CL               K  ++SL             K  PS L +     +  NS       RSELEAETLFSRLL+EKLYSKELEVEQLQAELVT
Subjt:  IMLCL---------------KRPLKSLV------------KIYPSKLQTM--GTMHYNSIVNIQ--RSELEAETLFSRLLKEKLYSKELEVEQLQAELVT

Query:  AVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ
        AVRGNDILKCEIQNVMDSLSCLTHTMK LELQDYLSKLQALEE+ILLKEGQITILKDTLGSKSIEFLAPPSS WEF+LQ
Subjt:  AVRGNDILKCEIQNVMDSLSCLTHTMKYLELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ

A0A6J1JRR9 protein Daple-like2.51e-6475.42Show/hide
Query:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHS
        SSPQ SKDL S K  +QIDD+ERS + PKLRRT SLSSAAFRDQG+INF    DPSRSPGNASS S+RQHEQSS    PSREM+FKVK  Q E+PNDY +
Subjt:  SSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSS---CPSREMKFKVKHAQMEMPNDYHS

Query:  SGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS
        SG VRPCSRTCYDSSGN +TSSS VSN VL RYIDGEQH+EINGS NKY QR+NGWRPP AQCLPPSSTTA I D P S
Subjt:  SGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39300.1 unknown protein9.7e-2039.43Show/hide
Query:  SIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQ----------------------------
        S+    R+EL AETL + LL+EKLYSKE E+EQL AE+   VRGN++L+CEIQNV+D+LS   H +K L+LQ                            
Subjt:  SIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQ----------------------------

Query:  -------------------DYLS-------KLQALEEDILLKEGQITILKDTLGSKSIEFL--APPSSTWEFQLQ
                           D  S       K++ LEED L KEGQITILKDTLGS+  + L  +P  S  +F +Q
Subjt:  -------------------DYLS-------KLQALEEDILLKEGQITILKDTLGSKSIEFL--APPSSTWEFQLQ

AT2G39300.2 unknown protein9.7e-2039.43Show/hide
Query:  SIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQ----------------------------
        S+    R+EL AETL + LL+EKLYSKE E+EQL AE+   VRGN++L+CEIQNV+D+LS   H +K L+LQ                            
Subjt:  SIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYLELQ----------------------------

Query:  -------------------DYLS-------KLQALEEDILLKEGQITILKDTLGSKSIEFL--APPSSTWEFQLQ
                           D  S       K++ LEED L KEGQITILKDTLGS+  + L  +P  S  +F +Q
Subjt:  -------------------DYLS-------KLQALEEDILLKEGQITILKDTLGSKSIEFL--APPSSTWEFQLQ

AT3G55060.1 unknown protein8.2e-1934Show/hide
Query:  LKRPLKSLVKIYPSKLQTMGT-------MHYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKY
        LKR L+++  +  S  ++  +           S+    R+EL AETL + L++EKLYSKE E+EQLQAEL  AVRGN+IL+CE+Q+ +D+LS  TH +K 
Subjt:  LKRPLKSLVKIYPSKLQTMGT-------MHYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKY

Query:  LE--------------------------LQDYLSK--------------------------------LQALEEDILLKEGQITILKDTLGSKSIEFLAPP
        L+                          L   LSK                                ++ LEE +L KEG+ITIL+DT+GSK +  L+ P
Subjt:  LE--------------------------LQDYLSK--------------------------------LQALEEDILLKEGQITILKDTLGSKSIEFLAPP

AT3G55060.1 unknown protein2.2e-0329.26Show/hide
Query:  CSKDLSSSKSRKQIDDSERSSTSPK----------LRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPND
        C K+  S    +    +E+   SPK          LRR+ S SSA F       F  +S    +     S  RR++  S C + E + + +  + +    
Subjt:  CSKDLSSSKSRKQIDDSERSSTSPK----------LRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPND

Query:  YHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHRE-----INGSMNKYSQRSNGWR-PPGAQCLPPSSTTAKIADKPDS
                   +  +DSSG+SS+ SS VS+ VL RYIDGE+H E      N S +  S+  N  R PP  Q   P+S +    +K  S
Subjt:  YHSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHRE-----INGSMNKYSQRSNGWR-PPGAQCLPPSSTTAKIADKPDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTTGGTCGACGCAATATTCTCACTCAATTTGTCAGCGAAGTAGTCCCCAATGTTCCAAGGATTTGTCTTCTTCCAAGTCCAGGAAGCAAATAGATGATAGTGA
AAGGTCCAGCACCAGTCCTAAACTTAGAAGGACCTGGTCATTATCTTCCGCTGCATTTAGAGACCAAGGTCAAATAAACTTTTATAGTTCGAGTGATCCAAGTAGATCTC
CTGGTAATGCTAGCAGTGGCTCCAGACGGCAACATGAACAGTCATCTTGTCCATCTCGAGAGATGAAATTCAAGGTAAAGCATGCGCAGATGGAAATGCCAAATGATTAC
CATAGCTCGGGACCTGTTAGGCCATGCTCCAGAACTTGCTATGATTCTTCAGGTAATTCTTCCACTAGCTCCAGTACTGTTTCAAATGGAGTCTTAGCCCGCTACATTGA
TGGTGAACAACATCGGGAAATAAATGGATCCATGAATAAGTATTCTCAGAGAAGTAACGGGTGGCGGCCTCCTGGAGCACAGTGTCTGCCACCTTCTTCAACAACAGCTA
AAATTGCAGATAAACCAGATTCTATTCATCAAGAGAAGCTAAATGTTCTCTTCCTCGTTTCTTGTCTGAAGAAGTGGGAGAATATGGATTTGGGAATGACTCACCTCGAA
GTATTGCAGAGACCGTTGTTAACAGACTCTCACAACATCATGCTGTGCCTAAAGCGACCTCTAAAGAGCTTGGTGAAAATATACCCATCAAAGTTACAGACAATGGGAAC
AATGCACTACAACTCAATAGTCAATATCCAGAGATCTGAGCTAGAAGCAGAAACTTTATTTTCAAGACTATTGAAAGAGAAACTATACTCTAAGGAGCTGGAAGTGGAGC
AGTTGCAAGCTGAACTGGTGACAGCAGTAAGAGGGAATGACATACTAAAATGTGAAATCCAGAATGTAATGGATAGCCTTTCCTGCCTCACTCATACGATGAAATATCTC
GAACTTCAGGATTATTTGTCAAAGCTACAGGCATTGGAAGAGGACATTTTGCTGAAGGAAGGTCAGATAACAATCTTGAAAGACACACTTGGGAGTAAATCTATTGAATT
TCTTGCTCCTCCCAGTTCTACGTGGGAATTTCAACTGCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTTTGGTCGACGCAATATTCTCACTCAATTTGTCAGCGAAGTAGTCCCCAATGTTCCAAGGATTTGTCTTCTTCCAAGTCCAGGAAGCAAATAGATGATAGTGA
AAGGTCCAGCACCAGTCCTAAACTTAGAAGGACCTGGTCATTATCTTCCGCTGCATTTAGAGACCAAGGTCAAATAAACTTTTATAGTTCGAGTGATCCAAGTAGATCTC
CTGGTAATGCTAGCAGTGGCTCCAGACGGCAACATGAACAGTCATCTTGTCCATCTCGAGAGATGAAATTCAAGGTAAAGCATGCGCAGATGGAAATGCCAAATGATTAC
CATAGCTCGGGACCTGTTAGGCCATGCTCCAGAACTTGCTATGATTCTTCAGGTAATTCTTCCACTAGCTCCAGTACTGTTTCAAATGGAGTCTTAGCCCGCTACATTGA
TGGTGAACAACATCGGGAAATAAATGGATCCATGAATAAGTATTCTCAGAGAAGTAACGGGTGGCGGCCTCCTGGAGCACAGTGTCTGCCACCTTCTTCAACAACAGCTA
AAATTGCAGATAAACCAGATTCTATTCATCAAGAGAAGCTAAATGTTCTCTTCCTCGTTTCTTGTCTGAAGAAGTGGGAGAATATGGATTTGGGAATGACTCACCTCGAA
GTATTGCAGAGACCGTTGTTAACAGACTCTCACAACATCATGCTGTGCCTAAAGCGACCTCTAAAGAGCTTGGTGAAAATATACCCATCAAAGTTACAGACAATGGGAAC
AATGCACTACAACTCAATAGTCAATATCCAGAGATCTGAGCTAGAAGCAGAAACTTTATTTTCAAGACTATTGAAAGAGAAACTATACTCTAAGGAGCTGGAAGTGGAGC
AGTTGCAAGCTGAACTGGTGACAGCAGTAAGAGGGAATGACATACTAAAATGTGAAATCCAGAATGTAATGGATAGCCTTTCCTGCCTCACTCATACGATGAAATATCTC
GAACTTCAGGATTATTTGTCAAAGCTACAGGCATTGGAAGAGGACATTTTGCTGAAGGAAGGTCAGATAACAATCTTGAAAGACACACTTGGGAGTAAATCTATTGAATT
TCTTGCTCCTCCCAGTTCTACGTGGGAATTTCAACTGCAGTAA
Protein sequenceShow/hide protein sequence
MDVWSTQYSHSICQRSSPQCSKDLSSSKSRKQIDDSERSSTSPKLRRTWSLSSAAFRDQGQINFYSSSDPSRSPGNASSGSRRQHEQSSCPSREMKFKVKHAQMEMPNDY
HSSGPVRPCSRTCYDSSGNSSTSSSTVSNGVLARYIDGEQHREINGSMNKYSQRSNGWRPPGAQCLPPSSTTAKIADKPDSIHQEKLNVLFLVSCLKKWENMDLGMTHLE
VLQRPLLTDSHNIMLCLKRPLKSLVKIYPSKLQTMGTMHYNSIVNIQRSELEAETLFSRLLKEKLYSKELEVEQLQAELVTAVRGNDILKCEIQNVMDSLSCLTHTMKYL
ELQDYLSKLQALEEDILLKEGQITILKDTLGSKSIEFLAPPSSTWEFQLQ