; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000523 (gene) of Chayote v1 genome

Gene IDSed0000523
OrganismSechium edule (Chayote v1)
DescriptionDUF789 domain-containing protein
Genome locationLG06:43707978..43712691
RNA-Seq ExpressionSed0000523
SyntenySed0000523
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]3.3e-4239.63Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++PV    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQPYF+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      +   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYHSLHTA+GG                            SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

XP_022150656.1 uncharacterized protein LOC111018737 [Momordica charantia]1.9e-4239.01Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------DS---------LAADEASEPVSI--SNPEPVVS-LSNLER-LQS------ALFLSK
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS        DS         +A+DEA++PV++   NP+PVVS LSNLER LQS      A F SK
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------DS---------LAADEASEPVSI--SNPEPVVS-LSNLER-LQS------ALFLSK

Query:  SELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGM----------------LLGHQIFGCLEGNGVRKV----
        S LRGWR  D + QPYF+L                         GVVQ YVPYLSGIQLYGM                       G  +    R++    
Subjt:  SELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGM----------------LLGHQIFGCLEGNGVRKV----

Query:  -------------IRMDRLSLSDQLLGLHEDCSSDEAE------------------------------------------SSILISKMWHSNNGPCFSYP
                     +R+DRLSL DQ +GLHEDCSSDEAE                                          S  L+   W S       YP
Subjt:  -------------IRMDRLSLSDQLLGLHEDCSSDEAE------------------------------------------SSILISKMWHSNNGPCFSYP

Query:  VYGIRSGKTLKDLDAYFLTYHSLHTAIGG-----------------------------SYKFKGSSMWFRNGGVDHQLANKL
        +Y I +G+TLKDLDA FLTYHSLHTAI G                             SYKFKGSS+W RNGGV+HQLAN L
Subjt:  VYGIRSGKTLKDLDAYFLTYHSLHTAIGG-----------------------------SYKFKGSSMWFRNGGVDHQLANKL

XP_022968119.1 uncharacterized protein LOC111467452 isoform X1 [Cucurbita maxima]1.3e-4139.06Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++P+    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQP+F+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      M   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG-------------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYHSLHTA+GG                               SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG-------------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]5.6e-4239.37Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++P+    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQP+F+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      M   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYHSLHTA+GG                            SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

XP_038892909.1 uncharacterized protein LOC120081811 [Benincasa hispida]5.1e-4338.99Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV---------------DSLAADEASEPVSISNPEPVVS-LSNLERL-------QSALFLSKSEL
        FGR  G++RFYDSS+AR   L+R++DRLC PQ+ AS                D L +DEA++PV  SN +PVVS LSNLER         SA FLSKS L
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV---------------DSLAADEASEPVSISNPEPVVS-LSNLERL-------QSALFLSKSEL

Query:  RGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGM----------------LLGHQIFGCLEGNGVRKV-------
        RGWR  DL+ QPYF+L                         GVVQ YVPYLSGIQLY M                       G  +    R++       
Subjt:  RGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGM----------------LLGHQIFGCLEGNGVRKV-------

Query:  ----------IRMDRLSLSDQLLGLHEDCSSDEAES--------------------SILISKMWHSNNGPCFS--------------------YPVYGIR
                  +RMDRLSL DQ LGLHEDCSSDEAES                      L  K+  S+   CF                     YP+Y I 
Subjt:  ----------IRMDRLSLSDQLLGLHEDCSSDEAES--------------------SILISKMWHSNNGPCFS--------------------YPVYGIR

Query:  SGKTLKDLDAYFLTYHSLHTAIG-----------------------------GSYKFKGSSMWFRNGGVDHQLANKL
        +G+TLKDLDA FLTYHSLHT I                               SYKF GSS+W RNGGV+HQLAN L
Subjt:  SGKTLKDLDAYFLTYHSLHTAIG-----------------------------GSYKFKGSSMWFRNGGVDHQLANKL

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187379.4e-4339.01Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------DS---------LAADEASEPVSI--SNPEPVVS-LSNLER-LQS------ALFLSK
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS        DS         +A+DEA++PV++   NP+PVVS LSNLER LQS      A F SK
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------DS---------LAADEASEPVSI--SNPEPVVS-LSNLER-LQS------ALFLSK

Query:  SELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGM----------------LLGHQIFGCLEGNGVRKV----
        S LRGWR  D + QPYF+L                         GVVQ YVPYLSGIQLYGM                       G  +    R++    
Subjt:  SELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGM----------------LLGHQIFGCLEGNGVRKV----

Query:  -------------IRMDRLSLSDQLLGLHEDCSSDEAE------------------------------------------SSILISKMWHSNNGPCFSYP
                     +R+DRLSL DQ +GLHEDCSSDEAE                                          S  L+   W S       YP
Subjt:  -------------IRMDRLSLSDQLLGLHEDCSSDEAE------------------------------------------SSILISKMWHSNNGPCFSYP

Query:  VYGIRSGKTLKDLDAYFLTYHSLHTAIGG-----------------------------SYKFKGSSMWFRNGGVDHQLANKL
        +Y I +G+TLKDLDA FLTYHSLHTAI G                             SYKFKGSS+W RNGGV+HQLAN L
Subjt:  VYGIRSGKTLKDLDAYFLTYHSLHTAIGG-----------------------------SYKFKGSSMWFRNGGVDHQLANKL

A0A6J1G225 uncharacterized protein LOC111449961 isoform X21.0e-4139.37Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++PV    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQPYF+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      +   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYH LHTA+GG                            SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

A0A6J1G242 uncharacterized protein LOC111449961 isoform X12.3e-4139.06Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++PV    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQPYF+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      +   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG-------------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYH LHTA+GG                               SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG-------------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X22.7e-4239.37Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++P+    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQP+F+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      M   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYHSLHTA+GG                            SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X16.1e-4239.06Show/hide
Query:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS
        FGR  G++RFYDSS+AR   L+R++DRLC+PQE AS                    D + +DEA++P+    P+P VS LSNLER LQS      A FLS
Subjt:  FGRDGGDERFYDSSKAR---LNRRSDRLCKPQERASV-------------------DSLAADEASEPVSISNPEPVVS-LSNLER-LQS------ALFLS

Query:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC
        KS LRGW+ SD +RQP+F+L                         GVVQ YVPYLSGIQLYG                                 +I  C
Subjt:  KSELRGWRMSDLKRQPYFML------------------------CGVVQNYVPYLSGIQLYGMLLG----------------------------HQIFGC

Query:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG
         E        +   +RMDRLSL DQ LG  EDCSSDEAES                           S L S+      M   +  PC       YP+Y 
Subjt:  LE-----GNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAES---------------------------SILISK------MWHSNNGPC-----FSYPVYG

Query:  IRSGKTLKDLDAYFLTYHSLHTAIGG-------------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE
        I +G+TLKDLDA FLTYHSLHTA+GG                               SYKFKGSS+W RNGGV+HQLANKL  E
Subjt:  IRSGKTLKDLDAYFLTYHSLHTAIGG-------------------------------SYKFKGSSMWFRNGGVDHQLANKLLLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)3.9e-0943.37Show/hide
Query:  YPVYGIRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKL
        YP+Y I +G TLKDLDA FLTYHSLHT   G                            SYK +G S+W   GG  HQLAN L
Subjt:  YPVYGIRSGKTLKDLDAYFLTYHSLHTAIGG----------------------------SYKFKGSSMWFRNGGVDHQLANKL

AT2G01260.1 Protein of unknown function (DUF789)2.5e-1129.6Show/hide
Query:  VVQNYVPYLSGIQLYGMLLGHQIFGCL----------------------EGNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAE------------------
        V+Q YVP LS IQ+Y     H +   L                      + +  R   R+D +SL DQ     ED SSD+ E                  
Subjt:  VVQNYVPYLSGIQLYGMLLGHQIFGCL----------------------EGNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAE------------------

Query:  ------------------------SSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTAIGG--------------------------S
                                S  L+   W S       YP+Y I +G TLKDLDA FLTYHSLHT+ GG                          S
Subjt:  ------------------------SSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTAIGG--------------------------S

Query:  YKFKGSSMWFRNGGVDHQLANKL
        YKF+G S+W   GG +HQL N L
Subjt:  YKFKGSSMWFRNGGVDHQLANKL

AT2G01260.2 Protein of unknown function (DUF789)5.7e-0830.11Show/hide
Query:  VVQNYVPYLSGIQLYGMLLGHQIFGCL----------------------EGNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAE------------------
        V+Q YVP LS IQ+Y     H +   L                      + +  R   R+D +SL DQ     ED SSD+ E                  
Subjt:  VVQNYVPYLSGIQLYGMLLGHQIFGCL----------------------EGNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAE------------------

Query:  ------------------------SSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTAIGGSYK
                                S  L+   W S       YP+Y I +G TLKDLDA FLTYHSLHT+ GG  K
Subjt:  ------------------------SSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTAIGGSYK

AT4G16100.1 Protein of unknown function (DUF789)1.7e-0425.17Show/hide
Query:  DSSKARLNRRSDRLCKPQER--ASVDSLAADEASEPVSISNPEPV--VSLSNLER-------LQSALFLSKSELRGWRMSDLKRQPYFML----------
        +  K +     DR  K +E+     +  +  + S P  +S+       + SNL R       + S   L  +  +GWR  + + +PYF+L          
Subjt:  DSSKARLNRRSDRLCKPQER--ASVDSLAADEASEPVSISNPEPV--VSLSNLER-------LQSALFLSKSELRGWRMSDLKRQPYFML----------

Query:  --------------CGVVQNYVPYLSGIQLY-----GMLLGHQIFGCLEGNGVRKVI---RMDRLSLSDQL--LGLHE----DCSSDEAESSI-------
                        VVQ YVPYLSGIQLY           ++    +G+  R +      D   LS  L    L E      SSDE+E+S        
Subjt:  --------------CGVVQNYVPYLSGIQLY-----GMLLGHQIFGCLEGNGVRKVI---RMDRLSLSDQL--LGLHE----DCSSDEAESSI-------

Query:  --------------LISKMWH-SNNGPCFS-----------------YPVYGIRSGKTLKDLDAYFLTYHSLHTAIGGSYKFKGSS
                      L  K+ + S+  P                    YP+Y I  G++L++LDA FLT+HSL T   G+   +G S
Subjt:  --------------LISKMWH-SNNGPCFS-----------------YPVYGIRSGKTLKDLDAYFLTYHSLHTAIGGSYKFKGSS

AT5G49220.1 Protein of unknown function (DUF789)4.5e-0531.25Show/hide
Query:  VQNYVPYLSGIQLYGMLLGH------QIFGCLEGNGVRKVIRMD-------RLSLSDQLLGLHEDCSSDEAE----------------------------
        VQ YVPYLSGIQLY   L           G  EG+   + + +D       R+SL DQ   +    SS EAE                            
Subjt:  VQNYVPYLSGIQLYGMLLGH------QIFGCLEGNGVRKVIRMD-------RLSLSDQLLGLHEDCSSDEAE----------------------------

Query:  --------------SSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTA
                      S  L+   W S +     YP+Y I  G TL++LDA FLT+HSL TA
Subjt:  --------------SSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACGATCGTTCCGCCATTTTCGTCCCCTTCAAGTTCGTCGTCGTCGTCGTCTTCGGTTAGGGTTCGGTCGCGACGGGGGAGATGAGAGGTTTTACGATTCATCGAA
AGCGAGGCTCAATCGTCGAAGTGATAGGCTCTGTAAACCTCAAGAACGCGCTTCGGTTGATTCTTTGGCGGCTGATGAAGCTTCTGAACCAGTTTCCATTTCTAATCCTG
AGCCGGTTGTTTCTCTAAGTAATCTCGAGCGTTTGCAGTCGGCTCTGTTTCTCTCTAAGAGTGAGTTGAGAGGTTGGAGGATGAGTGATTTGAAGAGGCAACCTTACTTT
ATGCTTTGTGGTGTGGTTCAAAATTATGTGCCGTACTTGTCTGGTATTCAATTGTATGGGATGTTATTAGGCCACCAAATCTTTGGTTGTTTGGAAGGCAATGGGGTGAG
GAAAGTGATTAGGATGGATAGATTGTCTTTGAGTGACCAACTTTTGGGACTTCATGAAGATTGCTCTAGTGATGAGGCTGAATCTTCGATTCTCATCTCCAAGATGTGGC
ACAGTAATAATGGACCTTGCTTCTCGTACCCAGTTTACGGGATACGAAGTGGGAAAACATTAAAGGATCTTGATGCTTACTTTCTCACATACCATTCTCTACACACGGCA
ATCGGAGGTTCATACAAGTTTAAAGGATCATCAATGTGGTTCAGAAATGGTGGAGTTGATCATCAATTGGCAAACAAGCTCTTGCTGGAAAATAAATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACGATCGTTCCGCCATTTTCGTCCCCTTCAAGTTCGTCGTCGTCGTCGTCTTCGGTTAGGGTTCGGTCGCGACGGGGGAGATGAGAGGTTTTACGATTCATCGAA
AGCGAGGCTCAATCGTCGAAGTGATAGGCTCTGTAAACCTCAAGAACGCGCTTCGGTTGATTCTTTGGCGGCTGATGAAGCTTCTGAACCAGTTTCCATTTCTAATCCTG
AGCCGGTTGTTTCTCTAAGTAATCTCGAGCGTTTGCAGTCGGCTCTGTTTCTCTCTAAGAGTGAGTTGAGAGGTTGGAGGATGAGTGATTTGAAGAGGCAACCTTACTTT
ATGCTTTGTGGTGTGGTTCAAAATTATGTGCCGTACTTGTCTGGTATTCAATTGTATGGGATGTTATTAGGCCACCAAATCTTTGGTTGTTTGGAAGGCAATGGGGTGAG
GAAAGTGATTAGGATGGATAGATTGTCTTTGAGTGACCAACTTTTGGGACTTCATGAAGATTGCTCTAGTGATGAGGCTGAATCTTCGATTCTCATCTCCAAGATGTGGC
ACAGTAATAATGGACCTTGCTTCTCGTACCCAGTTTACGGGATACGAAGTGGGAAAACATTAAAGGATCTTGATGCTTACTTTCTCACATACCATTCTCTACACACGGCA
ATCGGAGGTTCATACAAGTTTAAAGGATCATCAATGTGGTTCAGAAATGGTGGAGTTGATCATCAATTGGCAAACAAGCTCTTGCTGGAAAATAAATGCTGA
Protein sequenceShow/hide protein sequence
MERSFRHFRPLQVRRRRRLRLGFGRDGGDERFYDSSKARLNRRSDRLCKPQERASVDSLAADEASEPVSISNPEPVVSLSNLERLQSALFLSKSELRGWRMSDLKRQPYF
MLCGVVQNYVPYLSGIQLYGMLLGHQIFGCLEGNGVRKVIRMDRLSLSDQLLGLHEDCSSDEAESSILISKMWHSNNGPCFSYPVYGIRSGKTLKDLDAYFLTYHSLHTA
IGGSYKFKGSSMWFRNGGVDHQLANKLLLENKC