; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009987 (gene) of Chayote v1 genome

Gene IDSed0009987
OrganismSechium edule (Chayote v1)
DescriptionProtein SOGA3-like
Genome locationLG09:38217647..38219694
RNA-Seq ExpressionSed0009987
SyntenySed0009987
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058663.1 protein SOGA3-like [Cucumis melo var. makuwa]2.9e-12577.04Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS
        MNDMD H QI MN N  VW  PPNL PVL LENRGMF Q S               FRKGK  D RR D  +T YK SNWNELQYQNRLKS+RF+PKKKS
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS

Query:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVE
         YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS KDY+DE       E G SGDSDVE
Subjt:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVE

Query:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN
        GHLEVERRLDHDLSRFEMICP S GEE STLLE+RVDD+DCHI+QLEEENLTLKERVFFMERELEDL+ RVQ LE +GW  HL DNNKEETAASENAF N
Subjt:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN

Query:  GGIGHDCYETSINDTGNE
        GG  H C E SIND GNE
Subjt:  GGIGHDCYETSINDTGNE

KAE8646411.1 hypothetical protein Csa_016850 [Cucumis sativus]6.4e-12577.04Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS
        MNDMD H QI MN N  VW  PPNL PVL LENRG FLQ S       S       FRKGK  D RR D  + +YK SNWNELQYQNRLKS+RF+PKKKS
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS

Query:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVE
         YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS KDY+DE       E G SGDSDVE
Subjt:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVE

Query:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN
        GHLEVERRLDHDLSRFEMICP S GEE STLLE+RVDD+DCHI+QLEEENLTLKERVFFMERELEDL+ RVQ LE +GW  HLMDNNKEETAASENAF N
Subjt:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN

Query:  GGIGHDCYETSINDTGNE
        GG    C E SIND GNE
Subjt:  GGIGHDCYETSINDTGNE

KAG6593334.1 hypothetical protein SDJN03_12810, partial [Cucurbita argyrosperma subsp. sororia]8.4e-12578.21Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS
        MNDMD H QI MNQN  VW HPPNL PVL LENRGMFL       S A   NQ    RKG+W D RR  SIS+AYKPSNWNELQYQNRLKS+RFHPKKK 
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS

Query:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVER
         YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS +D +++E GGSGDSDVEGHLEVER
Subjt:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVER

Query:  RLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDN-NKEETAASENAFGNGGIGHD
        RLDHDLSRFEMICPISGGEE S++LESRVDDQD HIAQL+EENLTLKERVFFMERELE+L+ RVQFLE +G C  LMDN +K+ETAASENA  NG IGH 
Subjt:  RLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDN-NKEETAASENAFGNGGIGHD

Query:  CYETSINDTGNE
        C +   N  GNE
Subjt:  CYETSINDTGNE

XP_022144844.1 uncharacterized protein LOC111014426 [Momordica charantia]3.0e-13080.7Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS
        MNDMD  AQI MN+N  VW  PP LLP   LENRGMFLQ      S+A   NQ  GFRKGK  D RR DS++T YKPSNWNELQY NRLKS+RF+PKKKS
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS

Query:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE----EGGGSGDSDVEGHL
         YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWG+DGYGSMKGLIR+RS KDY+DE    E GGSGDSDVEGHL
Subjt:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE----EGGGSGDSDVEGHL

Query:  EVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMD-NNKEETAASENAFGNGG
        EVERRLDHDLSRFEMICP SGGEE STLLESRVDDQD HIAQLEEENLTLKERVFFMERELEDL+ RVQ LE +GW LH +D NNKEETAASENAF NGG
Subjt:  EVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMD-NNKEETAASENAFGNGG

Query:  IGHDCYETSINDTGNE
        IGH C ETSIND GNE
Subjt:  IGHDCYETSINDTGNE

XP_038899694.1 uncharacterized protein LOC120086952 [Benincasa hispida]1.5e-12677.99Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQ-LENRGMFLQTSAAF-----NFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKK
        MNDMD H QI +N N  VW  PPNL PVL  LENR MFLQ S          N   GFRKGK  D RR D  +TAYKPSNWNELQYQNRLKS+RF+PKKK
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQ-LENRGMFLQTSAAF-----NFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKK

Query:  SGYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEG------GGSGDSDVE
        S YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS KDY+DEE       G SGDSDVE
Subjt:  SGYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEG------GGSGDSDVE

Query:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN
        GHLEVERRLDHDLSRFEMICP SG EE STLLESRVDD+DCHI+QLEEENLTLKERVFFMERELEDL+TR+Q LE +GW L LMDNNKEETAASENAF N
Subjt:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN

Query:  GGIGHDCYETSINDTGNE
        GG  H   E SIN  GNE
Subjt:  GGIGHDCYETSINDTGNE

TrEMBL top hitse value%identityAlignment
A0A0A0K9V7 Uncharacterized protein1.7e-12376.92Show/hide
Query:  HAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPP
        H QI MN N  VW  PPNL PVL LENRG FLQ S       S       FRKGK  D RR D  + +YK SNWNELQYQNRLKS+RF+PKKKS YRFPP
Subjt:  HAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPP

Query:  FAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVEGHLEVE
        FAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS KDY+DE       E G SGDSDVEGHLEVE
Subjt:  FAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVEGHLEVE

Query:  RRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHD
        RRLDHDLSRFEMICP S GEE STLLE+RVDD+DCHI+QLEEENLTLKERVFFMERELEDL+ RVQ LE +GW  HLMDNNKEETAASENAF NGG    
Subjt:  RRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHD

Query:  CYETSINDTGNE
        C E SIND GNE
Subjt:  CYETSINDTGNE

A0A5D3CGI4 Protein SOGA3-like1.4e-12577.04Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS
        MNDMD H QI MN N  VW  PPNL PVL LENRGMF Q S               FRKGK  D RR D  +T YK SNWNELQYQNRLKS+RF+PKKKS
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGG-----FRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS

Query:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVE
         YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS KDY+DE       E G SGDSDVE
Subjt:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE-------EGGGSGDSDVE

Query:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN
        GHLEVERRLDHDLSRFEMICP S GEE STLLE+RVDD+DCHI+QLEEENLTLKERVFFMERELEDL+ RVQ LE +GW  HL DNNKEETAASENAF N
Subjt:  GHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGN

Query:  GGIGHDCYETSINDTGNE
        GG  H C E SIND GNE
Subjt:  GGIGHDCYETSINDTGNE

A0A6J1CTG2 uncharacterized protein LOC1110144261.4e-13080.7Show/hide
Query:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS
        MNDMD  AQI MN+N  VW  PP LLP   LENRGMFLQ      S+A   NQ  GFRKGK  D RR DS++T YKPSNWNELQY NRLKS+RF+PKKKS
Subjt:  MNDMDHHAQISMNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKS

Query:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE----EGGGSGDSDVEGHL
         YRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWG+DGYGSMKGLIR+RS KDY+DE    E GGSGDSDVEGHL
Subjt:  GYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDE----EGGGSGDSDVEGHL

Query:  EVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMD-NNKEETAASENAFGNGG
        EVERRLDHDLSRFEMICP SGGEE STLLESRVDDQD HIAQLEEENLTLKERVFFMERELEDL+ RVQ LE +GW LH +D NNKEETAASENAF NGG
Subjt:  EVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMD-NNKEETAASENAFGNGG

Query:  IGHDCYETSINDTGNE
        IGH C ETSIND GNE
Subjt:  IGHDCYETSINDTGNE

A0A6J1H988 uncharacterized protein LOC1114612442.7e-12177.67Show/hide
Query:  MNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRN
        MNQN  VW HPPNL PVL LENRGMFL       S A   NQ    RKG+W D +R  SIS+AYKPSNWNELQYQNRLKS+RFHPKKK  YRFPPFAPRN
Subjt:  MNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRN

Query:  TTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVERRLDHDLSRFEM
        TTSFLIRAKRSGGIASLVSP PVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS +D +++E GGSGDSDVEGHLEVERRLDHDLSRFEM
Subjt:  TTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVERRLDHDLSRFEM

Query:  ICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHDCYETSINDTGNE
        ICPISGGEE S++LESRVDDQD HIAQL+EENLTLKERVFFMERELE+L+ RVQFLE +G CL + +NNK+ETAASENA  NG IGH C +   N  GNE
Subjt:  ICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHDCYETSINDTGNE

A0A6J1KRB8 uncharacterized protein LOC1114979718.4e-12378.33Show/hide
Query:  MNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRN
        MNQN  VW HPPNL PVL LENRGMFL       S A   NQ    RKG+WFD RR  SIS+AYKPSNWNELQYQNRLKS+RFHPKKK  YRFPPFAPRN
Subjt:  MNQN--VWNHPPNLLPVLQLENRGMFLQ-----TSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRN

Query:  TTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVERRLDHDLSRFEM
        TTSFLIRAKRSGGIASLVSPYPVTPAVL TPIFSPLREVLVDMAKEEWGLDGYGSMKGLIR+RS +D +++E GGSGDSDVEGHLEVERRLDHDLSRFEM
Subjt:  TTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVERRLDHDLSRFEM

Query:  ICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHDCYETSINDTGNE
        ICPISGGEE S++LESRVDDQD HIAQL+EENLTLKERVFFMERELE+L+ RVQFLE +G CL + +NNK+ETA SENA  NG IGH C +   N  GNE
Subjt:  ICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHDCYETSINDTGNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G19900.1 PRLI-interacting factor, putative5.9e-7664.94Show/hide
Query:  YKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVR
        YKP   NELQ QNRLK+++F+PKKK G R+ P+APRNTTSF+IRAK+SGGIA LVSP PVTPAVL TP+FSP REVL DMAKEEWG+DGYGSMKGLIR+R
Subjt:  YKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRNTTSFLIRAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVR

Query:  S----LKDY--DDEEGGGSGDSDVEGHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLE
        +    L+ Y  DDE+ GGS +SDVE H+EVERRLDHDLSRFEMI P  GG E++ +LE+RVDDQD HIAQLEEENLTLKER+F MEREL D++ R+Q+LE
Subjt:  S----LKDY--DDEEGGGSGDSDVEGHLEVERRLDHDLSRFEMICPISGGEEHSTLLESRVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLE

Query:  NQGWCLHLMDNNKEETAASENAFGNGGIGHD
         +   +   D N+E       + G+   G D
Subjt:  NQGWCLHLMDNNKEETAASENAFGNGGIGHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATATGGATCATCATGCCCAGATCTCGATGAATCAAAACGTTTGGAATCATCCTCCAAATTTGCTTCCAGTTCTTCAGCTTGAGAATCGAGGTATGTTTCTTCA
AACCTCTGCAGCTTTTAATTTCAATCAAAGCGGAGGTTTCAGAAAGGGGAAATGGTTTGATACGAGGCGAAATGATTCGATTTCAACAGCCTATAAACCCTCGAATTGGA
ACGAATTACAGTACCAAAATCGCTTGAAATCTAAGCGTTTTCATCCCAAGAAGAAATCCGGTTACCGTTTTCCTCCATTCGCGCCTCGTAATACCACTTCCTTCTTGATT
CGTGCCAAACGATCCGGCGGAATCGCGTCGTTGGTTTCGCCGTATCCGGTAACCCCTGCTGTGTTGTCCACTCCCATATTTTCGCCCCTTAGGGAAGTTCTTGTTGATAT
GGCTAAGGAGGAGTGGGGTCTAGATGGTTATGGCTCGATGAAGGGTCTGATTCGGGTTCGATCTTTGAAAGACTACGACGACGAGGAAGGCGGGGGATCGGGGGATAGCG
ATGTGGAAGGGCATTTGGAGGTGGAGAGACGGCTGGATCATGATTTAAGTAGGTTTGAAATGATTTGCCCAATTTCTGGAGGTGAAGAACATAGTACTCTTTTGGAGAGT
AGAGTGGATGATCAAGATTGTCACATAGCTCAGCTTGAGGAGGAGAATTTGACATTGAAGGAAAGGGTATTTTTCATGGAGAGAGAATTGGAAGATCTAAAAACCAGGGT
TCAATTCTTGGAAAATCAAGGTTGGTGCCTGCATCTCATGGATAATAACAAAGAGGAAACCGCAGCTTCGGAGAACGCCTTTGGCAACGGTGGCATCGGCCATGACTGCT
ACGAGACAAGCATCAATGACACTGGGAATGAATGA
mRNA sequenceShow/hide mRNA sequence
GAAACAGAGAAGTTTCGCAAACGAAATTTCCGCCATTGATGGAGCTCAAGAGTTTTATGTGTTCATAATCCTCTTTTCTACTCCAATTTCAGAATCATAATCCTTCAATT
TCTTCACCAAAACAGCGATTGAGATTCATAATTAATATCTTCAAATAAATTTCCTCAGCAATTGTTTCAGGAAAACTTCTCTGTTTCCGATTCCTAGGGTTCCTGGTTTA
AACTCGCATGTCCTTAATTCCTCGTTACAATTAGCAGTTTCTTTCTCGCTTTTTTTGTTTATGTGTTCTTGTTTCTGTAATTGGCGTTTCAGAAATTTGATTTTGCTTCG
ATTAGATGAATGATATGGATCATCATGCCCAGATCTCGATGAATCAAAACGTTTGGAATCATCCTCCAAATTTGCTTCCAGTTCTTCAGCTTGAGAATCGAGGTATGTTT
CTTCAAACCTCTGCAGCTTTTAATTTCAATCAAAGCGGAGGTTTCAGAAAGGGGAAATGGTTTGATACGAGGCGAAATGATTCGATTTCAACAGCCTATAAACCCTCGAA
TTGGAACGAATTACAGTACCAAAATCGCTTGAAATCTAAGCGTTTTCATCCCAAGAAGAAATCCGGTTACCGTTTTCCTCCATTCGCGCCTCGTAATACCACTTCCTTCT
TGATTCGTGCCAAACGATCCGGCGGAATCGCGTCGTTGGTTTCGCCGTATCCGGTAACCCCTGCTGTGTTGTCCACTCCCATATTTTCGCCCCTTAGGGAAGTTCTTGTT
GATATGGCTAAGGAGGAGTGGGGTCTAGATGGTTATGGCTCGATGAAGGGTCTGATTCGGGTTCGATCTTTGAAAGACTACGACGACGAGGAAGGCGGGGGATCGGGGGA
TAGCGATGTGGAAGGGCATTTGGAGGTGGAGAGACGGCTGGATCATGATTTAAGTAGGTTTGAAATGATTTGCCCAATTTCTGGAGGTGAAGAACATAGTACTCTTTTGG
AGAGTAGAGTGGATGATCAAGATTGTCACATAGCTCAGCTTGAGGAGGAGAATTTGACATTGAAGGAAAGGGTATTTTTCATGGAGAGAGAATTGGAAGATCTAAAAACC
AGGGTTCAATTCTTGGAAAATCAAGGTTGGTGCCTGCATCTCATGGATAATAACAAAGAGGAAACCGCAGCTTCGGAGAACGCCTTTGGCAACGGTGGCATCGGCCATGA
CTGCTACGAGACAAGCATCAATGACACTGGGAATGAATGAGCTGACAAGATCAAGGTTGGTGCCTGCATCTCCTGGACAATAACAAAGAGGAAATTGCAGCTTCTGAGAA
CGCATTTGATAACGGTGGCATCGGCCACGCTTGCCACGAGACGAGCATCAATGACACTGGGAATGAATGAGATGACAAGTCTCAGTCACTCATGAGGTTAGTTCGTGGAA
TACACTTTGCTGCATTAGTATTGACTCTTGATCATGTTTTAGGATTTTAGCTTTGACTTTCAGTTTGTGTAGTTTAGTGACTTGTTTTTTTATACTTTTTCCTTGTGGGT
CTGATTCAAATCTCAATCTTTTGACCTCAAGGATAAGAAATAGATAGATATCATTAGTGTACAGAGCTAATTCTTTGATGTATGAACTTTGTTCTATCTGTTCTTTTGTT
TATGGATCTCAATCTTGTGGTCTTGTTCTCTCTTGATATAATAGAACTCTGAAGACTTGAGTCATCTTCTTGCAAGTTGCAAATGTATTGTAGATTAGAGCTGACTGTGT
TTTTGGTTCTGATCTTTATTTTCTAATGGAACACACTTCACGAAGTGTTCGATTGCAGTGCCCATAACCATCCTGGAATGCTTTAGTAGTAGCCGAGAGGAGGACTTAGG
GACCACAACTTGTGGTTACCTAACTAGTAGTTCTCGACAATCAAACGTTGTAAAGTTAGGCGATTTGTCTATGAGAATAGTCGAGATGCGTGTAAGTTGGCAATTCCAGG
CTTTGTATCAGCTCTTTTGGCTGTGGGAGCCTGGTTCTTCAATTCAATTGGTGATTTCTGAAGCCAAT
Protein sequenceShow/hide protein sequence
MNDMDHHAQISMNQNVWNHPPNLLPVLQLENRGMFLQTSAAFNFNQSGGFRKGKWFDTRRNDSISTAYKPSNWNELQYQNRLKSKRFHPKKKSGYRFPPFAPRNTTSFLI
RAKRSGGIASLVSPYPVTPAVLSTPIFSPLREVLVDMAKEEWGLDGYGSMKGLIRVRSLKDYDDEEGGGSGDSDVEGHLEVERRLDHDLSRFEMICPISGGEEHSTLLES
RVDDQDCHIAQLEEENLTLKERVFFMERELEDLKTRVQFLENQGWCLHLMDNNKEETAASENAFGNGGIGHDCYETSINDTGNE