; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019316 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019316
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationChr04:20230935..20237454
RNA-Seq ExpressionHG10019316
SyntenyHG10019316
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]5.7e-13391.94Show/hide
Query:  MGVESNSAPPPPP---TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDD
        MGVESNS PPPPP   +SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDD
Subjt:  MGVESNSAPPPPP---TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDD

Query:  CRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGP
        CRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+TPSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GP
Subjt:  CRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGP

Query:  SSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG
        SSMELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECSSMGVSEPEYNE+KSCKDLNR+MKD  SESGG
Subjt:  SSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG

XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]3.2e-13694.51Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSS
        CET+TNLFPSQSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QPPGPSS
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSS

Query:  MELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG
        MELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GVSE EYNE+KSCKDLNRDMKD  S SGG
Subjt:  MELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]5.0e-13793.38Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM
        CET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSM
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM

Query:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG
        ELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GVSE EYNE+KSCKDLNRDMKD  S+SGG
Subjt:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]4.3e-13392.28Show/hide
Query:  MGVESNSAPPPPP--TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC
        MGVESNS PPPPP  +SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAPPPPP--TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC

Query:  RFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPS
        RFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+TPSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPS
Subjt:  RFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPS

Query:  SMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG
        SMELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECSSMGVSEPEYNE+KSCKDLNR+MKD  SESGG
Subjt:  SMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]2.9e-13794.81Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPP   SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQR+EMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM
        CET+TNLFP+QSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSM
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM

Query:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG
        ELPYCSMPEPGP+IEAEERPCSC+KSLVDER +QLEECSSMGVSEPEYNEEKSCKDLNRDMKD  SESGG
Subjt:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein1.6e-13694.51Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSA PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSS
        CET+TNLFPSQSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QPPGPSS
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSS

Query:  MELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG
        MELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GVSE EYNE+KSCKDLNRDMKD  S SGG
Subjt:  MELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X12.4e-13793.38Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM
        CET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSM
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM

Query:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG
        ELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GVSE EYNE+KSCKDLNRDMKD  S+SGG
Subjt:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG

A0A5A7TRC2 Uncharacterized protein2.4e-13793.38Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPPTSSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM
        CET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSM
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM

Query:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG
        ELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GVSE EYNE+KSCKDLNRDMKD  S+SGG
Subjt:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSEPEYNEEKSCKDLNRDMKDMNSESGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X12.1e-13392.28Show/hide
Query:  MGVESNSAPPPPP--TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC
        MGVESNS PPPPP  +SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAPPPPP--TSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDC

Query:  RFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPS
        RFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+TPSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPS
Subjt:  RFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPS

Query:  SMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG
        SMELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECSSMGVSEPEYNE+KSCKDLNR+MKD  SESGG
Subjt:  SMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X11.5e-13191.85Show/hide
Query:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF
        MGVESNSAPPPPP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD LSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRF
Subjt:  MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRF

Query:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM
        CET+TNLFPSQSD+SVPTSPVSPYRYQRPFSG+TPST TNTSLGC+TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSM
Subjt:  CETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSM

Query:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG
        ELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECS+MGVSEPEYNE+KSCKDLNR+MKD   ESGG
Subjt:  ELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)3.4e-5956.27Show/hide
Query:  PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TATNL
        PP P S S   SP GKR RDP+DEVYLDN  S KRYLSEIMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  TAT  
Subjt:  PPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TATNL

Query:  FPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNT----------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGP
          S    S PTSPVSPYRYQRP +       + T          S+  + +  T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQMRTQP G 
Subjt:  FPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNT----------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGP

Query:  SSMELPYCSMPEPGPS-IEAEERPCSCLKSLVDERVYQL-EECSSMGVSEPEYNEEKSCKDLN
        SS           GPS I+ EER CS  KS+ ++R Y   E+     VS    ++ KSCK L+
Subjt:  SSMELPYCSMPEPGPS-IEAEERPCSCLKSLVDERVYQL-EECSSMGVSEPEYNEEKSCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTAGAATCGAACTCGGCGCCGCCGCCGCCGCCAACGTCCTCGTCTTCTACACCATCTCCCAGCGGGAAGAGGGCCAGAGATCCCGACGATGAAGTTTATCTCGA
CAATTTCCACTCTCATAAACGCTACCTCAGTGAGATAATGGCTTCGAGTTTGAATGGATTGACGGTTGGAGACCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGT
CAGAGTCCATGCTTTATCAAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCAGAAGATTCAGATGACTGCCGGTTTTGTGAGACAGCTACGAACTTATTTCCCTCG
CAGTCTGATAGTAGTGTACCTACCAGCCCAGTTTCTCCATACCGATATCAGAGGCCATTTAGTGGGGTGACTCCTTCAACAGGTACCAACACTTCACTTGGATGTTCTAC
TAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGATTCTGAGGGCCGTTTCCCATCATCTCCAAGCGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGC
TCCTGCGTTCGGTACAAATGAGAACACAACCTCCCGGTCCATCATCTATGGAGTTGCCATATTGCTCCATGCCTGAGCCTGGGCCTAGTATAGAAGCTGAAGAGCGGCCA
TGTTCTTGCTTAAAATCGTTGGTTGATGAAAGAGTTTATCAACTCGAGGAATGCTCGTCAATGGGAGTGTCCGAGCCTGAATATAATGAAGAAAAATCATGCAAGGACTT
GAATAGGGACATGAAGGACATGAACAGTGAGTCCGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTAGAATCGAACTCGGCGCCGCCGCCGCCGCCAACGTCCTCGTCTTCTACACCATCTCCCAGCGGGAAGAGGGCCAGAGATCCCGACGATGAAGTTTATCTCGA
CAATTTCCACTCTCATAAACGCTACCTCAGTGAGATAATGGCTTCGAGTTTGAATGGATTGACGGTTGGAGACCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGT
CAGAGTCCATGCTTTATCAAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCAGAAGATTCAGATGACTGCCGGTTTTGTGAGACAGCTACGAACTTATTTCCCTCG
CAGTCTGATAGTAGTGTACCTACCAGCCCAGTTTCTCCATACCGATATCAGAGGCCATTTAGTGGGGTGACTCCTTCAACAGGTACCAACACTTCACTTGGATGTTCTAC
TAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGATTCTGAGGGCCGTTTCCCATCATCTCCAAGCGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGC
TCCTGCGTTCGGTACAAATGAGAACACAACCTCCCGGTCCATCATCTATGGAGTTGCCATATTGCTCCATGCCTGAGCCTGGGCCTAGTATAGAAGCTGAAGAGCGGCCA
TGTTCTTGCTTAAAATCGTTGGTTGATGAAAGAGTTTATCAACTCGAGGAATGCTCGTCAATGGGAGTGTCCGAGCCTGAATATAATGAAGAAAAATCATGCAAGGACTT
GAATAGGGACATGAAGGACATGAACAGTGAGTCCGGAGGGTAG
Protein sequenceShow/hide protein sequence
MGVESNSAPPPPPTSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPS
QSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERP
CSCLKSLVDERVYQLEECSSMGVSEPEYNEEKSCKDLNRDMKDMNSESGG