; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038826 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038826
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationscaffold12:4239494..4243500
RNA-Seq ExpressionSpg038826
SyntenySpg038826
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]1.1e-13393.36Show/hide
Query:  MGVESNSAAAPP----SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD
        MGVESNS   PP    SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDD
Subjt:  MGVESNSAAAPP----SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDD

Query:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP
        CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGP

Query:  SSMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG
        SSMELPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECSSMGVSEPEYNEQKSCKDLNR+MKDSESGG
Subjt:  SSMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG

XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]9.9e-13895.93Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP+SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EYNEQKSCKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]3.5e-13594.07Show/hide
Query:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA   PP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EYNEQKSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]8.7e-13493.7Show/hide
Query:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS   PP   SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS

Query:  SMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG
        SMELPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECSSMGVSEPEYNEQKSCKDLNR+MKDSESGG
Subjt:  SMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]4.9e-13795.51Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNS  APP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY R+EMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFP+QSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG
        LPYCSMPEPGPNIEAEERP SCIKSLVDER +QLEECSSMGVSEPEYNE+KSCKDLNRDMKDSESGG
Subjt:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein4.8e-13895.93Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP+SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCS-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EYNEQKSCKDLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X11.7e-13594.07Show/hide
Query:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA   PP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EYNEQKSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG

A0A5A7TRC2 Uncharacterized protein1.7e-13594.07Show/hide
Query:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF
        MGVESNSA   PP+SSSSTPSPSGKRARDP+DEVYLDNFHSHKRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMSWQYSPMSEDSDDCRF
Subjt:  MGVESNSA-AAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM
        CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+APS GTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSM
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSM

Query:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG
        ELPYCSMPEPGPNIEAE+RP SCIKSLVDERVYQLEECSSM  GVSE EYNEQKSCKDLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSM--GVSEPEYNEQKSCKDLNRDMKDSESGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X14.2e-13493.7Show/hide
Query:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC
        MGVESNS   PP   SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSAAAPP---SSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PSTGTNTSLGC+T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPS

Query:  SMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG
        SMELPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECSSMGVSEPEYNEQKSCKDLNR+MKDSESGG
Subjt:  SMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X12.7e-13393.26Show/hide
Query:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC
        MGVESNSA  PP SSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGD LSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME
        ETSTNLFPSQSD+SVPTSPVSPYRYQRPFSG+ PST TNTSLGC+TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPSSME
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSME

Query:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG
        LPYCSMPEPGPNIEAEER  S IKSLVDERVYQL ECS+MGVSEPEYNEQKSCKDLNR+MKD ESGG
Subjt:  LPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)3.1e-5753.16Show/hide
Query:  GVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE
        G      A  P    S  SP GKR RDP+DEVYLDN  S KRYLSEIMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE
Subjt:  GVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCE

Query:  --TSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNT----------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM
          T+T    S    S PTSPVSPYRYQRP +       + T          S+  + +  T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQM
Subjt:  --TSTNLFPSQSDSSVPTSPVSPYRYQRPFSGVAPSTGTNT----------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQM

Query:  RAQPPGPSSMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQL-EECSSMGVSEPEYNEQKSCKDLN
        R QP G SS   P         NI+ EER   C KS+ ++R Y   E+     VS    ++ KSCK L+
Subjt:  RAQPPGPSSMELPYCSMPEPGPNIEAEERPYSCIKSLVDERVYQL-EECSSMGVSEPEYNEQKSCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTCGAATCGAACTCGGCGGCGGCGCCGCCATCGTCGTCGTCTTCTACGCCATCTCCGAGCGGGAAGAGGGCCCGAGATCCTGACGATGAAGTTTATCTCGACAA
TTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTCATGGATTCCCCAGCAAGGTCGG
AGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCCACAAACTTATTTCCCTCGCAG
TCTGATAGTAGTGTACCCACCAGTCCAGTCTCTCCATACCGATATCAGAGGCCATTCAGCGGGGTGGCTCCTTCAACAGGTACTAATACTTCACTTGGATGTTCTACTAG
TCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGACTCCGAGGGTCGTTTCCCATCGTCTCCTAGTGATATATGCCACTCAGCAGACTTGAGAAGGGCTGCGCTCC
TGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCTGAAGAGCGGCCATAT
TCTTGCATCAAATCGTTAGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAACCTGAATATAATGAACAGAAATCATGCAAGGACTTGAA
CAGGGATATGAAAGACAGTGAGTCTGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGTCGAATCGAACTCGGCGGCGGCGCCGCCATCGTCGTCGTCTTCTACGCCATCTCCGAGCGGGAAGAGGGCCCGAGATCCTGACGATGAAGTTTATCTCGACAA
TTTCCACTCTCACAAGCGCTACCTCAGTGAGATAATGGCTTCTAGTTTGAATGGATTGACGGTTGGGGATCCCCTTTCAGAGAATCTCATGGATTCCCCAGCAAGGTCGG
AGTCCATGCTTTATCTAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCTGAAGATTCAGATGACTGCCGGTTTTGTGAGACATCCACAAACTTATTTCCCTCGCAG
TCTGATAGTAGTGTACCCACCAGTCCAGTCTCTCCATACCGATATCAGAGGCCATTCAGCGGGGTGGCTCCTTCAACAGGTACTAATACTTCACTTGGATGTTCTACTAG
TCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGACTCCGAGGGTCGTTTCCCATCGTCTCCTAGTGATATATGCCACTCAGCAGACTTGAGAAGGGCTGCGCTCC
TGCGTTCGGTACAAATGAGAGCACAACCTCCTGGTCCATCATCTATGGAGTTGCCATATTGCTCAATGCCTGAGCCTGGACCTAATATAGAAGCTGAAGAGCGGCCATAT
TCTTGCATCAAATCGTTAGTCGATGAAAGAGTTTATCAACTTGAGGAATGCTCCTCAATGGGAGTGTCCGAACCTGAATATAATGAACAGAAATCATGCAAGGACTTGAA
CAGGGATATGAAAGACAGTGAGTCTGGAGGGTAG
Protein sequenceShow/hide protein sequence
MGVESNSAAAPPSSSSSTPSPSGKRARDPDDEVYLDNFHSHKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSWQYSPMSEDSDDCRFCETSTNLFPSQ
SDSSVPTSPVSPYRYQRPFSGVAPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPPGPSSMELPYCSMPEPGPNIEAEERPY
SCIKSLVDERVYQLEECSSMGVSEPEYNEQKSCKDLNRDMKDSESGG